- Network Traffic and Congestion Control
- Scientific Computing and Data Management
- Distributed and Parallel Computing Systems
- Complex Network Analysis Techniques
- Internet Traffic Analysis and Secure E-voting
- Research Data Management Practices
- Peer-to-Peer Network Technologies
- Green IT and Sustainability
- Data Quality and Management
- Cloud Computing and Resource Management
- Semantic Web and Ontologies
- Microbial Metabolic Engineering and Bioproduction
- Network Packet Processing and Optimization
- Innovative Human-Technology Interaction
- Service-Oriented Architecture and Web Services
- Protein Structure and Dynamics
- Enzyme Structure and Function
- Context-Aware Activity Recognition Systems
- Interactive and Immersive Displays
- Modular Robots and Swarm Intelligence
- Statistics Education and Methodologies
- Advanced Proteomics Techniques and Applications
- Data Analysis with R
- Mobile Crowdsensing and Crowdsourcing
- E-Government and Public Services
University of Southern California
2001-2025
Marina Del Rey Hospital
2020
University of California, Los Angeles
2012-2016
UCLA Health
2013-2015
Integrated Systems Incorporation (United States)
2001-2002
Carnegie Mellon University
1998
Mercator is a program that uses hop-limited probes-the same primitive used in traceroute-to infer an Internet map. It informed random address probing to carefully exploring the IP space when determining router adjacencies, source-route capable routers wherever possible enhance fidelity of resulting map, and employs novel mechanisms for resolving aliases (interfaces belonging router). This paper describes design these heuristics our experiences with Mercator, presents some preliminary analysis
Following the long-held belief that Internet is hierarchical, network topology generators most widely used by research community, Transit-Stub and Tiers, create networks with a deliberately hierarchical structure. However, in 1999 seminal paper Faloutsos et al. revealed Internet's degree distribution power-law. Because distributions produced Tiers are not power-laws, community has largely dismissed them as inadequate proposed new attempt to generate graphs power-law distributions.Contrary...
The impact of routing policy on Internet paths is poorly understood. In theory, the can inflate shortest-router-hop paths. To our knowledge, extent this inflation has not been previously examined. Using a simplified model in Internet, we obtain approximate indications Our findings suggest that does length significantly. For instance, policy, some 20% are inflated by more than five router-level hops.
One of the many benefits multicast, when compared to traditional unicast, is that multicast reduces overall network load. While importance beyond dispute, there have been surprisingly few attempts quantify multicast's reduction in The only substantial and quantitative effort we are aware Chuang Sirbu [3]. They calculate number links L a delivery tree connecting random source m distinct sites; extensive simulations over range networks suggest L(m) ∝ m0.8. In this paper examine function...
A key challenge for grid computing is creating large-scale, end-to-end scientific applications that draw from pools of specialized components to derive elaborate new results. We develop Pegasus, an AI planning system which integrated into the environment takes a user's highly specified desired results, generates valid workflows take account available resources, and submits execution on grid. also begin extend it as more distributed knowledge-rich architecture.
We present ohmage, a mobile to web platform that records, analyzes, and visualizes data from both prompted experience samples entered by the user, as well continuous streams of passively collected sensors onboard device. ohmage has been used in number research health stu
Participatory sensing (PS) is a distributed data collection and analysis approach where individuals, acting alone or in groups, use their personal mobile devices to systematically explore interesting aspects of lives communities [Burke et al. 2006]. These can be used capture diverse spatiotemporal through both intermittent self-report continuous recording from on-board sensors applications. Ohmage (http://ohmage.org) modular extensible open-source, Web PS platform that records, stores,...
Structures of many complex biological assemblies are increasingly determined using integrative approaches, in which data from multiple experimental methods combined. A standalone system, called PDB-Dev, has been developed for archiving structures and making them publicly available. Here, the standards software tools that support PDB-Dev described along with new updated components data-collection, processing infrastructure. Following FAIR (Findable, Accessible, Interoperable Reusable)...
IHMCIF (github.com/ihmwg/IHMCIF) is a data information framework that supports archiving and disseminating macromolecular structures determined by integrative or hybrid modeling (IHM), making them Findable, Accessible, Interoperable, Reusable (FAIR). an extension of the Protein Data Bank Exchange/macromolecular Crystallographic Information Framework (PDBx/mmCIF) serves as for (PDB) to archive experimentally atomic biological macromolecules their complexes with one another small molecule...
In a recent and much celebrated paper, Faloutsos <i>et al.</i> [6] found that the inter Autonomous System (AS) topology exhibits power-law degree distribution. This result was quite unexpected in networking community, stirred significant interest exploring possible causes of this phenomenon. The work Barabasi [2], its application to network generation Medina [9], have explored promising class models yield strict distributions. These models, which we will refer collectively as...
Following the long-held belief that Internet is hierarchical, network topology generators most widely used by research community, Transit-Stub and Tiers, create networks with a deliberately hierarchical structure. However, in 1999 seminal paper Faloutsos et al. revealed Internet's degree distribution power-law. Because distributions produced Tiers are not power-laws, community has largely dismissed them as inadequate proposed new attempt to generate graphs power-law distributions.Contrary...
No abstract available.
In our previous work, we used a simplified model of routing policy in the Internet to study impact on path-lengths. This prior work suffered from two shortcomings--it was based single snapshot topology, and could generate AS paths that violate peering relationships. this paper, address these shortcomings by re-examining results with respect more recent Internet, improving avoid violation. We find observations regarding path inflation due appear hold both across time sophisticated policy.
Smartphones can capture diverse spatio-temporal data about an individual; including both intermittent self-report, and continuous passive collection from onboard sensors applications. The resulting personal streams support powerful inference the user's state, behavior, well-being environment. However making sense acting on these multi-dimensional, heterogeneous requires iterative intensive exploration of datasets, development customized analysis techniques that are appropriate for a...
One of the many benefits multicast, when compared to traditional unicast, is that multicast reduces overall network load. While importance beyond dispute, there have been surprisingly few attempts quantify multicast's reduction in The only substantial and quantitative effort we are aware Chuang Sirbu [3]. They calculate number links L a delivery tree connecting random source m distinct sites; extensive simulations over range networks suggest L(m) &prop; 0.8 . In this paper examine...
A fundamental task on the Grid is to decide what jobs run computing resources based job or application requirements. Our previous work ontology-based matchmaking discusses a resource mechanism using Semantic Web technologies. We extend our provide dynamic access such capability by building persistent online service. implementation uses Globus Toolkit for service development, and exploits monitoring discovery in infrastructure dynamically discover update information. describe architecture of...
The pace of discovery in eScience is increasingly dependent on a scientist's ability to acquire, curate, integrate, analyze, and share large diverse collections data. It all too common for investigators spend inordinate amounts time developing ad hoc procedures manage their In previous work, we presented DERIVA, Scientific Asset Management System, designed accelerate data driven discovery. this paper, report the use DERIVA number substantial applications. We describe lessons have learned,...
The Common Fund Data Ecosystem (CFDE) has created a flexible system of data federation that enables researchers to discover datasets from across the US National Institutes Health without requiring owners move, reformat, or rehost those data. This is centered on catalog integrates detailed descriptions biomedical individual Programs' Coordination Centers (DCCs) into uniform metadata model can then be indexed and searched centralized portal. Crosscut Metadata Model (C2M2) supports wide variety...
Making sense of data is complex, and the knowledge skills required to understand "Big Data" - many open sources go beyond those taught in traditional introductory statistics courses. The Mobilize project has created implemented a course for secondary students, Introduction Data Science (IDS), that aims develop computational statistical thinking so students can access analyze from variety non-traditional sources. Although does not directly address source data, such are used curriculum, an...
No abstract available.
Database evolution is a notoriously difficult task, and it exacerbated by the necessity to evolve database-dependent applications. As science becomes increasingly dependent on sophisticated data management, need an array of database-driven systems will only intensify. In this paper, we present architecture for data-centric ecosystems that allows components seamlessly co-evolve centralizing models mappings at service pushing model-adaptive interactions database clients. Boundary objects fill...
The foundation of data oriented scientific collaboration is the ability for participants to find, access and reuse created during course an investigation, what has been referred as FAIR principles. In this paper, we describe ERMrest, a collaborative management service that promotes by enabling throughout life cycle. ERMrest RESTful web discovery organizing diverse assets into dynamic entity relationship model. We present details on design implementation its performance use range...