Ganesh Ananthanarayanan

ORCID: 0000-0002-7479-1664
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Cloud Computing and Resource Management
  • Caching and Content Delivery
  • IoT and Edge/Fog Computing
  • Advanced Neural Network Applications
  • Video Surveillance and Tracking Methods
  • Anomaly Detection Techniques and Applications
  • Peer-to-Peer Network Technologies
  • Distributed and Parallel Computing Systems
  • Advanced Data Storage Technologies
  • Image and Video Quality Assessment
  • Data Stream Mining Techniques
  • Software-Defined Networks and 5G
  • Adversarial Robustness in Machine Learning
  • Wireless Networks and Protocols
  • Video Analysis and Summarization
  • Advanced Image and Video Retrieval Techniques
  • Opportunistic and Delay-Tolerant Networks
  • Advanced Malware Detection Techniques
  • Visual Attention and Saliency Detection
  • Parallel Computing and Optimization Techniques
  • Security and Verification in Computing
  • Image Enhancement Techniques
  • Scientific Computing and Data Management
  • Human Pose and Action Recognition
  • Graph Theory and Algorithms

Microsoft (United States)
2007-2024

Seattle University
2022-2024

Microsoft Research (United Kingdom)
2007-2023

University of California, Berkeley
2008-2020

Microsoft Research (India)
2006

Experience froman operational Map-Reduce cluster reveals that outliers significantly prolong job completion. The causes for include run-time contention processor, memory and other resources, disk failures, varying bandwidth congestion along network paths and, imbalance in task workload. We present Mantri, a system monitors tasks culls using cause- resource-aware techniques. Mantri's strategies restarting outliers, network-aware placement of protecting outputs valuable tasks. Using real-time...

10.5555/1924943.1924962 article EN Operating Systems Design and Implementation 2010-10-04

Video analytics will drive a wide range of applications with great potential to impact society. A geographically distributed architecture public clouds and edges that extend down the cameras is only feasible approach meeting strict real-time requirements large-scale live video analytics.

10.1109/mc.2017.3641638 article EN Computer 2017-01-01

Tasks in modern data parallel clusters have highly diverse resource requirements, along CPU, memory, disk and network. Any of these resources may become bottlenecks hence, the likelihood wasting due to fragmentation is now larger. Today's schedulers do not explicitly reduce fragmentation. Worse, since they only allocate cores that ignore (disk network) can be over-allocated leading interference, failures hogging or memory could been used by other tasks. We present Tetris, a cluster scheduler...

10.1145/2619239.2626334 article EN 2014-08-12

Applying deep convolutional neural networks (NN) to video data at scale poses a substantial systems challenge, as improving inference accuracy often requires prohibitive cost in computational resources. While it is promising balance resource and by selecting suitable NN configuration (e.g., the resolution frame rate of input video), one must also address significant dynamics configuration's impact on analytics accuracy. We present Chameleon, controller that dynamically picks best...

10.1145/3230543.3230574 article EN 2018-08-07

To improve data availability and resilience MapReduce frameworks use file systems that replicate uniformly. However, analysis of job logs from a large production cluster shows wide disparity in popularity. Machines racks storing popular content become bottlenecks; thereby increasing the completion times jobs accessing this even when there are machines with spare cycles cluster. address problem, we present Scarlett, system replicates blocks based on their By accurately predicting popularity...

10.1145/1966445.1966472 article EN 2011-04-10

Organizations deploy a hierarchy of clusters - cameras, private clusters, public clouds for analyzing live video feeds from their cameras. Video analytics queries have many implementation options which impact resource demands and accuracy outputs. Our objective is to select the "query plan" implementations (and knobs) place it across merge common components maximize average query accuracy. This challenging task, because we consider multi-resource (network compute) constraints in hierarchical...

10.1109/sec.2018.00016 article EN 2018-10-01

Low latency analytics on geographically distributed datasets (across datacenters, edge clusters) is an upcoming and increasingly important challenge. The dominant approach of aggregating all the data to a single datacenter significantly inflates timeliness analytics. At same time, running queries over geo-distributed inputs using current intra-DC frameworks also leads high query response times because these cannot cope with relatively low variable capacity WAN links. We present Iridium,...

10.1145/2785956.2787505 article EN 2015-08-17

Tasks in modern data parallel clusters have highly diverse resource requirements, along CPU, memory, disk and network. Any of these resources may become bottlenecks hence, the likelihood wasting due to fragmentation is now larger. Today's schedulers do not explicitly reduce fragmentation. Worse, since they only allocate cores that ignore (disk network) can be over-allocated leading interference, failures hogging or memory could been used by other tasks. We present Tetris, a cluster scheduler...

10.1145/2740070.2626334 article EN ACM SIGCOMM Computer Communication Review 2014-08-17

Low latency analytics on geographically distributed datasets (across datacenters, edge clusters) is an upcoming and increasingly important challenge. The dominant approach of aggregating all the data to a single datacenter significantly inflates timeliness analytics. At same time, running queries over geo-distributed inputs using current intra-DC frameworks also leads high query response times because these cannot cope with relatively low variable capacity WAN links. We present Iridium,...

10.1145/2829988.2787505 article EN ACM SIGCOMM Computer Communication Review 2015-08-17

Mobile devices are increasingly equipped with multiple network interfaces complementary characteristics. In particular, the Wi-Fi interface has high throughput and transfer power efficiency, but its idle consumption is prohibitive. this paper we present, Blue-Fi, a sytem that predicts availability of connectivity by using combination bluetooth contact-patterns cell-tower information. This allows device to intelligently switch on only when there available, thus avoiding long periods in state...

10.1145/1555816.1555842 article EN 2009-06-22

In big data analytics, timely results, even if based on only part of the data, are often good enough. For this reason, approximation jobs, which have deadline or error bounds and require a subset their tasks to complete, projected dominate workloads. Straggler an important hurdle when designing approximate analytic frameworks, widely adopted approach deal with them is speculative execution. paper, we present GRASS, carefully uses speculation mitigate impact stragglers in jobs. GRASS's design...

10.5555/2616448.2616475 article EN Networked Systems Design and Implementation 2014-04-02

Given the well-known tradeoffs between fairness, performance, and efficiency, modern cluster schedulers often prefer instantaneous fairness as their primary objective to ensure performance isolation users groups. However, instantaneous, short-term convergence does not result in noticeable long-term benefits. Instead, we propose an altruistic, approach, CARBYNE, where jobs yield fractions of allocated resources without impacting own completion times. We show that leftover collected via...

10.5555/3026877.3026884 article EN Operating Systems Design and Implementation 2016-11-02

As clusters continue to grow in size and complexity, providing scalable predictable performance is an increasingly important challenge. A crucial roadblock achieving stragglers, i.e., tasks that take significantly longer than expected run. At this point, speculative execution has been widely adopted mitigate the impact of stragglers. However, speculation mechanisms are designed operated independently job scheduling when, fact, a copy task direct on resources available for other jobs. In...

10.1145/2785956.2787481 article EN 2015-08-17

Straggler tasks continue to be a major hurdle in achieving faster completion of data intensive applications running on modern data-processing frameworks. Existing straggler mitigation techniques are inefficient due their reactive and replicative nature -- they rely wait-speculate-re-execute mechanism, thus leading delayed detection resource utilization. proactive also over-utilize resources replication. modeling-based approaches hard for production-level adoption modeling errors. We present...

10.1145/2670979.2671005 article EN 2014-11-03

Large volumes of video are continuously recorded by cameras deployed for traffic control and surveillance with the goal answering after fact queries such as: identify frames objects certain classes (cars, bags) from many days video. Current systems processing on large datasets incur either high cost at ingest time or latency query time. We present Focus, a system providing both low-cost low-latency querying datasets. Focus' architecture flexibly effectively divides work between At (on live...

10.5555/3291168.3291188 article EN 2018-10-08

Interactive real-time streaming applications such as audio-video conferencing, online gaming and app streaming, place stringent requirements on the network in terms of delay, jitter, packet loss. Many these inherently involve client-to-client communication, which is particularly challenging since performance need to be met while traversing public wide-area (WAN). This different from typical situation cloud-to-client where WAN can often bypassed by moving a communication end-point cloud...

10.1145/2934872.2934907 article EN 2016-08-01

Recent work has made the case for geo-distributed analytics, where data collected and stored at multiple datacenters edge sites world-wide is analyzed in situ to drive operational management decisions. A key issue such systems ensuring low response times analytics queries issued against data. central determinant of time query execution plan (QEP). Current optimizers do not consider network when deriving QEPs, which a drawback as are connected via WAN links with heterogeneous modest...

10.5555/3026877.3026911 article EN Operating Systems Design and Implementation 2016-11-02

Mobile devices are increasingly equipped with multiple network interfaces: Wireless Local Area Network (WLAN) interfaces for local connectivity and Wide (WWAN) wide-area connectivity. The WWAN typically provides much wider coverage but lower speeds than the WLAN. To address this dichotomy, we present COMBINE, a system collaborative downloading wherein that within WLAN range pool together their links, significantly increasing effective speed available to them.

10.1145/1247660.1247693 article EN 2007-06-13

Providing timely results in the face of rapid growth data volumes has become important for analytical frameworks. For this reason, frameworks increasingly operate on only a subset input data. A key property such sampling is that combinatorially many subsets are present. We present KMN, system leverages these choices to perform data-aware scheduling, i.e., minimize time taken by tasks read their inputs, DAG tasks. KMN not uses co-locate with but also percolates combinatorial downstream...

10.5555/2685048.2685072 article EN Operating Systems Design and Implementation 2014-10-06
Coming Soon ...