NFDI4DS | UHH-SEMS - Publication Details

Sameh Elnikety

ORCID: 0000-0003-3478-2824

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5077856625

Research Areas

Cloud Computing and Resource Management
Parallel Computing and Optimization Techniques
Caching and Content Delivery
Distributed systems and fault tolerance
Advanced Data Storage Technologies
Distributed and Parallel Computing Systems
Data Management and Algorithms
Advanced Database Systems and Queries
Software System Performance and Reliability
Graph Theory and Algorithms
IoT and Edge/Fog Computing
Optimization and Search Problems
Peer-to-Peer Network Technologies
Age of Information Optimization
Advanced Graph Neural Networks
Scientific Computing and Data Management
Service-Oriented Architecture and Web Services
Web Data Mining and Analysis
Advanced Image and Video Retrieval Techniques
Algorithms and Data Compression
Mobile Agent-Based Network Management
Advanced Neural Network Applications
Data Mining Algorithms and Applications
Consumer Market Behavior and Pricing
Blockchain Technology Applications and Security

Microsoft (United States)
2013-2024

Microsoft Research (United Kingdom)
2011-2024

Concordia University
2019

Max Planck Institute for Software Systems
2017

Google (United States)
2017

École Polytechnique Fédérale de Lausanne
2003-2007

Rice University
2003

Migrating server storage to SSDs

OPENALEX - Publications

Dushyanth Narayanan Eno Thereska Austin Donnelly Sameh Elnikety Antony Rowstron

Recently, flash-based solid-state drives (SSDs) have become standard options for laptop and desktop storage, but their impact on enterprise server storage has not been studied. Provisioning is challenging. It requires optimizing the performance, capacity, power reliability needs of expected workload, all while minimizing financial costs. In this paper we analyze a number workload traces from servers in both large small data centers, to decide whether how SSDs should be used support each. We...

10.1145/1519065.1519081 article EN 2009-04-01

A method for transparent admission control and request scheduling in e-commerce web sites

OPENALEX - Publications

Sameh Elnikety Erich Nahum John Tracey Willy Zwaenepoel

This paper presents a method for admission control and request scheduling multiply-tiered e-commerce Web sites, achieving both stable behavior during overload improved response times. Our externally observes execution costs of requests online, distinguishing different types, performs protection preferential using relatively simple measurements straight forward mechanism. Unlike previous proposals, which require extensive changes to the server or operating system, our requires no...

10.1145/988672.988710 article EN 2004-05-17

Database Replication Using Generalized Snapshot Isolation

OPENALEX - Publications

Sameh Elnikety Willy Zwaenepoel Fernando Pedone

Generalized snapshot isolation extends as used in Oracle and other databases a manner suitable for replicated databases. While (conventional) requires that transactions observe the "latest" of database, generalized allows use "older" snapshots, facilitating implementation. We show many desirable properties remain. In particular, read-only never block or abort they do not cause update to abort. Moreover, under certain assumptions on transaction workload execution is serializable. An...

10.1109/reldis.2005.14 article EN 2006-10-11

Orbe

OPENALEX - Publications

Jiaqing Du Sameh Elnikety Amitabha Roy Willy Zwaenepoel

We propose two protocols that provide scalable causal consistency for both partitioned and replicated data stores using dependency matrices (DM) physical clocks. The DM protocol supports basic read update operations uses two-dimensional to track dependencies in a client session. It utilizes the transitivity of causality sparse matrix encoding keep metadata small bounded. DM-Clock extends support read-only transactions loosely synchronized

10.1145/2523616.2523628 article EN 2013-10-01

Swayam

OPENALEX - Publications

Arpan Gujarati Sameh Elnikety Yuxiong He Kathryn S. McKinley Björn B. Brandenburg

Developers use Machine Learning (ML) platforms to train ML models and then deploy these as web services for inference (prediction). A key challenge platform providers is guarantee response-time Service Level Agreements (SLAs) workloads while maximizing resource efficiency. Swayam a fully distributed autoscaling framework that exploits characteristics of production deliver on the dual efficiency SLA compliance. Our contributions are (1) model-based takes into account SLAs workload...

10.1145/3135974.3135993 article EN 2017-11-30

Faster and Cheaper Serverless Computing on Harvested Resources

OPENALEX - Publications

Yanqi Zhang Íñigo Goiri Gohar Irfan Chaudhry Rodrigo Fonseca Sameh Elnikety and 2 more

Serverless computing is becoming increasingly popular due to its ease of programming, fast elasticity, and fine-grained billing. However, the serverless provider still needs provision, manage, pay IaaS for virtual machines (VMs) hosting platform. This ties cost platform underlying VMs. One way significantly reduce use spare resources, which cloud providers rent at a massive discount. Harvest VMs offer such cheap resources: they grow shrink harvest all unallocated CPU cores in their host...

10.1145/3477132.3483580 article EN 2021-10-19

Performance comparison of middleware architectures for generating dynamic web content

OPENALEX - Publications

Emmanuel Cecchet Anupam Chanda Sameh Elnikety Julie Marguerite Willy Zwaenepoel

10.5555/1515915.1515933 article EN ACM/IFIP/USENIX international conference on Middleware 2003-06-16

Predictive parallelization

OPENALEX - Publications

Myeongjae Jeon Saehoon Kim Seung-won Hwang Yuxiong He Sameh Elnikety and 2 more

Web search engines are optimized to reduce the high-percentile response time consistently provide fast responses almost all user queries. This is a challenging task because query workload exhibits large variability, consisting of many short-running queries and few long-running that significantly impact time. With modern multicore servers, parallelizing processing an individual promising solution execution time, but it gives limited benefits compared sequential since most see little or no...

10.1145/2600428.2609572 article EN 2014-07-03

Few-to-Many

OPENALEX - Publications

M. E. Haque Yong hun Eom Yuxiong He Sameh Elnikety Ricardo Bianchini and 1 more

Interactive services, such as Web search, recommendations, games, and finance, must respond quickly to satisfy customers. Achieving this goal requires optimizing tail (e.g., 99th+ percentile) latency. Although every server is multicore, parallelizing individual requests reduce latency challenging because (1) service demand unknown when arrive; (2) blindly all oversubscribes hardware resources; (3) the numerous short will not improve This paper introduces Few-to-Many (FM) incremental...

10.1145/2694344.2694384 article EN 2015-03-03

Everest: scaling down peak loads through I/O off-loading

OPENALEX - Publications

Dushyanth Narayanan Austin Donnelly Eno Thereska Sameh Elnikety Antony Rowstron

Bursts in data center workloads are a real problem for storage subsystems. Data volumes can experience peak I/O request rates that over an order of magnitude higher than average load. This requires significant overprovisioning, and often still results latency during peaks.In to address this we propose Everest, which allows written overloaded volume be temporarily off-loaded into short-term virtual store. Everest creates the store by opportunistically pooling underutilized resources either on...

10.5555/1855741.1855743 article EN Operating Systems Design and Implementation 2008-12-08

Clock-SI: Snapshot Isolation for Partitioned Data Stores Using Loosely Synchronized Clocks

OPENALEX - Publications

Jiaqing Du Sameh Elnikety Willy Zwaenepoel

Clock-SI is a fully distributed protocol that implements snapshot isolation (SI) for partitioned data stores. It derives and commit timestamps from loosely synchronized clocks, rather than centralized timestamp authority as used in current systems. A transaction obtains its by reading the clock at originating partition provides corresponding consistent across all partitions. In contrast to using authority, has availability performance benefits: avoids single point of failure potential...

10.1109/srds.2013.26 article EN 2013-09-01

Zeta

OPENALEX - Publications

Yuxiong He Sameh Elnikety James R. Larus Chenyu Yan

This paper presents a scheduling model for class of interactive services in which requests are time bounded and lower result quality can be traded shorter execution time. These applications include web search engines, finance servers, other interactive, on-line services. We develop an efficient algorithm, Zeta, that allocates processor among service to maximize the minimize variance response.

10.1145/2391229.2391241 article EN 2012-10-14

Mercury: A memory-constrained spatio-temporal real-time search on microblogs

OPENALEX - Publications

Amr Magdy Mohamed F. Mokbel Sameh Elnikety Suman Nath Yuxiong He

This paper presents Mercury; a system for real-time support of top-k spatio-temporal queries on microblogs, where users are able to browse recent microblogs near their locations. With high arrival rates Mercury ensures query response within tight memory-constrained environment. bounds its search space include only those that have arrived certain spatial and temporal boundaries, in which the according ranking function, returned results. employs: (a) scalable dynamic in-memory index structure...

10.1109/icde.2014.6816649 article EN 2014-03-01

G-SPARQL

OPENALEX - Publications

Sherif Sakr Sameh Elnikety Yuxiong He

We propose a SPARQL-like language, G-SPARQL, for querying attributed graphs. The language expresses types of queries which large interest applications model their data as graphs such as: pattern matching, reachability and shortest path queries. Each query can combine both structural predicates value-based (on the attributes graph nodes edges). describe an algebraic compilation mechanism our proposed is extended from relational algebra based on basic construct building SPARQL queries, Triple...

10.1145/2396761.2396806 article EN 2012-10-29

Exploiting heterogeneity for tail latency and energy efficiency

OPENALEX - Publications

M. E. Haque Yuxiong He Sameh Elnikety Thu D. Nguyen Ricardo Bianchini and 1 more

Interactive service providers have strict requirements on high-percentile (tail) latency to meet user expectations. If tail targets with less energy, they increase profits, because energy is a significant operating expense. Unfortunately, optimizing and are typically conflicting goals. Our work resolves this conflict by exploiting servers per-core Dynamic Voltage Frequency Scaling (DVFS) Asymmetric Multicore Processors (AMPs). We introduce the Adaptive Slow-to-Fast scheduling framework,...

10.1145/3123939.3123956 article EN 2017-10-14

SmartHarvest

OPENALEX - Publications

Yawen Wang Kapil Arya Marios Kogias Manohar Vanga Aditya Bhandari and 5 more

We can increase the efficiency of public cloud datacenters by harvesting allocated but temporarily idling CPU cores from customer virtual machines (VMs) to run batch or analytics workloads. Even small gains translate into substantial savings, since provisioning and operating a datacenter costs hundreds millions dollars per year. The main challenge is harvest idle with little no impact on VMs, which could be running latency-sensitive services are essentially black-boxes provider.

10.1145/3447786.3456225 article EN 2021-04-21

Tashkent

OPENALEX - Publications

Sameh Elnikety Steven Dropsho Fernando Pedone

In stand-alone databases, the functions of ordering transaction commits and making effects transactions durable are performed in one single action, namely writing commit record to disk. For efficiency many these writes grouped into a disk operation. replicated databases which all replicas agree on order update transactions, two typically separated. Specifically, replication middleware determines global order, while database make durable.The contribution this paper is demonstrate that...

10.1145/1217935.1217947 article EN 2006-04-18

One-copy serializability with snapshot isolation under the hood

OPENALEX - Publications

Mihaela Bornea Orion Hodson Sameh Elnikety Alan Fekete

This paper presents a method that allows replicated database system to provide global isolation level stronger than the provided on each individual replica. We propose new multi-version concurrency control algorithm called, serializable generalized snapshot (SGSI), targets middleware systems. Each replica runs locally and replication guarantees one-copy serializability. introduce novel techniques level, namely readset extraction enhanced certification prevents read-write write-write...

10.1109/icde.2011.5767897 article EN 2011-04-01

Adaptive parallelism for web search

OPENALEX - Publications

Myeongjae Jeon Yuxiong He Sameh Elnikety Alan L. Cox Scott Rixner

A web search query made to Microsoft Bing is currently parallelized by distributing the processing across many servers. Within each of these servers, is, however, processed sequentially. Although server may be multiple queries concurrently, with modern multicore parallelizing an individual within nonetheless improve user's experience reducing response time. In this paper, we describe issues that make parallelization a challenging, and present approach effectively addresses challenges. Since...

10.1145/2465351.2465367 article EN 2013-04-15

Horton: Online Query Execution Engine for Large Distributed Graphs

OPENALEX - Publications

Mohamed Sarwat Sameh Elnikety Yuxiong He Gabriel Kliot

Graphs are used in many large-scale applications, such as social networking. The management of these graphs poses new challenges too large for a single server to manage efficiently. Current distributed techniques map-reduce and Pregel not well-suited processing interactive ad-hoc queries against graphs. In this paper we demonstrate Horton, query execution engine Horton defines language that allows the expression regular reach ability provides with optimizer on parallel. demo, show...

10.1109/icde.2012.129 article EN 2012-04-01

Horton+

OPENALEX - Publications

Mohamed Sarwat Sameh Elnikety Yuxiong He Mohamed F. Mokbel

Horton+ is a graph query processing system that executes declarative reachability queries on partitioned attributed multi-graph. It employs language, optimizer, and distributed execution engine. The language expresses queries, supports closures predicates node edge attributes to match paths. We introduce three algebraic operators, select, traverse, join, compiled into an plan containing these operators. As access the elements in random pattern, therefore maintained main memory of cluster...

10.14778/2556549.2556573 article EN Proceedings of the VLDB Endowment 2013-09-01

Delayed-Dynamic-Selective (DDS) Prediction for Reducing Extreme Tail Latency in Web Search

OPENALEX - Publications

Saehoon Kim Yuxiong He Seung-won Hwang Sameh Elnikety Seungjin Choi

A commercial web search engine shards its index among many servers, and therefore the response time of a query is dominated by slowest server that processes query. Prior approaches target improving responsiveness reducing tail latency an individual server. They predict execution time, if predicted to be long-running, it runs in parallel, otherwise sequentially. These are, however, not accurate enough for high when responses are aggregated from servers because this requires each reduce...

10.1145/2684822.2685289 article EN 2015-01-28

BitFunnel

OPENALEX - Publications

Bob Goodwin Michael Hopcroft Dan Luu Alex Clemmer Mihaela Curmei and 2 more

Since the mid-90s there has been a widely-held belief that signature files are inferior to inverted for text indexing. In recent years Bing search engine developed and deployed an index based on bit-sliced signatures. This index, known as BitFunnel, replaced existing production system index. The driving factor behind shift away from was operational cost savings. paper describes algorithmic innovations changes in cloud computing landscape led us reconsider eventually field technology once...

10.1145/3077136.3080789 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2017-07-28

Coming Soon ...