- Advanced Data Storage Technologies
- Parallel Computing and Optimization Techniques
- Cloud Computing and Resource Management
- Distributed systems and fault tolerance
- Software System Performance and Reliability
- Algorithms and Data Compression
- Complexity and Algorithms in Graphs
- Advanced Database Systems and Queries
- Advanced Memory and Neural Computing
- Interconnection Networks and Systems
- Distributed and Parallel Computing Systems
- IoT and Edge/Fog Computing
- Data Management and Algorithms
- Caching and Content Delivery
- Machine Learning and Algorithms
- Data Quality and Management
- Advanced Graph Theory Research
- Green IT and Sustainability
- Network Packet Processing and Optimization
- Graph Theory and Algorithms
- DNA and Biological Computing
- Security and Verification in Computing
- Embedded Systems Design Techniques
- Modular Robots and Swarm Intelligence
- Simulation Techniques and Applications
Intel (United States)
2013-2024
Intel (United Kingdom)
2015-2018
Rice University
1985-2006
In-memory computing is gaining popularity as a means of sidestepping the performance bottlenecks block storage operations. However, volatile nature DRAM makes these systems vulnerable to system crashes, while need continuously refresh massive amounts passive memoryresident data increases power consumption. Emerging storage-class memory (SCM) technologies combine fast DRAM-like cache-line access granularity with persistence devices like disks or SSDs, resulting in potential 10x-100x gains,...
Non-volatile byte-addressable memory has the potential to revolutionize system architecture by providing instruction-grained direct access vast amounts of persistent data. We describe a non-intrusive controller that uses backend operations for achieving lightweight failure atomicity. By moving synchronous background, performance overheads are minimized. Our solution avoids costly software intervention decoupling isolation and concurrency-driven atomicity from durability, does not require...
The identification of performance issues on specific computer architectures has a variety important benefits such as tuning software to improve performance, comparing the various platforms and assisting in design new platforms. In order enable this analysis, most modern micro-processors provide access hardware-based event counters. Unfortunately, features out-of-order execution, pre-fetching speculation complicate interpretation raw data. Thus, traditional approach assigning uniform...
Advances in memory technology are promising the availability of byte-addressable persistent as an integral component future computing platforms. This change has significant implications for software that traditionally made a sharp distinction between durable and volatile storage. In this paper we describe software-hardware architecture, WrAP, provides atomicity durability while simultaneously ensuring fast paths through cache, DRAM, layers not slowed down by burdensome buffering or...
In-memory database management systems (DBMSs) outperform disk-oriented for on-line transaction processing (OLTP) workloads. But this improved performance is only achievable when the smaller than amount of physical memory available in system. To overcome limitation, some in-memory DBMSs can move cold data out volatile DRAM to secondary storage. Such appears as if it resides with rest even though does not.
This paper addresses the challenges of coupling byte addressable non-volatile memory (NVM) and hardware transaction (HTM) in high-performance processing. We first show that HTM transactions can be ordered using existing processor instructions without any changes. In contrast, solutions posit changes to mechanisms form special or modified functionality. exploit ordering mechanism design a novel persistence method decouples concurrency from back-end NVM operations. Failure atomicity is...
This paper proposes a collaborative approach in which applications can provide guidance to the operating system regarding allocation and recycling of physical memory. The incorporates this decide page should be used back particular virtual page. key intuition behind is that application software, as generator memory accesses, best equipped inform about relative access rates overlapping patterns usage its own address space. It also capable steering algorithms order keep dynamic footprint under...
As it becomes ever more pervasively engaged in data driven commerce, a modern enterprise increasingly dependent upon reliable and high speed transaction services. At the same time aspires to capitalize large inflows of information draw timely business insights improve results. These two imperatives are frequently conflict because widely divergent strategies that must be pursued: need bolster on-line transactional processing generally drives towards small cluster high-end servers running...
Performance and energy efficiency in memory have become critically important for a wide range of computing domains. However, it is difficult to control optimize power performance because these effects depend upon activity across multiple layers the vertical execution stack. To address this challenge, we construct novel collaborative framework that employs object placement, cross-layer communication, page-level management effectively distribute application objects DRAM hardware achieve...
This paper introduces a software policy for memory management in heterogeneous systems order to improve the trade-offs between performance and power consumption, while attempting make best use of different characteristics underlying technologies. In this policy, operating system application co-schedule page informed decisions about allocation migration. Software-Controlled 2- Level Memory (Soft2LM) is hardware-agnostic approach efficient usage that allows region-based allocations,...
The growing popularity of hosted storage services and shared infrastructure in data centers is driving the recent interest resource management QoS systems. bursty nature workloads raises significant performance provisioning challenges, leading to increased infrastructure, management, energy costs. We present a novel dynamic workload shaping framework handle workloads, where arrival stream dynamically decomposed isolate its bursts, then rescheduled exploit available slack. show how...
Up-to-date business intelligence has become a critical differentiator for the modern data-driven highly engaged enterprise. It requires rapid integration of new information on continuous basis subsequent analyses. ETL-based and traditionally batch-processing oriented methods absorbing changes into relational database schema take time, are therefore incompatible with very low-latency demands realtime analytics. Instead, in-memory clustered stores that employ tunable consistency mechanisms...
Computer systems with multiple tiers of memory devices different latency, bandwidth, and capacity characteristics are quickly becoming mainstream. Due to cost physical limitations, device that enable better performance typically include less capacity. Such heterogeneous require alternative data management strategies utilize the capacity-constrained resources efficiently. However, current techniques often limited because they rely on inflexible hardware caching or manual modifications source...
Energy efficiency is an important factor in designing and configuring enterprise servers. In these servers, memory may consume 40% of the total system power. Different configurations (sizes, numbers ranks, speeds, etc.) can have significant impacts on performance energy consumption workloads. Many workloads, such as decision support systems (DSS), require large amounts memory. This paper investigates potential to save by making configuration adaptive workload behavior. We present a case...
Analysis of workload execution and identification software hardware performance barriers provide critical engineering benefits; these include guidance on optimization, design tradeoffs, configuration tuning, comparative assessments for platform selection. This paper uses Model trees to build statistical regression models the SPEC1 CPU2006 SPEC OMP2001 suites. These link key microarchitectural events. The detailed recipes identifying factors each suite determining contribution factor...
The growing popularity of hosted storage services and shared infrastructure in data centers is driving the recent interest resource management QoS systems. bursty nature workloads raises significant performance provisioning challenges, leading to increased requirements, costs, energy consumption. We present a novel workload shaping framework handle workloads, where arrival stream dynamically decomposed isolate its bursts, then rescheduled exploit available slack. show how decomposition...
Energy efficiency is an important factor in designing and configuring enterprise servers. In these servers, memory may consume 40% of the total system power. Different configurations (sizes, numbers ranks, speeds, etc.) can have significant impacts on performance energy consumption workloads. Many workloads, such as decision support systems (DSS), require large amounts memory. This paper investigates potential to save by making configuration adaptive workload behavior. We present a case...
Emerging Persistent Memory technologies (also pm, Non-Volatile DIMMs, Storage Class or scm) hold tremendous promise for accelerating popular data-management applications like in-memory databases. However, programmers now need to deal with ensuring the atomicity of transactions on resident data and maintaining consistency between order in which processors perform stores that updated values become durable.
As scaling of conventional memory devices has stalled, many high-end computing systems have begun to incorporate alternative technologies meet performance goals. Since these present distinct advantages and tradeoffs compared DDR* SDRAM, such as higher bandwidth with lower capacity or vice versa, they are typically packaged alongside SDRAM in a heterogeneous architecture. To utilize the different types efficiently, new data management strategies needed match application usage best available...
Controlling the distribution and usage of memory power is often difficult, because these effects typically depend on activity across multiple layers vertical execution stack. To address this challenge, we construct a novel collaborative framework that employs object placement, cross-layer communication, page-level management to effectively distribute application objects in DRAM hardware achieve desired power/performance goals. This work describes design implementation our framework, which...
As data centers and end users become increasingly reliant on virtualization technology, more efficient accurate methods of profiling such systems are needed. However, under the virtual machine OS each try to manage same resources independently, underlying hardware is now multiplexed between many streams execution, non-trivial interference can be caused by seemingly unrelated resources. While sampling techniques effective at gathering average behaviors over long runs, understanding...
By providing instruction-grained access to vast amounts of persistent data with ordinary loads and stores, byte-addressable storage class memory (SCM) has the potential revolutionize system architecture. We describe a non-intrusive SCM controller for achieving light-weight failure atomicity through back-end operations. Our solution avoids costly software intervention by decoupling isolation concurrency-driven from durability, does not require changes front-end cache hierarchy. Two...