- Cloud Computing and Resource Management
- Advanced Data Storage Technologies
- Caching and Content Delivery
- Distributed systems and fault tolerance
- Software System Performance and Reliability
- Distributed and Parallel Computing Systems
- Parallel Computing and Optimization Techniques
- Service-Oriented Architecture and Web Services
- Peer-to-Peer Network Technologies
- IoT and Edge/Fog Computing
- Green IT and Sustainability
- Data Management and Algorithms
- Interconnection Networks and Systems
- Advanced Software Engineering Methodologies
- Data Stream Mining Techniques
- Energy Efficient Wireless Sensor Networks
- Mobile Agent-Based Network Management
- Teaching and Learning Programming
- Mobile Ad Hoc Networks
- Nuclear and radioactivity studies
- Business Process Modeling and Analysis
- Nuclear Materials and Properties
- Software Testing and Debugging Techniques
- Software-Defined Networks and 5G
- Software Engineering Research
Thai Nguyen University
2024
Rutgers Sexual and Reproductive Health and Rights
2006-2022
Victoria University of Wellington
2022
Hai Phong University of Management and Technology
2022
Rutgers, The State University of New Jersey
2009-2021
University of Central Oklahoma
2021
Office of the Director
2019
University of Houston
2018
Université de Technologie de Troyes
2016
Oregon State University
2007
Interest has been growing in powering datacenters (at least partially) with renewable or "green" sources of energy, such as solar wind. However, it is challenging to use these because, unlike the "brown" (carbon-intensive) energy drawn from electrical grid, they are not always available. This means that demand and supply must be matched, if we take full advantage green minimize brown consumption. In this paper, investigate how manage a datacenter's computational workload match supply....
In this paper, we propose GreenSlot, a parallel batch job scheduler for datacenter powered by photovoltaic solar array and the electrical grid (as backup). GreenSlot predicts amount of energy that will be available in near future, schedules workload to maximize green consumption while meeting jobs' deadlines. If must used avoid deadline violations, selects times when it is cheap. Our results production scientific workloads demonstrate Green-Slot can increase up 117% decrease cost 39%,...
Several companies have recently announced plans to build "green" datacenters, i.e. datacenters partially or completely powered by renewable energy. These will either generate their own energy draw it directly from an existing nearby plant. Besides reducing carbon footprints, can potentially reduce costs, peak power both. However, certain fuels are intermittent, which requires approaches for tackling the supply variability. One approach is use batteries and/or electrical grid as a backup It...
We introduce PlanetP, content addressable publish/subscribe service for unstructured peer-to-peer (P2P) communities. PlanetP supports addressing by providing: (1) a gossiping layer used to globally replicate membership directory and an extremely compact index; (2) completely distributed search ranking algorithm that help users find the most relevant information. is simple, yet powerful system sharing simple because each peer must only perform periodic, randomized, point-to-point message...
In this paper, we first study the impact of load placement policies on cooling and maximum data center temperatures in cloud service providers that operate multiple geographically distributed centers. Based study, then propose dynamic distribution consider all electricity-related costs as well transient effects. Our evaluation studies ability different strategies to handle spikes, compares behaviors our cost-aware cost-unaware static policies, explores effects many parameter settings. Among...
We propose and evaluate a framework for creating running approximation-enabled MapReduce programs. Specifically, we approximation mechanisms that fit naturally into the paradigm, including input data sampling, task dropping, accepting precise user-defined approximate version of code. then show how to leverage statistical theories compute error bounds popular classes programs when approximating with sampling and/or dropping. implement proposed bound estimations in prototype system called...
The large amount of energy consumed by Internet services represents significant and fast-growing financial environmental costs. Increasingly, are exploring dynamic methods to minimize costs while respecting their service-level agreements (SLAs). Furthermore, it will soon be important for these manage usage "brown energy" (produced via carbon-intensive means) relative renewable or "green" energy. This paper introduces a general, optimization-based framework enabling multi-data-center brown...
This paper presents the design and evaluation of a novel distributed shared hosting approach, DMap, for managing dynamic identifier to locator mappings in global Internet. DMap is foundation fast name resolution service necessary enable emerging Internet services such as seamless mobility support, content delivery cloud computing. Our approach distributes among Autonomous Systems (ASs) by directly applying K>1 consistent hash functions on produce network addresses AS gateway routers at which...
Traditionally, network software has been structured in a monolithic fashion with all protocol stacks executing either within the kernel or single trusted user-level server. This organization is motivated by performance and security concerns. However, considerations of code maintenance, ease debugging, customization, simultaneous existence multiple protocols argue for separating implementations into more manageable libraries protocols. paper describes design implementation transport as...
We demonstrate a framework for improving the availability of cluster based Internet services. Our approach models services as collection interconnected components, each possessing well defined interfaces and failure semantics. Such decomposition allows designers to engineer high on an understanding interconnections isolated fault behavior component, opposed ad-hoc methods. In this work, we focus using entire commodity workstation component because it possesses natural, fault-isolated...
The large amount of energy consumed by Internet services represents significant and fast-growing financial environmental costs. This paper introduces a general, optimization-based framework several request distribution policies that enable multi-data-center to manage their brown consumption leverage green energy, while respecting service-level agreements (SLAs) minimizing cost. Our can be used abide caps on might arise from various scenarios such as government imposed Kyoto-style carbon...
Demand for clean products and services is increasing as society becoming increasingly aware of climate change. In response, many enterprises are setting explicit sustainability goals implementing initiatives to reduce carbon emissions. Quantification disclosure such have become important marketing tools. As individuals shift their workloads the cloud, this drive toward quantification will lead demand quantifiable green cloud services. Thus, we argue that providers should offer a new class...
Interactive service providers have strict requirements on high-percentile (tail) latency to meet user expectations. If tail targets with less energy, they increase profits, because energy is a significant operating expense. Unfortunately, optimizing and are typically conflicting goals. Our work resolves this conflict by exploiting servers per-core Dynamic Voltage Frequency Scaling (DVFS) Asymmetric Multicore Processors (AMPs). We introduce the Adaptive Slow-to-Fast scheduling framework,...
Traditionally, network software has been structured in a monolithic fashion with all protocol stacks executing either within the kernel or single trusted user-level server. This organization is motivated by performance and security concerns. However, considerations of code maintenance, ease debugging, customization, simultaneous existence multiple protocols argue for separating implementations into more manageable libraries protocols. The present paper describes design implementation...
Recent research has found that operators frequently misconfigure Internet services, causing various availability and performance problems. In this paper, we propose a software infrastructure eliminates several types of misconfiguration by automating the generation configuration files in even as services evolve. The comprises custom scripting language, file templates, communicating runtime monitors, heuristic algorithms to detect dependencies between parameters select ideal configurations. To...
On-site renewable energy has the potential to reduce datacenters' carbon footprint and power costs. The authors built Parasol, a solar-powered datacenter, GreenSwitch, system for scheduling workloads, explore this in controlled research setting.
Interest in powering data enters at least partially using on-site renewable sources, e.g. solar or wind, has been growing. In fact, researchers have studied distributed services comprising networks of such "green" centers, and load distribution approaches that "follow the renewables" to maximize their use. However, prior works not considered where site a network for efficient production energy, while minimizing both center plant building costs. Moreover, built real management systems...
Several companies have recently announced plans to build "green" datacenters, i.e. datacenters partially or completely powered by renewable energy. These will either generate their own energy draw it directly from an existing nearby plant. Besides reducing carbon footprints, can potentially reduce costs, peak power both. However, certain fuels are intermittent, which requires approaches for tackling the supply variability. One approach is use batteries and/or electrical grid as a backup It...
Autograding systems are increasingly being deployed to meet the challenge of teaching programming at scale. We propose a methodology for extending autograders provide meaningful feedback incorrect programs. Our starts with instructor identifying concepts and skills important each assignment, designing comprehensive test suite. Tests then applied code submissions learn classes common errors produce classifiers automatically categorize in future submissions. The maps writes hints help students...
We consider the problem of increasing availability shared data in peer-to-peer systems. In particular, we conservatively estimate amount excess storage required to achieve a practical 99.9% by studying decentralized algorithm that only depends on modest loosely synchronized global state. Our uses randomized decisions extensively together with novel application an erasure code tolerate autonomous peer actions as well staleness study behavior this three distinct environments modeled previously...
The large amount of energy consumed by Internet services represents significant and fast-growing financial environmental costs. This paper introduces a general, optimization-based framework several request distribution policies that enable multi-data-center to manage their brown consumption leverage green energy, while respecting service-level agreements (SLAs) minimizing cost. Our can be used abide caps on might arise from various scenarios such as government imposed Kyoto-style carbon...