NFDI4DS | UHH-SEMS - Publication Details

Jennifer M. Schopf

ORCID: 0000-0003-0726-3674

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5069008173

Research Areas

Distributed and Parallel Computing Systems
Scientific Computing and Data Management
Parallel Computing and Optimization Techniques
Advanced Data Storage Technologies
Software System Performance and Reliability
Cloud Computing and Resource Management
Distributed systems and fault tolerance
Peer-to-Peer Network Technologies
Research Data Management Practices
Network Security and Intrusion Detection
Data Quality and Management
Genetics, Bioinformatics, and Biomedical Research
Semantic Web and Ontologies
Advanced Database Systems and Queries
Interconnection Networks and Systems
Software-Defined Networks and 5G
Anomaly Detection Techniques and Applications
Big Data and Business Intelligence
Service-Oriented Architecture and Web Services
Network Traffic and Congestion Control
Advanced Data Compression Techniques
Plant Disease Management Techniques
Green IT and Sustainability
Internet Traffic Analysis and Secure E-voting
Real-Time Systems Scheduling

The University of Texas at Austin
2023

Indiana University
2017-2022

Indiana University Bloomington
2016-2021

Indiana State University
2021

Purdue University West Lafayette
2019

Woods Hole Oceanographic Institution
2009-2012

Energy Sciences Network
2010

Argonne National Laboratory
2001-2007

University of Chicago
2006-2007

University of Edinburgh
2006-2007

Adaptive computing on the grid using AppLeS

OPENALEX - Publications

Francine Berman Rich Wolski Henri Casanova Walfredo Cirne Holly Dail and 10 more

Ensembles of distributed, heterogeneous resources, also known as computational grids, have emerged critical platforms for high-performance and resource-intensive applications. Such provide the potential applications to aggregate enormous bandwidth, power, memory, secondary storage, other resources during a single execution. However, achieving this performance in dynamic, environments is challenging. Recent experience with distributed indicates that adaptivity fundamental application dynamic...

10.1109/tpds.2003.1195409 article EN IEEE Transactions on Parallel and Distributed Systems 2003-04-01

Application-level scheduling on distributed heterogeneous networks

OPENALEX - Publications

Francine Berman Rich Wolski Silvia Figueira Jennifer M. Schopf Gary Shao

Heterogeneous networks are increasingly being used as platforms for resource-intensive distributed parallel applications. A critical contributor to the performance of such applications is scheduling constituent application tasks on network. Since often resources cannot be brought under control a single global scheduler, must scheduled by user. To obtain best performance, user take into account both application-specific and dynamic system information in developing schedule which meets his or...

10.1145/369028.369109 article EN 1996-11-17

A performance study of monitoring and information services for distributed systems

OPENALEX - Publications

X. Zhang Jeffrey Freschl Jennifer M. Schopf

Monitoring and information services form a key component of distributed system, or Grid. A quantitative study such can aid in understanding the performance limitations, advise deployment monitoring help evaluate future development work. To this end, we three for systems: Globus Toolkit/spl reg/ Discovery Service (MDS2), European Data Grid Relational Architecture (R-GMA) Hawkeye, part Condor project. We perform experiments to test their scalability with respect number users, resources amount...

10.1109/hpdc.2003.1210036 article EN 2004-01-24

Conservative Scheduling

OPENALEX - Publications

Lingyun Yang Jennifer M. Schopf Ian Foster

In heterogeneous and dynamic environments, efficient execution of parallel computations can require mappings tasks to processors whose performance is both irregular (because heterogeneity) time-varying dynamicity). While adaptive domain decomposition techniques have been used address resource capabilities, temporal variations in those capabilities seldom considered. We propose a conservative scheduling policy that uses information about expected future variance produce more data mapping...

10.1145/1048935.1050182 article EN 2003-11-15

Predicting the performance of wide area data transfers

OPENALEX - Publications

S. Vazhkudai Jennifer M. Schopf Ian Foster

As Data Grids become more commonplace, large data sets are being replicated and distributed to multiple sites, leading the problem of determining which replica can be accessed most efficiently. The answer this question depend on many factors, including physical characteristics resources load behavior CPUs, networks, storage devices that part end-to-end path linking possible sources sinks. We develop a predictive framework combines (1) integrated instrumentation collects information about...

10.1109/ipdps.2002.1015510 article EN 2002-01-01

Homeostatic and tendency-based CPU load predictions

OPENALEX - Publications

Lingyun Yang Ian Foster Jennifer M. Schopf

The dynamic nature of a resource-sharing environment means that applications must be able to adapt their behavior in response changes system status. Predictions future performance can used guide such adaptations. In this paper, we present and evaluate several new one-step-ahead low-overhead time series prediction strategies track recent trends by giving more weight data. We results show tendency model with different ascending descending performs best among all studied. A comparative study...

10.1109/ipdps.2003.1213129 article EN 2004-03-22

Using Regression Techniques to Predict Large Data Transfers

OPENALEX - Publications

Sudharshan S. Vazhkudai Jennifer M. Schopf

The recent proliferation of Data Grids and the increasingly common practice using resources as distributed data stores provide a convenient environment for communities researchers to share, replicate, manage access copies large datasets. This has led question which replica can be accessed most efficiently. In such environments, fetching from one several locations requires accurate predictions end-to-end transfer times. answer this depend on many factors, including physical characteristics...

10.1177/1094342003173004 article EN The International Journal of High Performance Computing Applications 2003-08-01

Performance prediction in production environments

OPENALEX - Publications

Jennifer M. Schopf Francine Berman

Accurate performance predictions are difficult to achieve for parallel applications executing on production distributed systems. Conventional point-valued parameters and prediction models often inaccurate since they can only represent one point in a range of possible behaviors. The authors address this problem by allowing characteristic application system data be represented set values their probabilities, which call stochastic values. They give practical methodology using as adaptable...

10.1109/ipps.1998.669995 article EN 2002-11-27

Performance analysis of the Globus Toolkit Monitoring and Discovery Service, MDS2

OPENALEX - Publications

Xuehai Zhang Jennifer M. Schopf

Monitoring and information services form a key component of distributed system, or grid. A quantitative study such can aid in understanding the performance limitations, advise deployment monitoring help evaluate future development work. To this end, we examined Globus Toolkit/spl reg/ Discovery Service (MDS2) by instrumenting its main using NetLogger. Our shows strong advantage to caching prefetching data, as well need have primary components at well-connected sites.

10.1109/pccc.2004.1395199 article EN IEEE International Conference on Performance, Computing, and Communications, 2004 2005-02-22

Monitoring the grid with the Globus Toolkit MDS4

OPENALEX - Publications

Jennifer M. Schopf Laura Pearlman Neill Miller Carl Kesselman Ian Foster and 2 more

The Globus Toolkit Monitoring and Discovery System (MDS4) defines implements mechanisms for service resource discovery monitoring in distributed environments. MDS4 is distinguished from previous similar systems by its extensive use of interfaces behaviors defined the WS-Resource Framework WS-Notification specifications, deep integration into essentially every component Toolkit. We describe architecture Web that allow users to discover resources services, monitor states, receive updates on...

10.1088/1742-6596/46/1/072 article EN Journal of Physics Conference Series 2006-09-01

Predicting sporadic grid data transfers

OPENALEX - Publications

Sudharshan S. Vazhkudai Jennifer M. Schopf

The increasingly common practice of replicating datasets and using resources as distributed data stores in grid environments has led to the problem determining which replica can be accessed most efficiently. Due diverse performance characteristics load variations several components end-to-end path linking these various locations, selecting a from among many requires accurate prediction information transfer times between sources sinks. In this paper we present system that is based on...

10.1109/hpdc.2002.1029918 article EN 2003-06-25

Stochastic scheduling

OPENALEX - Publications

Jennifer M. Schopf Francine Berman

Article Free Access Share on Stochastic scheduling Authors: Jennifer M. Schopf Computer Science Department, Northwestern University UniversityView Profile , Francine Berman Dept. of and Engineering, California, San Diego DiegoView Authors Info & Claims SC '99: Proceedings the 1999 ACM/IEEE conference SupercomputingJanuary Pages 48–eshttps://doi.org/10.1145/331532.331580Published:01 January 1999Publication History 32citation747DownloadsMetricsTotal Citations32Total Downloads747Last 12...

10.1145/331532.331580 article EN 1999-01-01

Grids: The Top Ten Questions

OPENALEX - Publications

Jennifer M. Schopf Bill Nitzberg

The design and implementation of a national computing system data grid has become reachable goal from both the computer science computational point view. A distributed infrastructure capable sophisticated functions can bring many benefits to scientific work, but poses challenges, technical socio‐political. Technical challenges include having basic software tools, higher‐level services, functioning pervasive security, standards, while socio‐political issues building user community, adding...

10.1155/2002/757015 article EN cc-by Scientific Programming 2002-01-01

Log summarization and anomaly detection for troubleshooting distributed systems

OPENALEX - Publications

Dan Gunter Brian L. Tierney Aaron Brown Martin Swany John Bresnahan and 1 more

Today's system monitoring tools are capable of detecting failures such as host failures, OS errors, and network partitions in near-real time. Unfortunately, the same cannot yet be said end-to-end distributed software stack. Any given action, for example, reliably transferring a directory files, can involve wide range complex interrelated actions across multiple pieces software: checking user certificates permissions, getting details all performing third-party transfers, understanding re-try...

10.1109/grid.2007.4354137 article EN 2007-09-01

The Inca Test Harness and Reporting Framework

OPENALEX - Publications

Shava Smallen Catherine Olschanowsky Klas Ericson Pete Beckman Jennifer M. Schopf

Virtual organizations (VOs), communities that enable coordinated resource sharing among multiple sites, are becoming more prevalent in the high-performance computing community. In order to promote cross-site usability, most VOs prepare service agreements include a minimum set of common functionality, starting with software stack and evolving into complicated interoperability agreements. VO often difficult verify maintain, however, because sites dynamic autonomous. Automated verification is...

10.1109/sc.2004.56 article EN 2005-04-01

Scalability analysis of three monitoring and information systems: MDS2, R-GMA, and Hawkeye

OPENALEX - Publications

Xuehai Zhang Jeffrey Freschl Jennifer M. Schopf

10.1016/j.jpdc.2007.03.006 article EN Journal of Parallel and Distributed Computing 2007-04-12

Anomaly detection and diagnosis in grid environments

OPENALEX - Publications

Lingyun Yang Chuang Liu Jennifer M. Schopf Ian Foster

Identifying and diagnosing anomalies in application behavior is critical to delivering reliable application-level performance. In this paper we introduce a strategy detect diagnose the possible reasons behind them. Our approach extends traditional window-based by using signal-processing techniques filter out recurring, background fluctuations resource behavior. addition, have developed diagnosis technique that uses standard monitoring data determine which related changes may cause anomalies....

10.1145/1362622.1362667 article EN 2007-11-10

Coming Soon ...