NFDI4DS | UHH-SEMS - Publication Details

Ali Ghodsi

ORCID: 0000-0003-1866-246X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5040035859

Research Areas

Face and Expression Recognition
Neural Networks and Applications
Peer-to-Peer Network Technologies
Caching and Content Delivery
Cloud Computing and Resource Management
Advanced Data Storage Technologies
Topic Modeling
Distributed systems and fault tolerance
Distributed and Parallel Computing Systems
Domain Adaptation and Few-Shot Learning
Image Retrieval and Classification Techniques
Generative Adversarial Networks and Image Synthesis
Sparse and Compressive Sensing Techniques
Natural Language Processing Techniques
Multimodal Machine Learning Applications
Machine Learning and Data Classification
Advanced Database Systems and Queries
Scientific Computing and Data Management
Advanced Neural Network Applications
Blind Source Separation Techniques
Machine Learning and ELM
Gaussian Processes and Bayesian Inference
Machine Learning and Algorithms
Time Series Analysis and Forecasting
Advanced Image and Video Retrieval Techniques

University of Waterloo
2016-2025

Aja University of Medical Sciences
2023

University of Alberta
2022

Huawei Technologies (Sweden)
2021-2022

Actua
2012-2022

University of Shahrood
2020-2021

Vector Institute
2021

University of California, Berkeley
2009-2020

Berkeley College
2011-2020

KTH Royal Institute of Technology
2004-2018

Apache Spark

OPENALEX - Publications

Matei Zaharia Reynold Xin Patrick Wendell Tathagata Das Michael Armbrust and 9 more

This open source computing framework unifies streaming, batch, and interactive big data workloads to unlock new applications.

10.1145/2934664 article EN Communications of the ACM 2016-10-28

Mesos: a platform for fine-grained resource sharing in the data center

OPENALEX - Publications

Benjamin Hindman Andy Konwinski Matei Zaharia Ali Ghodsi Anthony D. Joseph and 3 more

We present Mesos, a platform for sharing commodity clusters between multiple diverse cluster computing frameworks, such as Hadoop and MPI. Sharing improves utilization avoids per-framework data replication. Mesos shares resources in fine-grained manner, allowing frameworks to achieve locality by taking turns reading stored on each machine. To support the sophisticated schedulers of today's introduces distributed two-level scheduling mechanism called resource offers. decides how many offer...

10.5555/1972457.1972488 article EN Networked Systems Design and Implementation 2011-03-30

Spark SQL

OPENALEX - Publications

Michael Armbrust Reynold Xin Cheng Lian Yin Huai Davies Liu and 6 more

Spark SQL is a new module in Apache that integrates relational processing with Spark's functional programming API. Built on our experience Shark, lets programmers leverage the benefits of (e.g. declarative queries and optimized storage), users call complex analytics libraries machine learning). Compared to previous systems, makes two main additions. First, it offers much tighter integration between procedural processing, through DataFrame API code. Second, includes highly extensible...

10.1145/2723372.2742797 article EN 2015-05-27

Dominant resource fairness: fair allocation of multiple resource types

OPENALEX - Publications

Ali Ghodsi Matei Zaharia Benjamin Hindman Andy Konwinski Scott Shenker and 1 more

We consider the problem of fair resource allocation in a system containing different types, where each user may have demands for resource. To address this problem, we propose Dominant Resource Fairness (DRF), generalization max-min fairness to multiple types. show that DRF, unlike other possible policies, satisfies several highly desirable properties. First, DRF incentivizes users share resources, by ensuring no is better off if resources are equally partitioned among them. Second,...

10.5555/1972457.1972490 article EN Networked Systems Design and Implementation 2011-03-30

Automatic dimensionality selection from the scree plot via the use of profile likelihood

OPENALEX - Publications

Mu Zhu Ali Ghodsi

10.1016/j.csda.2005.09.010 article EN Computational Statistics & Data Analysis 2005-10-26

Information-centric networking

OPENALEX - Publications

Ali Ghodsi Scott Shenker Teemu Koponen Ankit Singla Barath Raghavan and 1 more

There have been many recent papers on data-oriented or content-centric network architectures. Despite the voluminous literature, surprisingly little clarity is emerging as most focus what differentiates them from other proposals. We begin this paper by identifying existing commonalities and important differences in these designs, then discuss some remaining research issues. After our review, we emerge skeptical (but open-minded) about value of approach to networking.

10.1145/2070562.2070563 article EN 2011-11-14

Supervised principal component analysis: Visualization, classification and regression on subspaces and submanifolds

OPENALEX - Publications

Elnaz Barshan Ali Ghodsi Zohreh Azimifar Mansoor Zolghadri Jahromi

10.1016/j.patcog.2010.12.015 article EN Pattern Recognition 2010-12-30

Less pain, most of the gain

OPENALEX - Publications

Seyed Kaveh Fayazbakhsh Yin Lin Amin Tootoonchian Ali Ghodsi Teemu Koponen and 4 more

Information-Centric Networking (ICN) has seen a significant resurgence in recent years. ICN promises benefits to users and service providers along several dimensions (e.g., performance, security, mobility). These benefits, however, come at non-trivial cost as many proposals envision adding complexity the network by having routers serve content caches support nearest-replica routing. This paper is driven simple question of whether this additional justified if we can achieve these an...

10.1145/2486001.2486023 article EN 2013-08-13

Deep learning enables de novo peptide sequencing from data-independent-acquisition mass spectrometry

OPENALEX - Publications

Ngoc Hieu Tran Rui Qiao Lei Xin Xin Chen Chuyi Liu and 4 more

10.1038/s41592-018-0260-3 article EN Nature Methods 2018-12-07

Tachyon

OPENALEX - Publications

Haoyuan Li Ali Ghodsi Matei Zaharia Scott Shenker Ion Stoica

Tachyon is a distributed file system enabling reliable data sharing at memory speed across cluster computing frameworks. While caching today improves read workloads, writes are either network or disk bound, as replication used for fault-tolerance. eliminates this bottleneck by pushing lineage, well-known technique, into the storage layer. The key challenge in making long-running lineage-based timely recovery case of failures. addresses issue introducing checkpointing algorithm that...

10.1145/2670979.2670985 article EN 2014-11-03

Sentiment analysis based on improved pre-trained word embeddings

OPENALEX - Publications

Seyed Mahdi Rezaeinia Rouhollah Rahmani Ali Ghodsi Hadi Veisi

10.1016/j.eswa.2018.08.044 article EN Expert Systems with Applications 2018-09-20

Bolt-on causal consistency

OPENALEX - Publications

Peter Bailis Ali Ghodsi Joseph M. Hellerstein Ion Stoica

We consider the problem of separating consistency-related safety properties from availability and durability in distributed data stores via application a "bolt-on" shim layer that upgrades an underlying general-purpose store. This provides same consistency guarantees atop wide range widely deployed but often inflexible stores. As causal is one strongest models remain available during system partitions, we develop eventually consistent to provide convergent consistency. Accordingly, leverage...

10.1145/2463676.2465279 article EN 2013-06-22

A Berkeley View of Systems Challenges for AI

OPENALEX - Publications

Ion Stoica Dawn Song Raluca Ada Popa David A. Patterson Michael W. Mahoney and 9 more

With the increasing commoditization of computer vision, speech recognition and machine translation systems widespread deployment learning-based back-end technologies such as digital advertising intelligent infrastructures, AI (Artificial Intelligence) has moved from research labs to production. These changes have been made possible by unprecedented levels data computation, methodological advances in learning, innovations software architectures, broad accessibility these technologies. The...

10.48550/arxiv.1712.05855 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Fine-Tuning and training of densenet for histopathology image representation using TCGA diagnostic slides

OPENALEX - Publications

Abtin Riasatian Morteza Babaie Danial Maleki Shivam Kalra Mojtaba Valipour and 17 more

Feature vectors provided by pre-trained deep artificial neural networks have become a dominant source for image representation in recent literature. Their contribution to the performance of analysis can be improved through fine-tuning. As an ultimate solution, one might even train network from scratch with domain-relevant images, highly desirable option which is generally impeded pathology lack labeled images and computational expense. In this study, we propose new network, namely KimiaNet,...

10.1016/j.media.2021.102032 article EN cc-by-nc-nd Medical Image Analysis 2021-03-11

Naming in content-oriented architectures

OPENALEX - Publications

Ali Ghodsi Teemu Koponen Jarno Rajahalme Pasi Sarolahti Scott Shenker

There have been several recent proposals for content-oriented network architectures whose underlying mechanisms are surprisingly similar in spirit, but which differ many details. In this paper we step back from the mechanistic details and focus only on area where these approaches a fundamental difference: naming. particular, some designs adopt hierarchical, human-readable names, whereas others use self-certifying names. When discussing architecture, three of most important requirements...

10.1145/2018584.2018586 article EN 2011-08-19

Highly available transactions

OPENALEX - Publications

Peter Bailis Aaron Davidson Alan Fekete Ali Ghodsi Joseph M. Hellerstein and 1 more

To minimize network latency and remain online during server failures partitions, many modern distributed data storage systems eschew transactional functionality, which provides strong semantic guarantees for groups of multiple operations over items. In this work, we consider the problem providing Highly Available Transactions (HATs): that do not suffer unavailability system partitions or incur high latency. We introduce a taxonomy highly available analyze existing ACID isolation consistency...

10.14778/2732232.2732237 article EN Proceedings of the VLDB Endowment 2013-11-01

Coordination avoidance in database systems

OPENALEX - Publications

Peter Bailis Alan Fekete Michael J. Franklin Ali Ghodsi Joseph M. Hellerstein and 1 more

Minimizing coordination, or blocking communication between concurrently executing operations, is key to maximizing scalability, availability, and high performance in database systems. However, uninhibited coordination-free execution can compromise application correctness, consistency. When coordination necessary for correctness? The classic use of serializable transactions sufficient maintain correctness but not all applications, sacrificing potential scalability. In this paper, we develop a...

10.14778/2735508.2735509 article EN Proceedings of the VLDB Endowment 2014-11-01

Multi-resource fair queueing for packet processing

OPENALEX - Publications

Ali Ghodsi Vyas Sekar Matei Zaharia Ion Stoica

Middleboxes are ubiquitous in today's networks and perform a variety of important functions, including IDS, VPN, firewalling, WAN optimization. These functions differ vastly their requirements for hardware resources (e.g., CPU cycles memory bandwidth). Thus, depending on the they go through, different flows can consume amounts middlebox's resources. While there is much literature weighted fair sharing link bandwidth to isolate flows, it unclear how schedule multiple middlebox achieve similar...

10.1145/2342356.2342358 article EN 2012-08-13

HTTP as the narrow waist of the future internet

OPENALEX - Publications

Lucian Popa Ali Ghodsi Ion Stoica

Over the past decade a variety of network architectures have been proposed to address IP's limitations in terms flexible forwarding, security, and data distribution. Meanwhile, fueled by explosive growth video traffic HTTP infrastructure (e.g., CDNs, web caches), has became de-facto protocol for deploying new services applications. Given these developments, we argue that should be evaluated not only with respect IP, but also HTTP, could fertile ground (more so than IP) newly functionalities....

10.1145/1868447.1868453 article EN 2010-10-20

Eventual consistency today

OPENALEX - Publications

Peter Bailis Ali Ghodsi

How can applications be built on eventually consistent infrastructure given no guarantee of safety?

10.1145/2447976.2447992 article EN Communications of the ACM 2013-04-30

Choosy

OPENALEX - Publications

Ali Ghodsi Matei Zaharia Scott Shenker Ion Stoica

Max-Min Fairness is a flexible resource allocation mechanism used in most datacenter schedulers. However, an increasing number of jobs have hard placement constraints, restricting the machines they can run on due to special hardware or software requirements. It unclear how define, and achieve, max-min fairness presence such constraints. We propose Constrained (CMMF), extension that supports show it only policy satisfying important property incentivizes users pool resources. Optimally...

10.1145/2465351.2465387 article EN 2013-04-15

Coming Soon ...