NFDI4DS | UHH-SEMS - Publication Details

Alexander Gray

ORCID: 0000-0003-0337-7359

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5005255629

Research Areas

Machine Learning and Data Classification
Data Management and Algorithms
Algorithms and Data Compression
Machine Learning and Algorithms
Topic Modeling
Natural Language Processing Techniques
Neural Networks and Applications
Sparse and Compressive Sensing Techniques
Statistical Methods and Inference
Gaussian Processes and Bayesian Inference
Face and Expression Recognition
Explainable Artificial Intelligence (XAI)
Bayesian Modeling and Causal Inference
Bayesian Methods and Mixture Models
Scientific Research and Discoveries
Advanced Image and Video Retrieval Techniques
Astronomy and Astrophysical Research
Galaxies: Formation, Evolution, Phenomena
Stochastic Gradient Optimization Techniques
Advanced Bandit Algorithms Research
Logic, Reasoning, and Knowledge
Image Retrieval and Classification Techniques
Computational Physics and Python Applications
Astronomical Observations and Instrumentation
Text and Document Classification Technologies

Purdue University West Lafayette
2024

IBM Research - Thomas J. Watson Research Center
2020-2023

Queensland University of Technology
2023

University of Glasgow
2019-2022

IBM (United States)
2019-2021

IBM Research - Ireland
2020

University of North Carolina at Charlotte
2019

University of the West of Scotland
2018

Georgia Institute of Technology
2006-2017

GEI Consultants
2017

EFFICIENT PHOTOMETRIC SELECTION OF QUASARS FROM THE SLOAN DIGITAL SKY SURVEY. II. ∼1, 000, 000 QUASARS FROM DATA RELEASE 6

OPENALEX - Publications

Gordon T. Richards Adam D. Myers Alexander Gray Ryan Riegel R. C. Nichol and 4 more

We present a catalog of 1,172,157 quasar candidates selected from the photometric imaging data Sloan Digital Sky Survey (SDSS). The objects are all point sources to limiting magnitude i = 21.3 8417 deg2 SDSS Data Release 6 (DR6). This sample extends our previous by using latest public release and probing both ultraviolet (UV)-excess high-redshift quasars. While addition reduces overall efficiency (quasars:quasar candidates) ∼80%, it is expected contain no fewer than 850,000 bona fide...

10.1088/0067-0049/180/1/67 article EN The Astrophysical Journal Supplement Series 2008-12-23

Human-AI Collaboration in Data Science

OPENALEX - Publications

Dakuo Wang Justin D. Weisz Michael Müller Parikshit Ram Philipp Geyer and 4 more

The rapid advancement of artificial intelligence (AI) is changing our lives in many ways. One application domain data science. New techniques automating the creation AI, known as AutoAI or AutoML, aim to automate work practices scientists. systems are capable autonomously ingesting and pre-processing data, engineering new features, creating scoring models based on a target objectives (e.g. accuracy run-time efficiency). Though not yet widely adopted, we interested understanding how will...

10.1145/3359313 article EN Proceedings of the ACM on Human-Computer Interaction 2019-11-07

Efficient Photometric Selection of Quasars from the Sloan Digital Sky Survey: 100,000 z < 3 Quasars from Data Release One

OPENALEX - Publications

Gordon T. Richards Robert C. Nichol Alexander Gray Róbert Brunner Robert H. Lupton and 14 more

We present a catalog of 100,563 unresolved, UV-excess (UVX) quasar candidates to g=21 from 2099 deg^2 the Sloan Digital Sky Survey (SDSS) Data Release One (DR1) imaging data. Existing spectra 22,737 sources reveals that 22,191 (97.6%) are quasars; accounting for magnitude dependence this efficiency, we estimate 95,502 (95.0%) objects in quasars. Such high efficiency is unprecedented broad-band surveys This ``proof-of-concept'' sample designed be maximally efficient, but still has 94.7% completeness g

10.1086/425356 article EN The Astrophysical Journal Supplement Series 2004-12-01

First Measurement of the Clustering Evolution of Photometrically Classified Quasars

OPENALEX - Publications

Adam D. Myers Robert J. Brunner Gordon T. Richards R. C. Nichol Donald P. Schneider and 4 more

We present new measurements of the quasar autocorrelation from a sample \~80,000 photometrically-classified quasars taken SDSS DR1. find best-fit model $\omega(\theta) = (0.066\pm^{0.026}_{0.024})\theta^{-(0.98\pm0.15)}$ for angular autocorrelation, consistent with estimates spectroscopic surveys. show that only models little or no evolution in clustering comoving coordinates since z~1.4 can recover scale-length local galaxies and Active Galactic Nuclei (AGNs). A is best explained current...

10.1086/499093 article EN The Astrophysical Journal 2006-02-15

Maximum inner-product search using cone trees

OPENALEX - Publications

Parikshit Ram Alexander Gray

The problem of efficiently finding the best match for a query in given set with respect to Euclidean distance or cosine similarity has been extensively studied. However, closely related inner-product never explored general setting our knowledge. In this paper we consider and contrast it previous problems considered. First, propose branch-and-bound algorithm based on (single) tree data structure. Subsequently, present dual-tree case where there are multiple queries. Our proposed algorithms...

10.1145/2339530.2339677 preprint EN 2012-08-12

Logical Neural Networks

OPENALEX - Publications

Ryan Riegel Alexander Gray Francois Luus Naweed Khan Ndivhuwo Makondo and 10 more

We propose a novel framework seamlessly providing key properties of both neural nets (learning) and symbolic logic (knowledge reasoning). Every neuron has meaning as component formula in weighted real-valued logic, yielding highly intepretable disentangled representation. Inference is omnidirectional rather than focused on predefined target variables, corresponds to logical reasoning, including classical first-order theorem proving special case. The model end-to-end differentiable, learning...

10.48550/arxiv.2006.13155 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Leveraging Abstract Meaning Representation for Knowledge Base Question Answering

OPENALEX - Publications

Pavan Kapanipathi Ibrahim Abdelaziz Srinivas Ravishankar Salim Roukos Alexander Gray and 25 more

Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramón Fernandez Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Khandelwal, Young-Suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Gangi Reddy, Ryan Riegel, Gaetano Rossiello, Udit Sharma, G P Shrivatsa Bhargav, Mo Yu. Findings...

10.18653/v1/2021.findings-acl.339 article EN cc-by 2021-01-01

Nonparametric Density Estimation: Toward Computational Tractability

OPENALEX - Publications

Alexander Gray Andrew Moore

Density estimation is a core operation of virtually all probabilistic learning methods (as opposed to discriminative methods). Approaches density can be divided into two principal classes, parametric methods, such as Bayesian networks, and nonparametric kernel smoothing splines. While neither choice should universally preferred for situations, well-known benefit their ability achieve optimality ANY input distribution more data are observed, property that no model with assumption have, one...

10.1137/1.9781611972733.19 article EN 2003-05-01

High redshift detection of the integrated Sachs-Wolfe effect

OPENALEX - Publications

T. Giannantonio Robert Crittenden R. C. Nichol Ryan Scranton Gordon T. Richards and 5 more

We present evidence of a large angle correlation between the cosmic microwave background measured by WMAP and catalog photometrically detected quasars from SDSS. The observed cross is (0.30 +- 0.14) microK at zero lag, with shape consistent that expected for correlations arising integrated Sachs-Wolfe effect. photometric redshifts are centered z ~ 1.5, making this deepest survey in which such has been observed. Assuming due to ISW effect, constitutes earliest yet dark energy it can be used...

10.1103/physrevd.74.063520 article EN Physical review. D. Particles, fields, gravitation, and cosmology/Physical review. D, Particles, fields, gravitation, and cosmology 2006-09-21

Ovarian cancer detection from metabolomic liquid chromatography/mass spectrometry data by support vector machines

OPENALEX - Publications

Wei Guan Manshui Zhou Christina Y. Hampton Benedict B. Benigno L. DeEtte Walker and 3 more

Abstract Background The majority of ovarian cancer biomarker discovery efforts focus on the identification proteins that can improve predictive power presently available diagnostic tests. We here show metabolomics, study metabolic changes in biological systems, also provide characteristic small molecule fingerprints related to this disease. Results In work, new approaches automatic classification metabolomic data produced from sera patients and benign controls are investigated. performance...

10.1186/1471-2105-10-259 article EN cc-by BMC Bioinformatics 2009-08-22

Fast euclidean minimum spanning tree

OPENALEX - Publications

William B. March Parikshit Ram Alexander Gray

The Euclidean Minimum Spanning Tree problem has applications in a wide range of fields, and many efficient algorithms have been developed to solve it. We present new, fast, general EMST algorithm, motivated by the clustering analysis astronomical data. Large-scale surveys, including Sloan Digital Sky Survey, large simulations early universe, such as Millennium Simulation, can contain millions points fill terabytes storage. Traditional methods scale quadratically, more advanced lack rigorous...

10.1145/1835804.1835882 article EN 2010-07-25

Rapid Mass Spectrometric Metabolic Profiling of Blood Sera Detects Ovarian Cancer with High Accuracy

OPENALEX - Publications

Manshui Zhou Wei Guan L. DeEtte Walker Roman Mezencev Benedict B. Benigno and 3 more

Abstract Background: Ovarian cancer diagnosis is problematic because the disease typically asymptomatic, especially at early stages of progression and/or recurrence. We report here integration a new mass spectrometric technology with novel support vector machine computational method for use in diagnostics, and describe application to ovarian cancer. Methods: coupled high-throughput ambient ionization technique spectrometry (direct analysis real-time spectrometry) profile relative metabolite...

10.1158/1055-9965.epi-10-0126 article EN Cancer Epidemiology Biomarkers & Prevention 2010-09-01

Density estimation trees

OPENALEX - Publications

Parikshit Ram Alexander Gray

In this paper we develop density estimation trees (DETs), the natural analog of classification and regression trees, for task estimation. We consider a joint probability function d-dimensional random vector X define piecewise constant estimator structured as decision tree. The integrated squared error is minimized to learn show that method nonparametric: under standard conditions nonparametric estimation, DETs are shown be asymptotically consistent. addition, being perform automatic feature...

10.1145/2020408.2020507 article EN 2011-08-21

MLPACK: A Scalable C++ Machine Learning Library

OPENALEX - Publications

Ryan R. Curtin James R. Cline N. P. Slagle William B. March Parikshit Ram and 2 more

MLPACK is a state-of-the-art, scalable, multi-platform C++ machine learning library released in late 2011 offering both simple, consistent API accessible to novice users and high performance flexibility expert by leveraging modern features of C++. provides cutting-edge algorithms whose benchmarks exhibit far better than other leading libraries. version 1.0.3, licensed under the LGPL, available at http://www.mlpack.org.

10.48550/arxiv.1210.6293 preprint EN other-oa arXiv (Cornell University) 2012-01-01

An ADMM Based Framework for AutoML Pipeline Configuration

OPENALEX - Publications

Sijia Liu Parikshit Ram Deepak Vijaykeerthy Djallel Bouneffouf Gregory Bramble and 4 more

We study the AutoML problem of automatically configuring machine learning pipelines by jointly selecting algorithms and their appropriate hyper-parameters for all steps in supervised pipelines. This black-box (gradient-free) optimization with mixed integer & continuous variables is a challenging problem. propose novel scheme leveraging alternating direction method multipliers (ADMM). The proposed framework able to (i) decompose into easier sub-problems that have reduced number circumvent...

10.1609/aaai.v34i04.5926 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Foundations of reasoning with uncertainty via real-valued logics

OPENALEX - Publications

Ronald Fagin Ryan Riegel Alexander Gray

Interest in logics with some notion of real-valued truths has existed since at least Boole and been increasing AI due to the emergence neuro-symbolic approaches, though often their logical inference capabilities are characterized only qualitatively. We provide foundations for establishing correctness power such systems. introduce a rich class multidimensional sentences, sound complete axiomatization that can be parameterized cover many logics, including all common fuzzy extend these weighted...

10.1073/pnas.2309905121 article EN cc-by-nc-nd Proceedings of the National Academy of Sciences 2024-05-16

Coming Soon ...