- Data Analysis with R
- Natural Language Processing Techniques
- Video Analysis and Summarization
- Image Retrieval and Classification Techniques
- Computational and Text Analysis Methods
- Advanced Image and Video Retrieval Techniques
- Topic Modeling
- Data Visualization and Analytics
- 3D Surveying and Cultural Heritage
- Cinema and Media Studies
- Advanced Text Analysis Techniques
- Geographic Information Systems Studies
- Statistical Methods and Inference
- Digital Humanities and Scholarship
- Data-Driven Disease Surveillance
- Scientific Computing and Data Management
- Phonetics and Phonology Research
- Media Studies and Communication
- Speech and dialogue systems
- Complex Network Analysis Techniques
- Data Management and Algorithms
- Data Mining Algorithms and Applications
- COVID-19 epidemiological studies
- Advanced Database Systems and Queries
- Multimodal Machine Learning Applications
University of Richmond
2017-2024
Yale University
2010-2017
AT&T (United States)
2015
Keras is a high-level neural networks API, originall written in Python, and capable of running on top either TensorFlow or Theano.It was developed with focus enabling fast experimentation.This package provides an interface to from within R. All the returned objects functions this are native R raw pointers python objects, making it possible for users access entire keras API.The main benefits (1) correct, manual parsing inputs python, (2) R-sided documentation, (3) examples using API.It...
Abstract In this article we establish a methodological and theoretical framework for the study of large collections visual materials. Our framework, distant viewing, is distinguished from other approaches by making explicit interpretive nature extracting semantic metadata images. words, one must ‘view’ materials before studying them. We illustrate need process viewing simultaneously drawing on theories semiotics, photography, computer vision. Two illustrative applications to our own research...
We consider efficient implementations of the generalized lasso dual path algorithm given by Tibshirani and Taylor in Citation2011. first describe a generic approach that covers any penalty matrix D (full column rank) X predictor variables. then fast for special cases trend filtering problems, fused sparse both with = I general X. These specialized offer considerable improvement over implementation, terms numerical stability efficiency solution computation. algorithms are all available use...
Significance To study the COVID-19 pandemic, its effects on society, and measures for reducing spread, researchers need detailed data course of pandemic. Standard public health streams suffer inconsistent reporting frequent, unexpected revisions. They also miss other aspects a population’s behavior that are worthy consideration. We present an open database COVID signals in United States, measured at county level updated daily. This includes traditionally reported cases deaths, many others:...
Purpose.: Tocharacterize the 24-hour pattern of intraocular pressure (IOP) in untreated ocular hypertensive (OHTN) patients. Methods.: IOP measurements were taken every 2 hours during a period from 15 OHTN patients (ages 41–77 years). Measurements both sitting and supine (diurnal) only (nocturnal). Mean diurnal nocturnal IOPs group compared to previously reported values age-matched healthy glaucomatous eyes. Post hoc analysis who converted glaucoma those did not with that same Results.:...
Recent advances in natural language processing have produced libraries that extract lowlevel features from a collection of raw texts.These features, known as annotations, are usually stored internally hierarchical, tree-based data structures.This paper proposes model to represent annotations normalized relational tables optimized for exploratory analysis and predictive modeling.The R package cleanNLP, which calls one two state the art NLP (CoreNLP or spaCy), is presented an implementation...
The Distant Viewing Toolkit is a Python package for the computational analysis of visual culture.It addresses challenges working with moving images through automated extraction and visualization metadata summarizing content (e.g., people/actors, dialogue, scenes, objects) style shot angle, length, lighting, framing, sound) time-based media.This toolkit optimized two purposes: (1) scholarly inquiry culture from humanities social sciences, (2) search discovery collections within libraries,...
This article examines the international networks of communication among journals concerned with security studies. It uses Web Knowledge database on which cited articles in other over decade 1999—2008, and overall impact each journal field as a whole. We discover complex set networks, different central exerting influence both within subnetworks, well peripheral linked weakly to only few others. Some subnetworks can be distinguished by methodology or theoretical schools. Subnetworks frequently...
This paper was published twice due to an administrative error. The correct version can be found at https://doi.org/10.1093/llc/fqz013
Extensive scholarship in media studies has established how formal elements of moving images—such as camera angles, sound, and framing—reflect, establish, challenge cultural norms. Prior computational analyses attempting to ana- lyze these have primarily relied on summarizing relatively low-level fea- tures.
Abstract The COVID-19 pandemic presented enormous data challenges in the United States. Policy makers, epidemiological modelers, and health researchers all require up-to-date on relevant public behavior, ideally at fine spatial temporal resolution. COVIDcast API is our attempt to fill this need: operational since April 2020, it provides open access both traditional surveillance signals (cases, deaths, hospitalizations) many auxiliary indicators of activity, such as extracted from...
Abstract Analysis of figure skating scoring is notoriously difficult under the new Code Points (CoP) system, created following judging scandal 2002 Olympic Winter Games. The CoP involves selection a random subpanel judges; scores from other judges are reported but not used. An attempt to repeat methods previous studies establishing presence nationalistic bias in failed recreate competition raw sheets. This raised concern that different subpanels were being selected for each skater (breaking...
The way materials are archived and organized shapes knowledge production (Derrida, J. Archive Fever: A Freudian Impression. Vancouver: University of Chicago Press, 1996; Foucault, M. L'archéologie du savoir. Paris, France: Éditions Gallimard, 1969; Kramer, Going meta on metadata. Journal Digital Humanities, 3(2), 2014; Hart, T. How do you archive the sky? Journal, 5, 2015; Taylor, D. Save As. e-misférica, 9, 2012). We argue that recommender systems offer an opportunity to discover new...
This paper analyses the contribution of language metrics and, potentially, linguistic structures, to classify French learners English according levels Common European Framework Reference for Languages (CEFRL). The purpose is build a model prediction learner as function complexity features. We used EFCAMDAT corpus, database one million written assignments by learners. After applying on texts, we built representation matching texts their assigned CEFRL levels. Lexical and syntactic were...