Ben Welsh

ORCID: 0000-0002-5200-7269
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Data Quality and Management
  • Web Data Mining and Analysis
  • Computational Physics and Python Applications
  • Topic Modeling
  • Complex Network Analysis Techniques
  • Data Visualization and Analytics
  • Data Analysis with R
  • Caching and Content Delivery
  • Advanced Computational Techniques and Applications
  • Advanced Text Analysis Techniques
  • Computational and Text Analysis Methods

New York Times
2018

Altair is a declarative statistical visualization library for Python.Statistical constrained subset of data focused on the creation visualizations that are helpful in modeling.The model usually expressed terms grammar (Wilkinson, 2005) specifies how input transformed and mapped to visual properties (position, color, size, etc.).

10.21105/joss.01057 article EN cc-by The Journal of Open Source Software 2018-12-08

Information prioritization plays an important role in how humans perceive and understand the world. Homepage layouts serve as a tangible proxy for this prioritization. In work, we present NewsHomepages, large dataset of over 3,000 new website homepages (including local, national topic-specific outlets) captured twice daily three-year period. We develop models to perform pairwise comparisons between news items infer their relative significance. To illustrate that modeling organizational...

10.32388/x9l7as preprint EN cc-by 2025-01-30

0.4.0 Optional iconCreateFunction for MarkerCluster to customize the icons (odovad #701) Added HeatMapWithTime (Padarn #567) Added MeasureControl (ocefpaf #669) Added VideoOverlay plugin #665) Added TimestampedWmsTileLayers (acrosby #644 and #660) Vega-Lite features support via altair (njwilson23 #643) Experimental a static png output #634) Added subdomains options in TileLayer (damselem #623) Updated leaflet 1.2.0 #693) Added FastMarkerCluster (James Gardiner #585 (proposed by...

10.5281/zenodo.4447642 article EN 2017-01-01

Information prioritization plays an important role in how humans perceive and understand the world. Homepage layouts serve as a tangible proxy for this prioritization. In work, we present NewsHomepages, large dataset of over 3,000 new website homepages (including local, national topic-specific outlets) captured twice daily three-year period. We develop models to perform pairwise comparisons between news items infer their relative significance. To illustrate that modeling organizational...

10.48550/arxiv.2501.00004 preprint EN arXiv (Cornell University) 2024-11-20

Journalists must find stories in huge amounts of textual data (e.g. leaks, bills, press releases) as part their jobs: determining when and why text becomes news can help us understand coverage patterns build assistive tools. Yet, this is challenging because very few labelled links exist, language use between corpora different, may be covered for a variety reasons. In work we focus on local public policy the San Francisco Bay Area by Chronicle. First, gather articles, documents meeting...

10.48550/arxiv.2311.09734 preprint EN cc-by arXiv (Cornell University) 2023-01-01
Coming Soon ...