NFDI4DS | UHH-SEMS - Publication Details

Wei‐Hung Weng

ORCID: 0000-0003-2232-0390

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5034420885

Research Areas

Topic Modeling
Natural Language Processing Techniques
Biomedical Text Mining and Ontologies
Machine Learning in Healthcare
AI in cancer detection
Advanced Fluorescence Microscopy Techniques
Speech Recognition and Synthesis
Radiomics and Machine Learning in Medical Imaging
Artificial Intelligence in Healthcare and Education
Explainable Artificial Intelligence (XAI)
Artificial Intelligence in Healthcare
Advanced Neural Network Applications
Heart Rate Variability and Autonomic Control
melanin and skin pigmentation
Brain Metastases and Treatment
Human-Automation Interaction and Safety
Optical Coherence Tomography Applications
Brain Tumor Detection and Classification
Medical Image Segmentation Techniques
Non-Invasive Vital Sign Monitoring
Neuroscience and Neural Engineering
ECG Monitoring and Analysis
Sleep and Work-Related Fatigue
Healthcare professionals’ stress and burnout
Cutaneous Melanoma Detection and Management

Google (United States)
2024

Massachusetts Institute of Technology
2018-2023

IIT@MIT
2021

National Taiwan University
2015-2021

Harvard University
2016-2020

Massachusetts General Hospital
2020

University of California, San Francisco
2020

IBM (United States)
2020

Vassar College
2019

Chang Gung University
2011-2016

Publicly Available Clinical

OPENALEX - Publications

Emily Alsentzer John R. Murphy William Boag Wei‐Hung Weng Di Jindi and 2 more

Contextual word embedding models such as ELMo and BERT have dramatically improved performance for many natural language processing (NLP) tasks in recent months. However, these been minimally explored on specialty corpora, clinical text; moreover, the domain, no publicly-available pre-trained yet exist. In this work, we address need by exploring releasing text: one generic text another discharge summaries specifically. We demonstrate that using a domain-specific model yields improvements 3/5...

10.18653/v1/w19-1909 article EN 2019-01-01

Publicly Available Clinical BERT Embeddings

OPENALEX - Publications

Emily Alsentzer John R. Murphy Willie Boag Wei‐Hung Weng Di Jin and 2 more

Contextual word embedding models such as ELMo (Peters et al., 2018) and BERT (Devlin have dramatically improved performance for many natural language processing (NLP) tasks in recent months. However, these been minimally explored on specialty corpora, clinical text; moreover, the domain, no publicly-available pre-trained yet exist. In this work, we address need by exploring releasing text: one generic text another discharge summaries specifically. We demonstrate that using a domain-specific...

10.48550/arxiv.1904.03323 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images

OPENALEX - Publications

Richard J. Chen Ming Lu Wei‐Hung Weng Tiffany Chen Drew F. K. Williamson and 3 more

Survival outcome prediction is a challenging weakly-supervised and ordinal regression task in computational pathology that involves modeling complex interactions within the tumor microenvironment gigapixel whole slide images (WSIs). Despite recent progress formulating WSIs as bags for multiple instance learning (MIL), representation of entire remains an open problem, especially overcoming: 1) complexity feature aggregation large bags, 2) data heterogeneity gap incorporating biological priors...

10.1109/iccv48922.2021.00398 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

What Disease Does This Patient Have? A Large-Scale Open Domain Question Answering Dataset from Medical Exams

OPENALEX - Publications

Di Jin Eileen Pan Nassim Oufattole Wei‐Hung Weng Hanyi Fang and 1 more

Open domain question answering (OpenQA) tasks have been recently attracting more and attention from the natural language processing (NLP) community. In this work, we present first free-form multiple-choice OpenQA dataset for solving medical problems, MedQA, collected professional board exams. It covers three languages: English, simplified Chinese, traditional contains 12,723, 34,251, 14,123 questions languages, respectively. We implement both rule-based popular neural methods by sequentially...

10.3390/app11146421 article EN cc-by Applied Sciences 2021-07-12

Integrated biosensor platform based on graphene transistor arrays for real-time high-accuracy ion sensing

OPENALEX - Publications

Mantian Xue Charles Mackin Wei‐Hung Weng Jiadi Zhu Yiyue Luo and 6 more

Abstract Two-dimensional materials such as graphene have shown great promise biosensors, but suffer from large device-to-device variation due to non-uniform material synthesis and device fabrication technologies. Here, we develop a robust bioelectronic sensing platform composed of more than 200 integrated units, custom-built high-speed readout electronics, machine learning inference that overcomes these challenges achieve rapid, portable, reliable measurements. The demonstrates...

10.1038/s41467-022-32749-4 article EN cc-by Nature Communications 2022-08-27

Capabilities of Gemini Models in Medicine

OPENALEX - Publications

Khaled Saab Tao Tu Wei‐Hung Weng Ryutaro Tanno David Stutz and 61 more

Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date knowledge and understanding complex multimodal data. Gemini models, with strong general capabilities long-context offer exciting possibilities medicine. Building on these core strengths Gemini, we introduce Med-Gemini, family highly capable models that are specialized medicine the ability seamlessly use web search, can be efficiently tailored novel...

10.48550/arxiv.2404.18416 preprint EN arXiv (Cornell University) 2024-04-29

Merlin: A Vision Language Foundation Model for 3D Computed Tomography

OPENALEX - Publications

Louis Blankemeier Joseph Cohen Ashwin Kumar Dave Van Veen Syed Jamal Safdar Gardezi and 26 more

Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on abdomen. Given current shortage both general and specialized radiologists, there is a large impetus to use artificial intelligence alleviate burden interpreting these complex imaging studies while simultaneously using images extract novel physiological insights. Prior state-of-the-art approaches for automated medical image interpretation leverage vision language models...

10.21203/rs.3.rs-4546309/v1 preprint EN Research Square (Research Square) 2024-06-28

Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach

OPENALEX - Publications

Wei‐Hung Weng Kavishwar B. Wagholikar Alexa T. McCray Peter Szolovits Henry C. Chueh

The medical subdomain of a clinical note, such as cardiology or neurology, is useful content-derived metadata for developing machine learning downstream applications. To classify the note accurately, we have constructed learning-based natural language processing (NLP) pipeline and developed classifiers based on content note. We using NLP system, Text Analysis Knowledge Extraction System (cTAKES), Unified Medical Language (UMLS) Metathesaurus, Semantic Network, algorithms to extract features...

10.1186/s12911-017-0556-8 article EN cc-by BMC Medical Informatics and Decision Making 2017-12-01

Clinically Accurate Chest X-Ray Report Generation

OPENALEX - Publications

Guanxiong Liu Tzu-Ming Harry Hsu Matthew B. A. McDermott Willie Boag Wei‐Hung Weng and 2 more

The automatic generation of radiology reports given medical radiographs has significant potential to operationally and improve clinical patient care. A number prior works have focused on this problem, employing advanced methods from computer vision natural language produce readable reports. However, these often fail account for the particular nuances domain, and, in particular, critical importance accuracy resulting generated In work, we present a domain-aware chest X-ray report system which...

10.48550/arxiv.1904.02633 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Learning Deep Representations of Medical Images using Siamese CNNs with Application to Content-Based Image Retrieval

OPENALEX - Publications

Yu-An Chung Wei‐Hung Weng

Deep neural networks have been investigated in learning latent representations of medical images, yet most the studies limit their approach a single supervised convolutional network (CNN), which usually rely heavily on large scale annotated dataset for training. To learn image with less supervision involved, we propose deep Siamese CNN (SCNN) architecture that can be trained only binary pair information. We evaluated learned task content-based retrieval using publicly available multiclass...

10.48550/arxiv.1711.08490 preprint EN other-oa arXiv (Cornell University) 2017-01-01

What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams

OPENALEX - Publications

Di Jin Eileen Pan Nassim Oufattole Wei‐Hung Weng Hanyi Fang and 1 more

10.20944/preprints202105.0498.v1 preprint EN 2021-05-21

Advancing Multimodal Medical Capabilities of Gemini

OPENALEX - Publications

Lin Yang Shawn Xu Andrew Sellergren Timo Kohlberger Yuchen Zhou and 42 more

Many clinical tasks require an understanding of specialized data, such as medical images and genomics, which is not typically found in general-purpose large multimodal models. Building upon Gemini's models, we develop several models within the new Med-Gemini family that inherit core capabilities Gemini are optimized for use via fine-tuning with 2D 3D radiology, histopathology, ophthalmology, dermatology genomic data. Med-Gemini-2D sets a standard AI-based chest X-ray (CXR) report generation...

10.48550/arxiv.2405.03162 preprint EN arXiv (Cornell University) 2024-05-06

Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces

OPENALEX - Publications

Yu-An Chung Wei‐Hung Weng Schrasing Tong James Glass

Recent research has shown that word embedding spaces learned from text corpora of different languages can be aligned without any parallel data supervision. Inspired by the success in unsupervised cross-lingual embeddings, this paper we target learning a cross-modal alignment between speech and their respective modalities an fashion. The proposed framework learns individual spaces, attempts to align two via adversarial training, followed refinement procedure. We show how our could used...

10.48550/arxiv.1805.07467 preprint EN other-oa arXiv (Cornell University) 2018-01-01

AI mirrors experimental science to uncover a novel mechanism of gene transfer crucial to bacterial evolution

OPENALEX - Publications

José R. Penadés Juraj Gottweis Lingchen He Jonasz B. Patkowski Alexander Shurick and 8 more

AI models have been proposed for hypothesis generation, but testing their ability to drive high-impact research is challenging, since an AI-generated can take decades validate. Here, we challenge the of a recently developed LLM-based platform, co-scientist, generate high-level hypotheses by posing question that took years resolve experimentally remained unpublished: How could capsid-forming phage-inducible chromosomal islands (cf-PICIs) spread across bacterial species? Remarkably,...

10.1101/2025.02.19.639094 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2025-02-19

SAND: Enhancing Open-Set Neuron Descriptions through Spatial Awareness

OPENALEX - Publications

Azmeera Srinivas Tuomas Oikarinen Divyansh Srivastava Wei‐Hung Weng Tsui-Wei Weng

10.1109/wacv61041.2025.00296 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

Representation and Reinforcement Learning for Personalized Glycemic Control in Septic Patients

OPENALEX - Publications

Wei‐Hung Weng Mingwu Gao Ze He Susu Yan Peter Szolovits

Glycemic control is essential for critical care. However, it a challenging task because there has been no study on personalized optimal strategies glycemic control. This work aims to learn trajectories severely ill septic patients by learning data-driven policies identify targeted blood glucose levels as reference clinicians. We encoded patient states using sparse autoencoder and adopted reinforcement paradigm policy iteration the from data. also estimated expected return following learned...

10.48550/arxiv.1712.00654 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Towards Unsupervised Speech-to-text Translation

OPENALEX - Publications

Yu-An Chung Wei‐Hung Weng Schrasing Tong James Glass

We present a framework for building speech-to-text translation (ST) systems using only monolingual speech and text corpora, in other words, utterances from source language independent target language. As opposed to traditional cascaded end-to-end architectures, our system does not require any labeled data (i.e., transcribed audio or parallel corpora) during training, making it especially applicable pairs with very few even zero bilingual resources. The initializes the ST cross-modal...

10.1109/icassp.2019.8683550 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019-04-17

ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders

OPENALEX - Publications

Shawn Xu Lin Yang Christopher Kelly Marcin Sieniek Timo Kohlberger and 23 more

In this work, we present an approach, which call Embeddings for Language/Image-aligned X-Rays, or ELIXR, that leverages a language-aligned image encoder combined grafted onto fixed LLM, PaLM 2, to perform broad range of chest X-ray tasks. We train lightweight adapter architecture using images paired with corresponding free-text radiology reports from the MIMIC-CXR dataset. ELIXR achieved state-of-the-art performance on zero-shot (CXR) classification (mean AUC 0.850 across 13 findings),...

10.48550/arxiv.2308.01317 preprint EN other-oa arXiv (Cornell University) 2023-01-01

HeAR -- Health Acoustic Representations

OPENALEX - Publications

Sebastien Baur Zaid Nabulsi Wei‐Hung Weng Jake Garrison Louis Blankemeier and 13 more

Health acoustic sounds such as coughs and breaths are known to contain useful health signals with significant potential for monitoring disease, yet underexplored in the medical machine learning community. The existing deep systems acoustics often narrowly trained evaluated on a single task, which is limited by data may hinder generalization other tasks. To mitigate these gaps, we develop HeAR, scalable self-supervised learning-based system using masked autoencoders large dataset of 313...

10.48550/arxiv.2403.02522 preprint EN arXiv (Cornell University) 2024-03-04

The process of coping with stress by Taiwanese medical interns: a qualitative study

OPENALEX - Publications

Chun-Hao Liu Woung‐Ru Tang Wei‐Hung Weng Yu‐Hsuan Lin Ching-Yen Chen

Internship, the transition period from medical student to junior doctor, is highly stressful for interns in West; however, little known about experience of coping with stress Taiwan. This study aimed develop a model among Taiwanese and examine relationship between learning outcomes. For this qualitative study, we used grounded theory methodology theoretical sampling. We collected data through in-depth interviews participant observations. employed constant comparative method analyse until...

10.1186/s12909-016-0534-3 article EN cc-by BMC Medical Education 2016-01-12

Coming Soon ...