Wei‐Hung Weng

ORCID: 0000-0003-2232-0390
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Natural Language Processing Techniques
  • Biomedical Text Mining and Ontologies
  • Machine Learning in Healthcare
  • AI in cancer detection
  • Advanced Fluorescence Microscopy Techniques
  • Speech Recognition and Synthesis
  • Radiomics and Machine Learning in Medical Imaging
  • Artificial Intelligence in Healthcare and Education
  • Explainable Artificial Intelligence (XAI)
  • Artificial Intelligence in Healthcare
  • Advanced Neural Network Applications
  • Heart Rate Variability and Autonomic Control
  • melanin and skin pigmentation
  • Brain Metastases and Treatment
  • Human-Automation Interaction and Safety
  • Optical Coherence Tomography Applications
  • Brain Tumor Detection and Classification
  • Medical Image Segmentation Techniques
  • Non-Invasive Vital Sign Monitoring
  • Neuroscience and Neural Engineering
  • ECG Monitoring and Analysis
  • Sleep and Work-Related Fatigue
  • Healthcare professionals’ stress and burnout
  • Cutaneous Melanoma Detection and Management

Google (United States)
2024

Massachusetts Institute of Technology
2018-2023

IIT@MIT
2021

National Taiwan University
2015-2021

Harvard University
2016-2020

Massachusetts General Hospital
2020

University of California, San Francisco
2020

IBM (United States)
2020

Vassar College
2019

Chang Gung University
2011-2016

Contextual word embedding models such as ELMo and BERT have dramatically improved performance for many natural language processing (NLP) tasks in recent months. However, these been minimally explored on specialty corpora, clinical text; moreover, the domain, no publicly-available pre-trained yet exist. In this work, we address need by exploring releasing text: one generic text another discharge summaries specifically. We demonstrate that using a domain-specific model yields improvements 3/5...

10.18653/v1/w19-1909 article EN 2019-01-01

Contextual word embedding models such as ELMo (Peters et al., 2018) and BERT (Devlin have dramatically improved performance for many natural language processing (NLP) tasks in recent months. However, these been minimally explored on specialty corpora, clinical text; moreover, the domain, no publicly-available pre-trained yet exist. In this work, we address need by exploring releasing text: one generic text another discharge summaries specifically. We demonstrate that using a domain-specific...

10.48550/arxiv.1904.03323 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Survival outcome prediction is a challenging weakly-supervised and ordinal regression task in computational pathology that involves modeling complex interactions within the tumor microenvironment gigapixel whole slide images (WSIs). Despite recent progress formulating WSIs as bags for multiple instance learning (MIL), representation of entire remains an open problem, especially overcoming: 1) complexity feature aggregation large bags, 2) data heterogeneity gap incorporating biological priors...

10.1109/iccv48922.2021.00398 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Open domain question answering (OpenQA) tasks have been recently attracting more and attention from the natural language processing (NLP) community. In this work, we present first free-form multiple-choice OpenQA dataset for solving medical problems, MedQA, collected professional board exams. It covers three languages: English, simplified Chinese, traditional contains 12,723, 34,251, 14,123 questions languages, respectively. We implement both rule-based popular neural methods by sequentially...

10.3390/app11146421 article EN cc-by Applied Sciences 2021-07-12

Abstract Two-dimensional materials such as graphene have shown great promise biosensors, but suffer from large device-to-device variation due to non-uniform material synthesis and device fabrication technologies. Here, we develop a robust bioelectronic sensing platform composed of more than 200 integrated units, custom-built high-speed readout electronics, machine learning inference that overcomes these challenges achieve rapid, portable, reliable measurements. The demonstrates...

10.1038/s41467-022-32749-4 article EN cc-by Nature Communications 2022-08-27

Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date knowledge and understanding complex multimodal data. Gemini models, with strong general capabilities long-context offer exciting possibilities medicine. Building on these core strengths Gemini, we introduce Med-Gemini, family highly capable models that are specialized medicine the ability seamlessly use web search, can be efficiently tailored novel...

10.48550/arxiv.2404.18416 preprint EN arXiv (Cornell University) 2024-04-29

Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on abdomen. Given current shortage both general and specialized radiologists, there is a large impetus to use artificial intelligence alleviate burden interpreting these complex imaging studies while simultaneously using images extract novel physiological insights. Prior state-of-the-art approaches for automated medical image interpretation leverage vision language models...

10.21203/rs.3.rs-4546309/v1 preprint EN Research Square (Research Square) 2024-06-28

The medical subdomain of a clinical note, such as cardiology or neurology, is useful content-derived metadata for developing machine learning downstream applications. To classify the note accurately, we have constructed learning-based natural language processing (NLP) pipeline and developed classifiers based on content note. We using NLP system, Text Analysis Knowledge Extraction System (cTAKES), Unified Medical Language (UMLS) Metathesaurus, Semantic Network, algorithms to extract features...

10.1186/s12911-017-0556-8 article EN cc-by BMC Medical Informatics and Decision Making 2017-12-01

The automatic generation of radiology reports given medical radiographs has significant potential to operationally and improve clinical patient care. A number prior works have focused on this problem, employing advanced methods from computer vision natural language produce readable reports. However, these often fail account for the particular nuances domain, and, in particular, critical importance accuracy resulting generated In work, we present a domain-aware chest X-ray report system which...

10.48550/arxiv.1904.02633 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Deep neural networks have been investigated in learning latent representations of medical images, yet most the studies limit their approach a single supervised convolutional network (CNN), which usually rely heavily on large scale annotated dataset for training. To learn image with less supervision involved, we propose deep Siamese CNN (SCNN) architecture that can be trained only binary pair information. We evaluated learned task content-based retrieval using publicly available multiclass...

10.48550/arxiv.1711.08490 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Open domain question answering (OpenQA) tasks have been recently attracting more and attention from the natural language processing (NLP) community. In this work, we present first free-form multiple-choice OpenQA dataset for solving medical problems, MedQA, collected professional board exams. It covers three languages: English, simplified Chinese, traditional contains 12,723, 34,251, 14,123 questions languages, respectively. We implement both rule-based popular neural methods by sequentially...

10.20944/preprints202105.0498.v1 preprint EN 2021-05-21

Many clinical tasks require an understanding of specialized data, such as medical images and genomics, which is not typically found in general-purpose large multimodal models. Building upon Gemini's models, we develop several models within the new Med-Gemini family that inherit core capabilities Gemini are optimized for use via fine-tuning with 2D 3D radiology, histopathology, ophthalmology, dermatology genomic data. Med-Gemini-2D sets a standard AI-based chest X-ray (CXR) report generation...

10.48550/arxiv.2405.03162 preprint EN arXiv (Cornell University) 2024-05-06

Recent research has shown that word embedding spaces learned from text corpora of different languages can be aligned without any parallel data supervision. Inspired by the success in unsupervised cross-lingual embeddings, this paper we target learning a cross-modal alignment between speech and their respective modalities an fashion. The proposed framework learns individual spaces, attempts to align two via adversarial training, followed refinement procedure. We show how our could used...

10.48550/arxiv.1805.07467 preprint EN other-oa arXiv (Cornell University) 2018-01-01

AI models have been proposed for hypothesis generation, but testing their ability to drive high-impact research is challenging, since an AI-generated can take decades validate. Here, we challenge the of a recently developed LLM-based platform, co-scientist, generate high-level hypotheses by posing question that took years resolve experimentally remained unpublished: How could capsid-forming phage-inducible chromosomal islands (cf-PICIs) spread across bacterial species? Remarkably,...

10.1101/2025.02.19.639094 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2025-02-19

10.1109/wacv61041.2025.00296 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

Glycemic control is essential for critical care. However, it a challenging task because there has been no study on personalized optimal strategies glycemic control. This work aims to learn trajectories severely ill septic patients by learning data-driven policies identify targeted blood glucose levels as reference clinicians. We encoded patient states using sparse autoencoder and adopted reinforcement paradigm policy iteration the from data. also estimated expected return following learned...

10.48550/arxiv.1712.00654 preprint EN other-oa arXiv (Cornell University) 2017-01-01

We present a framework for building speech-to-text translation (ST) systems using only monolingual speech and text corpora, in other words, utterances from source language independent target language. As opposed to traditional cascaded end-to-end architectures, our system does not require any labeled data (i.e., transcribed audio or parallel corpora) during training, making it especially applicable pairs with very few even zero bilingual resources. The initializes the ST cross-modal...

10.1109/icassp.2019.8683550 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019-04-17

In this work, we present an approach, which call Embeddings for Language/Image-aligned X-Rays, or ELIXR, that leverages a language-aligned image encoder combined grafted onto fixed LLM, PaLM 2, to perform broad range of chest X-ray tasks. We train lightweight adapter architecture using images paired with corresponding free-text radiology reports from the MIMIC-CXR dataset. ELIXR achieved state-of-the-art performance on zero-shot (CXR) classification (mean AUC 0.850 across 13 findings),...

10.48550/arxiv.2308.01317 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Health acoustic sounds such as coughs and breaths are known to contain useful health signals with significant potential for monitoring disease, yet underexplored in the medical machine learning community. The existing deep systems acoustics often narrowly trained evaluated on a single task, which is limited by data may hinder generalization other tasks. To mitigate these gaps, we develop HeAR, scalable self-supervised learning-based system using masked autoencoders large dataset of 313...

10.48550/arxiv.2403.02522 preprint EN arXiv (Cornell University) 2024-03-04

Internship, the transition period from medical student to junior doctor, is highly stressful for interns in West; however, little known about experience of coping with stress Taiwan. This study aimed develop a model among Taiwanese and examine relationship between learning outcomes. For this qualitative study, we used grounded theory methodology theoretical sampling. We collected data through in-depth interviews participant observations. employed constant comparative method analyse until...

10.1186/s12909-016-0534-3 article EN cc-by BMC Medical Education 2016-01-12
Coming Soon ...