NFDI4DS | UHH-SEMS - Publication Details

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

OPENALEX - Publications

Canlin Zhang Daniel Biś Xiuwen Liu Zhe He

Abstract Background In recent years, deep learning methods have been applied to many natural language processing tasks achieve state-of-the-art performance. However, in the biomedical domain, they not out-performed supervised word sense disambiguation (WSD) based on support vector machines or random forests, possibly due inherent similarities of medical senses. Results this paper, we propose two deep-learning-based models for WSD: a model bi-directional long short-term memory (BiLSTM)...

10.1186/s12859-019-3079-8 article EN cc-by BMC Bioinformatics 2019-12-01

ELBA: Learning by Asking for Embodied Visual Navigation and Task Completion

OPENALEX - Publications

Ying Shen Daniel Biś Chunmei Lu Ismini Lourentzou

10.1109/wacv61041.2025.00506 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

Too Much in Common: Shifting of Embeddings in Transformer Language Models and its Implications

OPENALEX - Publications

Daniel Biś Maksim Podkorytov Xiuwen Liu

The success of language models based on the Transformer architecture appears to be inconsistent with observed anisotropic properties representations learned by such models. We resolve this showing, contrary previous studies, that do not occupy a narrow cone, but rather drift in common directions. At any training step, all embeddings except for ground-truth target embedding are updated gradient same direction. Compounded over set, and share components, manifested their shape we have...

10.18653/v1/2021.naacl-main.403 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2021-01-01

An Analysis on the Learning Rules of the Skip-Gram Model

OPENALEX - Publications

Canlin Zhang Xiuwen Liu Daniel Biś

To improve the generalization of representations for natural language processing tasks, words are commonly represented using vectors, where distances among vectors related to similarity words. While word2vec, state-of-the-art implementation skip-gram model, is widely used and improves performance many its mechanism not yet well understood. In this work, we derive learning rules model establish their close relationship competitive learning. addition, provide global optimal solution...

10.1109/ijcnn.2019.8852182 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2019-07-01

How Can the [MASK] Know? The Sources and Limitations of Knowledge in BERT

OPENALEX - Publications

Maksim Podkorytov Daniel Biś Xiuwen Liu

We explore the idea of using pre-trained BERT as a source factual knowledge, analyze which components model are responsible for its ability to answer questions requiring and study transferability knowledge downstream tasks. Our experiments show that Language Modeling Head is indispensable predicting facts, implying any captured in limited. While dominant approach researching how stored language models focuses on tailoring question formulation optimize retrieval quality, we find patterns...

10.1109/ijcnn52387.2021.9534299 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2021-07-18

Enhancing Prediction Models for One-Year Mortality in Patients with Acute Myocardial Infarction and Post Myocardial Infarction Syndrome

OPENALEX - Publications

Seyedeh Neelufar Payrovnaziri Laura A. Barrett Daniel Biś Jiang Bian Zhe He

Predicting the risk of mortality for patients with acute myocardial infarction (AMI) using electronic health records (EHRs) data can help identify risky who might need more tailored care. In our previous work, we built computational models to predict one-year admitted an intensive care unit (ICU) AMI or post syndrome. Our prior work only used structured clinical from MIMIC-III, a publicly available ICU database. this study, enhanced by adding word embedding features free-text discharge...

10.3233/shti190226 article EN Studies in health technology and informatics 2019-01-01

Enhancing Prediction Models for One-Year Mortality in Patients with Acute Myocardial Infarction and Post Myocardial Infarction Syndrome

OPENALEX - Publications

Seyedeh Neelufar Payrovnaziri Laura A. Barrett Daniel Biś Jiang Bian Zhe He

Predicting the risk of mortality for patients with acute myocardial infarction (AMI) using electronic health records (EHRs) data can help identify risky who might need more tailored care. In our previous work, we built computational models to predict one-year admitted an intensive care unit (ICU) AMI or post syndrome. Our prior work only used structured clinical from MIMIC-III, a publicly available ICU database. this study, enhanced by adding word embedding features free-text discharge...

10.48550/arxiv.1904.12383 preprint EN cc-by arXiv (Cornell University) 2019-01-01

Layered Multistep Bidirectional Long Short-Term Memory Networks for Biomedical Word Sense Disambiguation

OPENALEX - Publications

Daniel Biś Canlin Zhang Xiuwen Liu Zhe He

In this paper, we propose a novel deep neural network architecture for supervised medical word sense disambiguation. Our is based on layered bidirectional LSTM network, upon which max-pooling along multiple time steps are performed so that dense representation of the context created. addition, introduced four different adjustments to output in order find most suitable input form layer. Results show best model outperforms current state-of-the-art MSH WSD dataset. Moreover, also train an...

10.1109/bibm.2018.8621383 article EN 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2018-12-01

Effects of Architecture and Training on Embedding Geometry and Feature Discriminability in BERT

OPENALEX - Publications

Maksim Podkorytov Daniel Biś Jinglun Cai Kobra Amirizirtol Xiuwen Liu

Natural language processing has improved substantially in the last few years due to increased computational power and availability of text data. Bidirectional Encoder Representations from Transformers (BERT) have further performance by using an auto-encoding model that incorporates larger bidirectional contexts. However, underlying mechanisms BERT for its effectiveness are not well understood. In paper we investigate how architecture pretraining protocol affect geometry embeddings features...

10.1109/ijcnn48605.2020.9206645 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2020-07-01

PAIGE: Personalized Adaptive Interactions Graph Encoder for Query Rewriting in Dialogue Systems

OPENALEX - Publications

Daniel Biś Saurabh Gupta Jie Hao Xing Fan Chenlei Guo

Unexpected responses or repeated clarification questions from conversational agents detract the users’ experience with technology meant to streamline their daily tasks. To reduce these frictions, Query Rewriting (QR) techniques replace transcripts of faulty queries alternatives that lead thatsatisfy needs. Despite successes, existing QR approaches are limited in ability fix require considering personal preferences.We improve by proposing Personalized Adaptive Interactions Graph Encoder...

10.18653/v1/2022.emnlp-industry.40 article EN cc-by 2022-01-01