NFDI4DS | UHH-SEMS - Publication Details

Kyunghyun Cho

ORCID: 0000-0003-1669-3211

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5091175785

Research Areas

Topic Modeling
Natural Language Processing Techniques
Multimodal Machine Learning Applications
Neural Networks and Applications
Domain Adaptation and Few-Shot Learning
Speech Recognition and Synthesis
Radiomics and Machine Learning in Medical Imaging
AI in cancer detection
Text Readability and Simplification
Machine Learning in Bioinformatics
Speech and dialogue systems
Music and Audio Processing
Total Knee Arthroplasty Outcomes
Machine Learning and Data Classification
Protein Structure and Dynamics
Reinforcement Learning in Robotics
Human Pose and Action Recognition
Advanced Neural Network Applications
Computational Drug Discovery Methods
Machine Learning in Materials Science
Generative Adversarial Networks and Image Synthesis
Machine Learning and Algorithms
Advanced Image and Video Retrieval Techniques
Stochastic Gradient Optimization Techniques
Speech and Audio Processing

New York University
2016-2025

Courant Institute of Mathematical Sciences
2016-2025

Mercer University
2024

Canadian Institute for Advanced Research
2013-2023

University of Washington
2020-2023

Carnegie Mellon University
2020-2023

Korea Advanced Institute of Science and Technology
2023

Johns Hopkins University
2023

Shanghai Jiao Tong University
2023

Massachusetts Institute of Technology
2023

On the Properties of Neural Machine Translation: Encoder–Decoder Approaches

OPENALEX - Publications

Kyunghyun Cho Bart van Merriënboer Dzmitry Bahdanau Yoshua Bengio

Neural machine translation is a relatively new approach to statistical based purely on neural networks.The models often consist of an encoder and decoder.The extracts fixed-length representation from variable-length input sentence, the decoder generates correct this representation.In paper, we focus analyzing properties using two models; RNN Encoder-Decoder newly proposed gated recursive convolutional network.We show that performs well short sentences without unknown words, but its...

10.3115/v1/w14-4012 preprint EN cc-by 2014-01-01

Attention-Based Models for Speech Recognition

OPENALEX - Publications

Jan Chorowski Dzmitry Bahdanau Dmitriy Serdyuk Kyunghyun Cho Yoshua Bengio

Recurrent sequence generators conditioned on input data through an attention mechanism have recently shown very good performance a range of tasks in- cluding machine translation, handwriting synthesis and image caption gen- eration. We extend the attention-mechanism with features needed for speech recognition. show that while adaptation model used translation in reaches competitive 18.7% phoneme error rate (PER) TIMIT recognition task, it can only be applied to utterances which are roughly...

10.48550/arxiv.1506.07503 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

OPENALEX - Publications

Kelvin Xu Jimmy Ba Ryan Kiros Kyunghyun Cho Aaron Courville and 3 more

Inspired by recent work in machine translation and object detection, we introduce an attention based model that automatically learns to describe the content of images. We how can train this a deterministic manner using standard backpropagation techniques stochastically maximizing variational lower bound. also show through visualization is able learn fix its gaze on salient objects while generating corresponding words output sequence. validate use with state-of-the-art performance three...

10.48550/arxiv.1502.03044 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Theano: A Python framework for fast computation of mathematical expressions

OPENALEX - Publications

The Theano Development Team Rami Al‐Rfou Guillaume Alain Amjad Almahairi Christof Angermueller and 95 more

Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU GPU compilers - especially in machine learning community shown steady performance improvements. being actively continuously developed since 2008, multiple frameworks have built on top produce many state-of-the-art models. The present article structured as follows. Section I provides an...

10.48550/arxiv.1605.02688 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Describing Videos by Exploiting Temporal Structure

OPENALEX - Publications

Li Yao Atousa Torabi Kyunghyun Cho Nicolas Ballas Christopher Pal and 2 more

Recent progress in using recurrent neural networks (RNNs) for image description has motivated the exploration of their application video description. However, while images are static, working with videos requires modeling dynamic temporal structure and then properly integrating that information into a natural language model. In this context, we propose an approach successfully takes account both local global to produce descriptions. First, our incorporates spatial 3-D convolutional network...

10.1109/iccv.2015.512 article EN 2015-12-01

On Using Very Large Target Vocabulary for Neural Machine Translation

OPENALEX - Publications

Sébastien Jean Kyunghyun Cho Roland Memisevic Yoshua Bengio

Sébastien Jean, Kyunghyun Cho, Roland Memisevic, Yoshua Bengio. Proceedings of the 53rd Annual Meeting Association for Computational Linguistics and 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2015.

10.3115/v1/p15-1001 article EN cc-by 2015-01-01

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

OPENALEX - Publications

Kyunghyun Cho Bart van Merriënboer Dzmitry Bahdanau Yoshua Bengio

Neural machine translation is a relatively new approach to statistical based purely on neural networks. The models often consist of an encoder and decoder. extracts fixed-length representation from variable-length input sentence, the decoder generates correct this representation. In paper, we focus analyzing properties using two models; RNN Encoder--Decoder newly proposed gated recursive convolutional network. We show that performs well short sentences without unknown words, but its...

10.48550/arxiv.1409.1259 preprint EN other-oa arXiv (Cornell University) 2014-01-01

Structure-based protein function prediction using graph convolutional networks

OPENALEX - Publications

Vladimir Gligorijević P. Douglas Renfrew Tomasz Kościółek Julia Koehler Leman Daniel Berenberg and 9 more

Abstract The rapid increase in the number of proteins sequence databases and diversity their functions challenge computational approaches for automated function prediction. Here, we introduce DeepFRI, a Graph Convolutional Network predicting protein by leveraging features extracted from language model structures. It outperforms current leading methods sequence-based Neural Networks scales to size repositories. Augmenting training set experimental structures with homology models allows us...

10.1038/s41467-021-23303-9 article EN cc-by Nature Communications 2021-05-26

On Using Monolingual Corpora in Neural Machine Translation

OPENALEX - Publications

Çağlar Gülçehre Orhan Fırat Kelvin Xu Kyunghyun Cho Loïc Barrault and 4 more

Recent work on end-to-end neural network-based architectures for machine translation has shown promising results En-Fr and En-De translation. Arguably, one of the major factors behind this success been availability high quality parallel corpora. In work, we investigate how to leverage abundant monolingual corpora Compared a phrase-based hierarchical baseline, obtain up $1.96$ BLEU improvement low-resource language pair Turkish-English, $1.59$ focused domain task Chinese-English chat...

10.48550/arxiv.1503.03535 preprint EN cc-by-nc-sa arXiv (Cornell University) 2015-01-01

Deep Neural Networks Improve Radiologists’ Performance in Breast Cancer Screening

OPENALEX - Publications

Nan Wu Jason Phang Jungkyu Park Yiqiu Shen Zhe Huang and 27 more

We present a deep convolutional neural network for breast cancer screening exam classification, trained, and evaluated on over 200000 exams (over 1000000 images). Our achieves an AUC of 0.895 in predicting the presence breast, when tested population. attribute high accuracy to few technical advances. 1) network's novel two-stage architecture training procedure, which allows us use high-capacity patch-level learn from pixel-level labels alongside learning macroscopic breast-level labels. 2) A...

10.1109/tmi.2019.2945514 article EN cc-by IEEE Transactions on Medical Imaging 2019-10-07

Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism

OPENALEX - Publications

Orhan Fırat Kyunghyun Cho Yoshua Bengio

We propose multi-way, multilingual neural machine translation.The proposed approach enables a single translation model to translate between multiple languages, with number of parameters that grows only linearly the languages.This is made possible by having attention mechanism shared across all language pairs.We train multiway, on ten pairs from WMT'15 simultaneously and observe clear performance improvements over models trained one pair.In particular, we significantly improves quality...

10.18653/v1/n16-1101 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2016-01-01

Learning Distributed Representations of Sentences from Unlabelled Data

OPENALEX - Publications

Felix Hill Kyunghyun Cho Anna Korhonen

Unsupervised methods for learning distributed representations of words are ubiquitous in today's NLP research, but far less is known about the best ways to learn phrase or sentence from unlabelled data. This paper a systematic comparison models that such representations. We find optimal approach depends critically on intended application. Deeper, more complex preferable be used supervised systems, shallow log-linear work building representation spaces can decoded with simple spatial distance...

10.18653/v1/n16-1162 article EN Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2016-01-01

On the Number of Linear Regions of Deep Neural Networks

OPENALEX - Publications

Guido Montúfar Razvan Pascanu Kyunghyun Cho Yoshua Bengio

We study the complexity of functions computable by deep feedforward neural networks with piecewise linear activations in terms symmetries and number regions that they have. Deep are able to sequentially map portions each layer's input-space same output. In this way, models compute react equally complicated patterns different inputs. The compositional structure these enables them re-use pieces computation exponentially often network's depth. This paper investigates such maps contributes new...

10.48550/arxiv.1402.1869 preprint EN other-oa arXiv (Cornell University) 2014-01-01

Neural Machine Translation by Jointly Learning to Align and Translate

OPENALEX - Publications

Dzmitry Bahdanau Kyunghyun Cho Yoshua Bengio

Neural machine translation is a recently proposed approach to translation. Unlike the traditional statistical translation, neural aims at building single network that can be jointly tuned maximize performance. The models for often belong family of encoder-decoders and consists an encoder encodes source sentence into fixed-length vector from which decoder generates In this paper, we conjecture use bottleneck in improving performance basic encoder-decoder architecture, propose extend by...

10.48550/arxiv.1409.0473 preprint EN other-oa arXiv (Cornell University) 2014-01-01

Convolutional recurrent neural networks for music classification

OPENALEX - Publications

Keunwoo Choi György Fazekas M. Sandler Kyunghyun Cho

We introduce a convolutional recurrent neural network (CRNN) for music tagging. CRNNs take advantage of networks (CNNs) local feature extraction and temporal summarisation the extracted features. compare CRNN with three CNN structures that have been used tagging while controlling number parameters respect to their performance training time per sample. Overall, we found show strong parameter time, indicating effectiveness its hybrid structure in summarisation.

10.1109/icassp.2017.7952585 article EN 2017-03-01

Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks

OPENALEX - Publications

Kyunghyun Cho Aaron Courville Yoshua Bengio

Whereas deep neural networks were first mostly used for classification tasks, they are rapidly expanding in the realm of structured output problems, where observed target is composed multiple random variables that have a rich joint distribution, given input. We focus this paper on case input also has structure and structures somehow related. describe systems learn to attend different places input, each element output, variety tasks: machine translation, image caption generation, video clip...

10.1109/tmm.2015.2477044 article EN IEEE Transactions on Multimedia 2015-09-04

Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement

OPENALEX - Publications

Jason Lee Elman Mansimov Kyunghyun Cho

We propose a conditional non-autoregressive neural sequence model based on iterative refinement. The proposed is designed the principles of latent variable models and denoising autoencoders, generally applicable to any generation task. extensively evaluate machine translation (En-De En-Ro) image caption generation, observe that it significantly speeds up decoding while maintaining quality comparable autoregressive counterpart.

10.18653/v1/d18-1149 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2018-01-01

AdapterHub: A Framework for Adapting Transformers

OPENALEX - Publications

Jonas Pfeiffer Andreas Rücklé Clifton Poth Aishwarya Kamath Ivan Vulić and 3 more

Jonas Pfeiffer, Andreas Rücklé, Clifton Poth, Aishwarya Kamath, Ivan Vulić, Sebastian Ruder, Kyunghyun Cho, Iryna Gurevych. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. 2020.

10.18653/v1/2020.emnlp-demos.7 article EN cc-by 2020-01-01

Coming Soon ...