NFDI4DS | UHH-SEMS - Publication Details

John Hewitt

ORCID: 0000-0003-1320-6633

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5075075495

Research Areas

Topic Modeling
Natural Language Processing Techniques
Speech and dialogue systems
Multimodal Machine Learning Applications
South Asian Studies and Conflicts
Domain Adaptation and Few-Shot Learning
Speech Recognition and Synthesis
Indian and Buddhist Studies
Eurasian Exchange Networks
Adversarial Robustness in Machine Learning
Anthropological Studies and Insights
Text Readability and Simplification
Historical Geography and Cartography
Language and cultural evolution
Explainable Artificial Intelligence (XAI)
Computational and Text Analysis Methods
Machine Learning and Algorithms
Software Engineering Research
Neural Networks and Applications
Neural dynamics and brain function
Robotics and Automated Systems
Model-Driven Software Engineering Techniques
Ancient Near East History
Land Rights and Reforms
Anomaly Detection Techniques and Applications

Stanford University
2019-2024

RIKEN Center for Advanced Intelligence Project
2023

Mongolia International University
2023

Bar-Ilan University
2021

University of Helsinki
2021

Tel Aviv University
2021

Technical University of Darmstadt
2021

University of Copenhagen
2021

Edinburgh Napier University
2021

Universitat Pompeu Fabra
2021

On the Opportunities and Risks of Foundation Models

OPENALEX - Publications

Rishi Bommasani Drew A. Hudson Ehsan Adeli Russ B. Altman Simran Arora and 95 more

AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and adaptable to wide range downstream tasks. We call these foundation underscore their critically central yet incomplete character. This report provides thorough account opportunities risks models, ranging from capabilities language, vision, robotics, reasoning, human interaction) technical principles(e.g., model architectures, training procedures, data, systems,...

10.48550/arxiv.2108.07258 preprint EN cc-by arXiv (Cornell University) 2021-01-01

OPENALEX - Publications

John Hewitt Christopher D. Manning

John Hewitt, Christopher D. Manning. Proceedings of the 2019 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2019.

10.18653/v1/n19-1419 article EN 2019-01-01

Designing and Interpreting Probes with Control Tasks

OPENALEX - Publications

John Hewitt Percy Liang

John Hewitt, Percy Liang. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint (EMNLP-IJCNLP). 2019.

10.18653/v1/d19-1275 article EN cc-by 2019-01-01

Emergent linguistic structure in artificial neural networks trained by self-supervision

OPENALEX - Publications

Christopher D. Manning Kevin Clark John Hewitt Urvashi Khandelwal Omer Levy

This paper explores the knowledge of linguistic structure learned by large artificial neural networks, trained via self-supervision, whereby model simply tries to predict a masked word in given context. Human language communication is sequences words, but understanding requires constructing rich hierarchical structures that are never observed explicitly. The mechanisms for this have been prime mystery human acquisition, while engineering work has mainly proceeded supervised learning on...

10.1073/pnas.1907367117 article EN cc-by Proceedings of the National Academy of Sciences 2020-06-03

Lost in the Middle: How Language Models Use Long Contexts

OPENALEX - Publications

Nelson F. Liu Kevin Lin John Hewitt Ashwin Paranjape Michele Bevilacqua and 2 more

Abstract While recent language models have the ability to take long contexts as input, relatively little is known about how well they use longer context. We analyze performance of on two tasks that require identifying relevant information in their input contexts: multi-document question answering and key-value retrieval. find can degrade significantly when changing position information, indicating current do not robustly make contexts. In particular, we observe often highest occurs at...

10.1162/tacl_a_00638 article EN cc-by Transactions of the Association for Computational Linguistics 2024-01-01

Finding Universal Grammatical Relations in Multilingual BERT

OPENALEX - Publications

A. Ethan John Hewitt Christopher D. Manning

Recent work has found evidence that Multilingual BERT (mBERT), a transformer-based multilingual masked language model, is capable of zero-shot cross-lingual transfer, suggesting some aspects its representations are shared cross-lingually. To better understand this overlap, we extend recent on finding syntactic trees in neural networks’ internal to the setting. We show subspaces mBERT recover tree distances languages other than English, and these approximately across languages. Motivated by...

10.18653/v1/2020.acl-main.493 article EN cc-by 2020-01-01

Learning Translations via Images with a Massively Multilingual Image Dataset

OPENALEX - Publications

John Hewitt Daphne Ippolito Brendan Callahan Reno Kriz Derry Wijaya and 1 more

John Hewitt, Daphne Ippolito, Brendan Callahan, Reno Kriz, Derry Tanti Wijaya, Chris Callison-Burch. Proceedings of the 56th Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2018.

10.18653/v1/p18-1239 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2018-01-01

Refining Targeted Syntactic Evaluation of Language Models

OPENALEX - Publications

Benjamin Newman Kai-Siang Ang Julia Gong John Hewitt

Benjamin Newman, Kai-Siang Ang, Julia Gong, John Hewitt. Proceedings of the 2021 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2021.

10.18653/v1/2021.naacl-main.290 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2021-01-01

We Can't Understand AI Using our Existing Vocabulary

OPENALEX - Publications

John Hewitt Robert Geirhos Been Kim

This position paper argues that, in order to understand AI, we cannot rely on our existing vocabulary of human words. Instead, should strive develop neologisms: new words that represent precise concepts want teach machines, or machine need learn. We start from the premise humans and machines have differing concepts. means interpretability can be framed as a communication problem: must able reference control concepts, communicate machines. Creating shared human-machine language through...

10.48550/arxiv.2502.07586 preprint EN arXiv (Cornell University) 2025-02-11

XNMT: The eXtensible Neural Machine Translation Toolkit

OPENALEX - Publications

Graham Neubig Matthias Sperber Xinyi Wang Matthieu Felix Austin Matthews and 8 more

This paper describes XNMT, the eXtensible Neural Machine Translation toolkit. XNMT distin- guishes itself from other open-source NMT toolkits by its focus on modular code design, with purpose of enabling fast iteration in research and replicable, reliable results. In this we describe design experiment configuration system, demonstrate utility tasks machine translation, speech recognition, multi-tasked translation/parsing. is available at https://github.com/neulab/xnmt

10.48550/arxiv.1803.00188 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Simple, Fast, Accurate Intent Classification and Slot Labeling for Goal-Oriented Dialogue Systems

OPENALEX - Publications

Arshit Gupta John Hewitt Katrin Kirchhoff

With the advent of conversational assistants, like Amazon Alexa, Google Now, etc., dialogue systems are gaining a lot traction, especially in industrial setting. These typically consist Spoken Language understanding component which, turn, consists two tasks - Intent Classification (IC) and Slot Labeling (SL). Generally, these modeled together jointly to achieve best performance. However, this joint modeling adds model obfuscation. In work, we first design framework for modularization IC-SL...

10.18653/v1/w19-5906 article EN cc-by 2019-01-01

RNNs can generate bounded hierarchical languages with optimal memory

OPENALEX - Publications

John Hewitt Michael Hahn Surya Ganguli Percy Liang Christopher D. Manning

Recurrent neural networks empirically generate natural language with high syntactic fidelity. However, their success is not well-understood theoretically. We provide theoretical insight into this success, proving in a finite-precision setting that RNNs can efficiently bounded hierarchical languages reflect the scaffolding of syntax. introduce Dyck-(k,m), well-nested brackets (of k types) and m-bounded nesting depth, reflecting memory needs long-distance dependencies The best known results...

10.18653/v1/2020.emnlp-main.156 article EN cc-by 2020-01-01

Conditional probing: measuring usable information beyond a baseline

OPENALEX - Publications

John Hewitt Kawin Ethayarajh Percy Liang Christopher D. Manning

Probing experiments investigate the extent to which neural representations make properties—like part-of-speech—predictable. One suggests that a representation encodes property if probing produces higher accuracy than baseline like non-contextual word embeddings. Instead of using baselines as point comparison, we’re interested in measuring information is contained but not baseline. For example, current methods can detect when more useful identity (a baseline) for predicting part-of-speech;...

10.18653/v1/2021.emnlp-main.122 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

Learning Translations via Matrix Completion

OPENALEX - Publications

Derry Wijaya Brendan Callahan John Hewitt Jie Gao Xiao Ling and 2 more

Derry Tanti Wijaya, Brendan Callahan, John Hewitt, Jie Gao, Xiao Ling, Marianna Apidianaki, Chris Callison-Burch. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2017.

10.18653/v1/d17-1152 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2017-01-01

The EOS Decision and Length Extrapolation

OPENALEX - Publications

Benjamin Newman John Hewitt Percy Liang Christopher D. Manning

Extrapolation to unseen sequence lengths is a challenge for neural generative models of language. In this work, we characterize the effect on length extrapolation modeling decision often overlooked: predicting end process through use special end-of-sequence (EOS) vocabulary item. We study an oracle setting - forcing generate correct at test time compare length-extrapolative behavior networks trained predict EOS (+EOS) with not (-EOS). find that -EOS substantially outperforms +EOS, example...

10.18653/v1/2020.blackboxnlp-1.26 article EN 2020-01-01

Art. VII.–The Tribes and Castes of Bengal, by H. H. Risley. Vols. I. and II. Ethnographic Glossary, Vols. I. and II. Anthropometric Data

OPENALEX - Publications

John Hewitt

No one can fully appreciate the great value of this work to all students ethnology until they realize historical importance an accurate classification characteristic differences which divide social strata known as castes living in a country occupying geographical position Bengal. Bengal is practically Deltas Ganges and Brahmaputra, Western rivers, rise Vindhyan range, called by Hindu geographers Sukti mountains, flow down thence Bay It has always been highways Southern tribes moved northward...

10.1017/s0035869x00022395 article EN Journal of the Royal Asiatic Society of Great Britain & Ireland 1893-04-01

3. The Communal Origin of Indian Land Tenures

OPENALEX - Publications

John Hewitt

An abstract is not available for this content so a preview has been provided. As you have access to content, full PDF via the ‘Save PDF’ action button.

10.1017/s0035869x00024783 article EN Journal of the Royal Asiatic Society 1897-07-01

Designing and Interpreting Probes with Control Tasks

OPENALEX - Publications

John Hewitt Percy Liang

Probes, supervised models trained to predict properties (like parts-of-speech) from representations ELMo), have achieved high accuracy on a range of linguistic tasks. But does this mean that the encode structure or just probe has learned task? In paper, we propose control tasks, which associate word types with random outputs, complement By construction, these tasks can only be by itself. So good probe, (one reflects representation), should selective, achieving task and low accuracy. The...

10.48550/arxiv.1909.03368 preprint EN other-oa arXiv (Cornell University) 2019-01-01

A Distributional and Orthographic Aggregation Model for English Derivational Morphology

OPENALEX - Publications

Daniel Deutsch John Hewitt Dan Roth

Modeling derivational morphology to generate words with particular semantics is useful in many text generation tasks, such as machine translation or abstractive question answering. In this work, we tackle the task of derived word generation. That is, attempt “runner” for “someone who runs.” We identify two key problems generating from root and transformations. contribute a novel aggregation model that learns transformations both orthographic functions using sequence-to-sequence models...

10.18653/v1/p18-1180 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2018-01-01

Art. VIII.—Notes on the Early History of Northern India. Part IV.Essay on the Pre-Vedic History of India and the Identity of the Early Mythologieṣ of Europe and Asia, founded on a Study of the Brāhmaṇas and of Sacrificial Observances

OPENALEX - Publications

John Hewitt

In the previous papers of this series I have tried to trace in outline a truthful sketch general course early Indian History. The evidence consulted and set forth has led me believe that government, social institutions, fundamental principles religion country all originated among tribes for most part Dravidian race, who came into India from Euphrates valley. dealing with origin successively simultaneously ruled India, races which they belonged, religious beliefs held. doing also adduced...

10.1017/s0035869x00020633 article EN Journal of the Royal Asiatic Society of Great Britain & Ireland 1890-04-01

Probing artificial neural networks: insights from neuroscience

OPENALEX - Publications

Anna A. Ivanova John Hewitt Noga Zaslavsky

A major challenge in both neuroscience and machine learning is the development of useful tools for understanding complex information processing systems. One such tool probes, i.e., supervised models that relate features interest to activation patterns arising biological or artificial neural networks. Neuroscience has paved way using through numerous studies conducted recent decades. In this work, we draw insights from help guide probing research learning. We highlight two important design...

10.48550/arxiv.2104.08197 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Coming Soon ...