- Ethics and Social Impacts of AI
- Explainable Artificial Intelligence (XAI)
- Adversarial Robustness in Machine Learning
- Media Influence and Politics
- Artificial Intelligence in Healthcare and Education
- Topic Modeling
- Domain Adaptation and Few-Shot Learning
- Cognitive Science and Mapping
- Multimodal Machine Learning Applications
- Complex Systems and Decision Making
- Online Learning and Analytics
- Hate Speech and Cyberbullying Detection
- Computational and Text Analysis Methods
- Natural Language Processing Techniques
- Big Data and Business Intelligence
- Artificial Intelligence in Law
- Social Media and Politics
- Advanced Image and Video Retrieval Techniques
- Privacy-Preserving Technologies in Data
- Software Engineering Research
Stanford University
2018-2024
IIT@MIT
2017
AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and adaptable to wide range downstream tasks. We call these foundation underscore their critically central yet incomplete character. This report provides thorough account opportunities risks models, ranging from capabilities language, vision, robotics, reasoning, human interaction) technical principles(e.g., model architectures, training procedures, data, systems,...
Machine learning currently exerts an outsized influence on the world, increasingly affecting institutional practices and impacted communities. It is therefore critical that we question vague conceptions of field as value-neutral or universally beneficial, investigate what specific values advancing. In this paper, first introduce a method annotation scheme for studying encoded in documents such research papers. Applying scheme, analyze 100 highly cited machine papers published at premier...
Machine learning models that convert user-written text descriptions into images are now widely available online and used by millions of users to generate a day. We investigate the potential for these amplify dangerous complex stereotypes. find broad range ordinary prompts produce stereotypes, including simply mentioning traits, descriptors, occupations, or objects. For example, we cases prompting basic traits social roles resulting in reinforcing whiteness as ideal, occupations amplification...
Hundreds of millions people now interact with language models, uses ranging from help writing
Hundreds of millions people now interact with language models, uses ranging from serving as a writing aid to informing hiring decisions. Yet these models are known perpetuate systematic racial prejudices, making their judgments biased in problematic ways about groups like African Americans. While prior research has focused on overt racism social scientists have argued that more subtle character developed over time. It is unknown whether this covert manifests models. Here, we demonstrate...
A rapidly growing number of voices argue that AI research, and computer vision in particular, is powering mass surveillance. Yet the direct path from research to surveillance has remained obscured difficult assess. Here, we reveal Surveillance pipeline by analyzing three decades papers downstream patents, more than 40,000 documents. We find large majority annotated patents self-report their technology enables extracting data about humans. Moreover, these technologies specifically enable...
Despite the success of large vision and language models (VLMs) in many downstream applications, it is unclear how well they encode compositional information. Here, we create Attribution, Relation, Order (ARO) benchmark to systematically evaluate ability VLMs understand different types relationships, attributes, order. ARO consists Visual Genome test understanding objects' properties; for relational understanding; COCO & Flickr30k-Order, order sensitivity. orders magnitude larger than...
Learning data representations that are transferable and fair with respect to certain protected attributes is crucial reducing unfair decisions while preserving the utility of data. We propose an information-theoretically motivated objective for learning maximally expressive subject fairness constraints. demonstrate a range existing approaches optimize approximations Lagrangian dual our objective. In contrast these approaches, allows user control by specifying limits on unfairness. Exploiting...
Are members of marginalized communities silenced on social media when they share personal experiences racism? Here, we investigate the role algorithms, humans, and platform guidelines in suppressing disclosures racial discrimination. In a field study actual posts from neighborhood-based platform, find that users talk about their as targets racism, are disproportionately flagged for removal toxic by five widely used moderation algorithms major online platforms, including most recent large...
Machine learning models that convert user-written text descriptions into images are now widely available online and used by millions of users to generate a day. We investigate the potential for these amplify dangerous complex stereotypes. find broad range ordinary prompts produce stereotypes, including simply mentioning traits, descriptors, occupations, or objects. For example, we cases prompting basic traits social roles resulting in reinforcing whiteness as ideal, occupations amplification...
Machine learning currently exerts an outsized influence on the world, increasingly affecting institutional practices and impacted communities. It is therefore critical that we question vague conceptions of field as value-neutral or universally beneficial, investigate what specific values advancing. In this paper, first introduce a method annotation scheme for studying encoded in documents such research papers. Applying scheme, analyze 100 highly cited machine papers published at premier...