Bill Lin

ORCID: 0000-0003-0965-7247
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • VLSI and FPGA Design Techniques
  • Natural Language Processing Techniques
  • Parallel Computing and Optimization Techniques
  • VLSI and Analog Circuit Testing
  • Privacy-Preserving Technologies in Data
  • Interconnection Networks and Systems
  • Advancements in Photolithography Techniques
  • Machine Learning and Data Classification
  • Photonic and Optical Devices
  • Advanced Memory and Neural Computing
  • Adversarial Robustness in Machine Learning
  • Low-power high-performance VLSI design
  • Multimodal Machine Learning Applications
  • Advancements in Semiconductor Devices and Circuit Design
  • Explainable Artificial Intelligence (XAI)
  • Imbalanced Data Classification Techniques
  • Neural Networks and Reservoir Computing
  • Semiconductor materials and devices
  • Advanced Surface Polishing Techniques
  • Sparse and Compressive Sensing Techniques
  • 3D IC and TSV technologies
  • Stochastic Gradient Optimization Techniques
  • Speech and dialogue systems
  • Advanced Data Storage Technologies

University of California, San Diego
2016-2025

University of California, Berkeley
1992-2024

Allen Institute
2023-2024

Umbo Computer Vision (United Kingdom)
2023

University of Southern California
2022-2023

Southern California University for Professional Studies
2022-2023

University of Hong Kong
2007-2023

Cornell University
2023

Google (United States)
2023

Stanford University
2023

Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these are as yet poorly characterized. In order to inform future research, prepare for disruptive model capabilities, ameliorate socially harmful effects, it is vital that we understand the present near-future limitations of language models. To address this challenge, introduce Beyond Imitation Game benchmark (BIG-bench). BIG-bench...

10.48550/arxiv.2206.04615 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Bill Yuchen Lin, Chaoyang He, Zihang Ze, Hulin Wang, Yufen Hua, Christophe Dupuy, Rahul Gupta, Mahdi Soltanolkotabi, Xiang Ren, Salman Avestimehr. Findings of the Association for Computational Linguistics: NAACL 2022.

10.18653/v1/2022.findings-naacl.13 article EN cc-by Findings of the Association for Computational Linguistics: NAACL 2022 2022-01-01

We present LLM-Blender, an ensembling framework designed to attain consistently superior performance by leveraging the diverse strengths of multiple open-source large language models (LLMs). Our consists two modules: PairRanker and GenFuser, addressing observation that optimal LLMs for different examples can significantly vary. employs a specialized pairwise comparison method distinguish subtle differences between candidate outputs. It jointly encodes input text pair candidates, using...

10.18653/v1/2023.acl-long.792 article EN cc-by 2023-01-01

Conventional federated learning (FL) approaches are ineffective in scenarios where clients have significant differences the distributions of their local data. The Non-IID data distribution client causes a drift model updates from global optima, which significantly impacts performance trained models. In this paper, we present new algorithm called FLIS that aims to address problem by grouping into clusters jointly trainable distributions. This is achieved comparing <italic...

10.1109/ojcs.2023.3262203 article EN cc-by IEEE Open Journal of the Computer Society 2023-01-01

Clustered federated learning (FL) has been shown to produce promising results by grouping clients into clusters. This is especially effective in scenarios where separate groups of have significant differences the distributions their local data. Existing clustered FL algorithms are essentially trying group together with similar so that same cluster can leverage each other's data better perform learning. However, prior attempt learn these distribution similarities indirectly during training,...

10.1609/aaai.v37i8.26197 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

The traditional approach in FL tries to learn a single global model collaboratively with the help of many clients under orchestration central server. However, learning might not work well for all participating data heterogeneity. Therefore, personalization becomes crucial handling challenges that arise statistical heterogeneity and non-IID distribution data. Unlike prior works, this we propose new obtaining personalized from client-level objective. This further motivates participate...

10.1109/icdcsw53096.2021.00012 article EN 2021-07-01

Though successful, federated learning (FL) presents new challenges for machine learning, especially when the issue of data heterogeneity, also known as Non-IID data, arises.To cope with statistical previous works incorporated a proximal term in local optimization or modified model aggregation scheme at server side advocated clustered approaches where central groups agent population into clusters jointly trainable distributions to take advantage certain level personalization.While effective,...

10.1109/tai.2023.3293068 article EN cc-by IEEE Transactions on Artificial Intelligence 2023-07-06

10.18653/v1/2024.findings-acl.762 article EN Findings of the Association for Computational Linguistics: ACL 2022 2024-01-01

With the relentless scaling of technology nodes, design co-optimization (DTCO) for conventional (Conv.) cell structure is starting to reach its limitations due limited routing resources, lateral p-n separations, and performance requirements. As a result, system (STCO) has been proposed exploit benefits 3-D architectures. Complementary-FET (CFET) technology, which stacks p-FET on n-FET or vice versa, can release restriction separation reduce in-cell congestion by enabling direct connections....

10.1109/tvlsi.2021.3065639 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 2021-03-30

This paper proposes a new paradigm for learning set of independent logical rules in disjunctive normal form as an interpretable model classification. We consider the problem decision rule training neural network specific, yet very simple two-layer architecture. Each neuron first layer directly maps to if-then after training, and output second disjunction set. Our representation neurons this enables us encode both positive negative association features rule. State-of-the-art net approaches...

10.1609/aaai.v35i5.16555 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Humans can perform unseen tasks by recalling relevant skills acquired previously and then generalizing them to the target tasks, even if there is no supervision at all. In this paper, we aim improve kind of cross-task generalization ability massive multi-task language models, such as T0 FLAN, in an unsupervised setting. We propose a retrieval-augmentation method named ReCross that takes few unlabelled examples queries retrieve small subset upstream data uses update model for better...

10.48550/arxiv.2204.07937 preprint EN cc-by arXiv (Cornell University) 2022-01-01

10.1145/3658617.3703146 article EN Proceedings of the 28th Asia and South Pacific Design Automation Conference 2025-01-20

In this article, we propose an automated standard cell synthesis framework, SP&R, which simultaneously solves P&R without deploying any sequential/separate operations, by a novel dynamic pin allocation scheme. The proposed SP&R utilizes the multiobjective optimization feature of satisfiability modulo theories (SMT) to obtain optimal layouts. To achieve practical scalability develop various search-space reduction techniques, including breaking symmetry, conditional assignment/localization,...

10.1109/tcad.2020.3037885 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2020-11-13

Language models (LMs) have demonstrated their capability in possessing commonsense knowledge of the physical world, a crucial aspect performing tasks everyday life. However, it remains unclear whether they capacity to generate grounded, executable plans for embodied tasks. This is challenging task as LMs lack ability perceive environment through vision and feedback from environment. In this paper, we address important research question present first investigation into topic. Our novel...

10.1609/aaai.v37i11.26549 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Proprietary LMs such as GPT-4 are often employed to assess the quality of responses from various LMs. However, concerns including transparency, controllability, and affordability strongly motivate development open-source specialized in evaluations. On other hand, existing open evaluator exhibit critical shortcomings: 1) they issue scores that significantly diverge those assigned by humans, 2) lack flexibility perform both direct assessment pairwise ranking, two most prevalent forms...

10.48550/arxiv.2405.01535 preprint EN arXiv (Cornell University) 2024-05-02

Current asynchronous tools are focussed mainly on the design of a single interface module. In many applications, one must interacting modules that potentially communicate in complex and intricate ways. When designing communicating modules, several difficult problems arise. First, even if each individual module can be synthesized correctly, according to environmental assumptions specified for module, composition may not work properly. Thus, needs have way model how interact with other, verify...

10.1145/196244.196272 article EN 1994-01-01

Crossbar architectures are widely used to implement high-performance network switches and routers. A crossbar switch can transfer cells between multiple ports simultaneously by closing cross points. This configuration must be determined an intelligent centralized scheduler that ensure fairness high utilization. In this paper, we describe the design implementation of two symmetric scheduling algorithms for configuring crossbars in input-queued support virtual output queueing. Our target is a...

10.1109/icc.1999.765457 article EN 2003-01-20

Syntax directed translation based compilation from high-level concurrent programs has matured significantly over the past few years. They have been applied to significant designs in domains of digital signal processing and microprocessor designs. For data-path dominated designs, like those found applications, syntax approaches shown generate efficient asynchronous implementations. However for control-dominated where data parts play a relatively minor role, we believe solutions produced by...

10.1109/async.1996.494454 article EN 2002-12-23

Pin accessibility encounters nontrivial challenges due to the smaller number of routing tracks, higher pin density, and more complex design rules. Consequently, securing rule-correct routability has become a critical bottleneck for sub-10-nm IC designs (particularly in detailed stage) costing days runtime. To reduce turnaround time, designers demand new methodologies analyze feasibility given layout architecture (e.g., conditional rules, assignment patterns, etc). There are several...

10.1109/tcad.2020.2977066 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2020-02-28
Coming Soon ...