Yifan Sun

ORCID: 0000-0003-3532-6521
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Parallel Computing and Optimization Techniques
  • Advanced Neural Network Applications
  • Advanced Data Storage Technologies
  • Cloud Computing and Resource Management
  • Distributed and Parallel Computing Systems
  • Data Visualization and Analytics
  • Stochastic Gradient Optimization Techniques
  • Computer Graphics and Visualization Techniques
  • Brain Tumor Detection and Classification
  • Domain Adaptation and Few-Shot Learning
  • Vacuum and Plasma Arcs
  • Machine Learning and ELM
  • UAV Applications and Optimization
  • Machine Learning and Algorithms
  • Robotic Path Planning Algorithms
  • Embedded Systems Design Techniques
  • GNSS positioning and interference
  • Adversarial Robustness in Machine Learning
  • Autonomous Vehicle Technology and Safety
  • Ferroelectric and Negative Capacitance Devices
  • Indoor and Outdoor Localization Technologies
  • Radio Wave Propagation Studies
  • Electrostatic Discharge in Electronics
  • Complex Systems and Decision Making
  • Data Stream Mining Techniques

Williams (United States)
2020-2025

William & Mary
2020-2025

Dalian University of Technology
2022-2025

Baidu (China)
2022-2025

Northwest A&F University
2024-2025

Shanghai Electric (China)
2025

Second Affiliated Hospital of Nanjing Medical University
2025

Ningbo University
2024

Beijing Jiaotong University
2024

Fujian University of Traditional Chinese Medicine
2023-2024

Deep neural networks have demonstrated state-of-the-art performance in a variety of real-world applications. In order to obtain gains, these grown larger and deeper, containing millions or even billions parameters over thousand layers. The tradeoff is that large architectures require an enormous amount memory, storage, computation, thus limiting their usability. Inspired by the recent tensor ring factorization, we introduce Tensor Ring Networks (TR-Nets), which significantly compress both...

10.1109/cvpr.2018.00972 article EN 2018-06-01

Graphics Processing Units (GPUs) can easily outperform CPUs in processing large-scale data parallel workloads, but are considered weak serialized tasks and communicating with other devices. Pursuing a CPU-GPU collaborative computing model which takes advantage of both devices could provide an important breakthrough realizing the full performance potential heterogeneous computing. In recent years platform vendors runtime systems have added new features such as unified memory space dynamic...

10.1109/iiswc.2016.7581262 article EN 2016-09-01

In response to COVID-19, a vast number of visualizations have been created communicate information the public. Information exposure in public health crisis can impact people's attitudes towards and responses risks, ultimately trajectory pandemic. As such, there is need for work that documents, organizes, investigates what COVID-19 presented We address this gap through an analysis 668 visualizations. present our findings conceptual framework derived from analysis, examines who, (uses) data,...

10.1145/3411764.3445381 preprint EN 2021-05-06

The integration of AI assistants, especially through the development Large Language Models (LLMs), into computer science education has sparked significant debate, highlighting both their potential to augment student learning and risks associated with misuse. An emerging body work looked using LLMs in education, primarily focusing on evaluating performance existing models or conducting short-term human subject studies. However, very little examined impacts LLM-powered assistants students...

10.1145/3657604.3662036 article EN 2024-07-09

The rapidly growing popularity and scale of data-parallel workloads demand a corresponding increase in raw computational power Graphics Processing Units (GPUs). As single-GPU platforms struggle to satisfy these performance demands, multi-GPU have started dominate the high-performance computing world. advent such systems raises number design challenges, including GPU microarchitecture, interconnect fabric, runtime libraries, associated programming models. research community currently lacks...

10.1145/3307650.3322230 article EN 2019-06-14

Moore's Law and Dennard Scaling have guided the semiconductor industry for past few decades. Recently, both laws faced validity challenges as transistor sizes approach practical limits of physics. We are interested in testing these reflect on reasons responsible. In this work, we collect data more than 4000 publicly-available CPU GPU products. find that scaling remains critical keeping valid. However, architectural solutions become increasingly important will play a larger role future....

10.48550/arxiv.1911.11313 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Cross-domain weakly supervised object detection (CD-WSOD) aims to adapt the model a novel target domain with easily acquired image-level annotations. How align source and domains is critical CDWSOD accuracy. Existing methods usually focus on partial components for alignment. In contrast, this paper considers that all are important proposes Holistic Hier-archical Feature Alignment (H <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> FA)...

10.1109/cvpr52688.2022.01393 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

During crises like COVID-19, individuals are inundated with conflicting and time-sensitive information that drives a need for rapid assessment of the trustworthiness reliability sources platforms. This parallels evolutions in infrastructures, ranging from social media to government data Distinct current literature, which presumes static relationship between presence or absence trust people's behaviors, our mixed-methods research focuses on situated trust, is shaped by information-seeking...

10.1145/3491102.3501889 article EN CHI Conference on Human Factors in Computing Systems 2022-04-27

High performance multi-GPU systems are widely used to accelerate training of deep neural networks (DNNs) by exploiting the inherently massive parallel nature process. Typically, DNNs in leverages a data-parallel model which DNN is replicated on every GPU, and each GPU performs Forward Propagation (FP), Backward (BP) and, Weight Update (WU). We analyze WU stage that composed collective communication (e.g., allReduce, broadcast), demands very efficient among GPUs avoid diminishing returns when...

10.1109/iiswc.2018.8573521 article EN 2018-09-01

During the COVID-19 pandemic, a number of data visualizations were created to inform public about rapidly evolving crisis. Data dashboards, form information dissemination used during have facilitated this process by visualizing statistics regarding cases over time. Prior work on has primarily focused design and evaluation specific visualization systems from technology-centered perspectives. However, little is known what occurs behind scenes creation processes, given complex sociotechnical...

10.1109/tvcg.2022.3209493 article EN IEEE Transactions on Visualization and Computer Graphics 2022-01-01

This paper presents a DETR-based method for cross-domain weakly supervised object detection (CDWSOD), aiming at adapting the detector from source to target domain through weak supervision. We think DETR has strong potential CDWSOD due an insight: encoder and decoder in are both based on attention mechanism thus capable of aggregating semantics across entire image. The aggregation results, i.e., image-level predictions, can naturally exploit supervision alignment. Such motivated, we propose...

10.1109/cvpr52729.2023.01099 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Unexpected disasters, both naturally occurring and those caused through human actions, result in severe damage to communication infrastructure. Additionally, such events are accompanied by sharp spikes the usage of commercially licensed spectrum, when affected victims tragedy attempt transmit information about themselves capture high bandwidth data form pictures videos. We envisage cognitive radio as a candidate solution situations, where devices can identify alternate frequency bands,...

10.1109/mcom.2014.6917404 article EN IEEE Communications Magazine 2014-10-01

Heterogeneous systems, that marry CPUs and GPUs together in a range of configurations, are quickly becoming the design paradigm for today's platforms because their impressive parallel processing capabilities. However, many existing heterogeneous GPU is only treated as an accelerator by CPU, working slave to CPU master. But recently we starting see introduction new class devices changes system runtime model, which enable accelerators be first-class computing devices. To support...

10.1109/ispass.2016.7482093 article EN 2016-04-01

As transistor scaling becomes increasingly more difficult to achieve, the core count on a single GPU chip has also become extremely challenging. volume of data process in today's parallel workloads continues grow unbounded, we need find scalable solutions that can keep up with this increasing demand. To meet modern-day applications, multi-GPU systems offer promising path deliver high performance and large memory capacity. However, suffer from issues associated GPU-to-GPU communication...

10.1109/hpca47549.2020.00055 article EN 2020-02-01

Graph Neural Networks (GNNs) have emerged as a promising class of Machine Learning algorithms to train on non-euclidean data. GNNs are widely used in recommender systems, drug discovery, text understanding, and traffic forecasting. Due the energy efficiency high-performance capabilities GPUs, GPUs natural choice for accelerating training GNNs. Thus, we want better understand architectural system-level implications GPUs. Presently, there is no benchmark suite available designed study GNN...

10.1109/ispass51385.2021.00013 article EN 2021-03-01

The dynamic inference, which adaptively allocates computational budgets for different samples, is a prevalent approach achieving efficient action recognition. Current studies primarily focus on data-efficient regime that reduces spatial or temporal redundancy, their combination, by selecting partial video data, such as clips, frames, patches. However, these approaches often utilize fixed and computationally expensive networks. From perspective, this article introduces novel model-efficient...

10.1109/tnnls.2024.3525187 article EN IEEE Transactions on Neural Networks and Learning Systems 2025-01-01

We investigate a family of approximate multi-step proximal point methods, framed as implicit linear discretizations gradient flow. The resulting methods are with similar computational cost in each update the method. explore several optimization where applying an multistep points method results improved convergence behavior. also include analysis for proposed problem settings: quadratic problems, general problems that strongly or weakly (non)convex, and accelerated alternating projections.

10.48550/arxiv.2501.08146 preprint EN arXiv (Cornell University) 2025-01-14

The conduction delay time is one of the main technical parameters surface flashover triggered vacuum switch (STVS). Detailed research and optimization characteristics are necessary for development high-performance STVS. In this paper, performance STVS with different electrode structures operating investigated in detail. Based on analysis process, factors affecting discussed. experimental platform built by using a detachable chamber, compared under parameters. This analyzes discusses reasons...

10.1063/5.0249811 article EN cc-by AIP Advances 2025-01-01

Recent advancements in large language models (LLMs) have spurred growing interest automatic theorem proving using Lean4, where effective tree search methods are crucial for navigating proof spaces. While the existing approaches primarily rely on value functions and Monte Carlo Tree Search (MCTS), potential of simpler like Best-First (BFS) remains underexplored. This paper investigates whether BFS can achieve competitive performance large-scale tasks. We present \texttt{BFS-Prover}, a...

10.48550/arxiv.2502.03438 preprint EN arXiv (Cornell University) 2025-02-05

Generative AI (GenAI) has brought opportunities and challenges for higher education as it integrates into teaching learning environments. As instructors navigate this new landscape, understanding their engagement with attitudes toward GenAI is crucial. We surveyed 178 from a single U.S. university to examine current practices, perceptions, trust, distrust of in March 2024. While most reported moderate high familiarity GenAI-related concepts, actual use tools direct instructional tasks...

10.48550/arxiv.2502.05770 preprint EN arXiv (Cornell University) 2025-02-08
Coming Soon ...