NFDI4DS | UHH-SEMS - Publication Details

Learning supervised scoring ensemble for emotion recognition in the wild

OPENALEX - Publications

Ping Hu Dongqi Cai Shandong Wang Anbang Yao Yurong Chen

State-of-the-art approaches for the previous emotion recognition in wild challenges are usually built on prevailing Convolutional Neural Networks (CNNs). Although there is clear evidence that CNNs with increased depth or width can bring improved predication accuracy, existing top provide supervision only at output feature layer, resulting insufficient training of deep CNN models. In this paper, we present a new learning method named Supervised Scoring Ensemble (SSE) advancing challenge CNNs....

10.1145/3136755.3143009 article EN 2017-11-03

HoloNet: towards robust emotion recognition in the wild

OPENALEX - Publications

Anbang Yao Dongqi Cai Ping Hu Shandong Wang Sha Liang and 1 more

In this paper, we present HoloNet, a well-designed Convolutional Neural Network (CNN) architecture regarding our submissions to the video based sub-challenge of Emotion Recognition in Wild (EmotiW) 2016 challenge. contrast previous related methods that usually adopt relatively simple and shallow neural network architectures address emotion recognition task, HoloNet has three critical considerations design. (1) To reduce redundant filters enhance non-saturated non-linearity lower...

10.1145/2993148.2997639 article EN 2016-10-31

A Survey of Resource-efficient LLM and Multimodal Foundation Models

OPENALEX - Publications

Mengwei Xu Wangsong Yin Dongqi Cai Rongjie Yi Daliang Xu and 13 more

Large foundation models, including large language models (LLMs), vision transformers (ViTs), diffusion, and LLM-based multimodal are revolutionizing the entire machine learning lifecycle, from training to deployment. However, substantial advancements in versatility performance these offer come at a significant cost terms of hardware resources. To support growth scalable environmentally sustainable way, there has been considerable focus on developing resource-efficient strategies. This survey...

10.48550/arxiv.2401.08092 preprint EN cc-by arXiv (Cornell University) 2024-01-01

Mobile Foundation Model as Firmware

OPENALEX - Publications

Jinliang Yuan Chen Yang Dongqi Cai Shihe Wang X. Q. Yuan and 7 more

In the current AI era, mobile devices such as smartphones are tasked with executing a myriad of deep neural networks (DNNs) locally. It presents complex landscape, these models highly fragmented in terms architecture, operators, and implementations. Such fragmentation poses significant challenges to co-optimization hardware, systems, algorithms for efficient scalable AI.

10.1145/3636534.3649361 article EN other-oa Proceedings of the 28th Annual International Conference on Mobile Computing And Networking 2024-05-29

Learning Visual Knowledge Memory Networks for Visual Question Answering

OPENALEX - Publications

Su Zhou Chen Zhu Yinpeng Dong Dongqi Cai Yurong Chen and 1 more

Visual question answering (VQA) requires joint comprehension of images and natural language questions, where many questions can't be directly or clearly answered from visual content but require reasoning structured human knowledge with confirmation content. This paper proposes memory network (VKMN) to address this issue, which seamlessly incorporates deep features into networks in an end-to-end learning framework. Comparing existing methods for leveraging external supporting VQA, stresses...

10.1109/cvpr.2018.00807 article EN 2018-06-01

Efficient Federated Learning for Modern NLP

OPENALEX - Publications

Dongqi Cai Yaozong Wu Shangguang Wang Felix Xiaozhu Lin Mengwei Xu

Transformer-based pre-trained models have revolutionized NLP for superior performance and generality. Fine-tuning downstream tasks often requires private data, which federated learning is the de-facto approach (i.e., FedNLP). However, our measurements show that FedNLP prohibitively slow due to large model sizes resultant high network/computation cost. Towards practical FedNLP, we identify as key building blocks adapters, small bottleneck modules inserted at a variety of layers. A challenge...

10.1145/3570361.3592505 article EN Proceedings of the 28th Annual International Conference on Mobile Computing And Networking 2023-09-30

Federated Few-Shot Learning for Mobile NLP

OPENALEX - Publications

Dongqi Cai Shangguang Wang Yaozong Wu Felix Xiaozhu Lin Mengwei Xu

Natural language processing (NLP) sees rich mobile applications. To support various understanding tasks, a foundation NLP model is often fine-tuned in federated, privacy-preserving setting (FL). This process currently relies on at least hundreds of thousands labeled training samples from clients; yet users lack willingness or knowledge to label their data. Such an inadequacy data labels known as few-shot scenario; it becomes the key blocker for For first time, this work investigates...

10.1145/3570361.3613277 article EN Proceedings of the 28th Annual International Conference on Mobile Computing And Networking 2023-09-30

FwdLLM: Efficient FedLLM using Forward Gradient

OPENALEX - Publications

Mengwei Xu Yaozong Wu Dongqi Cai Xiang Li Shangguang Wang

Large Language Models (LLMs) are transforming the landscape of mobile intelligence. Federated Learning (FL), a method to preserve user data privacy, is often employed in fine-tuning LLMs downstream tasks, an approach known as FedLLM. Though recent efforts have addressed network issue induced by vast model size, they not practically mitigated vital challenges concerning integration with devices, such significant memory consumption and sluggish convergence. In response these challenges, this...

10.48550/arxiv.2308.13894 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Ubiquitous Memory Augmentation via Mobile Multimodal Embedding System

OPENALEX - Publications

Dongqi Cai Shangguang Wang Peng Chen Zeling Zhang Nicholas D. Lane and 1 more

<title>Abstract</title> Forgetting is inevitable in human memory. Recently, multimodal embedding models have been proposed to vectorize reality into a unified space. The generated embeddings can be easily retrieved help mobile users remember and recall information when needed. However, as the model’s capacity increases, its resource consumption also rises. resulting slow throughput significant computational requirements hinder deployment on devices. In this paper, we present Reminisce, first...

10.21203/rs.3.rs-5686668/v1 preprint EN cc-by Research Square (Research Square) 2025-01-16

Rechargeable Zinc–Air Battery with Ultrahigh Power Density Based on Uniform N, Co Codoped Carbon Nanospheres

OPENALEX - Publications

Chenghang You Xiaohong Gao Qingqing Wang Xiaobao Li Shiyu Tan and 6 more

Highly efficient catalysts for both oxygen reduction reaction (ORR) and evolution (OER) are key to the commercialization of rechargeable zinc-air batteries (ZABs). In this work, a catalyst with uniform nanospherical morphology was prepared from cobalt nitrate, acetylacetone, hydrazine hydrate. The final possesses high ORR OER performances, half-wave potential 0.911 V [vs reversible hydrogen electrode (RHE)] low 1.57 (vs RHE) at 10 mA cm-2 in 0.1 M KOH solution. Specially, ZAB based on...

10.1021/acsami.9b14165 article EN ACS Applied Materials & Interfaces 2019-11-08

FedRDMA

OPENALEX - Publications

Zeling Zhang Dongqi Cai Yiran Zhang Mengwei Xu Shangguang Wang and 1 more

Communication overhead is a significant bottleneck in federated learning (FL), which has been exaggerated with the increasing size of AI models. In this paper, we propose FedRDMA, communication-efficient cross-silo FL system that integrates RDMA into communication protocol. To overcome limitations wide-area networks (WANs), FedRDMA divides updated model chunks and designs series optimization techniques to improve efficiency robustness RDMA-based communication. We implement atop industrial...

10.1145/3642970.3655834 article EN 2024-04-19

An adaptive symmetry detection algorithm based on local features

OPENALEX - Publications

Dongqi Cai Pengyu Li Fei Su Zhicheng Zhao

Local feature-based symmetry detection algorithms can simultaneously consider symmetries over all locations, scales and orientations achieve state-of-the-art performance. This paper demonstrates the limitations of these in case dealing with background clutters, low contrast smooth surfaces, presents an adaptive feature point algorithm to overcome those limitations. Quantitative evaluations subjective comparisons against reflection on image dataset released by "Symmetry Detection from Real...

10.1109/vcip.2014.7051610 article EN 2014-12-01

Implementation of an E-Payment Security Evaluation System Based on Quantum Blind Computing

OPENALEX - Publications

Dongqi Cai Xi Chen Yu‐Hong Han Xin Yi Jia Jin-ping and 2 more

10.1007/s10773-020-04536-8 article EN International Journal of Theoretical Physics 2020-07-10

Towards Practical Few-shot Federated NLP

OPENALEX - Publications

Dongqi Cai Yaozong Wu Haitao Yuan Shangguang Wang Felix Xiaozhu Lin and 1 more

Transformer-based pre-trained models have emerged as the predominant solution for natural language processing (NLP). Fine-tuning such downstream tasks often requires a considerable amount of labeled private data. In practice, data is distributed across heterogeneous mobile devices and may be prohibited from being uploaded. Moreover, well-curated scarce, presenting an additional challenge. To address these challenges, we first introduce generator federated few-shot learning tasks, which...

10.1145/3578356.3592575 article EN 2023-05-04

ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents

OPENALEX - Publications

Haiyang Shen Yue Li Desong Meng Dongqi Cai Sheng Qi and 3 more

Recent advancements in integrating large language models (LLMs) with application programming interfaces (APIs) have gained significant interest both academia and industry. These API-based agents, leveraging the strong autonomy planning capabilities of LLMs, can efficiently solve problems requiring multi-step actions. However, their ability to handle multi-dimensional difficulty levels, diverse task types, real-world demands through APIs remains unknown. In this paper, we introduce...

10.48550/arxiv.2407.00132 preprint EN arXiv (Cornell University) 2024-06-28

Towards Ubiquitous Learning

OPENALEX - Publications

Dongqi Cai Qipeng Wang Yuanqiang Liu Yunxin Liu Shangguang Wang and 1 more

We are witnessing the emergence of ubiquitous learning, where each device (smartphones, wearables, IoTs, etc) can learn from their environments either alone or collaboratively. Such a new paradigm is enabled by deep learning techniques, more specifically, on-device training. Given its popularity in machine community, unfortunately, there no systematic understandings critical question: how much cost does it take to train typical models on commodity end devices? Therefore, this work performs...

10.1145/3469116.3470009 article EN 2021-06-24

A Survey of Backpropagation-free Training For LLMS

OPENALEX - Publications

Hanzi Mei Dongqi Cai Yaozong Wu Shangguang Wang Mengwei Xu

Large language models (LLMs) have achieved remarkable performance in various downstream tasks.However, training LLMs is computationally expensive and requires a large amount of memory.To address this issue, backpropagation-free (BP-free) has been proposed as promising approach to reduce the computational memory costs LLMs.In survey, we provide comprehensive overview BP-free for LLMs.We first outline three mainstream methods.Subsequently, introduce their optimizations LLMs.The goal survey...

10.36227/techrxiv.171172909.97532161/v1 preprint EN cc-by-nc-sa 2024-03-29

Large Language Models on Mobile Devices: Measurements, Analysis, and Insights

OPENALEX - Publications

Xiang Li Zhenyan Lu Dongqi Cai Xiao Ma Mengwei Xu

Deploying large language models (LLMs) inference into mobile devices is cost-efficient for companies, and well addresses the privacy concern of users. However, limited computation capacity memory constraints hinder their practical deployment. Prior work strives to expand model size better accuracy performance, while there a lack systematic understanding "small" sub-10 billion LLMs that are already feasible current commodity devices. To reveal landscape on devices, we conducted comprehensive...

10.1145/3662006.3662059 article EN 2024-06-03

Learning Visual Knowledge Memory Networks for Visual Question Answering

OPENALEX - Publications

Su Zhou Chen Zhu Yinpeng Dong Dongqi Cai Yurong Chen and 1 more

Visual question answering (VQA) requires joint comprehension of images and natural language questions, where many questions can't be directly or clearly answered from visual content but require reasoning structured human knowledge with confirmation content. This paper proposes memory network (VKMN) to address this issue, which seamlessly incorporates deep features into networks in an end-to-end learning framework. Comparing existing methods for leveraging external supporting VQA, stresses...

10.48550/arxiv.1806.04860 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Rethinking Mobile AI Ecosystem in the LLM Era

OPENALEX - Publications

Jinliang Yuan Yang Chen Dongqi Cai Shihe Wang Xin Yuan and 7 more

In today's landscape, smartphones have evolved into hubs for hosting a multitude of deep learning models aimed at local execution. A key realization driving this work is the notable fragmentation among these models, characterized by varied architectures, operators, and implementations. This imposes significant burden on comprehensive optimization hardware, system settings, algorithms. Buoyed recent strides in large foundation introduces pioneering paradigm mobile AI: collaborative management...

10.48550/arxiv.2308.14363 preprint EN other-oa arXiv (Cornell University) 2023-01-01

FedAdapter: Efficient Federated Learning for Modern NLP

OPENALEX - Publications

Dongqi Cai Yaozong Wu Shangguang Wang Felix Xiaozhu Lin Mengwei Xu

Transformer-based pre-trained models have revolutionized NLP for superior performance and generality. Fine-tuning downstream tasks often requires private data, which federated learning is the de-facto approach (i.e., FedNLP). However, our measurements show that FedNLP prohibitively slow due to large model sizes resultant high network/computation cost. Towards practical FedNLP, we identify as key building blocks adapters, small bottleneck modules inserted at a variety of layers. A challenge...

10.48550/arxiv.2205.10162 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Local metric learning for EEG-based personal identification

OPENALEX - Publications

Dongqi Cai Kai Liu Fei Su

There has been an increasing attention on Electroencephalograph (EEG) based personal identification over the last decade. Most existing methods address this problem by Euclidean metric Nearest Neighbor (NN) search. However, under various recording conditions, simple distance cannot model similarity relations between EEG signals precisely. To overcome drawback, a local learning Large Margin (L-LMNN) for is proposed in paper. For each sample, separate learned, making intra-class samples...

10.1109/icassp.2015.7178088 article EN 2015-04-01

An Electrochromic Ag-Decorated WO3−x Film with Adjustable Defect States for Electrochemical Surface-Enhanced Raman Spectroscopy

OPENALEX - Publications

Siqi Qu Jing Guan Dongqi Cai Qianshuo Wang Xiuyun Wang and 2 more

Electrochemical surface-enhanced Raman scattering (EC-SERS) spectroscopy is an ultrasensitive spectro-electrochemistry technique that provides mechanistic and dynamic information on electrochemical interfaces at the molecular level. However, plasmon-mediated photocatalysis hinders intrinsic behavior of molecules interfaces. This work aimed to develop a facile method for constructing reliable EC-SERS substrate can be used study dynamics Herein, novel Ag-WO3−x electrochromic heterostructure...

10.3390/nano12101637 article EN cc-by Nanomaterials 2022-05-11