Yuan Cheng

ORCID: 0000-0003-2502-9101
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Multimodal Machine Learning Applications
  • Advanced Image and Video Retrieval Techniques
  • Recommender Systems and Techniques
  • Domain Adaptation and Few-Shot Learning
  • Natural Language Processing Techniques
  • Human Pose and Action Recognition
  • Speech Recognition and Synthesis
  • Advanced Neural Network Applications
  • Video Analysis and Summarization
  • Advanced Memory and Neural Computing
  • Neural Networks and Reservoir Computing
  • Speech and dialogue systems
  • Advanced Text Analysis Techniques
  • Machine Learning and Data Classification
  • Data Quality and Management
  • Advanced Image Fusion Techniques
  • Ferroelectric and Negative Capacitance Devices
  • Emotion and Mood Recognition
  • Macrophage Migration Inhibitory Factor
  • Seismic Imaging and Inversion Techniques
  • Image Processing and 3D Reconstruction
  • Traffic Prediction and Management Techniques
  • Advanced Clustering Algorithms Research
  • Multi-Agent Systems and Negotiation

Fudan University
2023-2025

Shanghai University
2024

Tsinghua University
2024

Peking University First Hospital
2024

Peking University
2024

Guangzhou Academy of Special Equipment Inspection and Testing
2023

Beijing University of Posts and Telecommunications
2022

Zhejiang Financial College
2022

Renmin University of China
2021

Heilongjiang University of Science and Technology
2020

Abstract Scalable, high-capacity, and low-power computing architecture is the primary assurance for increasingly manifold large-scale machine learning tasks. Traditional electronic artificial agents by conventional power-hungry processors have faced issues of energy scaling walls, hindering them from sustainable performance improvement iterative multi-task learning. Referring to another modality light, photonic has been progressively applied in high-efficient neuromorphic systems. Here, we...

10.1038/s41377-024-01395-4 article EN cc-by Light Science & Applications 2024-02-26

Kun Zhou, Xiaolei Wang, Yuanhang Chenzhan Shang, Yuan Cheng, Wayne Xin Zhao, Yaliang Li, Ji-Rong Wen. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing: System Demonstrations. 2021.

10.18653/v1/2021.acl-demo.22 article EN cc-by 2021-01-01

Magnetic tunneling junctions (MTJs) lie in the core of magnetic random access memory, holding promise integrating memory and computing to reduce hardware complexity, transition latency, power consumption. However, traditional MTJs are insensitive light, limiting their functionality in-memory sensing─a crucial component for machine vision systems artificial intelligence applications. Herein, convergence with optical sensing capabilities is achieved all-two-dimensional (2D) junction Fe

10.1021/acsnano.4c09735 article EN ACS Nano 2024-09-17

Positron Emission Tomography (PET) imaging plays a crucial role in modern medical diagnostics by revealing the metabolic processes within patient's body, which is essential for quantification of therapy response and monitoring treatment progress. However, segmentation PET images presents unique challenges due to their lower contrast less distinct boundaries compared other structural modalities. Recent developments foundation models have shown superior versatility across diverse natural image...

10.48550/arxiv.2502.14351 preprint EN arXiv (Cornell University) 2025-02-20

With the explosive growth of web videos in recent years, large-scale Content-Based Video Retrieval (CBVR) becomes increasingly essential video filtering, recommendation, and copyright protection. Segment-level CBVR (S-CBVR) locates start end time similar segments finer granularity, which is beneficial for user browsing efficiency infringement detection especially long scenarios. The challenge S-CBVR task how to achieve high temporal alignment accuracy with efficient computation low storage...

10.1145/3474085.3475301 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

In this paper, we introduce VCSL (Video Copy Segment Localization), a new comprehensive segment-level annotated video copy dataset. Compared with existing detection datasets restricted by either video-level annotation or small-scale, not only has two orders of magnitude more labelled data, 160k realistic pairs containing than 280k localized copied segment pairs, but also covers variety categories and wide range duration. All the segments inside each collected pair are manually extracted...

10.1109/cvpr52688.2022.02041 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

In recent years, the explosion of web videos makes text-video retrieval increasingly essential and popular for video filtering, recommendation, search. Text-video aims to rank relevant text/video higher than irrelevant ones. The core this task is precisely measure cross-modal similarity between texts videos. Recently, contrastive learning methods have shown promising results retrieval, most which focus on construction positive negative pairs learn text representations. Nevertheless, they do...

10.1145/3581783.3612006 preprint EN 2023-10-26

Recently, with the emergence of retrieval requirements for certain individual in same superclass, e.g., birds, persons, cars, fine-grained recognition task has attracted a significant amount attention from academia and industry. In scenario, inter-class differences are quite diverse subtle, which makes it challenging to extract all discriminative cues. Traditional training mechanism optimizes overall discriminativeness whole feature. It may stop early when some feature elements been trained...

10.1109/cvpr46437.2021.00087 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Influencer marketing is emerging as a new method, changing the strategies of brands profoundly. In order to help find suitable micro-influencers partners, micro-influencer recommendation regarded an indispensable part influencer marketing. However, previous works only focus on modeling <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">individual image</i> brands/micro-influencers, which insufficient represent characteristics...

10.1109/tmm.2022.3151029 article EN IEEE Transactions on Multimedia 2022-02-14

Product image search in E-commerce systems is a challenging task, because of huge number product classes, low intra-class similarity and high inter-class similarity. Deep metric learning, based on paired distances independent the aims to minimize variances feature embedding space. Most existing approaches strictly restrict distance between samples with fixed values distinguish different classes samples. However, has various magnitudes during training stages. Therefore, it difficult directly...

10.1145/3366423.3380094 article EN 2020-04-20

We illustrate how one can use basic combinatorial theory and computer programming technique (Python) to analyze the game: Mahjong. The results confirm some folklore concerning game, expose unexpected results. Related possible future research in connection artificial intelligence are mentioned. Readers interested subject may further develop techniques deepen study of or other games.

10.48550/arxiv.1707.07345 preprint EN other-oa arXiv (Cornell University) 2017-01-01

As it requires a huge number of parameters when exposed to high dimensional inputs in video detection and classification, there is grand challenge develop compact yet accurate comprehension at terminal devices. Current works focus on optimizations classification separated fashion. In this paper, we introduce (object action recognition) system for devices, namely DEEPEYE. Based You Only Look Once (YOLO), have developed an 8-bit quantization method training YOLO; also tensorized-compression...

10.48550/arxiv.1805.07935 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Uncertain data may exist in many application fields, due to the inaccurate raw data, use of coarse-grained set, for purposes privacy protection, and integration etc. The original features be changed or ignored if uncertainties were mishandled. Therefore effective management analysis uncertain objects should rely on an appropriate model depicting characteristic uncertainties. For values attributes, this paper proposed construction method based nonparametric estimation, which can represent...

10.1109/iceict.2016.7879722 article EN 2016-08-01

A novel method was brought forward for the purpose of filtering Gaussian noise effectively by using variable step time matrix simplified pulse coupled neural network (PCNN). Firstly, PCNN, related to grayscale and spatial information an image, is calculated identify polluted pixels. Subsequently, a step, long strong short weak noise, based on applied modify noised pixels in sliding window. And then wiener filter used image further noise. Experiments show that proposed can remove than other...

10.4028/www.scientific.net/amm.48-49.551 article EN Applied Mechanics and Materials 2011-02-01

As the aging population continues to grow, fall detection has become a key issue in public health and healthcare. To address problem of low accuracy poor real-time performance algorithms real scenarios, an improved model, FALLNET, is proposed. First, YOLOv7-X-pose algorithm used quickly extract multiple human body keypoints multi-person keypoint extraction module. Second, classic CNN_Attention_LSTM model dangerous action recognition module by adding LSTM layer better capture important...

10.1109/iccsi58851.2023.10303929 article EN 2022 International Conference on Cyber-Physical Social Intelligence (ICCSI) 2023-10-20

10.1145/3626772.3657920 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2024-07-10

Recently, significant advancements have been made in Large Language Models (LLMs) through the implementation of various alignment techniques. These techniques enable LLMs to generate highly tailored content response diverse user instructions. Consequently, potential serve as robust, customizable recommendation systems field recommendation. However, using with individual information and online exploration remains a challenge, which are important perspectives developing personalized news...

10.1145/3637528.3671638 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2024-08-24

Abstract. Stone cultural heritage, encompassing a broad spectrum of artifacts such as stone artworks, buildings, tools, and utensils, represents one the most significant categories heritage. However, conservation these heritage faces challenges from process deterioration. This degradation not only compromises structural integrity but also results in loss invaluable historical information. Thus, there emerges critical demand for effective methods to detect assess condition enabling timely...

10.5194/isprs-archives-xlviii-4-2024-541-2024 article EN cc-by ˜The œinternational archives of the photogrammetry, remote sensing and spatial information sciences/International archives of the photogrammetry, remote sensing and spatial information sciences 2024-10-21

Effective collaboration in multi-agent systems requires communicating goals and intentions between agents. Current agent frameworks often suffer from dependencies on single-agent execution lack robust inter-module communication, frequently leading to suboptimal reinforcement learning (MARL) policies inadequate task coordination. To address these challenges, we present a framework for training large language models (LLMs) as collaborative agents enable coordinated behaviors cooperative MARL....

10.48550/arxiv.2407.12532 preprint EN arXiv (Cornell University) 2024-07-17

Stent migration is one of the common complications after tracheal stent implantation. The causes include size mismatch between and trachea, physiological movement so on. In order to solve above problems, this study designed a non-uniform Poisson ratio by combining structure trachea improve stent, meanwhile ensuring support stent. study, corresponding cartilage was constructed with negative Poisson's ratio, circular connective tissue muscular membrane positive ratio. And four kinds stents...

10.7507/1001-5515.202402014 article EN PubMed 2024-10-25

In recent years, conversational recommender system (CRS) has received much attention in the research community. However, existing studies on CRS vary scenarios, goals and techniques, lacking unified, standardized implementation or comparison. To tackle this challenge, we propose an open-source toolkit CRSLab, which provides a unified extensible framework with highly-decoupled modules to develop CRSs. Based framework, collect 6 commonly-used human-annotated datasets implement 18 models that...

10.48550/arxiv.2101.00939 preprint EN other-oa arXiv (Cornell University) 2021-01-01
Coming Soon ...