Chengru Song

ORCID: 0009-0009-3826-8436
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Recommender Systems and Techniques
  • Advanced Data Compression Techniques
  • Image Retrieval and Classification Techniques
  • Image and Video Quality Assessment
  • AI in cancer detection
  • Image and Object Detection Techniques
  • Network Traffic and Congestion Control
  • Machine Learning and Data Classification
  • Ferroelectric and Negative Capacitance Devices
  • Wireless Communication Networks Research
  • Caching and Content Delivery
  • Parallel Computing and Optimization Techniques
  • Mathematics, Computing, and Information Processing
  • Neural dynamics and brain function
  • Distributed and Parallel Computing Systems
  • Advanced Computational Techniques and Applications
  • Technology and Data Analysis
  • IoT and Edge/Fog Computing
  • Natural Language Processing Techniques
  • Library Science and Information Systems
  • Text and Document Classification Technologies
  • IPv6, Mobility, Handover, Networks, Security
  • Medical Image Segmentation Techniques
  • Advanced Memory and Neural Computing

Korea Advanced Institute of Science and Technology
2025

Kuaishou (China)
2023

Beijing University of Posts and Telecommunications
2019

Click-through rate prediction is an essential task in industrial applications, such as online advertising. Recently deep learning based models have been proposed, which follow a similar Embedding\&MLP paradigm. In these methods large scale sparse input features are first mapped into low dimensional embedding vectors, and then transformed fixed-length vectors group-wise manner, finally concatenated together to fed multilayer perceptron (MLP) learn the nonlinear relations among features....

10.48550/arxiv.1706.06978 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Increasing the size of embedding layers has shown to be effective in improving performance recommendation models, yet gradually causing their sizes exceed terabytes industrial recommender systems, and hence increase computing storage costs. To save resources while maintaining model performances, we propose SHARK, compression practice have summarized system scenarios. SHARK consists two main components. First, use novel first-order component Taylor expansion as importance scores prune number...

10.1145/3583780.3615499 article EN 2023-10-21

Abstract The cloud network has the advantages in efficiently offloading large‐scale Internet traffic, which is considered as a promising architecture to provide satisfactory multimedia services for mobile users. However, most current studies lack joint consideration of economic and security hybrid networks. In this paper, novel service optimization mechanism proposed hereby meet user's requirements mentioned above while guaranteeing reliability service. Firstly, credible scheme designed help...

10.1002/ett.3779 article EN Transactions on Emerging Telecommunications Technologies 2019-11-20

Brain-inspired Spiking Neural Networks (SNNs) have the characteristics of event-driven and high energy-efficient, which are different from traditional Artificial (ANNs) when deployed on edge devices such as neuromorphic chips. Most previous work focuses SNNs training strategies to improve model performance brings larger deeper network architectures. It's difficult deploy these complex networks resource-limited directly. To meet demand, people compress very cautiously balance computation...

10.1145/3581783.3611838 article EN 2023-10-26

Large language models have demonstrated exceptional capability in natural understanding and generation. However, their generation speed is limited by the inherently sequential nature of decoding process, posing challenges for real-time applications. This paper introduces Lexical Unit Decoding (LUD), a novel methodology implemented data-driven manner, accelerating process without sacrificing output quality. The core our approach observation that pre-trained model can confidently predict...

10.48550/arxiv.2405.15208 preprint EN arXiv (Cornell University) 2024-05-24

Increasing the size of embedding layers has shown to be effective in improving performance recommendation models, yet gradually causing their sizes exceed terabytes industrial recommender systems, and hence increase computing storage costs. To save resources while maintaining model performances, we propose SHARK, compression practice have summarized system scenarios. SHARK consists two main components. First, use novel first-order component Taylor expansion as importance scores prune number...

10.48550/arxiv.2308.09395 preprint EN other-oa arXiv (Cornell University) 2023-01-01

The vBNS (very-high-performance Backbone Network Service) is a high-speed Internet Protocol (IP) over an ATM national backbone network. A "reserved bandwidth" service designed based on the network traffic characteristics as initial quality-of-service (QoS) offering for user community. This paper outlines improved configuration in layer and required new router mechanisms to support this service.

10.1109/iwqos.1998.675219 article EN 2002-11-27
Coming Soon ...