Pengyue Jia

ORCID: 0000-0003-4712-3676
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Recommender Systems and Techniques
  • Embedded Systems Design Techniques
  • Machine Learning and Algorithms
  • Parallel Computing and Optimization Techniques
  • Real-Time Systems Scheduling
  • Speech and dialogue systems
  • AI-based Problem Solving and Planning
  • Machine Learning and Data Classification
  • Machine Learning and ELM
  • Handwritten Text Recognition Techniques

City University of Hong Kong
2024-2025

10.1109/icassp49660.2025.10888677 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

To enhance the efficacy of multi-scenario services in industrial recommendation systems, emergence multi-domain has become prominent, which entails simultaneous modeling all domains through a unified model, effectively capturing commonalities and differences among them. However, current methods rely on manual domain partitioning, overlook intricate relationships heterogeneity different during joint optimization, hindering integration differences. address these challenges, this paper proposes...

10.1609/aaai.v38i8.28699 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Deep Recommender Systems (DRS) are increasingly dependent on a large number of feature fields for more precise recommendations. Effective selection methods consequently becoming critical further enhancing the accuracy and optimizing storage efficiencies to align with deployment demands. This research area, particularly in context DRS, is nascent faces three core challenges. Firstly, variant experimental setups across papers often yield unfair comparisons, obscuring practical insights....

10.1145/3637528.3671571 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2024-08-24

The reranker and generator are two critical components in the Retrieval-Augmented Generation (i.e., RAG) pipeline, responsible for ranking relevant documents generating responses. However, due to differences pre-training data objectives, there is an inevitable gap between ranked as by those required support answering query. To address this gap, we propose RADIO, a novel practical preference alignment framework with RAtionale DIstillatiOn. Specifically, We first rationale extraction method...

10.48550/arxiv.2412.08519 preprint EN arXiv (Cornell University) 2024-12-11

The performance of Dense retrieval (DR) is significantly influenced by the quality negative sampling. Traditional DR methods primarily depend on naive sampling techniques or mining hard negatives through external retriever and meticulously crafted strategies. However, often fails to adequately capture accurate boundaries between positive samples, whereas existing are prone false negatives, resulting in degradation training instability. Recent advancements large language models (LLMs) offer...

10.48550/arxiv.2412.17250 preprint EN arXiv (Cornell University) 2024-12-22
Coming Soon ...