NFDI4DS | UHH-SEMS - Publication Details

Haoyue Bai

ORCID: 0009-0009-1328-9230

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5064223000

Research Areas

Recommender Systems and Techniques
Advanced Bandit Algorithms Research
Text and Document Classification Technologies
Data Stream Mining Techniques
Music and Audio Processing
Advanced Graph Neural Networks
Ovarian cancer diagnosis and treatment
Machine Learning in Healthcare
Topic Modeling
Multimodal Machine Learning Applications
Endometrial and Cervical Cancer Treatments
Caching and Content Delivery
Image Retrieval and Classification Techniques
Cervical Cancer and HPV Research

Hefei University of Technology
2023-2025

Institute of Art
2024

China Academy of Information and Communications Technology
2024

A Survey on Data-Centric AI: Tabular Learning from Reinforcement Learning and Generative AI Perspective

OPENALEX - Publications

Wangyang Ying Cong Wei Nanxu Gong Xinyuan Wang Haoyue Bai and 5 more

Tabular data is one of the most widely used formats across various domains such as bioinformatics, healthcare, and marketing. As artificial intelligence moves towards a data-centric perspective, improving quality essential for enhancing model performance in tabular data-driven applications. This survey focuses on optimization, specifically exploring reinforcement learning (RL) generative approaches feature selection generation fundamental techniques refining spaces. Feature aims to identify...

10.48550/arxiv.2502.08828 preprint EN arXiv (Cornell University) 2025-02-12

Fair Personalized Learner Modeling Without Sensitive Attributes

OPENALEX - Publications

Hefei Xu Min Hou Le Wu Fei Liu Yonghui Yang and 3 more

10.1145/3696410.3714787 article EN 2025-04-22

Popularity-Aware Alignment and Contrast for Mitigating Popularity Bias

OPENALEX - Publications

Miaomiao Cai Lei Chen Yifan Wang Haoyue Bai Peijie Sun and 3 more

Collaborative Filtering~(CF) typically suffers from the significant challenge of popularity bias due to uneven distribution items in real-world datasets. This leads a accuracy gap between popular and unpopular items. It not only hinders accurate user preference understanding but also exacerbates Matthew effect recommendation systems. To alleviate bias, existing efforts focus on emphasizing or separating correlation item representations their popularity. Despite effectiveness, works still...

10.1145/3637528.3671824 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2024-08-24

Double Correction Framework for Denoising Recommendation

OPENALEX - Publications

Zhuangzhuang He Yifan Wang Yonghui Yang Peijie Sun Le Wu and 4 more

As its availability and generality in online services, implicit feedback is more commonly used recommender systems. However, usually presents noisy samples real-world recommendation scenarios (such as misclicks or non-preferential behaviors), which will affect precise user preference learning. To overcome the problem, a popular solution based on dropping model training phase, follows observation that have higher losses than clean samples. Despite effectiveness, we argue this still has...

10.1145/3637528.3671692 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2024-08-24

Multimodality Invariant Learning for Multimedia-Based New Item Recommendation

OPENALEX - Publications

Haoyue Bai Le Wu Min Hou Miaomiao Cai Zhuangzhuang He and 3 more

Multimedia-based recommendation provides personalized item suggestions by learning the content preferences of users.With proliferation digital devices and APPs, a huge number new items are created rapidly over time.How to quickly provide recommendations for at inference time is challenging.What's worse, real-world exhibit varying degrees modality missing (e.g., many short videos uploaded without text descriptions).Though efforts have been devoted multimedia-based recommendations, they either...

10.1145/3626772.3658596 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2024-07-10

Unified Representation Learning for Discrete Attribute Enhanced Completely Cold-Start Recommendation

OPENALEX - Publications

Haoyue Bai Min Hou Le Wu Yonghui Yang Kun Zhang and 2 more

Recommender systems face a daunting challenge when entities (users or items) without any historical interactions, known as the " <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Completely Cold-Start Problem</i> ". Due to absence of collaborative signals, Collaborative Filtering (CF) schema fails deduce user preferences item characteristics for such cold entities. A common solution is incorporating auxiliary discrete attributes bridge spread...

10.1109/tbdata.2024.3387276 article EN IEEE Transactions on Big Data 2024-01-01

GoRec: A Generative Cold-start Recommendation Framework

OPENALEX - Publications

Haoyue Bai Min Hou Le Wu Yonghui Yang Kun Zhang and 2 more

Multimedia-based recommendation models learn user and item preference representation by fusing both the user-item collaborative signals multimedia content signals. In real scenarios, cold items appear in test stage without any interaction record. How to perform is challenging as training have different data distributions. These hybrid representations contained auxiliary signals, so current solutions designed alignment functions transfer learned items. Despite effectiveness, we argue that...

10.1145/3581783.3612238 article EN 2023-10-26

Mitigating Recommendation Biases via Group-Alignment and Global-Uniformity in Representation Learning

OPENALEX - Publications

Miaomiao Cai Min Hou Lei Chen Le Wu Haoyue Bai and 2 more

Collaborative Filtering (CF) plays a crucial role in modern recommender systems, leveraging historical user-item interactions to provide personalized suggestions. However, CF-based methods often encounter biases due imbalances training data. This phenomenon makes tend prioritize recommending popular items and performing unsatisfactorily on inactive users. Existing works address this issue by rebalancing samples, reranking recommendation results, or making the modeling process robust bias....

10.1145/3664931 article EN ACM Transactions on Intelligent Systems and Technology 2024-05-14

Boosting Multimedia Recommendation via Separate Generic and Unique Awareness

OPENALEX - Publications

Zhuangzhuang He Zihan Wang Yonghui Yang Haoyue Bai Le Wu

Multimedia recommendation, which incorporates various modalities (e.g., images, texts, etc.) into user or item representation to improve recommendation quality, has received widespread attention. Recent methods mainly focus on cross-modal alignment with self-supervised learning obtain higher quality representation. Despite remarkable performance, we argue that there is still a limitation: completely aligning undermines modality-unique information. We consider right, but it should not be the...

10.48550/arxiv.2406.08270 preprint EN arXiv (Cornell University) 2024-06-12

Coming Soon ...