NFDI4DS | UHH-SEMS - Publication Details

Ziniu Hu

ORCID: 0009-0007-8818-739X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5060639952

Research Areas

Topic Modeling
Natural Language Processing Techniques
Software Engineering Research
Advanced Malware Detection Techniques
Machine Learning in Materials Science
Multimodal Machine Learning Applications
Green IT and Sustainability
Software Testing and Debugging Techniques
Web Data Mining and Analysis
Manufacturing Process and Optimization
Caching and Content Delivery
Information Retrieval and Search Behavior
Autonomous Vehicle Technology and Safety
Traffic control and management
Sentiment Analysis and Opinion Mining
Advanced Image and Video Retrieval Techniques
Reinforcement Learning in Robotics
Model-Driven Software Engineering Techniques
Sports Analytics and Performance
Optimization and Search Problems
Vehicle Dynamics and Control Systems
Personal Information Management and User Behavior
Open Source Software Innovations
Speech Recognition and Synthesis
Domain Adaptation and Few-Shot Learning

University of California, Los Angeles
2018-2024

California Institute of Technology
2024

Hunan University
2022-2024

Peking University
2017-2020

Listening to Chaotic Whispers

OPENALEX - Publications

Ziniu Hu Weiqing Liu Jiang Bian Xuanzhe Liu Tie‐Yan Liu

Stock trend prediction plays a critical role in seeking maximized profit from the stock investment. However, precise is very difficult since highly volatile and non-stationary nature of market. Exploding information on Internet together with advancing development natural language processing text mining techniques have enabled investors to unveil market trends volatility online content. Unfortunately, quality, trustworthiness, comprehensiveness content related vary drastically, large portion...

10.1145/3159652.3159690 article EN 2018-02-02

Unbiased LambdaMART: An Unbiased Pairwise Learning-to-Rank Algorithm

OPENALEX - Publications

Ziniu Hu Yang Wang Peng Qu Hang Li

Recently a number of algorithms under the theme 'unbiased learning-to-rank' have been proposed, which can reduce position bias, major type bias in click data, and train high-performance ranker with data. Most existing algorithms, based on inverse propensity weighting (IPW) principle, first estimate at each position, then an unbiased estimated biases using learning-to-rank algorithm. However, there has not method for pairwise that simultaneously conduct debiasing data training loss function....

10.1145/3308558.3313447 article EN 2019-05-13

Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification

OPENALEX - Publications

Zhenpeng Chen Sheng Shen Ziniu Hu Xuan Lü Qiaozhu Mei and 1 more

Sentiment classification typically relies on a large amount of labeled data. In practice, the availability labels is highly imbalanced among different languages, e.g., more English texts are than in any other which creates considerable inequality quality related information services received by users speaking languages. To tackle this problem, cross-lingual sentiment approaches aim to transfer knowledge learned from one language that has abundant examples (i.e., source language, usually...

10.1145/3308558.3313600 article EN 2019-05-13

Few-Shot Representation Learning for Out-Of-Vocabulary Words

OPENALEX - Publications

Ziniu Hu Ting Chen Kai-Wei Chang Yizhou Sun

Existing approaches for learning word embedding often assume there are sufficient occurrences each in the corpus, such that representation of words can be accurately estimated from their contexts. However, real-world scenarios, out-of-vocabulary (a.k.a. OOV) do not appear training corpus emerge frequently. How to learn accurate representations these augment a pre-trained by only few observations is challenging research problem. In this paper, we formulate OOV as few-shot regression problem...

10.18653/v1/p19-1402 preprint EN cc-by 2019-01-01

Fast Adaptation for Cold-Start Collaborative Filtering with Meta-Learning

OPENALEX - Publications

Tianxin Wei Ziwei Wu Ruirui Li Ziniu Hu Fuli Feng and 3 more

Collaborative Filtering (CF), as one of the most popular approaches, is widely employed in recommender systems but suffers from cold-start problem, where interactions are very limited for new users system. To deal with this issue, previous work has largely focused on utilizing various auxiliary information such user profiles and social relationships to infer preferences. However, not always available due reasons privacy concerns, making CF approaches have count interactions. Moreover,...

10.1109/icdm50108.2020.00075 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2020-11-01

Empowering Language Models with Knowledge Graph Reasoning for Open-Domain Question Answering

OPENALEX - Publications

Ziniu Hu Xu Yi‐chong Wenhao Yu Shuohang Wang Ziyi Yang and 3 more

Answering open-domain questions requires world knowledge about in-context entities. As pre-trained Language Models (LMs) lack the power to store all required knowledge, external sources, such as graphs, are often used augment LMs. In this work, we propose knOwledge REasOning empowered Model(OREO-LM), which consists of a novel Knowledge Interaction Layer that can be flexibly plugged into existing Transformer-based LMs interact with differentiable Graph Reasoning module collaboratively. way,...

10.18653/v1/2022.emnlp-main.650 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2022-01-01

AVIS: Autonomous Visual Information Seeking with Large Language Model Agent

OPENALEX - Publications

Ziniu Hu Ahmet İşcen Chen Sun Kai-Wei Chang Yizhou Sun and 3 more

In this paper, we propose an autonomous information seeking visual question answering framework, AVIS. Our method leverages a Large Language Model (LLM) to dynamically strategize the utilization of external tools and investigate their outputs, thereby acquiring indispensable knowledge needed provide answers posed questions. Responding questions that necessitate knowledge, such as "What event is commemorated by building depicted in image?", complex task. This task presents combinatorial...

10.48550/arxiv.2306.08129 preprint EN cc-by arXiv (Cornell University) 2023-01-01

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

OPENALEX - Publications

Zongyu Lin Yao Tang Xingcheng Yao Da Yin Ziniu Hu and 2 more

Language agents have become a promising solution to complex interactive tasks. One of the key ingredients success language is reward model on trajectory agentic workflow, which provides valuable guidance during training or inference. However, due lack annotations intermediate interactions, most existing works use an outcome optimize policies across entire trajectories. This may lead sub-optimal and hinder overall performance. To address this, we propose QLASS (Q-guided Agent Stepwise...

10.48550/arxiv.2502.02584 preprint EN arXiv (Cornell University) 2025-02-04

DataSciBench: An LLM Agent Benchmark for Data Science

OPENALEX - Publications

Dan Zhang Sining Zhoubian Cai Min Fanghu Li Likun Yang and 5 more

This paper presents DataSciBench, a comprehensive benchmark for evaluating Large Language Model (LLM) capabilities in data science. Recent related benchmarks have primarily focused on single tasks, easily obtainable ground truth, and straightforward evaluation metrics, which limits the scope of tasks that can be evaluated. In contrast, DataSciBench is constructed based more curated collection natural challenging prompts uncertain truth metrics. We develop semi-automated pipeline generating...

10.48550/arxiv.2502.13897 preprint EN arXiv (Cornell University) 2025-02-19

Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion

OPENALEX - Publications

Yujia Huang Adishree Ghatare Y Liu Ziniu Hu Qinsheng Zhang and 4 more

We study the problem of symbolic music generation (e.g., generating piano rolls), with a technical focus on non-differentiable rule guidance. Musical rules are often expressed in form note characteristics, such as density or chord progression, many which pose challenge when using them for guided diffusion. propose Stochastic Control Guidance (SCG), novel guidance method that only requires forward evaluation functions can work pre-trained diffusion models plug-and-play way, thus achieving...

10.48550/arxiv.2402.14285 preprint EN arXiv (Cornell University) 2024-02-21

Paladin

OPENALEX - Publications

Yun Ma Yangyang Huang Ziniu Hu Xusheng Xiao Xuanzhe Liu

Automated-test-generation tools generate test cases to enable dynamic analysis of Android apps, such as functional testing. These build a GUI model describe the app states during execution, and script that performs actions on UI widgets form case. However, when are re-executed, apps under often do not behave consistently. The major reasons for limited reproducibility due (1) backend-service dependencies cause non-determinism in behaviors (2) severe fragmentation platform (i.e., alarming...

10.1145/3301293.3302363 article EN 2019-02-22

Aladdin

OPENALEX - Publications

Yun Ma Ziniu Hu Yunxin Liu Tao Xie Xuanzhe Liu

Compared to the Web where each web page has a global URL for external access, specific 'page' inside mobile app cannot be easily accessed unless user performs several steps from landing of this app. Recently, concept 'deep link' is expected promising solution and been advocated by major service providers enable targeting opening an externally with accessible uniform resource identifier. In paper, we present large-scale empirical study investigate how deep links are really adopted, over...

10.1145/3178876.3186059 article EN 2018-01-01

Towards Release Strategy Optimization for Apps in Google Play

OPENALEX - Publications

Sheng Shen Xuan Lü Ziniu Hu Xuanzhe Liu

In the appstore-centric ecosystem, app developers have an urgent requirement to optimize their release strategy maximize user adoption of apps. To address this problem, we introduce approach assisting select proper opportunity based on purpose update and current condition app. Before that, propose interval characterize patterns apps, find significance updates through empirical analysis. We mined release-history data 17,820 apps from 33 categories in Google Play, over a period 105 days. With...

10.1145/3131704.3131710 article EN 2017-09-23

FaceOff

OPENALEX - Publications

Shuyu Zheng Ziniu Hu Yun Ma

Designing desirable and aesthetical manifestation of web graphic user interfaces (GUI) is a challenging task for developers. After determining page's content, developers usually refer to existing pages, adapt the styles from desired pages into target one. However, it not only difficult find appropriate exhibit but also tedious incorporate different harmoniously in page. To tackle these two issues, we propose FaceOff, data-driven automation system that assists design GUI. FaceOff constructs...

10.1145/3289600.3290610 article EN 2019-01-30

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

OPENALEX - Publications

Dan Zhang Ziniu Hu Sining Zhoubian Zhengxiao Du Kaiyu Yang and 4 more

Large Language Models (LLMs) have shown promise in assisting scientific discovery. However, such applications are currently limited by LLMs' deficiencies understanding intricate concepts, deriving symbolic equations, and solving advanced numerical calculations. To bridge these gaps, we introduce SciGLM, a suite of language models able to conduct college-level reasoning. Central our approach is novel self-reflective instruction annotation framework address the data scarcity challenge science...

10.48550/arxiv.2401.07950 preprint EN cc-by arXiv (Cornell University) 2024-01-01

Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis

OPENALEX - Publications

Zongyue Qin Yunsheng Bai Atefeh Sohrabizadeh Zijian Ding Ziniu Hu and 2 more

10.1145/3670474.3685952 article EN 2024-09-03

PlayBest: Professional Basketball Player Behavior Synthesis via Planning with Diffusion

OPENALEX - Publications

Xiusi Chen Wei‐Yao Wang Ziniu Hu David Reynoso Kun Jin and 3 more

10.1145/3627673.3680092 article EN 2024-10-20

Spatial-Dependent Robust Control Strategy for On-Ramp Merging

OPENALEX - Publications

Tianchuang Meng Jin Huang Ziniu Hu Zeyu Yang Ye‐Hwa Chen and 2 more

A spatial-dependent robust control strategy is proposed for the on-ramp merging problem based on coordination of connected and automated vehicles. In strategy, planning stage weakened while strengthened. More specifically, mainly forms a virtual platoon containing all vehicles inside communication zone. stage, time-varying parameter uncertainties in model are considered. controller with uniform boundedness, ultimate boundedness robustness delicately designed each vehicle to analytically...

10.1109/tvt.2023.3326821 article EN IEEE Transactions on Vehicular Technology 2023-10-26

Professional Basketball Player Behavior Synthesis via Planning with Diffusion

OPENALEX - Publications

Xiusi Chen Wei‐Yao Wang Ziniu Hu Curtis Chou Lam Hoang and 4 more

Dynamically planning in multi-agent systems has been explored to improve decision-making various domains. Professional basketball serves as a compelling example of dynamic spatio-temporal game, encompassing both concealed strategic policies and decision-making. However, processing the diverse on-court signals navigating vast space potential actions outcomes makes it difficult for existing approaches swiftly identify optimal strategies response evolving circumstances. In this study, we first...

10.48550/arxiv.2306.04090 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Towards Release Strategy Optimization for Apps in Google Play

OPENALEX - Publications

Sheng Shen Xuan Lü Ziniu Hu

In the appstore-centric ecosystem, app developers have an urgent requirement to optimize their release strategy maximize success opportunity of apps. To address this problem, we introduce approach assisting select proper based on purpose update and current condition app. Before that, propose interval its previous characterize patterns, find significance through empirical analysis. We mined update-history data 17,820 apps from 33 categories in Google Play, over a period 105 days. With 41,028...

10.48550/arxiv.1707.06022 preprint EN other-oa arXiv (Cornell University) 2017-01-01

DroidWalker: Generating Reproducible Test Cases via Automatic Exploration of Android Apps

OPENALEX - Publications

Ziniu Hu Yun Ma Yangyang Huang

Generating test cases through automatic app exploration is very useful for analyzing and testing Android apps. However, generated by current app-exploration tools are not reproducible, i.e. when the case re-executed, cannot reach same state as explored one. As a result, developers able to reproduce failure or crash reported during exploration, conduct regression after fixing bug, execute in different environments. In this paper, we present DroidWalker, dynamic-analysis tool generate...

10.48550/arxiv.1710.08562 preprint EN other-oa arXiv (Cornell University) 2017-01-01

DroidLink: Automated Generation of Deep Links for Android Apps

OPENALEX - Publications

Yun Ma Xuanzhe Liu Ruogu Du Ziniu Hu Yi Liu and 2 more

The mobile application (app) has become the main entrance to access Internet on handheld devices. Unlike Web where each webpage a global URL reach directly, specific "content page" of an app can be opened only by exploring with several operations from landing page. interoperability between apps is quite fixed and thus limits value-added "linked data" apps. Recently, deep link been proposed enable targeting opening page externally accessible uniform resource identifier (URI). However,...

10.48550/arxiv.1605.06928 preprint EN cc-by arXiv (Cornell University) 2016-01-01

Rendering bounded error in adaptive robust path tracking control for autonomous vehicles

OPENALEX - Publications

Ziniu Hu Ziyun Yu Zeyu Yang Zhanyi Hu Yougang Bian

For the sake of safety, vehicle path tracking control should not only ensure stability error containing lateral offset and orientation but also guarantee that both transient steady states are within a specified safe boundary. However, time-varying uncertainties system make design tough task. This paper develops an adaptive robust (ARC) which guarantees bounded property for autonomous vehicles. First, to handle requirement, barrier function based state transformation converts constrained into...

10.1049/cth2.12303 article EN cc-by-nc-nd IET Control Theory and Applications 2022-05-19

Coming Soon ...