NFDI4DS | UHH-SEMS - Publication Details

Hongyuan Zha

ORCID: 0000-0001-7493-0911

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5046703129

Research Areas

Complex Network Analysis Techniques
Point processes and geometric inequalities
Recommender Systems and Techniques
Advanced Graph Neural Networks
Topic Modeling
Text and Document Classification Technologies
Domain Adaptation and Few-Shot Learning
Advanced Image and Video Retrieval Techniques
Face and Expression Recognition
Matrix Theory and Algorithms
Reinforcement Learning in Robotics
Opinion Dynamics and Social Influence
Information Retrieval and Search Behavior
Diffusion and Search Dynamics
Sparse and Compressive Sensing Techniques
Model Reduction and Neural Networks
Web Data Mining and Analysis
Human Mobility and Location-Based Analysis
Bayesian Methods and Mixture Models
Generative Adversarial Networks and Image Synthesis
Image Retrieval and Classification Techniques
Advanced Bandit Algorithms Research
Advanced Clustering Algorithms Research
Neural Networks and Applications
Graph Theory and Algorithms

Chinese University of Hong Kong, Shenzhen
2019-2025

Georgia Institute of Technology
2013-2023

Southwest Jiaotong University
2022-2023

East China Normal University
2014-2022

Shenzhen Research Institute of Big Data
2020-2022

Chinese University of Hong Kong
2019-2021

Shanghai Jiao Tong University
2019-2020

Institute of Art
2020

University of California, Los Angeles
2019

Amazon (United States)
2018

Principal Manifolds and Nonlinear Dimensionality Reduction via Tangent Space Alignment

OPENALEX - Publications

Zhenyue Zhang Hongyuan Zha

We present a new algorithm for manifold learning and nonlinear dimensionality reduction. Based on set of unorganized data points sampled with noise from parameterized manifold, the local geometry is learned by constructing an approximation tangent space at each point, those spaces are then aligned to give global coordinates respect underlying manifold. also error analysis our showing that reconstruction errors can be quite small in some cases. illustrate using curves surfaces both...

10.1137/s1064827502419154 article EN SIAM Journal on Scientific Computing 2004-01-01

A min-max cut algorithm for graph partitioning and data clustering

OPENALEX - Publications

Chris Ding Xiaofeng He Hongyuan Zha Ming Gu Horst D. Simon

An important application of graph partitioning is data clustering using a model - the pairwise similarities between all objects form weighted adjacency matrix that contains necessary information for clustering. In this paper, we propose new algorithm with an objective function follows min-max principle. The relaxed version optimization cut leads to Fiedler vector in spectral partitioning. Theoretical analyses indicate it balanced partitions, and lower bounds are derived. tested on newsgroup...

10.1109/icdm.2001.989507 article EN 2002-11-14

R1-PCA

OPENALEX - Publications

Chris Ding Ding Zhou Xiaofeng He Hongyuan Zha

Principal component analysis (PCA) minimizes the sum of squared errors (L2-norm) and is sensitive to presence outliers. We propose a rotational invariant L1-norm PCA (R1-PCA). R1-PCA similar in that (1) it has unique global solution, (2) solution are principal eigenvectors robust covariance matrix (re-weighted soften effects outliers), (3) invariant. These properties not shared by PCA. A new subspace iteration algorithm given compute efficiently. Experiments on several real-life datasets...

10.1145/1143844.1143880 article EN 2006-01-01

Sensor positioning in wireless ad-hoc sensor networks using multidimensional scaling

OPENALEX - Publications

Xiang Ji Hongyuan Zha

Sensor Positioning is a fundamental and crucial issue for sensor network operation management. In the paper, we first study some situations where most existing positioning methods tend to fail perform well, an example being when topology of anisotropic. Then, explore idea using dimensionality reduction techniques estimate sensors coordinates in two (or three) dimensional space, propose distributed method based on multidimensional scaling technique deal with these challenging conditions....

10.1109/infcom.2004.1354684 article EN 2005-02-22

Principal Manifolds and Nonlinear Dimension Reduction via Local Tangent Space Alignment

OPENALEX - Publications

Zhenyue Zhang Hongyuan Zha

Nonlinear manifold learning from unorganized data points is a very challenging unsupervised and visualization problem with great variety of applications. In this paper we present new algorithm for nonlinear dimension reduction. Based on set sampled noise the manifold, represent local geometry using tangent spaces learned by fitting an affine subspace in neighborhood each point. Those are aligned to give internal global coordinates respect underlying way partial eigendecomposition connection...

10.48550/arxiv.cs/0212008 preprint EN other-oa arXiv (Cornell University) 2002-01-01

Sequential Recommendation with User Memory Networks

OPENALEX - Publications

Xu Chen Hongteng Xu Yongfeng Zhang Jiaxi Tang Yixin Cao and 2 more

User preferences are usually dynamic in real-world recommender systems, and a user»s historical behavior records may not be equally important when predicting his/her future interests. Existing recommendation algorithms -- including both shallow deep approaches embed into single latent vector/representation, which have lost the per item- or feature-level correlations between In this paper, we aim to express, store, manipulate users» more explicit, dynamic, effective manner. To do so,...

10.1145/3159652.3159668 article EN 2018-02-02

Like like alike

OPENALEX - Publications

Shuang-Hong Yang Bo Long Alex Smola Narayanan Sadagopan Zhaohui Zheng and 1 more

Targeting interest to match a user with services (e.g. news, products, games, advertisements) and predicting friendship build connections among users are two fundamental tasks for social network systems. In this paper, we show that the information contained in networks (i.e. user-service interactions) user-user connections) is highly correlated mutually helpful. We propose framework exploits homophily establish an integrated linking interested connecting different common interests, upon...

10.1145/1963405.1963481 article EN 2011-03-28

Two supervised learning approaches for name disambiguation in author citations

OPENALEX - Publications

Hui Han C. Lee Giles Hongyuan Zha Cheng Li Kostas Tsioutsiouliklis

Due to name abbreviations, identical names, misspellings, and pseudonyms inpublications or bibliographies (citations), an author may have multiple names authors share the same name. Such ambiguity affects performance of document retrieval, web search, database integration, cause improper attribution authors. This paper investigates two supervised learning approaches disambiguate in citations. One approach uses naive Bayes probability model, a generative model; other Support Vector...

10.1145/996350.996419 article EN 2004-06-07

Iterative Learning with Open-set Noisy Labels

OPENALEX - Publications

Yisen Wang Weiyang Liu Xingjun Ma James Bailey Hongyuan Zha and 2 more

Large-scale datasets possessing clean label annotations are crucial for training Convolutional Neural Networks (CNNs). However, labeling large-scale data can be very costly and error-prone, even high-quality likely to contain noisy (incorrect) labels. Existing works usually employ a closed-set assumption, whereby the samples associated with labels possess true class contained within set of known classes in data. such an assumption is too restrictive many applications, since might fact that...

10.1109/cvpr.2018.00906 preprint EN 2018-06-01

Unsupervised Deep Learning for Optical Flow Estimation

OPENALEX - Publications

Zhe Ren Junchi Yan Bingbing Ni Bin Liu Xiaokang Yang and 1 more

Recent work has shown that optical flow estimation can be formulated as a supervised learning problem. Moreover, convolutional networks have been successfully applied to this task. However, is obfuscated by the shortage of labeled training data. As consequence, existing methods turn large synthetic datasets for easily computer generated ground truth. In work, we explore if deep network trained without supervision. Using image warping estimated flow, devise simple yet effective unsupervised...

10.1609/aaai.v31i1.10723 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2017-02-12

Finding the right facts in the crowd

OPENALEX - Publications

Jiang Bian Yandong Liu Eugene Agichtein Hongyuan Zha

Community Question Answering has emerged as a popular and effective paradigm for wide range of information needs. For example, to find out an obscure piece trivia, it is now possible even very post question on community QA site such Yahoo! Answers, rely other users provide answers, often within minutes. The importance sites magnified they create archives millions questions hundreds many which are invaluable the needs searchers. However, make this immense body knowledge accessible, answer...

10.1145/1367497.1367561 article EN 2008-04-21

Functional matrix factorizations for cold-start recommendation

OPENALEX - Publications

Ke Zhou Shuang-Hong Yang Hongyuan Zha

A key challenge in recommender system research is how to effectively profile new users, a problem generally known as cold-start recommendation. Recently the idea of progressively querying user responses through an initial interview process has been proposed useful preference elicitation strategy. In this paper, we present functional matrix factorization (fMF), novel recommendation method that solves construction within context learning and item profiles. Specifically, fMF constructs decision...

10.1145/2009916.2009961 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2011-07-24

Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation

OPENALEX - Publications

Lu Wang Wei Zhang Xiaofeng He Hongyuan Zha

Dynamic treatment recommendation systems based on large-scale electronic health records (EHRs) become a key to successfully improve practical clinical outcomes. Prior relevant studies recommend treatments either use supervised learning (e.g. matching the indicator signal which denotes doctor prescriptions), or reinforcement maximizing evaluation indicates cumulative reward from survival rates). However, none of these have considered combine benefits and learning. In this paper, we propose...

10.1145/3219819.3219961 article EN 2018-07-19

Personalized Fashion Recommendation with Visual Explanations based on Multimodal Attention Network

OPENALEX - Publications

Xu Chen Hanxiong Chen Hongteng Xu Yongfeng Zhang Yixin Cao and 2 more

Fashion recommendation has attracted increasing attention from both industry and academic communities. This paper proposes a novel neural architecture for fashion based on image region-level features user review information. Our basic intuition is that: image, not all the regions are equally important users, i.e., people usually care about few parts of image. To model such human sense, we learn an over many pre-segmented regions, which can understand where really interested in...

10.1145/3331184.3331254 article EN 2019-07-18

Modeling the Intensity Function of Point Process Via Recurrent Neural Networks

OPENALEX - Publications

Shuai Xiao Junchi Yan Xiaokang Yang Hongyuan Zha Stephen M. Chu

Event sequence, asynchronously generated with random timestamp, is ubiquitous among applications. The precise and arbitrary timestamp can carry important clues about the underlying dynamics, has lent event data fundamentally different from time-series whereby series indexed fixed equal time interval. One expressive mathematical tool for modeling point process. intensity functions of many processes involve two components: background effect by history. Due to its inherent spontaneousness, be...

10.1609/aaai.v31i1.10724 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2017-02-12

Bipartite graph partitioning and data clustering

OPENALEX - Publications

Hongyuan Zha Xiaofeng He Chris Ding Horst D. Simon Ming Gu

Many data types arising from mining applications can be modeled as bipartite graphs, examples include terms and documents in a text corpus, customers purchasing items market basket analysis reviewers movies movie recommender system. In this paper, we propose new clustering method based on partitioning the underlying graph. The partition is constructed by minimizing normalized sum of edge weights between unmatched pairs vertices We show that an approximate solution to minimization problem...

10.1145/502585.502591 article EN 2001-10-05

Name disambiguation in author citations using a K-way spectral clustering method

OPENALEX - Publications

Hui Han Hongyuan Zha C. Lee Giles

An author may have multiple names and authors share the same name simply due to abbreviations, identical names, or misspellings in publications bibliographies 1. This can produce ambiguity which affect performance of document retrieval, web search, database integration, cause improper attribution credit. Proposed here is an unsupervised learning approach using K-way spectral clustering that disambiguates citations. The utilizes three types citation attributes: co-author paper titles,...

10.1145/1065385.1065462 article EN 2005-06-07

Automatic document metadata extraction using support vector machines

OPENALEX - Publications

Hui Han C. Lee Giles Eren Manavoglu Hongyuan Zha Zhenyue Zhang and 1 more

Automatic metadata generation provides scalability and usability for digital libraries their collections. Machine learning methods offer robust adaptable automatic extraction. We describe a support vector machine classification-based method extraction from header part of research papers show that it outperforms other on the same task. The first classifies each line into one or more 15 classes. An iterative convergence procedure is then used to improve classification by using predicted class...

10.5555/827140.827146 article EN ACM/IEEE Joint Conference on Digital Libraries 2003-05-27

Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering

OPENALEX - Publications

Hongyuan Zha

A novel method for simultaneous keyphrase extraction and generic text summarization is proposed by modeling documents as weighted undirected bipartite graphs. Spectral graph clustering algorithms are useed partitioning sentences of the into topical groups with sentence link priors being exploited to enhance quality. Within each group, saliency scores keyphrases generated based on a mutual reinforcement principle. The then ranked according their selected inclusion in top list summaries...

10.1145/564376.564398 article EN 2002-08-11

Co-ranking Authors and Documents in a Heterogeneous Network

OPENALEX - Publications

Ding Zhou Sergey A. Orshanskiy Hongyuan Zha C. Lee Giles

Recent graph-theoretic approaches have demonstrated remarkable successes for ranking networked entities, but most of their applications are limited to homogeneous networks such as the network citations between publications. This paper proposes a novel method co-ranking authors and publications using several networks: social connecting authors, citation publications, well authorship that ties previous two together. The new framework is based on coupling random walks, separately rank documents...

10.1109/icdm.2007.57 article EN 2007-10-01

Contour regression: A general approach to dimension reduction

OPENALEX - Publications

Bing Li Hongyuan Zha Francesca Chiaromonte

We propose a novel approach to sufficient dimension reduction in regression, based on estimating contour directions of small variation the response. These span orthogonal complement minimal space relevant for regression and can be extracted according two measures response, leading simple general (SCR GCR) methodology. In comparison with existing techniques, this contour-based methodology guarantees exhaustive estimation central subspace under ellipticity predictor distribution mild...

10.1214/009053605000000192 article EN The Annals of Statistics 2005-08-01

Coming Soon ...