Jian Yin

ORCID: 0000-0002-1214-5384
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Advanced Image and Video Retrieval Techniques
  • Natural Language Processing Techniques
  • Image Retrieval and Classification Techniques
  • Multimodal Machine Learning Applications
  • Recommender Systems and Techniques
  • Data Management and Algorithms
  • Text and Document Classification Technologies
  • Face and Expression Recognition
  • Video Surveillance and Tracking Methods
  • Generative Adversarial Networks and Image Synthesis
  • Metaheuristic Optimization Algorithms Research
  • Advanced Graph Neural Networks
  • Data Mining Algorithms and Applications
  • Advanced Text Analysis Techniques
  • Advanced Clustering Algorithms Research
  • Sentiment Analysis and Opinion Mining
  • Human Mobility and Location-Based Analysis
  • Domain Adaptation and Few-Shot Learning
  • Neural Networks and Applications
  • Advanced Computational Techniques and Applications
  • Advanced Vision and Imaging
  • Anomaly Detection Techniques and Applications
  • Plant Water Relations and Carbon Dynamics
  • Rough Sets and Fuzzy Logic

Sun Yat-sen University
2016-2025

China Tourism Academy
2023-2025

China Guangzhou Analysis and Testing Center
2018-2024

University of Nottingham Malaysia Campus
2023

Guangzhou Experimental Station
2021

Microsoft Research Asia (China)
2020

Guangdong Food and Drug Vocational College
2020

Beijing Institute of Big Data Research
2018-2019

Northeast Agricultural University
2017

Anqing Normal University
2014-2017

Multi-view clustering, which seeks a partition of the data inmultiple views that often provide complementary information to eachother, has received considerable attention in recent years. In reallife clustering problems, each view may haveconsiderable noise. However, existing methods blindlycombine from multi-view with possiblyconsiderable noise, degrades their performance. thispaper, we propose novel Markov chain method for RobustMulti-view Spectral Clustering (RMSC). Our flavor oflow-rank...

10.1609/aaai.v28i1.8950 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2014-06-21

Pre-trained models for programming languages have recently demonstrated great success on code intelligence. To support both code-related understanding and generation tasks, recent works attempt to pre-train unified encoder-decoder models. However, such framework is sub-optimal auto-regressive especially completion that requires a decoder-only manner efficient inference. In this paper, we present UniXcoder, cross-modal pre-trained model language. The utilizes mask attention matrices with...

10.18653/v1/2022.acl-long.499 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

Image content analysis is an important surround perception modality of intelligent vehicles. In order to efficiently recognize the on-road environment based on image from large-scale scene database, relevant images retrieval becomes one fundamental problems. To improve efficiency calculating similarities between images, hashing techniques have received increasing attentions. For most existing hash methods, suboptimal binary codes are generated, as hand-crafted feature representation not...

10.1109/tits.2017.2749965 article EN IEEE Transactions on Intelligent Transportation Systems 2017-10-04

Virtual try-on systems under arbitrary human poses have significant application potential, yet also raise extensive challenges, such as self-occlusions, heavy misalignment among different poses, and complex clothes textures. Existing virtual methods can only transfer given a fixed pose, still show unsatisfactory performances, often failing to preserve person identity or texture details, with limited pose diversity. This paper makes the first attempt towards multi-pose guided system, which...

10.1109/iccv.2019.00912 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Fact checking is a challenging task because verifying the truthfulness of claim requires reasoning about multiple retrievable evidence. In this work, we present method suitable for semantic-level structure Unlike most previous works, which typically represent evidence sentences with either string concatenation or fusing features isolated sentences, our approach operates on rich semantic structures obtained by role labeling. We propose two mechanisms to exploit while leveraging advances...

10.18653/v1/2020.acl-main.549 article EN cc-by 2020-01-01

Recently a few systems for automatically solving math word problems have reported promising results. However, the datasets used evaluation limitations in both scale and diversity. In this paper, we build large-scale dataset which is more than 9 times size of previous ones, contains many problem types. Problems are semi-automatically obtained from community question-answering (CQA) web pages. A ranking SVM model trained to extract answers answer text provided by CQA users, significantly...

10.18653/v1/p16-1084 article EN Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2016-01-01

Beyond current image-based virtual try-on systems that have attracted increasing attention, we move a step forward to developing video system precisely transfers clothes onto the person and generates visually realistic videos conditioned on arbitrary poses. Besides challenges in (e.g., fidelity, image synthesis), further requires spatiotemporal consistency. Directly adopting existing approaches often fails generate coherent with natural textures. In this work, propose Flow-navigated Warping...

10.1109/iccv.2019.00125 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

This paper presents a novel template-based method to solve math word problems. learns the mappings between concept phrases in problems and their expressions from training data. For each equation template, we automatically construct rich template sketch by aggregating information various with same template. Our approach is implemented two-stage system. It first retrieves few relevant system templates aligns numbers those for candidate generation. then does fine-grained inference obtain final...

10.18653/v1/d17-1084 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2017-01-01

Interactive fashion image manipulation, which enables users to edit images with sketches and color strokes, is an interesting research problem great application value. Existing works often treat it as a general inpainting task do not fully leverage the semantic structural information in images. Moreover, they directly utilize conventional convolution normalization layers restore incomplete image, tends wash away sketch information. In this paper, we propose novel Fashion Editing Generative...

10.1109/cvpr42600.2020.00814 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Deep hashing is an appealing approach for large-scale image retrieval. Most existing supervised deep methods learn hash functions using pairwise or triple similarities in randomly sampled mini-batches. They suffer from low training efficiency, insufficient coverage of data distribution, and pair imbalance problems. Recently, central similarity quantization (CSQ) attacks the above problems by "hash centers" as a global metric, which encourages codes similar images to their common center...

10.1109/cvpr52729.2023.02246 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Retrieval plays an important role in knowledge-based visual question answering (KB-VQA), which relies on external knowledge to answer questions related image. However, not all information the is beneficial retrieval, e.g., that only semantically similar query but useful for answering. To improve effectiveness and efficiency of this paper, we propose efficient multimodal selection filter out irrelevant increase retriever performance KB-VQA. First, exclude most from large knowledge, uses a...

10.1109/tcsvt.2025.3527032 article EN IEEE Transactions on Circuits and Systems for Video Technology 2025-01-01

Learning user’s preference from check-in data is important for POI recommendation. Yet, a user usually has visited some POIs while most of are unvisited (i.e., negative samples). To leverage these “no-behavior” POIs, typical approach pairwise ranking, which constructs ranking pairs the and POIs. Although this generally effective, samples in obtained randomly, may fail to “critical” model training. On other hand, previous studies also utilized geographical feature improve recommendation...

10.24963/ijcai.2019/250 article EN 2019-07-28

Destination prediction is very important in location-based services such as recommendation of targeted advertising location. Most current approaches always predict destination according to existing trip based on history trajectories. However, no work has considered the difference between effects passing-by locations and trajectories, which seriously impacts accuracy predicted results can indicate purpose traveling. Meanwhile, temporal information trajectories plays an role. On one hand,...

10.1109/tits.2016.2518685 article EN IEEE Transactions on Intelligent Transportation Systems 2016-03-16

Sentence similarity modeling lies at the core of many natural language processing applications, and thus has received much attention. Owing to success word embeddings, recently, popular neural network methods achieved sentence embedding. Most them focused on learning semantic information it as a continuous vector, yet syntactic sentences not been fully exploited. On other hand, prior works have shown benefits structured trees that include information, while few in this branch utilized...

10.1109/taslp.2019.2899494 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2019-02-14

Wanjun Zhong, Duyu Tang, Zhangyin Feng, Nan Duan, Ming Zhou, Gong, Linjun Shou, Daxin Jiang, Jiahai Wang, Jian Yin. Proceedings of the 58th Annual Meeting Association for Computational Linguistics. 2020.

10.18653/v1/2020.acl-main.539 article EN cc-by 2020-01-01

In this paper, we study how to learn a semantic parser of state-of-the-art accuracy with less supervised training data. We conduct our on WikiSQL, the largest hand-annotated parsing dataset date. First, demonstrate that question generation is an effective method empowers us neural network based thirty percent Second, show applying full data further improves model. addition, observe there logarithmic relationship between and amount

10.18653/v1/d18-1188 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2018-01-01

We propose a novel end-to-end deep architecture for face landmark detection, based on convolutional and deconvolutional network followed by carefully designed recurrent structures. The pipeline of this consists three parts. Through the first part, we encode an input image to resolution-preserved feature maps via with stacked layers. Then, in second estimate initial coordinates facial key points additional layer top these maps. In last using as input, refine that multiple long short-term...

10.1109/tcsvt.2016.2645723 article EN IEEE Transactions on Circuits and Systems for Video Technology 2016-12-28

Social emotion classification aims to predict the aggregation of emotional responses embedded in online comments contributed by various users. Such a task is inherently challenging because extracting relevant semantics from free texts classical research problem. Moreover, are typically characterized sparse feature space, which makes corresponding very difficult. On other hand, though deep neural networks have been shown be effective for speech recognition and image analysis tasks their...

10.1109/taffc.2017.2716930 article EN IEEE Transactions on Affective Computing 2017-06-19
Coming Soon ...