NFDI4DS | UHH-SEMS - Publication Details

Visual Attention Consistency Under Image Transforms for Multi-Label Image Classification

OPENALEX - Publications

Hao Guo Kang Zheng Xiaochuan Fan Hongkai Yu Song Wang

Human visual perception shows good consistency for many multi-label image classification tasks under certain spatial transforms, such as scaling, rotation, flipping and translation. This has motivated the data augmentation strategy widely used in CNN classifier training -- transformed images are included by assuming same class labels their original images. In this paper, we further propose assumption of perceptual attention regions i.e., region a follows transform if input is spatially...

10.1109/cvpr.2019.00082 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Combining local appearance and holistic view: Dual-Source Deep Neural Networks for human pose estimation

OPENALEX - Publications

Xiaochuan Fan Kang Zheng Yuewei Lin Song Wang

We propose a new learning-based method for estimating 2D human pose from single image, using Dual-Source Deep Convolutional Neural Networks (DS-CNN). Recently, many methods have been developed to estimate by priors that are estimated physiologically inspired graphical models or learned holistic perspective. In this paper, we integrate both the local (body) part appearance and view of each more accurate estimation. Specifically, proposed DS-CNN takes set image patches (category-independent...

10.1109/cvpr.2015.7298740 article EN 2015-06-01

Combining Local Appearance and Holistic View: Dual-Source Deep Neural Networks for Human Pose Estimation

OPENALEX - Publications

Xiaochuan Fan Kang Zheng Yuewei Lin Song Wang

We propose a new learning-based method for estimating 2D human pose from single image, using Dual-Source Deep Convolutional Neural Networks (DS-CNN). Recently, many methods have been developed to estimate by priors that are estimated physiologically inspired graphical models or learned holistic perspective. In this paper, we integrate both the local (body) part appearance and view of each more accurate estimation. Specifically, proposed DS-CNN takes set image patches (category-independent...

10.48550/arxiv.1504.07159 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Detecting Small Signs from Large Images

OPENALEX - Publications

Zibo Meng Xiaochuan Fan Xin Chen Min Chen Yan Tong

In the past decade, Convolutional Neural Networks (CNNs) have been demonstrated successful for object detections. However, size of network input is limited by amount memory available on GPUs. Moreover, performance degrades when detecting small objects. To alleviate usage and improve traffic signs, we proposed an approach signs from large images under real world conditions. particular, are broken into patches as to a Small-Object-Sensitive-CNN (SOS-CNN) modified Single Shot Multibox Detector...

10.1109/iri.2017.57 preprint EN 2017-08-01

Human attribute recognition by refining attention heat map

OPENALEX - Publications

Hao Guo Xiaochuan Fan Song Wang

10.1016/j.patrec.2017.05.012 article EN publisher-specific-oa Pattern Recognition Letters 2017-05-13

Structural and functional characterization of MBS301, an afucosylated bispecific anti-HER2 antibody

OPENALEX - Publications

Sijia Huang Feng Li Huifang Liu Pei Ye Xiaochuan Fan and 7 more

MBS301, a glyco-engineered bispecific anti-human epidermal growth factor receptor 2 (HER2) antibody with typical IgG1 monoclonal structure, was developed through dual-cell expression and in vitro assembling process. MBS301 consists of two half antibodies engineered from trastuzumab pertuzumab, respectively. Integrity purity profiles MB301 indicated that the heterodimerization successful. The high similar melting temperatures (Tm1,72.0°C Tm2, 84.8°C) compared those its parental pertuzumab...

10.1080/19420862.2018.1486946 article EN mAbs 2018-08-06

Groupwise Tracking of Crowded Similar-Appearance Targets from Low-Continuity Image Sequences

OPENALEX - Publications

Hongkai Yu Youjie Zhou Jeff Simmons Craig Przybyla Yuewei Lin and 3 more

Automatic tracking of large-scale crowded targets are particular importance in many applications, such as people/vehicle video surveillance, fiber materials science, and cell biomedical imaging. This problem becomes very challenging when the show similar appearance interslice/ inter-frame continuity is low due to sparse sampling, camera motion target occlusion. The main challenge comes from step association which aims at matching predictions observations multiple targets. In this paper we...

10.1109/cvpr.2016.109 article EN 2016-06-01

A trehalose biosynthetic enzyme doubles as an osmotic stress sensor to regulate bacterial morphogenesis

OPENALEX - Publications

Ximing Chen Lizhe An Xiaochuan Fan Furong Ju Binglin Zhang and 9 more

The dissacharide trehalose is an important intracellular osmoprotectant and the OtsA/B pathway principal for biosynthesis in a wide range of bacterial species. Scaffolding proteins other cytoskeletal elements play essential role morphogenetic processes bacteria. Here we describe how OtsA, addition to its biosynthesis, functions as osmotic stress sensor regulate cell morphology Arthrobacter strain A3. In response stress, this species undergo transition from bacillary myceloid growth. An otsA...

10.1371/journal.pgen.1007062 article EN cc-by PLoS Genetics 2017-10-30

Learning View-Invariant Features for Person Identification in Temporally Synchronized Videos Taken by Wearable Cameras

OPENALEX - Publications

Kang Zheng Xiaochuan Fan Yuewei Lin Hao Guo Hongkai Yu and 2 more

In this paper, we study the problem of Cross-View Person Identification (CVPI), which aims at identifying same person from temporally synchronized videos taken by different wearable cameras. Our basic idea is to utilize human motion consistency for CVPI, where can be computed optical flow. However, flow view-variant - person's in very due view angle change. attempt 3D human-skeleton sequences learn a model that extract view-invariant features flows views. For purpose, use Mocap database...

10.1109/iccv.2017.311 article EN 2017-10-01

Visual Attention Consistency for Human Attribute Recognition

OPENALEX - Publications

Hao Guo Xiaochuan Fan Song Wang

10.1007/s11263-022-01591-y article EN International Journal of Computer Vision 2022-03-05

Co-Interest Person Detection from Multiple Wearable Camera Videos

OPENALEX - Publications

Yuewei Lin Kareem Abdelfatah Youjie Zhou Xiaochuan Fan Hongkai Yu and 2 more

Wearable cameras, such as Google Glass and Go Pro, enable video data collection over larger areas from different views. In this paper, we tackle a new problem of locating the co-interest person (CIP), i.e., one who draws attention most camera wearers, temporally synchronized videos taken by multiple wearable cameras. Our basic idea is to exploit motion patterns people use them correlate persons across videos, instead performing appearance-based matching in traditional...

10.1109/iccv.2015.503 article EN 2015-12-01

DSGPT: Domain-Specific Generative Pre-Training of Transformers for Text Generation in E-commerce Title and Review Summarization

OPENALEX - Publications

Xueying Zhang Yunjiang Jiang Yue Shang Zhaomeng Cheng Chi Zhang and 3 more

We propose a novel domain-specific generative pre-training (DS-GPT) method for text generation and apply it to the product titleand review summarization problems on E-commerce mobile display.First, we adopt decoder-only transformer architecture, which fitswell fine-tuning tasks by combining input output all to-gether. Second, demonstrate utilizing only small amount of data in related domains is powerful. Pre-training languagemodel from general corpus such as Wikipedia or CommonCrawl requires...

10.1145/3404835.3463037 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2021-07-11

A fuzzy edge-weighted centroidal Voronoi tessellation model for image segmentation

OPENALEX - Publications

Xiaochuan Fan Lili Ju Xiaoqiang Wang Song Wang

10.1016/j.camwa.2015.11.003 article EN Computers & Mathematics with Applications 2015-11-28

Identifying Same Persons from Temporally Synchronized Videos Taken by Multiple Wearable Cameras

OPENALEX - Publications

Kang Zheng Hao Guo Xiaochuan Fan Hongkai Yu Song Wang

Video-based human action recognition benefits from multiple cameras which can provide temporally synchronized, multi-view videos. Cross-video person identification, i.e., determining whether at a given time, persons tracked in different videos are the same or not, is key step to integrate such information for collaborative recognition. For fixed cameras, this relatively easy since be precalibrated. In paper, we study cross-video identification wearable constantly moving with wearers....

10.1109/cvprw.2016.106 article EN 2016-06-01

Detecting Small Signs from Large Images

OPENALEX - Publications

Zibo Meng Xiaochuan Fan Xin Chen Min Chen Yan Tong

In the past decade, Convolutional Neural Networks (CNNs) have been demonstrated successful for object detections. However, size of network input is limited by amount memory available on GPUs. Moreover, performance degrades when detecting small objects. To alleviate usage and improve traffic signs, we proposed an approach signs from large images under real world conditions. particular, are broken into patches as to a Small-Object-Sensitive-CNN (SOS-CNN) modified Single Shot Multibox Detector...

10.48550/arxiv.1706.08574 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Scenario-based Multi-product Advertising Copywriting Generation for E-Commerce

OPENALEX - Publications

Xueying Zhang Kai Shen Chi Zhang Xiaochuan Fan Yun Xiao and 3 more

In this paper, we proposed an automatic Scenario-based Multi-product Advertising Copywriting Generation system (SMPACG) for E-Commerce, which has been deployed on a leading Chinese e-commerce platform. The SMPACG consists of two main components: 1) multi-product combination selection module, itself is consisted topic prediction model, pattern and attribute-based model arbitrator model; 2) advertising copywriting generation combines our domain-specific pretrained language knowledge-based data...

10.48550/arxiv.2205.10530 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Automatic Generation of Product-Image Sequence in E-commerce

OPENALEX - Publications

Xiaochuan Fan Chi Zhang Yong Yang Yue Shang Xueying Zhang and 4 more

Product images are essential for providing desirable user experience in an e-commerce platform. For a platform with billions of products, it is extremely time-costly and labor-expensive to manually pick organize qualified images. Furthermore, there the numerous complicated image rules that product needs comply order be generated/selected. To address these challenges, this paper, we present new learning framework achieve Automatic Generation Product-Image Sequence (AGPIS) e-commerce. end,...

10.1145/3534678.3539149 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2022-08-12

Object Detection with Mask-based Feature Encoding

OPENALEX - Publications

Xiaochuan Fan Hao Guo Kang Zheng Wei Feng Song Wang

Region-based Convolutional Neural Networks (R-CNNs) have achieved great success in the field of object detection. The existing R-CNNs usually divide a Region-of-Interest (ROI) into grids, and then localize objects by utilizing spatial information reflected relative position each grid ROI. In this paper, we propose novel feature-encoding approach, where is represented through distributions visual patterns. particular, design Mask Weight Network (MWN) to learn set masks apply channel-wise...

10.48550/arxiv.1802.03934 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Co-interest Person Detection from Multiple Wearable Camera Videos

OPENALEX - Publications

Yuewei Lin Kareem Ezzeldeen Youjie Zhou Xiaochuan Fan Hongkai Yu and 2 more

Wearable cameras, such as Google Glass and Go Pro, enable video data collection over larger areas from different views. In this paper, we tackle a new problem of locating the co-interest person (CIP), i.e., one who draws attention most camera wearers, temporally synchronized videos taken by multiple wearable cameras. Our basic idea is to exploit motion patterns people use them correlate persons across videos, instead performing appearance-based matching in traditional...

10.48550/arxiv.1509.01654 preprint EN other-oa arXiv (Cornell University) 2015-01-01