NFDI4DS | UHH-SEMS - Publication Details

Hao Tian

ORCID: 0000-0001-8219-9743

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100771541

Research Areas

Generative Adversarial Networks and Image Synthesis
Advanced Image and Video Retrieval Techniques
Advanced Neural Network Applications
Multimodal Machine Learning Applications
Domain Adaptation and Few-Shot Learning
Advanced Image Processing Techniques
Topic Modeling
Computer Graphics and Visualization Techniques
Natural Language Processing Techniques
3D Shape Modeling and Analysis
Video Analysis and Summarization
Robotic Path Planning Algorithms
Image Processing Techniques and Applications
Plant Stress Responses and Tolerance
Software Engineering Research
Image and Signal Denoising Methods
Rice Cultivation and Yield Improvement
Advanced Vision and Imaging
Video Surveillance and Tracking Methods
Plant Micronutrient Interactions and Effects
Modular Robots and Swarm Intelligence
Plant responses to water stress
Face recognition and analysis
Heavy metals in environment
Digital Media Forensic Detection

Shihezi University
2022-2024

Ministry of Agriculture and Rural Affairs
2024

Jilin Agricultural University
2023-2024

China University of Geosciences
2024

Jinan University
2024

Northeast Agricultural University
2024

Hubei University Of Economics
2024

Zhejiang University
2024

Sichuan Agricultural University
2024

Ministry of Ecology and Environment
2024

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision

OPENALEX - Publications

Chenyu Yang Yuntao Chen Hao Tian Chenxin Tao Xizhou Zhu and 7 more

We present a novel bird's-eye-view (BEV) detector with perspective supervision, which converges faster and bet-suits modern image backbones. Existing state-of-the-art BEV detectors are often tied to certain depth pretrained backbones like Vo Vn et, hindering the synergy between booming detectors. To address this limitation, we prioritize easing optimization of by introducing view supervision. end, propose two-stage detector; where proposals from head fed into bird' s-eye-view for final...

10.1109/cvpr52729.2023.01710 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts

OPENALEX - Publications

Zhida Feng Zhenyu Zhang Xintong Yu Yewei Fang Lanxin Li and 10 more

Recent progress in diffusion models has revolutionized the popular technology of text-to-image generation. While existing approaches could produce photorealistic high-resolution images with text conditions, there are still several open problems to be solved, which limits further improvement image fidelity and relevancy. In this paper, we propose ERNIE-ViLG 2.0, a large-scale Chinese model, progressively upgrade quality generated by: (1) incorporating fine-grained textual visual knowledge key...

10.1109/cvpr52729.2023.00977 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Spatiotemporal differentiation and attribution of land surface temperature in China in 2001–2020

OPENALEX - Publications

Hao Tian Lin Liu Zhengyong Zhang Hongjin Chen Xueying Zhang and 2 more

10.1007/s11442-024-2209-z article EN Journal of Geographical Sciences 2024-01-26

Zinc regulation of chlorophyll fluorescence and carbohydrate metabolism in saline-sodic stressed rice seedlings

OPENALEX - Publications

Kun Dang Jinmeng Mu Hao Tian Dapeng Gao Hongxiang Zhou and 4 more

Abstract Saline-sodic stress can limit the absorption of available zinc in rice, subsequently impacting normal photosynthesis and carbohydrate metabolism rice plants. To investigate impact exogenous application on grown saline-sodic soil, this study simulated conditions using two varieties, 'Changbai 9' 'Tonghe 899', as experimental materials. Rice seedlings at 4 weeks age underwent various treatments including control (CT), 2 μmol·L −1 treatment alone (Z), 50 mmol·L (S), with (Z + S). We...

10.1186/s12870-024-05170-w article EN cc-by BMC Plant Biology 2024-05-27

A Multi-Scale Gate Network for High-Quality Image Deblurring

OPENALEX - Publications

Shu Tang Y. Lin Xinbo Gao Shuli Yang Jiaxu Leng and 2 more

10.2139/ssrn.5124823 preprint EN 2025-01-01

Achieving Superior Strength and Ductility in TiAl/Ti2AlNb Dissimilar Brazed Joints by Controlling the Brittle Zr(Ni,Cu)3 Intermetallic Compound

OPENALEX - Publications

Hao Tian Jie Xiong Lei Zhao Jun Mei Qi Yan and 2 more

10.1016/j.msea.2025.148225 article EN Materials Science and Engineering A 2025-03-01

Emotional Landscapes in Urban Design: Analyzing Color Emotional Responses of the Elderly to Community Outdoor Spaces in Yi Jie Qu

OPENALEX - Publications

Chengyan Zhang Y. Chen Bart Dewancker Chaojie Shentu Hao Tian and 4 more

Addressing the emotional needs of elderly in urban space design has increasingly become a vital concern. This study innovatively integrates theories with community outdoor spaces, thereby expanding research on categorization spaces. At 8 sites Yi Jie Qu, China, 330 residents were randomly recruited to assess their color responses (CER) landscapes these Based Affective Circumplex Model and Japanese Color Image Theory, Emotion was constructed visually represent overall tendencies significant...

10.3390/buildings14030793 article EN cc-by Buildings 2024-03-14

Geogenic Phosphorus Enrichment in Groundwater due to Anaerobic Methane Oxidation-Coupled Fe(III) Oxide Reduction

OPENALEX - Publications

Yao Du Yaojin Xiong Yamin Deng Yanqiu Tao Hao Tian and 4 more

Accumulation of geogenic phosphorus (P) in groundwater is an emerging environmental concern, which closely linked to coupled processes involving FeOOH and organic matter under methanogenic conditions. However, it remains unclear how P enrichment associated with methane cycling, particularly the anaerobic oxidation (AMO). This study conducted a comprehensive investigation carbon isotopes dissolved inorganic (DIC), CO2, CH4, alongside Fe isotopes, microbial communities, functions quaternary...

10.1021/acs.est.4c00267 article EN Environmental Science & Technology 2024-04-26

Impact of ZnO NPs on photosynthesis in rice leaves plants grown in saline-sodic soil

OPENALEX - Publications

Kun Dang Yuxin Wang Hao Tian Jingjing Bai Xiyuan Cheng and 4 more

Saline-sodic stress restricts the absorption of zinc by rice, consequently impacting photosynthesis process rice plants. In this experiment, Landrace 9 was selected as test material and potting method employed to investigate influence ZnO nanoparticles (ZnO NPs) on chlorophyll fluorescence in grown saline-sodic land. The research findings demonstrate that application NPs proves be more advantageous for growth soil. Notably, significantly decreases levels Na

10.1038/s41598-024-66935-9 article EN cc-by Scientific Reports 2024-07-14

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

OPENALEX - Publications

Han Zhang Weichong Yin Yewei Fang Lanxin Li Boqiang Duan and 5 more

Conventional methods for the image-text generation tasks mainly tackle naturally bidirectional separately, focusing on designing task-specific frameworks to improve quality and fidelity of generated samples. Recently, Vision-Language Pre-training models have greatly improved performance image-to-text tasks, but large-scale pre-training text-to-image synthesis task are still under-developed. In this paper, we propose ERNIE-ViLG, a unified generative framework with transformer model. Based...

10.48550/arxiv.2112.15283 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Spatio-temporal correlation between human activity intensity and land surface temperature on the north slope of Tianshan Mountains

OPENALEX - Publications

Hongjin Chen Lin Liu Zhengyong Zhang Ya Liu Hao Tian and 3 more

10.1007/s11442-022-2030-5 article EN Journal of Geographical Sciences 2022-10-01

Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation

OPENALEX - Publications

Zhihong Pan Xin Zhou Hao Tian

Diffusion-based text-to-image generation models like GLIDE and DALLE-2 have gained wide success recently for their superior performance in turning complex text inputs into images of high quality diversity. In particular, they are proven to be very powerful creating graphic arts various formats styles. Although current supported specifying style oil painting or pencil drawing, fine-grained features color distributions brush strokes hard specify as randomly picked from a conditional...

10.1109/wacv56688.2023.00444 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023-01-01

Attention-Aware Anime Line Drawing Colorization

OPENALEX - Publications

Yu Cao Hao Tian P.Y. Mok

Automatic colorization of anime line drawing has attracted much attention in recent years since it can substantially benefit the animation industry. User-hint based methods are mainstream approach for colorization, while reference-based offer a more intuitive approach. Nevertheless, although improve feature aggregation reference image and drawing, results not compelling terms color consistency or semantic correspondence. In this paper, we introduce an attention-based model which channel-wise...

10.1109/icme55011.2023.00282 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2023-07-01

Combined Effects of Straw Return with Nitrogen Fertilizer on Leaf Ion Balance, Photosynthetic Capacity, and Rice Yield in Saline-Sodic Paddy Fields

OPENALEX - Publications

Kun Dang Cheng Ran Hao Tian Dapeng Gao Jinmeng Mu and 5 more

Soil salinization is a prevalent global environmental issue that significantly hampers crop growth and yield. However, there has been limited research on the impact of nitrogen fertilization various management practices in alleviating saline-sodic stress crops. In order to examine combined straw fertilizer application physiological photosynthetic characteristics rice paddy fields, three-year field experiment was conducted Jilin Province, China. The as split-zone trial, where main zone...

10.3390/agronomy13092274 article EN cc-by Agronomy 2023-08-29

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages

OPENALEX - Publications

Yekun Chai Shuohuan Wang Chao Pang Yu Sun Hao Tian and 1 more

Software engineers working with the same programming language (PL) may speak different natural languages (NLs) and vice versa, erecting huge barriers to communication efficiency. Recent studies have demonstrated effectiveness of generative pre-training in computer programs, yet they are always English-centric. In this work, we step towards bridging gap between multilingual NLs PLs for large models (LLMs). We release ERNIE-Code, a unified pre-trained model 116 6 PLs. employ two methods...

10.18653/v1/2023.findings-acl.676 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2023-01-01

DETR-based Layered Clothing Segmentation and Fine-Grained Attribute Recognition

OPENALEX - Publications

Hao Tian Yu Cao P.Y. Mok

Clothing segmentation and fine-grained attribute recognition are challenging tasks at the crossing of computer vision fashion, which segment entire ensemble clothing instances as well recognize detailed attributes products from any input human images. Many new models have been developed for in recent years, nevertheless accuracy is less than satisfactory case layered or fashion different scales. In this paper, a DEtection TRansformer (DETR) based method proposed to with high accuracy. model,...

10.1109/cvprw59228.2023.00360 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023-06-01

Research on Image Segmentation Algorithm and Performance of Power Insulator Based on Adaptive Region Growing

OPENALEX - Publications

Xingmou Liu Hao Tian Yan Wang Fan Jiang Chenyang Zhang

10.1007/s42835-022-01118-y article EN Journal of Electrical Engineering and Technology 2022-06-02

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

OPENALEX - Publications

Pengfei Zhu Chao Pang Shuohuan Wang Yekun Chai Yu Sun and 2 more

In recent years, the burgeoning interest in diffusion models has led to significant advances image and speech generation. Nevertheless, direct synthesis of music waveforms from unrestricted textual prompts remains a relatively underexplored domain. response this lacuna, paper introduces pioneering contribution form text-to-waveform generation model, underpinned by utilization models. Our methodology hinges on innovative incorporation free-form as conditional factors guide waveform process...

10.48550/arxiv.2302.04456 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Asymmetric convolution Swin transformer for medical image super-resolution

OPENALEX - Publications

LU Wei-jia Jiehui Jiang Hao Tian Jun Gu Yuhong Lu and 5 more

Medical Image Super-Resolution plays a pivotal role in enhancing diagnostic accuracy. Transformer-based methods, such as Restoration Using Swin Transformer (SwinIR) and transformer for fast Magnetic Resonance Imaging (SwinMR), have shown prowess this area but also exhibit limitations. Specifically, LayerNorm channel normalization diminishes high-frequency detail, while the Multilayer Perceptron prioritizes global information over local information. Moreover, low-resolution inputs contain...

10.1016/j.aej.2023.11.044 article EN cc-by-nc-nd Alexandria Engineering Journal 2023-11-20

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training

OPENALEX - Publications

Bin Shan Weichong Yin Yu Sun Hao Tian Hua Wu and 1 more

Recent Vision-Language Pre-trained (VLP) models based on dual encoder have attracted extensive attention from academia and industry due to their superior performance various cross-modal tasks high computational efficiency. They attempt learn representation using contrastive learning image-text pairs, however, the built inter-modal correlations only rely a single view for each modality. Actually, an image or text contains potential views, just as humans could capture real-world scene via...

10.48550/arxiv.2209.15270 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Screening and Identification of Cold-Tolerant Phosphorus and Potassium Solubilizing Bacteria and Their Growth-Promoting Effects on Soybean in Cold Regions

OPENALEX - Publications

Hao Yan Tianyi Wang Han Wang Nan Sun Xuebing Wang and 6 more

In this study, we collected soybean inter-root soil (clay soil) from the cold region of Heilongjiang Province, China, screening for cold-tolerant phosphorus- and potassium-solubilizing bacteria by gradient-cooling-directed design mixed bacterial agents. This study screened phosphorus-solubilizing constructed We analyzed strain’s phosphorus/potassium solubilizing capacity, as well its organic acid secretion ability, to reveal mechanism detoxification phosphorus potassium. Clay Heilongjiang,...

10.3390/agronomy15010040 article EN cc-by Agronomy 2024-12-27

Dynamic manipulation of a large object using a leader–assistant mobile robot system

OPENALEX - Publications

Hao Tian Jin-Kyu Choi Heon-Hui Kim

10.5916/jamet.2024.48.6.472 article EN Han-guk marin enjinieoring hakoeji 2024-12-31

Application of Convolution Neural Network Algorithm Based on Intelligent Sensor Network in Target Recognition of Corn Weeder at Seedling Stage

OPENALEX - Publications

Zhiyi Zhang Shiyuan Li Jiakai Jia Zhiqi Zheng Hao Tian

Grass damage in the seedling corn field has always been an important factor affecting growth and development of crops. The existence grass not only compresses living space seedlings but also easily causes insect damage. Therefore, it is essential for weeding field. existing methods usually use manual or chemical herbicide spraying, which time-consuming laborious inefficient. With artificial intelligence modern agricultural technology, robots become effective means, attracted more attention...

10.1155/2022/2748862 article EN cc-by Journal of Sensors 2022-10-03

InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation

OPENALEX - Publications

Rongyao Fang Shilin Yan Zhaoyang Huang Jingqiu Zhou Hao Tian and 2 more

Empowering models to dynamically accomplish tasks specified through natural language instructions represents a promising path toward more capable and general artificial intelligence. In this work, we introduce InstructSeq, an instruction-conditioned multi-modal modeling framework that unifies diverse vision flexible control handling of both visual textual data. InstructSeq employs multimodal transformer architecture encompassing visual, language, sequential modeling. We utilize encoder...

10.48550/arxiv.2311.18835 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Coming Soon ...