NFDI4DS | UHH-SEMS - Publication Details

Jinhua Zhu

ORCID: 0000-0003-2157-9077

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5028284977

Research Areas

Natural Language Processing Techniques
Topic Modeling
Computational Drug Discovery Methods
Radiation Effects in Electronics
Protein Structure and Dynamics
Multimodal Machine Learning Applications
Machine Learning in Materials Science
Fusion materials and technologies
Nuclear Materials and Properties
Machine Learning in Bioinformatics
Reinforcement Learning in Robotics
Satellite Communication Systems
Software Engineering Research
Genomics and Phylogenetic Studies
Interconnection Networks and Systems
vaccines and immunoinformatics approaches
E-commerce and Technology Innovations
Biomedical Text Mining and Ontologies
Advanced Bandit Algorithms Research
Speech and Audio Processing
Ion-surface interactions and analysis
Indoor and Outdoor Localization Technologies
Low-power high-performance VLSI design
Adaptive Dynamic Programming Control
Cryptographic Implementations and Security

University of Science and Technology of China
2018-2025

Science and Technology on Surface Physics and Chemistry Laboratory
2023-2024

SP Technology (South Korea)
2023

Tianjin University
2018-2022

State Key Laboratory of Nuclear Physics and Technology
2017-2019

Peking University
2015-2019

Microsoft Research Asia (China)
2019

Xi'an University of Science and Technology
2019

Zhejiang Yuexiu University
2015

Sun Yat-sen University
2009

Incorporating BERT into Neural Machine Translation

OPENALEX - Publications

Jinhua Zhu Yingce Xia Lijun Wu Di He Tao Qin and 3 more

The recently proposed BERT has shown great power on a variety of natural language understanding tasks, such as text classification, reading comprehension, etc. However, how to effectively apply neural machine translation (NMT) lacks enough exploration. While is more commonly used fine-tuning instead contextual embedding for downstream in NMT, our preliminary exploration using better than fine-tuning. This motivates us think leverage NMT along this direction. We propose new algorithm named...

10.48550/arxiv.2002.06823 preprint EN public-domain arXiv (Cornell University) 2020-01-01

Soft Contextual Data Augmentation for Neural Machine Translation

OPENALEX - Publications

Fei Gao Jinhua Zhu Lijun Wu Yingce Xia Tao Qin and 3 more

While data augmentation is an important trick to boost the accuracy of deep learning methods in computer vision tasks, its study natural language tasks still very limited. In this paper, we present a novel method for neural machine translation.Different from previous that randomly drop, swap or replace words with other sentence, softly augment chosen word sentence by contextual mixture multiple related words. More accurately, one-hot representation distribution (provided model) over...

10.18653/v1/p19-1555 preprint EN cc-by 2019-01-01

Multimodal Sentiment Analysis With Two-Phase Multi-Task Learning

OPENALEX - Publications

Bo Yang Lijun Wu Jinhua Zhu Bo Shao Xiaola Lin and 1 more

Multimodal Sentiment Analysis (MSA) is a challenging research area that studies sentiment expressed from multiple heterogeneous modalities. Given those pre-trained language models such as BERT have shown state-of-the-art (SOTA) performance in NLP disciplines, existing tend to integrate these modalities into and treat the MSA single prediction task. However, we find simply fusing multimodal features cannot well establish power of strong model. Besides, classification ability each modality...

10.1109/taslp.2022.3178204 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2022-01-01

Soft Contextual Data Augmentation for Neural Machine Translation

OPENALEX - Publications

Fei Gao Jinhua Zhu Lijun Wu Yingce Xia Tao Qin and 3 more

While data augmentation is an important trick to boost the accuracy of deep learning methods in computer vision tasks, its study natural language tasks still very limited. In this paper, we present a novel method for neural machine translation. Different from previous that randomly drop, swap or replace words with other sentence, softly augment chosen word sentence by contextual mixture multiple related words. More accurately, one-hot representation distribution (provided model) over...

10.48550/arxiv.1905.10523 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Unified 2D and 3D Pre-Training of Molecular Representations

OPENALEX - Publications

Jinhua Zhu Yingce Xia Lijun Wu Shufang Xie Tao Qin and 3 more

Molecular representation learning has attracted much attention recently. A molecule can be viewed as a 2D graph with nodes/atoms connected by edges/bonds, and also represented 3D conformation 3-dimensional coordinates of all atoms. We note that most previous work handles information separately, while jointly leveraging these two sources may foster more informative representation. In this work, we explore appealing idea propose new method based on unified pre-training. Atom interatomic...

10.1145/3534678.3539368 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2022-08-12

Masked Contrastive Representation Learning for Reinforcement Learning

OPENALEX - Publications

Jinhua Zhu Yingce Xia Lijun Wu Jiajun Deng Wengang Zhou and 3 more

In pixel-based reinforcement learning (RL), the states are raw video frames, which mapped into hidden representation before feeding to a policy network. To improve sample efficiency of state learning, recently, most prominent work is based on contrastive unsupervised representation. Witnessing that consecutive frames in game highly correlated, further data efficiency, we propose new algorithm, i.e., masked for RL (M-CURL), takes correlation among inputs consideration. our architecture,...

10.1109/tpami.2022.3176413 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-01-01

Dual-view Molecular Pre-training

OPENALEX - Publications

Jinhua Zhu Yingce Xia Lijun Wu Shufang Xie Wengang Zhou and 3 more

Molecular pre-training, which is about to learn an effective representation for molecules on large amount of data, has attracted substantial attention in cheminformatics and bioinformatics. A molecule can be viewed as either a graph (where atoms are connected by bonds) or SMILES sequence depth-first-search applied the molecular with specific rules). The Transformer neural networks (GNN) two representative methods deal sequential data graphic globally locally model respectively supposed...

10.1145/3580305.3599317 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2023-08-04

Breaking the barriers of data scarcity in drug–target affinity prediction

OPENALEX - Publications

Qizhi Pei Lijun Wu Jinhua Zhu Yingce Xia Shufang Xie and 4 more

Accurate prediction of drug-target affinity (DTA) is vital importance in early-stage drug discovery, facilitating the identification drugs that can effectively interact with specific targets and regulate their activities. While wet experiments remain most reliable method, they are time-consuming resource-intensive, resulting limited data availability poses challenges for deep learning approaches. Existing methods have primarily focused on developing techniques based available DTA data,...

10.1093/bib/bbad386 article EN Briefings in Bioinformatics 2023-09-22

A study of BERT for context-aware neural machine translation

OPENALEX - Publications

Xueqing Wu Yingce Xia Jinhua Zhu Lijun Wu Shufang Xie and 1 more

10.1007/s10994-021-06070-y article EN Machine Learning 2022-01-09

FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation

OPENALEX - Publications

K. Y. Gao Qizhi Pei G.L. Zhang Jinhua Zhu Kun He and 1 more

10.1145/3690624.3709253 article EN 2025-04-04

Influence of helium bubbles location on hydrogen isotope retention and exchange behavior in plasma-facing materials: A numerical simulation investigation

OPENALEX - Publications

Yue Huang Hao Chen Qing Liu Jinhua Zhu Fei Sun and 2 more

Tritium (T) is a costly radioactive element that, when retained in plasma-facing materials (PFMs), not only results fuel loss but also raises issues of contamination. Hydrogen isotope exchange potential method for T removal future fusion devices. However, the nuclear environment, PFMs will be subjected to low-energy and high-flux helium (He) plasma irradiation, forming He bubble layer near material surface. This greatly impacts diffusion retention behavior hydrogen isotopes PFMs. In this...

10.1016/j.nme.2024.101596 article EN cc-by-nc-nd Nuclear Materials and Energy 2024-01-23

BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning

OPENALEX - Publications

Qizhi Pei Lijun Wu Kaiyuan Gao Xiaozhuan Liang Yin Fang and 4 more

10.18653/v1/2024.findings-acl.71 article EN Findings of the Association for Computational Linguistics: ACL 2022 2024-01-01

Microsoft Research Asia’s Systems for WMT19

OPENALEX - Publications

Yingce Xia Xu Tan Fei Tian Fei Gao Di He and 10 more

Yingce Xia, Xu Tan, Fei Tian, Gao, Di He, Weicong Chen, Yang Fan, Linyuan Gong, Yichong Leng, Renqian Luo, Yiren Wang, Lijun Wu, Jinhua Zhu, Tao Qin, Tie-Yan Liu. Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1). 2019.

10.18653/v1/w19-5348 article EN cc-by 2019-01-01

Competitive barrier and trapping effects of helium bubbles on hydrogen isotopes migration behavior in tungsten

OPENALEX - Publications

Fei Sun D.Y. Chen Qing Liu Jinhua Zhu Xiao-Chun Li and 4 more

10.1016/j.jnucmat.2024.155197 article EN Journal of Nuclear Materials 2024-05-28

BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning

OPENALEX - Publications

Qizhi Pei Lijun Wu Kaiyuan Gao Xiaozhuan Liang Yin Fang and 4 more

Recent research trends in computational biology have increasingly focused on integrating text and bio-entity modeling, especially the context of molecules proteins. However, previous efforts like BioT5 faced challenges generalizing across diverse tasks lacked a nuanced understanding molecular structures, particularly their textual representations (e.g., IUPAC). This paper introduces BioT5+, an extension framework, tailored to enhance biological drug discovery. BioT5+ incorporates several...

10.48550/arxiv.2402.17810 preprint EN arXiv (Cornell University) 2024-02-27

R2-DDI: relation-aware feature refinement for drug–drug interaction prediction

OPENALEX - Publications

Jiacheng Lin Lijun Wu Jinhua Zhu Xiaobo Liang Yingce Xia and 3 more

Precisely predicting the drug-drug interaction (DDI) is an important application and host research topic in drug discovery, especially for avoiding adverse effect when using combination treatment patients. Nowadays, machine learning deep methods have achieved great success DDI prediction. However, we notice that most of works ignore importance relation type building prediction models. In this work, propose a novel R$^2$-DDI framework, which introduces relation-aware feature refinement module...

10.1093/bib/bbac576 article EN Briefings in Bioinformatics 2022-11-28

Discovering drug–target interaction knowledge from biomedical literature

OPENALEX - Publications

Yutai Hou Yingce Xia Lijun Wu Shufang Xie Fan Yang and 3 more

The interaction between drugs and targets (DTI) in human body plays a crucial role biomedical science applications. As millions of papers come out every year the domain, automatically discovering DTI knowledge from literature, which are usually triplets about drugs, their interaction, becomes an urgent demand industry. Existing methods biological mainly extractive approaches that often require detailed annotations (e.g. all mentions entities, relations two entity mentions, etc.). However, it...

10.1093/bioinformatics/btac648 article EN Bioinformatics 2022-10-04

Uni-Fold MuSSe: De Novo Protein Complex Prediction with Protein Language Models

OPENALEX - Publications

Jinhua Zhu Zhen‐Yu He Ziyao Li Guolin Ke Linfeng Zhang

A bstract Accurately solving the structures of protein complexes is crucial for understanding and further modifying biological activities. Recent success AlphaFold its variants shows that deep learning models are capable accurately predicting complex structures, yet with painstaking effort homology search pairing. To bypass this need, we present Uni-Fold MuSSe (Multimer Single Sequence inputs), which predicts from their primary sequences aid pre-trained language models. Specifically, built...

10.1101/2023.02.14.528571 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-02-15

Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design

OPENALEX - Publications

K. Y. Gao Lijun Wu Jinhua Zhu Tianbo Peng Yingce Xia and 6 more

Antibodies are proteins that effectively protect the human body by binding to pathogens. Recently, deep learning-based computational antibody design has attracted popular attention since it automatically mines patterns from data could be complementary experiences. However, methods heavily rely on high-quality structure data, which is quite limited. Besides, complementarity-determining region (CDR), key component of an determines specificity and affinity, highly variable hard predict....

10.1145/3580305.3599468 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2023-08-04

Design and Implementation of Configuration Memory SEU-Tolerant Viterbi Decoders in SRAM-Based FPGAs

OPENALEX - Publications

Zhen Gao Jinhua Zhu Ruishi Han Zhan Xu Anees Ullah and 1 more

A Viterbi decoder is used in many communication receivers to efficiently decode the received signal that has been convolutional encoded transmitter. This decoding corrects errors occur due noise and other imperfections channel key achieve a low bit error rate. If implemented on SRAM-based field-programmable gate array (SRAM-FPGA), radiation-induced soft can affect operation of by corrupting configuration memory, which change circuit functionality will not be corrected unless FPGA...

10.1109/tnano.2019.2925872 article EN IEEE Transactions on Nanotechnology 2019-01-01

Dual-view Molecule Pre-training

OPENALEX - Publications

Jinhua Zhu Yingce Xia Tao Qin Wengang Zhou Houqiang Li and 1 more

Inspired by its success in natural language processing and computer vision, pre-training has attracted substantial attention cheminformatics bioinformatics, especially for molecule based tasks. A can be represented either a graph (where atoms are connected bonds) or SMILES sequence depth-first-search is applied to the molecular with specific rules). Existing works on use representations only only. In this work, we propose leverage both design new algorithm, dual-view (briefly, DMP), that...

10.48550/arxiv.2106.10234 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Estimation of different calculation models for evaluating heavy ion-induced damage in plasma facing materials

OPENALEX - Publications

Fei Sun D.Y. Chen Hao Chen Yasuhisa Oya Jinhua Zhu and 3 more

10.1016/j.fusengdes.2023.113910 article EN publisher-specific-oa Fusion Engineering and Design 2023-07-05

Return-Based Contrastive Representation Learning for Reinforcement Learning

OPENALEX - Publications

Guoqing Liu Chuheng Zhang Li Zhao Tao Qin Jinhua Zhu and 3 more

Recently, various auxiliary tasks have been proposed to accelerate representation learning and improve sample efficiency in deep reinforcement (RL). However, existing do not take the characteristics of RL problems into consideration are unsupervised. By leveraging returns, most important feedback signals RL, we propose a novel task that forces learnt representations discriminate state-action pairs with different returns. Our loss is theoretically justified learn capture structure new form...

10.48550/arxiv.2102.10960 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Coming Soon ...