Yuqi Zhang

ORCID: 0000-0002-7310-253X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Multimodal Machine Learning Applications
  • Text Readability and Simplification
  • Recommender Systems and Techniques
  • Advanced Text Analysis Techniques
  • Speech and dialogue systems
  • Translation Studies and Practices
  • Advanced Graph Neural Networks
  • Lexicography and Language Studies
  • Speech Recognition and Synthesis
  • AI in cancer detection
  • Sentiment Analysis and Opinion Mining
  • Educational Technology and Pedagogy
  • Epigenetics and DNA Methylation
  • Text and Document Classification Technologies
  • Algorithms and Data Compression
  • Ideological and Political Education
  • COVID-19 diagnosis using AI
  • Hate Speech and Cyberbullying Detection
  • Foreign Language Teaching Methods
  • Complex Network Analysis Techniques
  • Genomics and Rare Diseases
  • Stock Market Forecasting Methods
  • Video Analysis and Summarization

Shanghai Ninth People's Hospital
2024-2025

Shanghai Jiao Tong University
2013-2025

Auckland University of Technology
2022-2024

Chongqing University of Posts and Telecommunications
2023-2024

Shanghai University of Electric Power
2024

Alibaba Group (United States)
2023

Guangdong University of Finance
2023

Renmin University of China
2023

Beihang University
2016-2022

East China Jiaotong University
2022

In this paper, we describe a source-side reordering method based on syntactic chunks for phrase-based statistical machine translation. First, shallow parse the source language sentences. Then, rules are automatically learned from and word alignments. During translation, used to generate lattice each sentence. Experimental results reported Chinese-to-English task, showing an improvement of 0.5%--1.8% BLEU score absolute various test sets better computational efficiency than during decoding....

10.3115/1626281.1626282 article EN 2007-01-01

Abstract The precise mechanisms behind early embryonic arrest due to sperm‐related factors and the most effective strategies are not yet fully understood. Here, we present two cases of male infertility linked novel TDRD6 variants, associated with oligoasthenoteratozoospermia (OAT) arrest. To investigate underlying promising therapeutic approaches, Tdrd6 knock‐in knock‐out mice were generated. variant demonstrated OAT arrest, mirroring clinical observations our patients. Sperm from both...

10.1002/mco2.70038 article EN cc-by MedComm 2025-01-01

Aligning large language models (LLMs) with human values, particularly in the face of stealthy and complex jailbreak attacks, presents a formidable challenge. In this study, we present simple yet highly effective defense strategy, i.e., Intention Analysis ($\mathbb{IA}$). The principle behind is to trigger LLMs' inherent self-correct improve ability through two-stage process: 1) essential intention analysis, 2) policy-aligned response. Notably, $\mathbb{IA}$ an inference-only method, thus...

10.48550/arxiv.2401.06561 preprint EN other-oa arXiv (Cornell University) 2024-01-01

This paper introduces new definitions of Chinese base phrases and presents a hybrid model to combine Memory-Based Learning method disambiguation proposal based on lexical information grammar rules populated from large corpus for 9 types chunking. Our experiment achieves an accuracy (F-measure) 93.4%. The significance the research lies in fact that it provides solid foundation parser.

10.3115/1118824.1118842 article EN 2002-01-01

In this paper, we present the KIT systems participating in Shared Translation Task translating between English↔German and English↔French.All translations are generated using phrase-based translation systems, different kinds of word-based, part-ofspeech-based cluster-based language models trained on provided data.Additional include bilingual models, reordering based part-of-speech tags syntactic parse trees, as well a lexicalized model.In order to make use noisy web-crawled data, apply...

10.3115/v1/w14-3313 article EN cc-by 2014-01-01

With the in-depth development of China's education reform, combination educational public opinion and emotion analysis is conducive to discovery evolution law sentiment related factors, provides ideas for guidance relevant government departments. This paper proposes an model on policy network based text mining, obtains comment contents through Octopus crawler, preprocesses data, introduces calculation formula heat, which process divided into incubation period, outbreak spreading period...

10.1109/iciet51873.2021.9419585 article EN 2021-03-27

Improving neural machine translation (NMT) systems with prompting has achieved significant progress in recent years. In this work, we focus on how to integrate multi-knowledge, multiple types of knowledge, into NMT models enhance the performance prompting. We propose a unified framework, which can effectively knowledge including sentences, terminologies/phrases and templates models. utilize as prefix-prompts input for encoder decoder guide process. The approach requires no changes model...

10.18653/v1/2023.findings-emnlp.333 article EN cc-by 2023-01-01

Abstract Neural Machine Translation (NMT) has achieved great developments in recent years, but we still have to face two challenges: establishing a high-quality corpus and exploring optimal parameters of models for long text translation. In this paper, first attempt set up paragraph-parallel based on English Chinese versions the novels then design hierarchical attention model it handle these challenges. Our encoder decoder take segmented clauses as input process words, clauses, paragraphs at...

10.1088/1742-6596/1453/1/012006 article EN Journal of Physics Conference Series 2020-01-01

In this paper, the KIT systems submitted to Shared Translation Task are presented.We participated in two translation directions: from German English and German.Both translations generated using phrase-based systems.The performance of was boosted by language models built based on different tokens such as word, part-of-speech, automacally word clusters.The difference order between is addressed part-of-speech syntactic tree-based reordering models.In addition a discriminative lexicon, we used...

10.18653/v1/w15-3008 article EN cc-by 2015-01-01

Background: The histone deacetylase family of proteins, which includes the sirtuins, participates in a wide range cellular processes, and is intimately involved neurodegenerative illnesses. research on sirtuins has garnered lot interest. However, there are currently no effective therapeutic drugs. Methods: In order to explore potential inhibitors SIRTs, we first screened four lead compounds SIRT2 Traditional Chinese Medicine (TCM) for nervous disease using Auto- Dock Vina method. Then, with...

10.2174/0115701808288696240308052948 article EN Letters in Drug Design & Discovery 2024-03-15

<title>Abstract</title> The prediction of stock market price trends has always been a challenging issue, attracting widespread attention from both economists and computer scientists. Recently, integrating prices with news data shown to be an effective strategy for enhancing the accuracy task. Yet, many current methods fail fully leverage intricate inter-stock relationships inherent in news.Applying deep learning, especially Graph Convolutional Networks (GCNs), predict demonstrated advanced...

10.21203/rs.3.rs-5242175/v1 preprint EN cc-by Research Square (Research Square) 2024-10-28

In many challenging breast cancer pathology images, the proportion of truly informative tumor regions is extremely limited. The disparity between essential information required for clinical diagnosis (Tumor area less than 10) and vast amount data within Whole Slide Images (WSIs) makes it exceedingly difficult pathologists to identify subtle lesions. To address labor-intensive task imposed by this gap, paper proposes a dynamic sparse token based multi-instance learning framework. This...

10.1109/jbhi.2024.3509213 article EN IEEE Journal of Biomedical and Health Informatics 2024-12-18
Coming Soon ...