Yu Zhou

ORCID: 0000-0003-4188-9953
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Natural Language Processing Techniques
  • Fluid Dynamics and Vibration Analysis
  • Fluid Dynamics and Turbulent Flows
  • Wind and Air Flow Studies
  • Handwritten Text Recognition Techniques
  • Speech and dialogue systems
  • Aerodynamics and Fluid Dynamics Research
  • Multimodal Machine Learning Applications
  • Advanced Image and Video Retrieval Techniques
  • Domain Adaptation and Few-Shot Learning
  • Vibration and Dynamic Analysis
  • Image Retrieval and Classification Techniques
  • Human Pose and Action Recognition
  • Image Processing and 3D Reconstruction
  • Plasma and Flow Control in Aerodynamics
  • Sentiment Analysis and Opinion Mining
  • Advanced Text Analysis Techniques
  • AI in Service Interactions
  • Speech Recognition and Synthesis
  • Video Analysis and Summarization
  • Plant Water Relations and Carbon Dynamics
  • AI in cancer detection
  • Heat Transfer Mechanisms
  • Hand Gesture Recognition Systems

Nankai University
2024-2025

CRRC (China)
2025

University of Chinese Academy of Sciences
2011-2024

Harbin Institute of Technology
2015-2024

Xiamen University
2014-2024

Institute of Information Engineering
2014-2024

Chinese Academy of Sciences
2014-2024

Shenzhen University
2024

Columbia University
2021-2024

Taiyuan University of Technology
2023-2024

Scene text recognition is a hot research topic in computer vision. Recently, many methods based on the encoder-decoder framework have been proposed, and they can handle scene texts of perspective distortion curve shape. Nevertheless, still face lots challenges like image blur, uneven illumination, incomplete characters. We argue that most are local visual features without explicit global semantic information. In this work, we propose semantics enhanced to robustly recognize low-quality...

10.1109/cvpr42600.2020.01354 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

In self-supervised spatio-temporal representation learning, the temporal resolution and long-short term characteristics are not yet fully explored, which limits capabilities of learned models. this paper, we propose a novel method, referred to as video Playback Rate Perception (PRP), learn in simple-yet-effective way. PRP roots dilated sampling strategy, produces self-supervision signals about playback rates for model learning. is implemented with feature encoder, classification module,...

10.1109/cvpr42600.2020.00658 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

We propose a novel self-supervised method, referred to as Video Cloze Procedure (VCP), learn rich spatial-temporal representations. VCP first generates “blanks” by withholding video clips and then creates “options” applying spatio-temporal operations on the withheld clips. Finally, it fills blanks with learns representations predicting categories of applied can act either proxy task or target in learning. As task, converts into clip (options), which enhances flexibility reduces complexity...

10.1609/aaai.v34i07.6840 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Large language models (LLMs), such as ChatGPT, are able to generate human-like, fluent responses for many downstream tasks, e.g., task-oriented dialog and question answering. However, applying LLMs real-world, mission-critical applications remains challenging mainly due their tendency hallucinations inability use external knowledge. This paper proposes a LLM-Augmenter system, which augments black-box LLM with set of plug-and-play modules. Our system makes the grounded in knowledge, stored...

10.48550/arxiv.2302.12813 preprint EN cc-by arXiv (Cornell University) 2023-01-01

In this paper, we present our work in emotion cause extraction.Since there is no open dataset available, the lack of annotated resources has limited research area.Thus, first a built using SINA city news.The annotation based on scheme W3C Emotion Markup Language.Second, propose 7-tuple definition to describe events.Based general definition, new event-driven extraction method multi-kernel SVMs where syntactical tree approach used represent events text.A convolution kernel multikernel SVM are...

10.18653/v1/d16-1170 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2016-01-01

This work investigates the aerodynamics of a NACA 0012 airfoil at chord-based Reynolds numbers (Rec) from 5.3 × 103 to 2.0 104. The lift and drag coefficients, CL CD, airfoil, along with flow structure, were measured as turbulent intensity Tu oncoming varies 0.6% 6.0%. analysis present data those in literature unveils total eight distinct structures around suction side airfoil. Four Rec regimes, i.e., ultra-low (<1.0 104), low (1.0 104–3.0 105), moderate (3.0 105–5.0 106), high...

10.1063/1.4901969 article EN Physics of Fluids 2014-11-01

Junnan Zhu, Qian Wang, Yining Yu Zhou, Jiajun Zhang, Shaonan Chengqing Zong. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint (EMNLP-IJCNLP). 2019.

10.18653/v1/d19-1302 article EN cc-by 2019-01-01

This paper presents a systematic study of the cross-flow-induced vibration on spring-supported circular cylinder diameter $D$ placed in wake fixed smaller $d$ . The ratios $d/D$ and $L/d$ are varied from 0.2 to 1.0 5.5, respectively, where $L$ is distance between centre upstream forward stagnation point downstream cylinder. Extensive measurements conducted capture frequency responses, surface pressure, shedding frequencies flow fields using laser vibrometer, hot-wire, pressure scanner...

10.1017/jfm.2017.510 article EN Journal of Fluid Mechanics 2017-09-22

This work aims to provide a systematic experimental study on the wake of two tandem cylinders unequal diameters. The fluid dynamics around circular cylinder diameter $D$ placed in another with smaller $d$ is investigated, including time-mean drag coefficient ( $C_{D}$ ), fluctuating and lift coefficients $C_{D}^{\prime }$ $C_{L}^{\prime Strouhal number $St$ ) flow structures. Reynolds based kept constant at $4.27\times 10^{4}$ . ratios $d/D$ $L/d$ vary from 0.2 1.0 8.0 respectively, where...

10.1017/jfm.2017.735 article EN Journal of Fluid Mechanics 2017-12-11

Multimodal summarization with multimodal output (MSMO) is to generate a summary for news report, which has been proven effectively improve users' satisfaction. The existing MSMO methods are trained by the target of text modality, leading modality-bias problem that ignores quality model-selected image during training. To alleviate this problem, we propose objective function guidance reference use loss from generation and selection. Due lack data, present two strategies, i.e., ROUGE-ranking...

10.1609/aaai.v34i05.6525 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Nowadays, scene text recognition has attracted more and attention due to its various applications. Most state-of-the-art methods adopt an encoder-decoder framework with mechanism, which generates autoregressively from left right. Despite the convincing performance, speed is limited because of one-by-one decoding strategy. As opposed autoregressive models, non-autoregressive models predict results in parallel a much shorter inference time, but accuracy falls behind counterpart considerably....

10.1145/3474085.3475238 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

The combination of optogenetics and electrophysiological recording enables high-precision bidirectional interactions between neural interfaces circuits, which provides a promising approach for the study progressive neurophysiological phenomena. Opto-electrophysiological probes with sufficient flexibility biocompatibility are desirable to match low mechanical stiffness brain tissue chronic reliable performance. However, lack rigidity poses challenges accurate implantation flexible less...

10.1038/s41378-022-00461-4 article EN cc-by Microsystems & Nanoengineering 2022-11-08

With the continuous advancement of socio-economic levels and relentless innovation in modern medical technologies, there's been a significant increase importance people place on their physiological health, particularly context colorectal cancer—a prevalent malignant tumor that has captivated widespread attention within community for its prevention treatment. Notably, polyps, identified as precursors to cancer, are crucial early diagnosis precise detection, serving fundamental elements...

10.55524/ijircst.2024.12.2.14 article EN International Journal of Innovative Research in Computer Science & Technology 2024-03-01

10.1007/s00348-014-1790-9 article EN Experiments in Fluids 2014-07-17

Traffic classification, a mapping of traffic to network applications, is important for variety networking and security issues, such as measurement, monitoring, well the detection malware activities. In this paper, we propose Securitas, trace-based protocol identification system, which exploits semantic information in message formats. Securitas requires no prior knowledge specifications. Deeming language between two processes, our approach based upon new insight that n-grams traces, just like...

10.1109/tnet.2014.2381230 article EN IEEE/ACM Transactions on Networking 2015-01-08

Abstract Active control of a turbulent boundary layer has been experimentally investigated with view to reducing the skin-friction drag and gaining some insight into mechanism that leads reduction. A spanwise-aligned array piezo-ceramic actuators was employed generate transverse travelling wave along wall surface, specified phase shift between adjacent actuators. Local exhibits strong dependence on parameters, including wavelength, amplitude frequency oscillation. maximum reduction 50 %...

10.1017/jfm.2014.261 article EN Journal of Fluid Mechanics 2014-06-09

Cross-language multidocument summarization is the task to generate a summary in target language (e.g., Chinese) from collection of documents different source English). Previous methods such as extractive and compressive algorithms focus only on single sentence selection compression, which cannot make full use similar sentences containing complementary information. Furthermore, translation model knowledge not fully explored previous approaches. To address these two problems, we propose this...

10.1109/taslp.2016.2586608 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2016-06-30
Coming Soon ...