Yifei Duan

ORCID: 0009-0003-6336-0513
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Automated Road and Building Extraction
  • Image Processing and 3D Reconstruction
  • Remote Sensing and LiDAR Applications
  • Remote-Sensing Image Classification
  • Advanced Algorithms and Applications
  • Advanced Sensor and Control Systems
  • Advanced Image and Video Retrieval Techniques
  • Radiomics and Machine Learning in Medical Imaging
  • Machine Learning in Materials Science
  • Cell Image Analysis Techniques
  • Advanced Image Fusion Techniques
  • Advanced Neural Network Applications
  • Generative Adversarial Networks and Image Synthesis

Xi’an University of Posts and Telecommunications
2024-2025

California University of Pennsylvania
2024

We introduce Baichuan-Omni-1.5, an omni-modal model that not only has understanding capabilities but also provides end-to-end audio generation capabilities. To achieve fluent and high-quality interaction across modalities without compromising the of any modality, we prioritized optimizing three key aspects. First, establish a comprehensive data cleaning synthesis pipeline for multimodal data, obtaining about 500B (text, audio, vision). Second, audio-tokenizer (Baichuan-Audio-Tokenizer) been...

10.48550/arxiv.2501.15368 preprint EN arXiv (Cornell University) 2025-01-25

The current generation of large language models (LLMs) is typically designed for broad, general-purpose applications, while domain-specific LLMs, especially in vertical fields like medicine, remain relatively scarce. In particular, the development highly efficient and practical LLMs medical domain challenging due to complexity knowledge limited availability high-quality data. To bridge this gap, we introduce Baichuan-M1, a series specifically optimized applications. Unlike traditional...

10.48550/arxiv.2502.12671 preprint EN arXiv (Cornell University) 2025-02-18

Nowadays, object detection algorithms are widely used in various scenarios. However, there further small requirements some special Due to the problems related objects, such as their less available features, unbalanced samples, higher positioning accuracy requirements, and fewer data sets, a algorithm is more complex than general algorithm. The effect of model for objects not ideal. Therefore, this paper takes YOLOXs benchmark network enhances feature information on by improving network’s...

10.3390/rs17050913 article EN cc-by Remote Sensing 2025-03-05

This paper reports the application of deep learning techniques in bright-field transmission electron microscopy image segmentation and tracking, order to understand dynamic changes nanoparticles during high-temperature sintering. Four state-of-the-art models, YOLO v8n-seg, Swin- UNet, VMamba , EfficientSAM -tiny, were used their performances compared. The results show that -tiny performs best task, achieving highest accuracy ( IoU 0.99672, Dice Coefficient 0.99836). In tracking combining...

10.69761/hdta8338 article EN cc-by 2024-10-03
Coming Soon ...