Tan Yu

ORCID: 0000-0003-2870-2314
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image and Video Retrieval Techniques
  • Multimodal Machine Learning Applications
  • Image Retrieval and Classification Techniques
  • Advanced Data Storage Technologies
  • Advanced Neural Network Applications
  • Distributed and Parallel Computing Systems
  • MicroRNA in disease regulation
  • Video Analysis and Summarization
  • Cancer-related molecular mechanisms research
  • Advanced Vision and Imaging
  • Neural Networks and Applications
  • Molecular Biology Techniques and Applications
  • Robotics and Sensor-Based Localization
  • Video Surveillance and Tracking Methods
  • Cloud Computing and Remote Desktop Technologies
  • Lipid metabolism and disorders
  • Text and Document Classification Technologies
  • Genomic variations and chromosomal abnormalities
  • Educational Robotics and Engineering
  • Ferroelectric and Piezoelectric Materials
  • Forensic and Genetic Research
  • Domain Adaptation and Few-Shot Learning
  • Prenatal Screening and Diagnostics
  • Advanced Malware Detection Techniques
  • Organic Light-Emitting Diodes Research

University of Hong Kong
2025

Beijing National Laboratory for Molecular Sciences
2025

Peking University
2025

State Key Laboratory of Rare Earth Materials Chemistry and Application
2025

Sichuan University
2021-2023

West China Second University Hospital of Sichuan University
2022-2023

Guilin University of Electronic Technology
2022

Baidu (China)
2021-2022

Bellevue Hospital Center
2021-2022

University of Electronic Science and Technology of China
2014-2021

Recently, MLP-based vision backbones emerge. architectures with less inductive bias achieve competitive performance in image recognition compared CNNs and Transformers. Among them, spatial-shift MLP (S$^2$-MLP), adopting the straightforward operation, achieves better than pioneering works including MLP-mixer ResMLP. More recently, using smaller patches a pyramid structure, Vision Permutator (ViP) Global Filter Network (GFNet) S$^2$-MLP. In this paper, we improve S$^2$-MLP backbone. We expand...

10.48550/arxiv.2108.01072 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Colloidal quantum dots (QDs) have illuminated computer monitors and television screens due to their fascinating color-tunable properties depending on the size. Here, electroluminescence (EL) wavelength of perovskite LEDs was tuned via atomic layer number (ALN) nanoplates (NPs) instead "size" in conventional QDs. We demonstrated efficient with controllably tailored emission from n = 3, 4, 5, ≥7 ALN NPs specific discrete major peaks at 607, 638, 669, 728 nanometers. These peak external...

10.1126/sciadv.adp9595 article EN cc-by-nc Science Advances 2025-02-14

Most of current visual search systems focus on image-to-image (point-to-point) such as image and object retrieval. Nevertheless, fast image-to-video (point-to-set) is much less exploited. This paper tackles instance in videos, where efficient point-to-set matching essential. Through jointly optimizing vector quantization hashing, we propose compressive method to compressM proposals extracted from each video into only k binary codes, ≪ M. Then the similarity between query whole can be...

10.1109/iccv.2017.85 article EN 2017-10-01

By exploiting the cross-modal attention, cross-BERT methods have achieved state-of-the-art accuracy in retrieval. Nevertheless, heavy text-image interactions model are prohibitively slow for large-scale Late-interaction trade off retrieval and efficiency by interaction only late stage, attaining a satisfactory speed. In this work, we propose an inflating shrinking approach to further boost of late-interaction methods. The operation plugs several codes input encoder exploit more thoroughly...

10.18653/v1/2021.emnlp-main.772 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021-01-01

Bilinear pooling has achieved an excellent performance in many computer vision tasks such as fine-grained classification, scene recognition and texture recognition. However, the high-dimension features from bilinear can sometimes be inefficient prone to over-fitting. Random Maclaurin (RM) is a widely used GPU-friendly approximation method reduce dimensionality of features. achieve good performance, large projection matrices are usually required practice, making it costly computation memory....

10.1609/aaai.v35i4.16435 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Abstract Background Patients with biallelic variants in the lanosterol synthase ( LSS ) gene has been reported to exhibit phenotypes as follows: non‐syndromic form of hypotrichosis, congenital cataracts, and alopecia intellectual disability or growth retardation. However, genotype–phenotype correlations are still not completely clear. Methods In this study, we a Chinese girl who had cataracts hypotrichosis. The trio exome sequencing was performed elucidate genetic cause patient. Results We...

10.1002/mgg3.2320 article EN cc-by-nc-nd Molecular Genetics & Genomic Medicine 2023-11-10

This paper tackles the problem of efficient and effective object instance search in videos. To effectively capture relevance between a query video frames precisely localize particular object, we leverage proposals to improve quality However, hundreds obtained from each frame could result unaffordable memory computational cost. this end, present simple yet hierarchical prototype encoding (HOPE) model accelerate without sacrificing accuracy, which exploits both spatial temporal self-similarity...

10.1109/cvpr.2017.340 article EN 2017-07-01

Since an image can be perceived by customers in few seconds, it is effective medium for advertising and adored advertisers. Baidu, as one of the lead search companies world, receives billions text queries per day. How to feed attractive images capture customers' attentions core task Baidu advertising. Traditionally, query-to-image tackled matching query with title. Nevertheless, title-based relies on high-quality titles, which are not easy obtained or unavailable some cases. A more reliable...

10.1109/icde51399.2021.00225 article EN 2022 IEEE 38th International Conference on Data Engineering (ICDE) 2021-04-01

Video advertisements may grasp customers' attention instantly and are often adored by advertisers. Since the corpus is vast, achieving an efficient query-to-video search can be challenging. Because traditional approximate nearest neighborhood (ANN) methods based simple similarities (e.g., cosine or inner products) on embedding vectors. They not sufficient for bridging modal gap between a text query video typically only achieve sub-optimal performance in search. Tree-based deep model (TDM)...

10.1145/3534678.3539061 article EN Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining 2022-08-12

In order to make more accurate and scientific predictions of the industrial economy, we integrated method BP artificial neural network with grey relational analysis together as an Industrial economic forecasting model in this study, applied it predict gross output Huainan City China. This paper examines two most recent patents between May 2012 January 2013 area forecasting. The efficiency effectiveness new is tested by comparing predicted results stepwise regression GM (1, 1) models....

10.2174/2213275908666150831194125 article EN Recent Patents on Computer Science 2016-03-04

Due to their low storage capacity and fast retrieval speed, hashing techniques have received much attention in cross-modal retrieval. However, there are some issues that need be further explored. First, existing methods use the labels construct semantic similarity matrix between pairwise data, ignoring potential manifold structure heterogeneous data. Second, underestimate importance of multi-label gaps different class labels, making learned hash codes less discriminative. Third, few them...

10.1109/tits.2022.3213320 article EN IEEE Transactions on Intelligent Transportation Systems 2022-11-03

Automatically discovering common visual patterns from images and videos is a useful but challenging task. On the one hand, definition of rather ambiguous, it refers to spatial composition frequently occurring primitives which correspond local features, semantic parts or objects. For example, wheels body car could be seen as different primitives, while whole can also an individual primitive. other there exhibit large variations in appearance structures even within same kind pattern, makes...

10.1109/apsipa.2017.8282178 article EN 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2017-12-01

In this paper, we implement a real-time method for incrementally stitching aerial images over large area. To collect high-quality images, the camera is mounted on gimbal. Being isolated from vibration of drone, can provide relatively stable and less blurry images. achieve fast mapping, instead using traditional feature extraction matching steps, algorithm only relies position orientation blending. The mosaic fused updated based adaptive weight multi-band blending algorithm. matrix Laplacian...

10.1109/icarcv.2018.8581078 article EN 2022 17th International Conference on Control, Automation, Robotics and Vision (ICARCV) 2018-11-01

The discriminant cut algorithm is used to detect coastlines in synthetic aperture radar (SAR) images. proposed approach a region-based one, which able capture and utilize spatial information the image. real SAR images, e.g. ALOS-1 PALSAR COSMO-SkyMed together with in-situ GPS data were collected validate performance of for coastline detection accuracy better than 4 times image resolution. efficiency also tested.

10.1088/1755-1315/46/1/012035 article EN IOP Conference Series Earth and Environmental Science 2016-11-01

10.1007/s11265-021-01675-x article EN Journal of Signal Processing Systems 2021-06-30

Type III Bartter syndrome (BS), often known as classic is caused by variants in CLCNKB gene, which encoding the basolateral chloride channel protein ClC-Kb, and characterized renal salt wasting, hypokalemia, metabolic alkalosis, increased renin, aldosterone levels.A 2-year-old boy presented severe malnutrition, alkalosis hypokalemia was clinically diagnosed with BS. The trio exome sequencing (ES) performed to discover genetic cause of this patient, followed validation using Sanger...

10.1002/mgg3.2027 article EN Molecular Genetics & Genomic Medicine 2022-08-01

Abstract Background Short‐rib thoracic dysplasia (SRTD) and Joubert syndrome (JS) are rare genetic ciliopathies, individuals with either can manifest cerebellar malformation variable developmental delays. However, neither of these conditions is easily diagnosed during pregnancy due to a limited fetal phenotype. Here, we investigated fetus that was initially observed have short limbs polydactyly discovered compound heterozygous pathogenesis through exome sequencing (ES). Methods Simultaneous...

10.1002/mgg3.2124 article EN cc-by-nc-nd Molecular Genetics & Genomic Medicine 2022-12-20

In recent years, China has developed modern agriculture energetically. An effective information framework is an important way to provide farms with agricultural services and improve farmer's production technology their income. The mountain areas in central are dominated by agriculture, such as Jiangxi province, Anhui province. These area exist many problems on striking regional difference, unbalanced economic development, low cultural level of most the farmers requirement national policy,...

10.13031/2013.42127 article EN 2012 Dallas, Texas, July 29 - August 1, 2012 2012-01-01
Coming Soon ...