Xiaodan Song

ORCID: 0000-0002-8049-1828
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Data Compression Techniques
  • Domain Adaptation and Few-Shot Learning
  • Sparse and Compressive Sensing Techniques
  • Wireless Signal Modulation Classification
  • Advanced Data Storage Technologies
  • Advanced Steganography and Watermarking Techniques
  • Caching and Content Delivery
  • Advanced Vision and Imaging
  • Machine Learning and ELM
  • Energy Efficient Wireless Sensor Networks
  • Vehicle License Plate Recognition
  • Advanced MIMO Systems Optimization
  • Context-Aware Activity Recognition Systems
  • Image and Video Stabilization
  • Advanced Neural Network Applications
  • Chaos-based Image/Signal Encryption
  • Advanced Measurement and Detection Methods
  • Error Correcting Code Techniques
  • Animal Vocal Communication and Behavior
  • Advanced Image Processing Techniques
  • Service-Oriented Architecture and Web Services
  • Image Enhancement Techniques
  • Video Coding and Compression Technologies
  • Image Retrieval and Classification Techniques
  • Network Security and Intrusion Detection

Xidian University
2014-2025

Wireless communication has achieved great success in the past several decades. The challenge is of improving bandwidth with limited spectrum and power consumption, which however gradually become a bottleneck evolution going on. intrinsic problem that modeled as message transportation from sender to receiver pursues for an exact replication Shannon's information theory, certainly leads large requirements data explosion. However, goal among intelligent agents, entities intelligence including...

10.48550/arxiv.2101.12649 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Loop filters are used in video coding to remove artifacts or improve performance. Recent advances deploying convolutional neural network (CNN) replace traditional loop show large gains but with problems for practical application. First, different model is frames encoded quantization parameter (QP), respectively. It expensive hardware. Second, float points operation CNN leads inconsistency between encoding and decoding across platforms. Third, redundancy within consumes precious computational...

10.1109/icip.2018.8451589 article EN 2018-09-07

10.1109/icassp49660.2025.10890347 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

Large pre-trained vision-language models (VLMs) like CLIP have shown great potential for solving the unsupervised domain adaptation (UDA) problem. Existing prompt learning UDA based on unsupervised-trained VLMs requires distribution alignment between source and target domains in common space both vision language branches. However, it is difficult rough cross-domain to maintain discriminative semantic structure of domains. Besides, coarse features with non-informative noises due ignoring...

10.1145/3725735 article EN ACM Transactions on Multimedia Computing Communications and Applications 2025-03-21

We consider efficient image transmission via time-varying channels. To improve the performance, we propose a new distributed compressive sensing (CS) scheme that can leverage similar images in cloud. It is featured by channel SNR and bandwidth scalability, high efficiency, low encoding complexity. For each image, compressed thumbnail first transmitted after forward error correction (FEC) modulation to retrieve generate side information (SI) The residual subtracting decompressed then coded CS...

10.1109/tmm.2017.2654123 article EN IEEE Transactions on Multimedia 2017-01-16

With multimedia flourishing on the Web, it is easy to find similar images for a query, especially landmark images. Traditional image coding, such as JPEG, cannot exploit correlations with external Existing vision-based approaches are able by reconstructing from local descriptors but ensure pixel-level fidelity of reconstruction. In this paper, cloud-based distributed coding (Cloud-DIC) scheme proposed mobile photo uploading. For each input image, thumbnail transmitted retrieve correlated and...

10.1109/tcsvt.2015.2416562 article EN IEEE Transactions on Circuits and Systems for Video Technology 2015-03-25

In recent years, semantic communication based on deep learning for source-channel joint encoding has garnered significant attention. It utilizes network models trained end-to-end to represent signals as embedding vectors and demonstrated superior performance compared traditional methods. However, due the disparity between human language, it can be challenging succinctly capture abstract semantics such scenes. this paper, we introduce Scene Graph-based Generative Semantic Communication...

10.1109/icassp48485.2024.10446594 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

This paper proposes a cloud-based distributed image coding scheme (Cloud-DIC) to exploit the strong correlations with external partial-duplicate images in cloud. It features both high efficiency and low encoder complexity, which makes it suitable for photo sharing on mobile devices. To get side information cloud, thumbnail of current is transmitted retrieve highly correlated reconstruct through geometrical registration adaptive patched-based stitching. The then compressed by transform-domain...

10.1109/icip.2014.7025973 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2014-10-01

Redundancy is necessary for a storage system to recover from errors. The frequent errors in large-scale systems, e.g. cloud, make it desired reduce the recovery cost. Among all kinds of data stored video takes large portion due its volume. other characteristic that certain distortion can be tolerated. This paper investigates using scalable representation and unequal error protection scheme costs cloud. By introducing more base layer less on enhancement layers, achieve better tradeoff between...

10.1109/icme.2015.7177458 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2015-06-01

This paper proposes a distributed compressive sensing (CS) scheme for robust image transmission over unknown or time-varying channels with highly correlated images at the decoder. A compressed thumbnail is first transmitted after digital forward error correction (FEC) and modulation to retrieve generate side information (SI) The current residual subtracting decompressed then coded by CS through very dense constellation without FEC. linear representation of signal measurements rateless...

10.1109/vcip.2015.7457886 article EN 2015-12-01

Loop filters are used in video coding to remove artifacts or improve performance. Recent advances deploying convolutional neural network (CNN) replace traditional loop show large gains but with problems for practical application. First, different model is frames encoded quantization parameter (QP), respectively. It expensive hardware. Second, float points operation CNN leads inconsistency between encoding and decoding across platforms. Third, redundancy within consumes precious computational...

10.48550/arxiv.1805.06121 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Redundancy is necessary for a storage system to achieve reliability. Frequent errors in large-scale systems, example, cloud, make it desirable reduce the cost of recovery. Among all types data cloud storage, videos generally occupy significant amounts space due high volumes and rapid development video sharing video-on-demand services. Unlike general data, can tolerate certain level quality degradation. This paper investigates multilayer representations, such as scalable simulcast streaming,...

10.1109/tmm.2017.2751147 article EN IEEE Transactions on Multimedia 2017-09-11

Monitoring multimodal signals provides a more comprehensive understanding of health conditions compared to singlemode monitoring. In the face significant volumes signals, existing IoT monitoring systems primarily focus on high-fidelity signal transmission by encoding separately. However, due lack consideration for downstream applications and correlation between portion bandwidth resources is wasted task-irrelevant information intermodal redundancy. To address this issue, we propose...

10.1109/icassp48485.2024.10446763 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Abstract Since residual learning was proposed, identity mapping has been widely utilized in various neural networks. The method enables information transfer without any attenuation, which plays a significant role training deeper However, interference with unhindered transmission also affects the network’s performance. Accordingly, we propose generalized architecture called reverse attention (RA), applies high-level semantic features to supervise low-level branch. It means that higher...

10.1038/s41598-024-63623-6 article EN cc-by Scientific Reports 2024-06-04

In order to achieve multiple video images fusion processing, domestic and foreign technology commonly used DSP image algorithm, limited serial instruction execution speed, it was difficult meet the requirements of modern high-speed digital signal processing.An based on decoder chip ADV7180 FPGA hardware platform is presented, with analog input sampled by ADV7180, utilization parallel processing capabilities, can be implemented.The test results show that this method simple flexible, able...

10.2991/cisia-15.2015.118 article EN cc-by-nc Advances in computer science research 2015-01-01
Coming Soon ...