NFDI4DS | UHH-SEMS - Publication Details

Jing Dong

ORCID: 0000-0003-3489-6661

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101739941

Research Areas

Advanced Neural Network Applications
Advanced Image and Video Retrieval Techniques
Generative Adversarial Networks and Image Synthesis
Digital Media Forensic Detection
Emotion and Mood Recognition
Image Retrieval and Classification Techniques
Speech Recognition and Synthesis
Medical Image Segmentation Techniques
Human Pose and Action Recognition
Advanced Steganography and Watermarking Techniques
Smart Agriculture and AI
Speech and Audio Processing
Video Surveillance and Tracking Methods
Robotics and Sensor-Based Localization
Visual Attention and Saliency Detection
3D Surveying and Cultural Heritage
Multimodal Machine Learning Applications
Cell Image Analysis Techniques
Gait Recognition and Analysis
IoT and Edge/Fog Computing
Face recognition and analysis
Handwritten Text Recognition Techniques
Gaze Tracking and Assistive Technology
Technology and Security Systems
Image Processing Techniques and Applications

Chinese Academy of Sciences
2014-2025

Institute of Automation
2010-2025

Dalian University
2020-2024

Dalian University of Technology
2019-2022

Beijing Jiaotong University
2013-2021

Beijing Automation Control Equipment Institute
2021

National Engineering Research Center for Information Technology in Agriculture
2014-2020

Ministry of Agriculture and Rural Affairs
2020

Tianjin University of Finance and Economics
2008

Attention Gate ResU-Net for Automatic MRI Brain Tumor Segmentation

OPENALEX - Publications

Jianxin Zhang Zongkang Jiang Jing Dong Yaqing Hou Bin Liu

Brain tumor segmentation technology plays a pivotal role in the process of diagnosis and treatment MRI brain tumors. It helps doctors to locate measure tumors, as well develop rehabilitation strategies. Recently, methods based on U-Net architecture have become popular they largely improve accuracy by applying skip connection combine high-level feature information low-level information. Meanwhile, researchers demonstrated that introducing attention mechanism into can enhance local expression...

10.1109/access.2020.2983075 article EN cc-by IEEE Access 2020-01-01

A hybrid CNN-GRU model for predicting soil moisture in maize root zone

OPENALEX - Publications

Jingxin Yu Xin Zhang Linlin Xu Jing Dong Lili Zhangzhong

10.1016/j.agwat.2020.106649 article EN Agricultural Water Management 2020-11-27

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

OPENALEX - Publications

Kristen Grauman Andrew Westbury Lorenzo Torresani Kris Kitani Jitendra Malik and 95 more

10.1109/cvpr52733.2024.01834 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Latent Watermark: Inject and Detect Watermarks in Latent Diffusion Space

OPENALEX - Publications

Zheling Meng Bo Peng Jing Dong

10.1109/tmm.2025.3535300 article EN IEEE Transactions on Multimedia 2025-01-01

Exploring DCT Coefficient Quantization Effects for Local Tampering Detection

OPENALEX - Publications

Wei Wang Jing Dong Tieniu Tan

In this paper, we focus on local image tampering detection. For a JPEG image, the probability distributions of its DCT coefficients will be disturbed by operation. The tampered region and unchanged have different distributions, which is an important clue for locating tampering. Based assumption Laplacian distribution unquantized ac coefficients, these two as well size can estimated so that each block being obtained. More accurate localization results could got when consider prior knowledge...

10.1109/tifs.2014.2345479 article EN IEEE Transactions on Information Forensics and Security 2014-08-05

Class activation map guided level sets for weakly supervised semantic segmentation

OPENALEX - Publications

Yifan Wang Gerald Schaefer Xiyao Liu Jing Dong Linglin Jing and 3 more

10.1016/j.patcog.2025.111566 article EN Pattern Recognition 2025-03-01

Leveraging Large Vision-Language Model as User Intent-Aware Encoder for Composed Image Retrieval

OPENALEX - Publications

Zelong Sun Jing Dong Guoxing Yang Nanyi Fei Zhiwu Lu

Composed Image Retrieval (CIR) aims to retrieve target images from candidate set using a hybrid-modality query consisting of reference image and relative caption that describes the user intent. Recent studies attempt utilize Vision-Language Pre-training Models (VLPMs) with various fusion strategies for addressing task. However, these methods typically fail simultaneously meet two key requirements CIR: comprehensively extracting visual information faithfully following In this work, we propose...

10.1609/aaai.v39i7.32768 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Improved 3D lighting environment estimation for image forgery detection

OPENALEX - Publications

Bo Peng Wei Wang Jing Dong Tieniu Tan

3D lighting environment is an important clue in image that can be used for forgery detection. Existing forensic methods exploring consistency are based on many assumptions, among which convexity and constant reflectance of the surface two critical ones. In this paper, we propose improved estimation method a more general reflection model. We relax assumptions by incorporating local geometry texture information into our position dependent The proposed model realistic objects like human faces...

10.1109/wifs.2015.7368587 article EN 2015-11-01

A Novel Adaptively Binarizing Magnitude Vector Method in Local Binary Pattern Based Framework for Texture Classification

OPENALEX - Publications

Shiqi Hu Zhibin Pan Jing Dong Xincheng Ren

Local Binary Pattern (LBP) based framework only uses a scalar threshold to binarize all magnitude vectors in <i>P</i> different directions around each center pixel of texture image. Hence, the original LBP-based framework, fact, can not precisely extract features pixel. Furthermore, value have dramatic changes from coarse areas flat same Therefore, using calculated whole image and simultaneously. To overcome these two drawbacks, we propose novel adaptively binarizing vector (ABMV) method....

10.1109/lsp.2022.3158199 article EN IEEE Signal Processing Letters 2022-01-01

Occlusion-aware Bi-directional Guided Network for Light Field Salient Object Detection

OPENALEX - Publications

Jing Dong Shuo Zhang Runmin Cong Youfang Lin

Existing light field based works utilize either views or focal stacks for saliency detection. However, since depth information exists implicitly in adjacent different slices, it is difficult to exploit scene from both. By comparison, Epipolar Plane Images (EPIs) provide explicit accurate and occlusion by projected pixel lines. Due the fact that of an object often continuous, distribution edges concentrates more on boundaries compared with traditional color edges, which beneficial improving...

10.1145/3474085.3475312 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

Breast Cancer Histopathological Image Classification Based on Deep Second-order Pooling Network

OPENALEX - Publications

Jiasen Li Jianxin Zhang Qiule Sun Hengbo Zhang Jing Dong and 2 more

With the breakthrough performance in a variety of computer vision and medical image analysis problems, convolutional neural networks (CNNs) have been successfully introduced for classification task breast cancer histopathological images recent years. Nevertheless, existing mainly utilize first-order statistic information deep features to represent images, failing characterize complex global feature distribution images. To address problem, this work makes first attempt explore second-order...

10.1109/ijcnn48605.2020.9207604 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2020-07-01

End-to-End Speech Emotion Recognition Based on One-Dimensional Convolutional Neural Network

OPENALEX - Publications

Mengna Gao Jing Dong Dongsheng Zhou Qiang Zhang Deyun Yang

Real-time speech emotion recognition has always been a problem. To this end, we proposed an end-to-end model based on one-dimensional convolutional neural network, which contains only three convolution layers, two pooling layers and one full-connected layer. Through Adam optimization algorithm back propagation mechanism, more discriminative features can be extracted continuously. Our is quite simple in structure easy to quickly complete the emotional classification task. Compared with...

10.1145/3319921.3319963 article EN 2019-03-15

Artifact feature purification for cross-domain detection of AI-generated images

OPENALEX - Publications

Zheling Meng Bo Peng Jing Dong Tieniu Tan Haonan Cheng

10.1016/j.cviu.2024.104078 article EN Computer Vision and Image Understanding 2024-07-14

MFC: A multi-scale fully convolutional approach for visual instance retrieval

OPENALEX - Publications

Jiedong Hao Wei Wang Jing Dong Tieniu Tan

Previous work has shown that feature maps of deep convolutional neural networks (CNNs) can be interpreted as representation an image. Image features aggregated from these have achieved steady progress in terms performances on visual instance retrieval tasks recent years. The key to the success such methods is representation. In this paper, we study how represent image using discriminative features. We demonstrate first size important factor which affects performance but not been thoroughly...

10.1109/icmew.2017.8026302 article EN 2017-07-01

High-order local connection network for 3D human pose estimation based on GCN

OPENALEX - Publications

Wei Wu Dongsheng Zhou Qiang Zhang Jing Dong Xiaopeng Wei

10.1007/s10489-022-03312-x article EN Applied Intelligence 2022-03-17

Speech Emotion Recognition Based on Convolutional Neural Network and Feature Fusion

OPENALEX - Publications

Gao Mengna Jing Dong Dongsheng Zhou Xiaopeng Wei Qiang Zhang

In view of the remarkable achievements convolutional neural network in field computer vision, We propose a speech emotion recognition algorithm based on convolution and feature fusion, Which extracts features from original signal its spectrogram for recognition. From point enhancement, extracted 1D-CNN 2D-CNN tivo models are fused by dimension splicing this algorithm, then sent to model again train. This Way fusion makes better use emotional information time domain frequency domain, gives...

10.1109/iske47853.2019.9170369 article EN 2019-11-01

Lightweight Real-Time Image Semantic Segmentation Network Based on Multi-Resolution Hybrid Attention Mechanism

OPENALEX - Publications

Wang Xi-zhong Rui Liu Jing Dong Qiang Zhang Dongsheng Zhou

Effective perception of the surrounding environment and balance between accuracy processing speed are crucial for successful application real-time semantic segmentation algorithm in fields autonomous driving, drones, smart security. In this paper, a lightweight feature reuse network MHANet is proposed. The main novelties our method improved ResNet attention-based fusion mechanism. And effectiveness verified by large number experiments. Without any pre-training process, performance using deep...

10.1155/2022/3215083 article EN Wireless Communications and Mobile Computing 2022-09-17

Artifact Feature Purification for Cross-domain Detection of AI-generated Images

OPENALEX - Publications

Zheling Meng Bo Peng Jing Dong Tieniu Tan

In the era of AIGC, fast development visual content generation technologies, such as diffusion models, bring potential security risks to our society. Existing generated image detection methods suffer from performance drop when faced with out-of-domain generators and scenes. To relieve this problem, we propose Artifact Purification Network (APN) facilitate artifact extraction images through explicit implicit purification processes. For one, a suspicious frequency-band proposal method spatial...

10.48550/arxiv.2403.11172 preprint EN arXiv (Cornell University) 2024-03-17

A Vision-based Remote Assistance Method and it's Application in Object Transfer

OPENALEX - Publications

Mingkai Cheng Pengfei Yi Yujie Guo Rui Liu Jing Dong and 1 more

10.1145/3663976.3663983 article EN 2024-04-26

Research on Model-Free 6D Object Pose Estimation Based on Vision 3D Matching

OPENALEX - Publications

Yan Chen Pengfei Yi Yujie Guo Rui Liu Jing Dong and 1 more

10.1145/3663976.3663984 article EN 2024-04-26

MCFNet: Multi-scale Cross Fusion Network for 3D Human Pose Estimation

OPENALEX - Publications

Dazhong Wang Rui Liu Pengfei Yi Jing Dong Dongsheng Zhou

10.1109/icsip61881.2024.10671503 article EN 2022 7th International Conference on Signal and Image Processing (ICSIP) 2024-07-12

Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval

OPENALEX - Publications

Zelong Sun Jing Dong Guoxing Yang Nanyi Fei Zhiwu Lu

Composed Image Retrieval (CIR) aims to retrieve target images from candidate set using a hybrid-modality query consisting of reference image and relative caption that describes the user intent. Recent studies attempt utilize Vision-Language Pre-training Models (VLPMs) with various fusion strategies for addressing task.However, these methods typically fail simultaneously meet two key requirements CIR: comprehensively extracting visual information faithfully following In this work, we propose...

10.48550/arxiv.2412.11087 preprint EN arXiv (Cornell University) 2024-12-15

Coming Soon ...