Joohee Kim

ORCID: 0000-0001-8833-0319
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Vision and Imaging
  • Advanced Neural Network Applications
  • Video Coding and Compression Technologies
  • Advanced Data Compression Techniques
  • Video Surveillance and Tracking Methods
  • Advanced Image and Video Retrieval Techniques
  • Wireless Communication Security Techniques
  • Image and Video Quality Assessment
  • Advanced Image Processing Techniques
  • Sparse and Compressive Sensing Techniques
  • Robotics and Sensor-Based Localization
  • Chaos-based Image/Signal Encryption
  • Domain Adaptation and Few-Shot Learning
  • Image Processing Techniques and Applications
  • Image Enhancement Techniques
  • Educational Systems and Policies
  • Cooperative Communication and Network Coding
  • Optical measurement and interference techniques
  • Music and Audio Processing
  • Parallel Computing and Optimization Techniques
  • Radiation Detection and Scintillator Technologies
  • Image and Signal Denoising Methods
  • Advanced Data Storage Technologies
  • Advanced Steganography and Watermarking Techniques
  • Speech and Audio Processing

Illinois Institute of Technology
2015-2025

Sungkyunkwan University
2016

Kyonggi University
2013-2014

Inha University
2005-2008

Georgia Institute of Technology
2003-2005

University of Michigan
2002

This paper presents a new bit-plane-wise unequal error protection algorithm for progressive bitstreams transmitted over lossy networks. The proposed protects compressed embedded bitstream generated by 3-D SPIHT assigning an amount of forward correction (FEC) to each bit-plane. reduces the side information needed send size code decoder limiting number quality levels bit-planes be sent while providing graceful degradation picture as packet losses increase. We also apply our transmission JPEG...

10.1109/tip.2003.809006 article EN IEEE Transactions on Image Processing 2003-02-01

10.1016/j.cviu.2024.103956 article EN cc-by-nc Computer Vision and Image Understanding 2024-02-08

Multi-class and multi-scale object detection for autonomous driving is challenging because of the high variation in scales cluttered background complex street scenes. Context information high-resolution features are keys to achieve a good performance detection. However, context typically unevenly distributed, feature map also contains distractive low-level features. In this paper, we propose location-aware deformable convolution backward attention filtering improve performance. The extracts...

10.1109/cvpr.2019.00968 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

This paper presents a distributed video streaming framework using unbalanced multiple description coding (MDC) and unequal error protection. In the proposed framework, two senders simultaneously stream complementary descriptions to single receiver over different paths. To minimize overall distortion exploit benefits of multipath transport when characteristics each path are different, an MDC method for wavelet-based coders combined with TCP-friendly rate allocation algorithm is proposed. The...

10.1109/tip.2005.849335 article EN IEEE Transactions on Image Processing 2005-06-15

Universal image segmentation aims to handle all tasks within a single model architecture and ideally requires only one training phase. To achieve task-conditioned joint training, task token needs be used in the multi-task condition for specific tasks. Existing approaches generate from text input (e.g., "the is panoptic"). However, such text-based inputs merely serve as labels fail capture inherent differences between tasks, potentially misleading model. In addition, discrepancy visual...

10.3390/s25020359 article EN cc-by Sensors 2025-01-09

With the introduction of Microsoft Kinect into gaming industry and release Kinect-based application development kits, a whole new has evolved around with applications for gaming, gesture recognition controlling devices, 3-D communication, etc. coming to fore. The popularity sensor can be attributed its low cost real-time depth map generation capability. However, owing Kinect's lack sophistication, maps generated have lot artifacts like poorly object boundaries missing values misalignment...

10.1109/isocc.2012.6407114 article EN 2012-11-01

Recent advancements in image segmentation have been notably driven by Vision Transformers. These transformer-based models offer one versatile network structure capable of handling a variety tasks. Despite their effectiveness, the pursuit enhanced capabilities often leads to more intricate architectures and greater computational demands. OneFormer has responded these challenges introducing query-text contrastive learning strategy active during training only. However, this approach not...

10.3390/s24061879 article EN cc-by Sensors 2024-03-14

Generating an accurate and dense disparity image is one of the important requirements for many applications such as 3D video stereo vision-based advanced driver assistance systems (ADAS). Depth estimation process obtaining a depth map based on two or more reference images. Recently, several techniques that use semi-global optimization estimating maps have been suggested. Although robustness against illumination changes vital factor in like ADAS, matching (SGM) mutual information achieves...

10.1109/iccve.2013.6799860 article EN 2013-12-01

Video object detection enhances the performance of still-image based by exploiting temporal context information from neighboring frames. Most state-of-the-art video detectors are non-causal and require lots preceding succeeding frames, which makes them impractical for real-time online where frames not available. In this paper, we propose a causal recurrent flow-based method detection. The proposed reads only current frame one memory buffer at each time step. Two types utilized. short-term is...

10.1109/icip.2019.8802920 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2019-08-26

10.1016/j.asoc.2020.106209 article EN Applied Soft Computing 2020-03-06

Transformer-based semantic segmentation methods have achieved excellent performance in recent years. Mask2Former is one of the well-known transformer-based which unifies common image into a universal model. However, it performs relatively poorly obtaining local features and segmenting small objects due to relying heavily on transformers. To this end, we propose simple yet effective architecture that introduces auxiliary branches during training capture dense encoder side. The obtained help...

10.3390/s23020581 article EN cc-by Sensors 2023-01-04

This paper presents a TCP-friendly rate allocation algorithm for multiple description coding combined with path diversity. A 3-D SPIHT-based is used to generate two independent substreams and each substream protected an FEC-based unequal error protection algorithm. In the proposed video streaming framework, senders simultaneously stream complementary receiver over different paths. The coordinates distributed from single minimize overall distortion. Simulation results show that increases...

10.1109/icme.2003.1221701 article EN 2003-01-01

In this paper we introduce Slowee, a smart eating-speed guide system with light and vibration. Slowee aims to improve the user's eating habits by delivering right feedback in real time user while eating. We designed implemented our system, conducted pilot study investigate usability of obtain feedbacks from users. Although number participants is rather small (n=10), gave positive on potentials Slowee. expect that device can help maintaining appropriate speed chewing numbers for patients...

10.1145/2851581.2892323 article EN 2016-05-06

Quality‐of‐service‐guaranteed video communication in wireless sensor networks (WVSNs) is extremely challenging because of the unique constraints WVSN (e.g. limited resources sensors and high error rates networks) characteristics traffic huge data rate tight delay bounds). Distributed coding (DVC) has emerged as a new paradigm for applications with resource‐limited encoders. However, current DVC architectures still have several technical limitations that prevent their widespread use...

10.1049/iet-wss.2012.0115 article EN IET Wireless Sensor Systems 2013-08-19

10.1007/s11042-013-1747-7 article EN Multimedia Tools and Applications 2013-11-07

10.1007/s11042-018-6240-x article EN Multimedia Tools and Applications 2018-06-29

One of the major challenges in video object detection is drastic scale changes objects due to camera motion.In this paper, we propose a two-path Convolutional Long Short-Term Memory (convLSTM) pyramid network designed extract and convey multi-scale temporal contextual information order handle efficiently.The proposed convLSTM consists stack multi-input modules.It updated top-down bottom-up pathways so that for small-to-large large-to-small exploited.The module uses two input feature maps...

10.1109/access.2020.3017411 article EN cc-by IEEE Access 2020-01-01

In this paper, an adaptive motion vector smoothing scheme based on weighted median filtering is proposed in order to eliminate the outliers more effectively for improving quality of side information frame-based distributed video coding. We use a simple outlier reliability measure each block compensated interpolated frame and apply only blocks with unreliable vectors. Simulation results show that algorithm improves significantly while maintaining low complexity at encoder

10.3745/jips.2011.7.1.103 article EN Journal of Information Processing Systems 2011-03-31

Compressed sensing (CS) is the theory and practice of sub-Nyquist sampling sparse signals interest. Exact reconstruction may then be possible with much fewer than Nyquist-required number data. In this paper, we consider a multiview video system in which multiple cameras at different locations perform independent CS to simultaneously capture views scene. At decoder, propose disparity- motion-compensated total variation minimization algorithm jointly reconstruct sequence. The experimental...

10.1109/tcsvt.2017.2656920 article EN publisher-specific-oa IEEE Transactions on Circuits and Systems for Video Technology 2017-01-23

10.1016/j.jvcir.2013.12.006 article EN Journal of Visual Communication and Image Representation 2013-12-11

In this paper we introduce a feedback system for the prevention of forward head posture in sedentary work environments. Our aims to promote user maintain good by delivering real-time when poses postures particularly sitting at desk. We designed and developed visual using 3D camera simple pop-up notification window. The results pilot study with 14 participants show that window made positive influence on preventing his or her posture.

10.1145/2908805.2909414 article EN 2016-06-04
Coming Soon ...