- Video Coding and Compression Technologies
- Advanced Data Compression Techniques
- Image Retrieval and Classification Techniques
- Advanced Image and Video Retrieval Techniques
- Image and Video Quality Assessment
- Advanced Vision and Imaging
- Medical Image Segmentation Techniques
- Peer-to-Peer Network Technologies
- Data Management and Algorithms
- Advanced Steganography and Watermarking Techniques
- IoT-based Smart Home Systems
- Cooperative Communication and Network Coding
- Video Analysis and Summarization
- Advanced Wireless Network Optimization
- Multimedia Communication and Technology
- Cloud Computing and Resource Management
- Caching and Content Delivery
- Wireless Communication Networks Research
- Network Traffic and Congestion Control
- Advanced Wireless Communication Techniques
- Gaze Tracking and Assistive Technology
- Digital Filter Design and Implementation
- Wireless Communication Security Techniques
- Mobile Agent-Based Network Management
- Distributed and Parallel Computing Systems
National Taiwan University of Science and Technology
2013-2024
National Dong Hwa University
2001-2008
National Taiwan University
2006
National Yang Ming Chiao Tung University
1997-2002
ITRI International
2002
Industrial Technology Research Institute
2000-2001
A source model describing the relationship between bits, distortion, and quantization step sizes of a large class block-transform video coders is proposed. This initially derived from rate-distortion theory then modified to match practical real image data. The realistic constraints such as quantizer dead-zone threshold coefficient selection are included in our formulation. most attractive feature this its simplicity final form. It enables us predict bits needed encode picture at given...
Processing images for specific targets on a large scale has to handle various kinds of contents with regular processing steps. To segment objects in one image, we utilized dual multiScalE Graylevel mOrphological open and close recoNstructions (SEGON) build background (BG) gray-level variation mesh, which can help identify BG object regions. It was developed from macroscopic perspective image gray levels implemented using standard procedures, thus robustly dealing large-scale database images....
We consider optimal encoding of video sequences for ATM networks. Two cases are investigated. In one, the units coded independently (e.g., motion JPEG), while in other, coding quality a later picture may depend on that an earlier H.26x and MPEGx). The aggregate distortion-rate relationship latter case exhibits tree structure, its solution commands higher degree complexity than former. For independent coding, we develop algorithm which employs multiple Lagrange multipliers to find constrained...
We investigate how to well utilize radio frequency identification techniques for real-time location systems (RTLS). A new control method, locate tracking tag through reader power and candidate region intersection (LOCTREC), has been proposed improve the RTLS estimation accuracy by eliminating trivial information from procedure. Multireaders were deployed operated with multipower level progressively refine target accurate estimation. To enhance LOCTREC, visual interpolation algorithms adopted...
Static gestures can convey certain meanings and act as specific transitions in dynamic gestures. Therefore, recognizing static is one of the most important aspects gesture recognition. In this paper, a new approach presented for based on Zernike moments (ZMs) pseudo-Zernike (PZMs). The binary hand silhouette first accommodated with minimum bounding circle (MBC). then decomposed into finger part palm by morphological operations according to radius MBC. After that, ZMs & PZMs different...
This paper presents an innovative access control system, based on human detection and path analysis, to reduce false automatic door system actions while increasing the added values for security applications. The proposed can first identify a person from scene, track his trajectory predict intention accessing entrance, finally activate accordingly. experimental results show that has advantages of high precision, safety, reliability, be responsive demands, preserving benefits being low cost value.
In the first part of this paper, we derived a source model describing relationship between bits, distortion, and quantization step size for transform coders. Based on model, variable frame rate coding algorithm is developed. The basic idea to select proper picture ensure minimum quality every frame. Because our can predict approximately number coded bits when certain used, could images without going through entire real-coding process. Therefore, skip right frames accomplish goal constant...
Efficient data hiding algorithms have been developed for video coders such as MPEG-4 and H.264/AVC, to deliver embedded information. Lin et al. proposed an error propagation free discrete cosine transform (DCT) based algorithm in H.264/AVC intra-coded frames. However, the state-of-the-art codec, high efficiency coding (HEVC), adopts both DCT sine (DST) that previous cannot afford fully utilize available capacity under HEVC framework. We investigate block DST coefficient characteristics...
The video codec, H.266/VVC, adopts a Quad-Tree-plus-Multi-type-Tree(QTMT). It's efficient but with high time complexity. A convolutional-neural-network(CNN) and random-forest-classifier are designed to predict the depth splitting-type of 32 × Coding-Unit eliminate exhaustive RDO-tests. CNN outputs label that specifies depth-range for coder early-skip or early-terminate coding; We utilize random-forest(RF) algorithm design six RF binary classifiers multi-level one. If it classifies CU be not...
The authors address the problem of providing fair multimedia quality-of-service (QoS) in IEEE 802.11 distributed co-ordination function-based wireless local area networks infrastructure mode where mobile hosts experience heterogeneous channel conditions due to mobility and fading effects. It was observed that unequal link qualities can pose significant unfairness sharing, which may thereby lead degradation QoS performed adverse conditions. A cross-layer adaptation scheme provides by online...
We proposed to utilize the scalable peer-to-peer network perform content-based image retrieval and mining, i.e, P2P-CBIRM. The decentralized unstructured P2P model with certain overheads, i.e., peer clustering update procedures, is adopted compromise structured one while still reserving flexible routing control when peers join/leave or fails. CBIRM engine designed multi-instance query multi-feature types effectively reduce traffic maintaining high accuracy. It helps enhance knowledge...
With the advance of 5th generation (5G) wireless system technology, JVET started to develop a FVC (Future Video Coding) standard for ultra-high definition video (UHDV) since 2016. We study how speed up H.266 coding without degrading quality. The FVC/H.266 adopts QuadTree plus Binary Tree (QTBT) structure Coding Units (CU). proposed reference average depth information neighboring Largest Unit (LCU) determine whether early terminate CU decomposition or not. By utilizing modes CUs, it can...
Because the radio spectrum is limited, managing limited amount of resources an important issue, especially for high-speed data applications. This paper presents a distributed multiagent scheme (DMAS) that was developed supporting resource allocation in customer-accepted and cost-effective fashion. The consists collection problem-solving agents with three modules built into scheme: knowledge source, blackboard system, control engine. Through operations cooperation among active agents, policy...
The extended format of HEVC, Scalable HEVC (SHVC), has been developed to serve users with mobile devices under heterogeneous networks. SHVC can provide different bitrate, resolution, or quality formats the same video through one-time encoding process. As is based on its time complexity high and be reduced for practical applications. In SHVC, it select one best mode, from CUs, PUs TUs, achieve RD-cost encoding. this work, we proposed utilize two fast schemes speed up process: (1) We utilized...
The newest video coding standard, Versatile Video Coding (VVC), adopts a QTMT, quad-tree plus multi-type tree (MTT), block partition structure and improves the compression performance by about 30%~50%, compared with previous High-Efficiency (HEVC) one, at cost of higher time complexity. To make practical communication applications feasible, we have to reduce high complexity resulting from an exhaustive rate-distortion optimization (RDO) search procedure.We proposed predict Unit (CU) split...
We consider optimal encoding of video sequences (for ATM networks) under buffer and channel constraints. For independent coding, we develop an algorithm which employs multiple Lagrange multipliers to find the constrained bit allocation, Simulation results are presented. also compare coded quality other characteristics variable bit-rate constant transmission.
The Internet Protocol Television (IPTV) service provides rich multimedia services over IP networks and is considered as a potential killer application the Internet. intellectual property management protocol (IPMP) system becomes important in developing this network media applications. In paper, security function, streaming user terminal that adopted IPMP are seamlessly integrated to provide secured live service. addition, peer-to-peer (P2P) connecting method between terminals also developed...
Cloud video processing and streaming services has to be delivered under heterogeneous network device environments. Scalable coding transcoding are required serve users. As the task scheduling algorithm pre-configures a Hadoop MapReduce platform with assumption of homogeneous node capability complexity, it cannot well accommodate practical resources tasks. In this research, we proposed Dynamic Adjustment Slot Complexity Aware Scheduler (DASCAS) assign tasks Complexities decomposed segments...
Performing Content-Based Image Retrieval (CBIR) from Internet databases connected through Peer-to-Peer (P2P) network, abbreviated as P2P-CBIR, helps to effectively explore the large-scale image database distributed over peers. Decentralized unstructured P2P framework is adopted in our system compromise with structured one while still reserving flexible routing control when peers join/leave or network fails. The P2P-CBIR search engine designed provide multi-instance query multi-feature types...
Performing Content-Based Image Retrieval (CBIR) from Internet databases connected through Peer-to-Peer (P2P) network, abbreviated as P2P-CBIR, helps to effectively explore the large-scale image database distributed over peers. Decentralized unstructured P2P framework is adopted in our system compromise with structured one while still reserving flexible routing control when peers join/leave or network fails. The P2P-CBIR search engine designed provide multiinstance query multi-feature types...
Multiple description coding (MDC) decomposes one single media into several descriptions and transmits them over different channels for error resilience. Each contributes to improving the reconstructed quality when decoded. Distributed video (DVC) encodes multiple correlated images utilizes correction codes shift codec complexity a joint decoder. Combining MDC with DVC (MDVC) yields stable mobile encoders. In this paper, improve MDVC performance, image correlations among processing modules...
With the prevalence of personal computer devices and Internet, a multimedia server has to perform video transcoding serve users under heterogeneous network environments. In this paper, we propose stability-driven hierarchical scheduling algorithm for cloud-based streaming system speed up process real-time service applications. The Multi-layer Division Max-MCT (MDMCT) been developed reduce convoy effect. It helps coordinate operations dynamically adjust number slots so that cloud cluster-s...
The use of IP (Internet protocol) technology in the information and communications industry constitutes a major global trend. A highly efficient service architecture, enabling technologies advanced applications are essential to rapid multimedia services an all-IPv6 network environment. This work presents platform based on open architecture (OSA) support set standard interfaces applications. environment was integrated using network-processor-based IPv4/IPv6 translator mobile router (MR)...