Jar-Ferr Yang

ORCID: 0000-0001-6502-9961
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Vision and Imaging
  • Video Coding and Compression Technologies
  • Advanced Data Compression Techniques
  • Image Enhancement Techniques
  • Image and Video Quality Assessment
  • Face recognition and analysis
  • Image and Signal Denoising Methods
  • Face and Expression Recognition
  • Advanced Image Processing Techniques
  • Advanced Neural Network Applications
  • Video Surveillance and Tracking Methods
  • Digital Filter Design and Implementation
  • Neural Networks and Applications
  • Advanced Decision-Making Techniques
  • Infrastructure Maintenance and Monitoring
  • Numerical Methods and Algorithms
  • Advanced Image and Video Retrieval Techniques
  • Gaze Tracking and Assistive Technology
  • Industrial Vision Systems and Defect Detection
  • Advanced Computational Techniques and Applications
  • Spectroscopy Techniques in Biomedical and Chemical Research
  • Advanced Algorithms and Applications
  • Hand Gesture Recognition Systems
  • Advanced Wireless Communication Techniques
  • Advancements in PLL and VCO Technologies

National Cheng Kung University
2003-2019

Beijing University of Technology
2015

The H.264 advanced video coding (H.264/AVC) standard provides several features such as improved efficiency and error robustness for storage transmission. In order to improve the performance of H.264/AVC, control parameters group-of-pictures (GOP) sizes should be adaptively adjusted according different content variations (VCVs), which can extracted from temporal deviation between two consecutive frames. authors present a simple VCV estimation design adaptive GOP detection (AGD) scene change...

10.1049/iet-ipr:20070014 article EN IET Image Processing 2008-04-14

In this paper, we present efficient recursive architectures for realizing the modified discrete cosine transform (MDCT) and inverse MDCT (IMDCT) acquired in many audio coding systems. After data rearrangement, IMDCT can be represented as Chebyshev polynomials such that efficiently implement them structures. For verification, design an ASIC to realize IMDCT. The analyzed results show proposed infinite-impulse response (IIR) structures possess advantages of high efficiency throughput rate....

10.1109/tcsii.2003.808895 article EN IEEE Transactions on Circuits and Systems II Analog and Digital Signal Processing 2003-01-01

For future 3D TV broadcasting systems and navigation applications, it is necessary to have accurate stereo matching which could precisely estimate depth map from two distanced cameras. In this paper, we first suggest a trinary cross color (TCC) census transform, can help achieve disparity raw cost with low computational cost. The two-pass aggregation (TPCA) formed compute the cost, then be obtained by range winner-take-all (RWTA) process white hole filling procedure. To further enhance...

10.1186/s13634-017-0462-3 article EN cc-by EURASIP Journal on Advances in Signal Processing 2017-03-30

This paper presents recursive architectures for the modified discrete cosine transform (MDCT) and its inverse (IMDCT) which are most complex operations in layer 3 of MPEG audio coding standard. By rearranging input data, we first derive two trigonometric equations, can be represented as Chebyshev polynomials. Then demonstrate that general length MDCT IMDCT efficiently implemented by structure. The computational complexity each data throughput these is less than existing related systems many...

10.1109/sips.2000.886703 article EN 2002-11-08

In this paper, a novel class-specific kernel linear regression classification is proposed for face recognition under very low-resolution and severe illumination variation conditions. Since the problem coupled with variations makes ill-posed data distribution, nonlinear projection rendered by function would enhance modeling capability of distribution. The explicit knowledge mapping can be avoided using trick. To reduce redundancy, low-rank-r approximation suggested to make feasible...

10.1186/s13634-016-0328-0 article EN cc-by EURASIP Journal on Advances in Signal Processing 2016-02-27

For smart living applications, personal identification as well behavior and emotion detection becomes more important in our daily life. identity classification facial expression detection, features extracted from face images are the most popular low-cost information. The shape terms of landmarks estimated by a alignment method can be used for many applications including virtual animation real classification. In this paper, we propose robust based on multi-feature regression (MSR), which is...

10.1186/s13634-018-0572-6 article EN cc-by EURASIP Journal on Advances in Signal Processing 2018-08-02

A successive termination and elimination (STE) method to achieve fast inter mode decision is proposed. The detection starts from residual homogeneous then spatial performed for each 16×16 macroblock. For either the or case, authors can directly terminate prediction choose as best mode. non-homogeneous cases, carry out 8×8 subblock motion estimation. Based on cost analyses of modes, method, which could help remove unlikely 8×16 16×8 also suggested. Similarly, STE block be applied decide if...

10.1049/iet-spr.2008.0184 article EN IET Signal Processing 2009-04-28

In this paper, a simple fuzzy-based algorithm to remove the impulse noise from images is proposed. To achieve real-time applications, proposed filter architecture, which combines fuzzy detection and filtering, also designed. With low computational complexity, simulation results show that filters effectively noise.

10.1109/tim.2003.814677 article EN IEEE Transactions on Instrumentation and Measurement 2003-06-01

In this paper, we propose an approximate square criterion for H.264/AVC intra mode decision. The sum of difference (SSD) achieves the best video quality but takes much high computation due to operation. A (SASD) is proposed maintain and reduce computation. By applying characteristic SSD criterion, simulation results show rate-distortion performance SASD close that method. For hardware implementation, synthesized operation respectively reduces 75% 61% in area cost timing delay than function.

10.1109/icme.2008.4607439 article EN 2008-06-01

The ACELP method makes use of multipulse structure to represent the excitation pulses residual signal. With purpose computational complexity reduction, this paper provides maximum-take-precedence (MTP-ACELP) search under acceptable degradation in performance. Because maximum target signal is preferentially compensated, performance would be diminished. By predicting locations pulses, reduced. We not only reduce possible pulse combinations procedure but also avoid computation useless...

10.1109/icassp.2001.941009 article EN 2002-11-13

Model-based video coding has been adopted as a core experiment in the ISO MPEG-4 standard. The clip-and-paste technique for putting objects line is an important tool to reduce transmission rate. To assist clip-and-pasting method fitting into 2-D model, we propose several smoothing algorithms improving quality of reconstructed images. In this paper, proposed can adjust deformation zoom, tilt, and rotation object A luminance algorithm also applied compensate light source variations. Simulation...

10.1109/30.793428 article EN IEEE Transactions on Consumer Electronics 1999-05-01

To improve the discriminant nearest feature space analysis (DNFSA) methods [6], in this paper, we propose an improved DNFSA (IDNFSA) algorithm to increase robustness for variable lighting face recognition. The IDNFSA removes mean of each image and attempts minimize within-class (FS) distance maximize between-class FS simultaneously. In IDNFSA, first n eigenvectors are dropped a generalized whitening transformation is suggested. recognition phase, projected coefficients classified by rule...

10.1109/iscas.2013.6572506 article EN 2022 IEEE International Symposium on Circuits and Systems (ISCAS) 2013-05-01

Hardware designs that can support multiple standards are required for versatile media players. The study proposes a unified inverse transform architecture be efficiently used in Moving Picture Expert Group and ITU International Telecommunication Standardisation Sector (ITU-T) H.264/advanced video coding (AVC), Microsoft codec 1 (VC-1) Chinese Audio Video Coding Standard (AVS) decoders. For H.264/AVC 8-, 4- 2-point transforms, the computational complexity proposed is similar to defined...

10.1049/iet-ipr.2010.0241 article EN IET Image Processing 2012-08-24

Stereo matching of two distanced cameras and structured-light RGB-D are the common ways to capture depth map, which conveys per-pixel information image. However, results with mismatched occluded pixels would not provide accurately well-matched image information. The depth-image relations degrade performances view syntheses seriously in modern-day three-dimension video applications. Therefore, how effectively utilize enhance themselves becomes more important. In this paper, we propose an...

10.1186/s13634-017-0487-7 article EN cc-by EURASIP Journal on Advances in Signal Processing 2017-07-10

In this paper, a modified bit-rate estimation method is proposed to reduce the computation for 4×4 intra mode decision of H.264/AVC video encoder. The number coded bits modeled by linear combination existing coding parameters, which are highly related entropy H.264/AVC. Furthermore, improve accuracy estimation, scheme made adaptive information obtained from previously blocks. Comparing original rate distortion optimized (RDO) encoding process, needs calculate actual encoded each mode, can...

10.1109/icassp.2008.4517820 article EN Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing 2008-03-01

In this paper, we discuss an approach for designing the computational neural network, which is mainly composed of a hardlimiter neuron, updated and search function to solve some problems. The computation-by-search scheme can effectively complicated problems in condition that their functions be easily obtainable by existing networks. convergence suggested networks achieve solution are discussed analyzed. Both theoretical analyses simulated results show proposed network such they belong...

10.1109/icnn.1995.487334 article EN 2002-11-19

The authors present an effective spectral envelope (SE) quantisation scheme for parametric speech coders, based on human hearing properties. variable-dimension SE uniformly sampled vector in frequency is first converted into a fixed, but small, number of nonlinearly spaced bands the Bark scale. minimum distortion (BSD) criterion applied to enable hearing-based (HSEVQ) quantise vector, achieving slightly better perceptual quality than traditional method. A simplified HSEVQ (SSEVQ) developed...

10.1049/ip-vis:20040809 article EN IEE Proceedings - Vision Image and Signal Processing 2004-01-01

10.2495/acar140341 article EN WIT transactions on engineering sciences 2015-01-02

This paper presents a real-time vision-interactive guiding system, which could be interactive with users based on the computer vision technology. A front-view face detection using Harr-like features is used to decide when system should wake up and become user. After initialization, some feature points within detected area are going found. Then orientation of user's head will estimated via pyramidal Lucas-Kanade optical flow tracking. Compared traditional our has more flexibility. Guiding...

10.1109/iscas.2006.1693104 article EN 1993 IEEE International Symposium on Circuits and Systems 2006-09-22

10.36463/idw.2019.0157 article EN Proceedings of the International Display Workshops 2019-11-28
Coming Soon ...