NFDI4DS | UHH-SEMS - Publication Details

Hiromitsu Nishizaki

ORCID: 0000-0002-7717-8312

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5014342555

Research Areas

Speech Recognition and Synthesis
Speech and Audio Processing
Speech and dialogue systems
Natural Language Processing Techniques
Music and Audio Processing
Topic Modeling
Robotics and Automated Systems
Text and Document Classification Technologies
Advanced Text Analysis Techniques
Smart Agriculture and AI
Horticultural and Viticultural Research
Subtitles and Audiovisual Media
Phonetics and Phonology Research
Music Technology and Sound Studies
Indoor and Outdoor Localization Technologies
Handwritten Text Recognition Techniques
Sentiment Analysis and Opinion Mining
Video Analysis and Summarization
Non-Invasive Vital Sign Monitoring
AI in Service Interactions
Social Robot Interaction and HRI
Educational Technology and Assessment
Face recognition and analysis
Fermentation and Sensory Analysis
Computational and Text Analysis Methods

University of Yamanashi
2016-2025

Takeda (Japan)
2016-2024

Universiti Malaysia Perlis
2022-2024

Yamaha (Japan)
2021

University of Tsukuba
2019

Interdisciplinary Scientific Research
2015

Toyohashi University of Technology
1999-2004

Data augmentation and feature extraction using variational autoencoder for acoustic modeling

OPENALEX - Publications

Hiromitsu Nishizaki

A data augmentation and feature extraction method using a variational autoencoder (VAE) for acoustic modeling is described. VAE generative model based on Bayesian learning deep framework. can extract latent values its input variables to generate new information. VAEs are widely used pictures sentences. In this paper, applied speech corpus vector from modeling. First, the size of doubled by encoding extracted original utterances The waveforms have "meanings" waveforms. Therefore, be as...

10.1109/apsipa.2017.8282225 article EN 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2017-12-01

Real-Time In-Vehicle Air Quality Monitoring System Using Machine Learning Prediction Algorithm

OPENALEX - Publications

Chew Cheik Goh Latifah Munirah Kamarudin Ammar Zakaria Hiromitsu Nishizaki Nuraminah Ramli and 5 more

This paper presents the development of a real-time cloud-based in-vehicle air quality monitoring system that enables prediction current and future cabin quality. The designed provides predictive analytics using machine learning algorithms can measure drivers’ drowsiness fatigue based on presented in car. It consists five sensors level CO2, particulate matter, vehicle speed, temperature, humidity. Data from these were collected stored cloud database. A model multilayer perceptron, support...

10.3390/s21154956 article EN Sensors 2021-07-21

Handwritten Character Image Generation for Effective Data Augmentation

OPENALEX - Publications

Chee Siang Leow Tomoki Kitagawa Hideaki Yajima Hiromitsu Nishizaki

10.1587/transinf.2024edp7201 article EN IEICE Transactions on Information and Systems 2025-01-01

A hybrid approach of knowledge-driven and data-driven reasoning for activity recognition in smart homes

OPENALEX - Publications

Abdul Syafiq Abdull Sukor Ammar Zakaria Norasmadi Abdul Rahim Latifah Munirah Kamarudin Rossitza Setchi and 1 more

Accurate activity recognition plays a major role in smart homes to provide assistance and support for users, especially elderly cognitively impaired people. To realize this task, knowledge-driven approaches are one of the emerging research areas that have shown interesting advantages featu res. However, several limitations been associated with these approaches. The produced models usually incomplete capture all types human activities. This resulted limited ability accurately infer users’...

10.3233/jifs-169976 article EN Journal of Intelligent & Fuzzy Systems 2019-01-22

Chinese Character Recognition based on Swin Transformer-Encoder

OPENALEX - Publications

Ziying Li Haifeng Zhao Hiromitsu Nishizaki Chee Siang Leow Xingfa Shen

10.1016/j.dsp.2025.105080 article EN Digital Signal Processing 2025-02-01

Supporting table grape berry thinning with deep neural network and augmented reality technologies

OPENALEX - Publications

Prawit Buayai Kabin Yok-In Daisuke Inoue Hiromitsu Nishizaki Koji Makino and 1 more

Berry thinning is a crucial process in table grape cultivation. Such visual features as bunch compactness, form, and berry size are important factors affecting market value. Moreover, sufficient space for each to grow also largely influences the final product's quality, such sugar concentration. requires professional skills, it usually accomplished only by experienced farmers. Furthermore, appropriate period limited two weeks; hence, berry-thinning tasks have led bottleneck terms of...

10.1016/j.compag.2023.108194 article EN cc-by-nc-nd Computers and Electronics in Agriculture 2023-09-06

Evaluating Tree-based Ensemble Strategies for Imbalanced Network Attack Classification

OPENALEX - Publications

Hui Fern Soon Amiza Amir Hiromitsu Nishizaki Nik Adilah Hanin Zahri Latifah Munirah Kamarudin and 1 more

With the continual evolution of cybersecurity threats, development effective intrusion detection systems is increasingly crucial and challenging. This study tackles these challenges by exploring imbalanced multiclass classification, a common situation in network datasets mirroring real-world scenarios. The paper aims to empirically assess performance diverse classification algorithms managing class distributions. Experiments were conducted using UNSW-NB15 benchmark dataset, comprising ten...

10.14569/ijacsa.2024.01501111 article EN International Journal of Advanced Computer Science and Applications 2024-01-01

Non-Contact Breathing Monitoring Using Sleep Breathing Detection Algorithm (SBDA) Based on UWB Radar Sensors

OPENALEX - Publications

Muhammad Husaini Latifah Munirah Kamarudin Ammar Zakaria Intan Kartika Kamarudin Muhammad Amin Ibrahim and 3 more

Ultra-wideband radar application for sleep breathing monitoring is hampered by the difficulty of obtaining signals non-stationary subjects. This occurs due to imprecise signal clutter removal and poor body movement algorithms extracting accurate signals. Therefore, this paper proposed a Sleep Breathing Detection Algorithm (SBDA) address challenge. First, SBDA introduces combination variance feature with Discrete Wavelet Transform (DWT) tackle issue method used Daubechies wavelets five levels...

10.3390/s22145249 article EN cc-by Sensors 2022-07-13

RSSI-Based for Device-Free Localization Using Deep Learning Technique

OPENALEX - Publications

Abdul Syafiq Abdull Sukor Latifah Munirah Kamarudin Ammar Zakaria Norasmadi Abdul Rahim Sukhairi Sudin and 1 more

Device-free localization (DFL) has become a hot topic in the paradigm of Internet Things. Traditional methods are focused on locating users with attached wearable devices. This involves privacy concerns and physical discomfort especially to that need wear activate those devices daily. DFL makes use received signal strength indicator (RSSI) characterize user’s location based their influence wireless signals. Existing work utilizes statistical features extracted from However, some may not...

10.3390/smartcities3020024 article EN cc-by Smart Cities 2020-06-01

Constructing Japanese test collections for spoken term detection

OPENALEX - Publications

Yoshiaki Itoh Hiromitsu Nishizaki Xinhui Hu Hiroaki Nanjo Tomoyosi Akiba and 5 more

Spoken Document Retrieval (SDR) and Term Detection (STD) have been two of the most intensively investigated topics in spoken document processing research according to establishment SDR STD test collections by Text REtrieval Conference (TREC) NIST. Because Japanese researchers also requires such for STD, we established a working group develop these Special Interest Group -Spoken Language Processing (SIG-SLP) Information Society Japan. The has constructed made available collection SDR, is now...

10.21437/interspeech.2010-258 article EN Interspeech 2022 2010-09-26

Signal Classification Using Deep Learning

OPENALEX - Publications

Hiromitsu Nishizaki Koji Makino

Internet-of-Things (IoT) devices have rapidly become important in understanding conditions an environment. The sensed data from IoT (or sensor) device generally form a time sequential signal where the values vary with time. This study describes processing using recurrent-based neural network and particularly focuses on two sorts of classification tasks: sound tennis swing motion classification. We will introduce these tasks their evaluation results recurrent networks. experimental show that...

10.1109/sensorsnano44414.2019.8940077 article EN 2019-07-01

Dialogue Robot Competition for the Development of an Android Robot with Hospitality

OPENALEX - Publications

Ryuichiro Higashinaka Takashi Minato Kurima Sakai Tomo Funayama Hiromitsu Nishizaki and 1 more

To promote the research and development of an android robot with hospitality, we organized a dialogue competition in which task is to serve customer travel destination recommendation task. The acts as salesperson at agency needs help customers choose their desired tourist destinations. This paper describes setting, software distributed for competition, evaluation procedure, results preliminary final rounds competition.

10.1109/gcce56475.2022.10014410 article EN 2022 IEEE 11th Global Conference on Consumer Electronics (GCCE) 2022-10-18

End-to-end lightweight berry number prediction for supporting table grape cultivation

OPENALEX - Publications

Yan San Woo Prawit Buayai Hiromitsu Nishizaki Koji Makino Latifah Munirah Kamarudin and 1 more

The advent of smart agriculture has revolutionized and streamlined various manual tasks in grape cultivation, one which is berry thinning. This task necessitates experienced farmers to selectively remove a specific number berries from the working bunch, as guided by remaining bunch. In response, this paper introduces novel real-time edge computing application that automates process counting bunch using single 2D image. proposed employs YOLOv5-based object detection techniques (Jocher, 2021)...

10.1016/j.compag.2023.108203 article EN cc-by-nc-nd Computers and Electronics in Agriculture 2023-09-23

Overview of Dialogue Robot Competition 2023

OPENALEX - Publications

Takashi Minato Ryuichiro Higashinaka Kurima Sakai Tomo Funayama Hiromitsu Nishizaki and 1 more

We have held dialogue robot competitions in 2020 and 2022 to compare the performances of interactive robots using an android that closely resembles a human. In 2023, third competition DRC2023 was held. The task designed be more challenging than previous travel agent tasks. Since anyone can now develop system LLMs, participating teams are required effectively uses information about situation on spot (real-time information), which is not handled by ChatGPT other systems. has two rounds,...

10.48550/arxiv.2401.03547 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Construction of a Test Collection for Spoken Document Retrieval from Lecture Audio Data

OPENALEX - Publications

Tomoyosi Akiba Kiyoaki Aikawa Yoshiaki Itoh Tatsuya Kawahara Hiroaki Nanjo and 4 more

The lecture is one of the most valuable genres audiovisual data. Though spoken document processing a promising technology for utilizing in various ways, it difficult to evaluate because evaluation require subjective judgment and/or verification large quantities In this paper, test collection retrieval reported. consists target documents about 2, 700 lectures (604 hours) taken from Corpus Spontaneous Japanese (CSJ), 39 queries, relevant passages each query, and automatic transcription speech...

10.2197/ipsjjip.17.82 article EN Journal of Information Processing 2009-01-01

Japanese spoken term detection using syllable transition network derived from multiple speech recognizers' outputs

OPENALEX - Publications

Satoshi Natori Hiromitsu Nishizaki Yoshihiro Sekiguchi

10.21437/interspeech.2010-259 article EN Interspeech 2022 2010-09-26

Automatic Fluency Evaluation of Spontaneous Speech Using Disfluency-Based Features

OPENALEX - Publications

Huaijin Deng Youchao Lin Takehito Utsuro Akio Kobayashi Hiromitsu Nishizaki and 1 more

This paper describes an automatic fluency evaluation of spontaneous speech. Although we regularly observe a variety different disfluencies in speech, focus on two types phenomena, i.e., filled pauses and word fragments. aims to reveal that these have effects speech differently. To this end, conduct series SVM classification experiments the Japanese corpus. The experimental results show features derived from fragments are effective evaluating disfluent especially when combined with prosodic...

10.1109/icassp40776.2020.9053452 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020-04-09

End-to-End Inflorescence Measurement for Supporting Table Grape Trimming with Augmented Reality

OPENALEX - Publications

Prawit Buayai Kabin Yok-In Daisuke Inoue Chee Siang Leow Hiromitsu Nishizaki and 2 more

Inflorescence trimming is a crucial process to produce high-quality table grapes. It can eliminate nutrient competition in bunch and makes it less vulnerable disease development. After trimming, the remaining part of inflorescence should have target length decided by grape variety. This challenging for novice farmers because time constraint. The farmer needs finish before berries develop. paper proposes novel end-to-end measurement method supporting with augmented reality technology....

10.1109/cw52790.2021.00022 article EN 2021-09-01

Peer Collaborative Learning for Polyphonic Sound Event Detection

OPENALEX - Publications

Hayato Endo Hiromitsu Nishizaki

This paper describes how semi-supervised learning, called peer collaborative learning (PCL), can be applied to the polyphonic sound event detection (PSED) task, which is one of tasks in Detection and Classification Acoustic Scenes Events (DCASE) challenge. Many deep models have been studied determine what kind events occur where for long a given audio clip. The characteristic PCL used this combination ensemble-based knowledge distillation into sub-networks student-teacher model-based...

10.1109/icassp43922.2022.9746878 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

IV-AQMS: HTTP and MQTT Protocol from a Realistic Testbed

OPENALEX - Publications

Chew Cheik Goh E. Kanagaraj Latifah Munirah Kamarudin Ammar Zakaria Hiromitsu Nishizaki and 1 more

HTTP and MQTT messaging protocols widely support the Internet of things (IoT) application. These recent years in-vehicle air quality monitoring system has drawn public's attention. With assist Things technology, passengers will be updated with realtime through web page even in a mobile The customised was deployed on 40 shuttle buses, data have collected over year Universiti Malaysia Perlis (UniMAP). In this study, 11 buses were implemented protocol while rest 29 applied protocol. travelled...

10.1109/sensorsnano44414.2019.8940094 article EN 2019-07-01

Coming Soon ...