NFDI4DS | UHH-SEMS - Publication Details

Ryosuke Furuta

ORCID: 0000-0003-1441-889X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5091227949

Research Areas

Human Pose and Action Recognition
Multimodal Machine Learning Applications
Advanced Neural Network Applications
Video Analysis and Summarization
Hand Gesture Recognition Systems
Image Enhancement Techniques
Advanced Image and Video Retrieval Techniques
Advanced Vision and Imaging
Visual Attention and Saliency Detection
Robot Manipulation and Learning
Generative Adversarial Networks and Image Synthesis
Railway Systems and Energy Efficiency
Domain Adaptation and Few-Shot Learning
Electrical Contact Performance and Analysis
Text and Document Classification Technologies
Engineering Applied Research
Video Surveillance and Tracking Methods
Anomaly Detection Techniques and Applications
Computer Graphics and Visualization Techniques
Image Retrieval and Classification Techniques
Topic Modeling
Innovation in Digital Healthcare Systems
CCD and CMOS Imaging Sensors
Electric and Hybrid Vehicle Technologies
Robotics and Sensor-Based Localization

The University of Tokyo
2015-2025

Tokyo University of Science
2019

Tokyo University of Information Sciences
2015-2018

East Japan Railway (Japan)
2005-2014

Aims Community College
2005

University of Washington
1983

Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation

OPENALEX - Publications

Naoto Inoue Ryosuke Furuta Toshihiko Yamasaki Kiyoharu Aizawa

Can we detect common objects in a variety of image domains without instance-level annotations? In this paper, present framework for novel task, cross-domain weakly supervised object detection, which addresses question. For have access to images with annotations source domain (e.g., natural image) and image-level target watercolor). addition, the classes be detected are all or subset those domain. Starting from fully detector, is pre-trained on domain, propose two-step progressive adaptation...

10.1109/cvpr.2018.00525 preprint EN 2018-06-01

PixelRL: Fully Convolutional Network With Reinforcement Learning for Image Processing

OPENALEX - Publications

Ryosuke Furuta Naoto Inoue Toshihiko Yamasaki

This article tackles a new problem setting: reinforcement learning with pixel-wise rewards (pixelRL) for image processing. After the introduction of deep Q-network, RL has been achieving great success. However, applications (RL) processing are still limited. Therefore, we extend to pixelRL various applications. In pixelRL, each pixel an agent, and agent changes value by taking action. We also propose effective method that significantly improves performance considering not only future states...

10.1109/tmm.2019.2960636 article EN IEEE Transactions on Multimedia 2019-12-18

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

OPENALEX - Publications

Kristen Grauman Andrew Westbury Lorenzo Torresani Kris Kitani Jitendra Malik and 95 more

10.1109/cvpr52733.2024.01834 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Catenary and storage battery hybrid system for electric railcar series EV-E301

OPENALEX - Publications

Yohei Kono Naoki Shiraki Hiroshi Yokoyama Ryosuke Furuta

East Japan Railway Company (JR East) has developed the catenary and storage battery hybrid train system using a test car for purpose of through operation service between electrified section non-electrified decreasing environmental impact diesel trains operating. We will develop commercial Series EV-E301 applied this start in March 2014. introduce technical items paper.

10.1109/ipec.2014.6869881 article EN 2014-05-01

Fully Convolutional Network with Multi-Step Reinforcement Learning for Image Processing

OPENALEX - Publications

Ryosuke Furuta Naoto Inoue Toshihiko Yamasaki

This paper tackles a new problem setting: reinforcement learning with pixel-wise rewards (pixelRL) for image processing. After the introduction of deep Q-network, RL has been achieving great success. However, applications processing are still limited. Therefore, we extend to pixelRL various applications. In pixelRL, each pixel an agent, and agent changes value by taking action. We also propose effective method that significantly improves performance considering not only future states own but...

10.1609/aaai.v33i01.33013598 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey

OPENALEX - Publications

Takehiko Ohkawa Ryosuke Furuta Yoichi Sato

Abstract In this survey, we present a systematic review of 3D hand pose estimation from the perspective efficient annotation and learning. has been an important research area owing to its potential enable various applications, such as video understanding, AR/VR, robotics. However, performance models is tied quality quantity annotated poses. Under status quo, acquiring poses challenging, e.g., due difficulty presence occlusion. To reveal problem, pros cons existing methods classified manual,...

10.1007/s11263-023-01856-0 article EN cc-by International Journal of Computer Vision 2023-08-07

Detection of Door-Closing Defects by Learning from Physics-Based Simulations

OPENALEX - Publications

Ryota Takahashi Yota Yamamoto Ryosuke Furuta Yukinobu Taniguchi

10.5220/0013148200003912 article EN Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications 2025-01-01

Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos

OPENALEX - Publications

Takehiko Ohkawa Takuma Yagi Taichi Nishimura Ryosuke Furuta Atsushi Hashimoto and 2 more

10.1109/wacv61041.2025.00807 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

Learning Multiple Object States from Actions via Large Language Models

OPENALEX - Publications

Masatoshi Tateno Takuma Yagi Ryosuke Furuta Yoichi Sato

10.1109/wacv61041.2025.00925 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

Prediction of User Ratings of Oral Presentations using Label Relations

OPENALEX - Publications

Toshihiko Yamasaki Yusuke Fukushima Ryosuke Furuta Litian Sun Kiyoharu Aizawa and 1 more

Predicting the users' impressions on a video talk is an important step for recommendation tasks. We propose method to accurately predict multiple impression-related user ratings given talk. Our proposal considers (a) multimodal features including linguistic as well acoustic features, (b) correlations between different (labels), and (c) feature types. In particular, proposed models both label within single Markov random field (MRF), jointly optimizes assignment problem obtain consistent set...

10.1145/2813524.2813533 article EN 2015-10-30

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos

OPENALEX - Publications

Zecheng Yu Yifei Huang Ryosuke Furuta Takuma Yagi Yusuke Goutsu and 1 more

Object affordance is an important concept in hand-object interaction, providing information on action possibilities based human motor capacity and objects' physical property thus benefiting tasks such as anticipation robot imitation learning. However, the definition of existing datasets often: 1) mix up with object functionality; 2) confuse goal-related action; 3) ignore capacity. This paper proposes efficient annotation scheme to address these issues by combining goal-irrelevant actions...

10.1109/wacv56688.2023.00219 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023-01-01

Fast Volume Seam Carving With Multipass Dynamic Programming

OPENALEX - Publications

Ryosuke Furuta Ikuko Tsubaki Toshihiko Yamasaki

In volume seam carving, i.e., carving for 3D cost volume, an optimal surface can be derived by graph cuts, resulting from sophisticated construction. To date, the graph-cut algorithm is only solution carving. However, it not suitable practical use because incurs a heavy computational load. We propose multipass dynamic programming (DP)-based approach which reduces computation time and memory consumption while maintaining similar image quality as that of cuts. Our DP scheme achieved conducting...

10.1109/tcsvt.2016.2620563 article EN IEEE Transactions on Circuits and Systems for Video Technology 2016-11-23

Hybrid Traction Technologies with Energy Storage Devices for Nonelectrified Railway Lines

OPENALEX - Publications

Ryosuke Furuta J. Kawasaki Keiichiro Kondo

Abstract Hybrid traction systems of lithium‐ion batteries with fuel cells or diesel engines are the one effective measures to reduce emission railcars for nonelectrified lines. Some challenges, including revenue services, hybrid have been recently addressed in Japan. In this paper, evolution and features cell discussed examples challenges. addition, technical trend applications storage is summarized perspectives future systems. Copyright © 2010 Institute Electrical Engineers Published by...

10.1002/tee.20532 article EN IEEJ Transactions on Electrical and Electronic Engineering 2010-04-20

Fooling Neural Networks in Face Attractiveness Evaluation: Adversarial Examples with High Attractiveness Score But Low Subjective Score

OPENALEX - Publications

Sijie Shen Ryosuke Furuta Toshihiko Yamasaki Kiyoharu Aizawa

People are fond of taking and sharing photos in their social life, a large part it is face images, especially selfies. A lot researchers interested analyzing attractiveness images. Benefited from deep neural networks (DNNs) training data, have been developing learning models that can evaluate facial photos. However, recent development on DNNs showed they could be easily fooled even when trained dataset. In this paper, we used two approaches to generate adversarial examples high scores but...

10.1109/bigmm.2017.64 article EN 2017-04-01

Simultaneous control of head pose and expressions in 3D facial keypoint-based GAN

OPENALEX - Publications

Tomoyuki Hatakeyama Ryosuke Furuta Yoichi Sato

Abstract In this work, we present a novel method for simultaneously controlling the head pose and facial expressions of given input image using 3D keypoint-based GAN. Existing methods are not suitable real images, or they generate unnatural results because it is trivial to capture (large changes) (small simultaneously. achieve simultaneous control by introducing keypoints GAN-based synthesis, unlike existing 2D landmark-based approach. As result, our can handle both large variations due...

10.1007/s11042-024-18449-9 article EN cc-by Multimedia Tools and Applications 2024-03-01

Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild

OPENALEX - Publications

Lin Nie Takehiko Ohkawa Mingfang Zhang Yifei Huang Ryosuke Furuta and 1 more

We present a contrastive learning framework based on in-the-wild hand images tailored for pre-training 3D pose estimators, dubbed HandCLR. Pre-training large-scale achieves promising results in various tasks, but prior methods have not fully utilized the potential of diverse accessible from videos. To facilitate scalable pre-training, we first prepare an extensive pool videos and design our method with learning. Specifically, collected over 2.0M recent human-centric videos, such as 100DOH...

10.48550/arxiv.2409.09714 preprint EN arXiv (Cornell University) 2024-09-15

Efficient and interactive spatial-semantic image retrieval

OPENALEX - Publications

Ryosuke Furuta Naoto Inoue Toshihiko Yamasaki

This paper proposes an efficient image retrieval system. When users wish to retrieve images with semantic and spatial constraints (e.g., a horse is located at the center of image, person riding on horse), it difficult for conventional text-based systems such exactly. In contrast, proposed system can consider both information, because based segmentation using fully convolutional networks (FCN). The accept three types as queries: map sketched by user, natural or combination two. distance...

10.1007/s11042-018-7148-1 article EN cc-by Multimedia Tools and Applications 2019-02-01

Fast volume seam carving with multi-pass dynamic programming

OPENALEX - Publications

Ryosuke Furuta Ikuko Tsubaki Toshihiko Yamasaki

In volume seam carving, carving for three-dimensional (3D) cost volume, an optimal surface can be derived by graph cuts, resulting from sophisticated construction. However, the cuts algorithm is not suitable practical use because it incurs a heavy computational load. We propose multi-pass dynamic programming (DP) based approach that reduces computation time to 60 times faster and memory consumption 10 smaller than those of while maintaining similar image quality as cuts. our DP, suboptimal...

10.1109/icip.2016.7532672 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2016-08-17

Object detection refinement using Markov random field based pruning and learning based rescoring

OPENALEX - Publications

Naoto Inoue Ryosuke Furuta Toshihiko Yamasaki Kiyoharu Aizawa

Contextual information such as the co-occurrence of objects and location has played an important role in object detection. We present candidate pruning rescoring methods that leverage contextual can improve state-of-the-art CNN-based detection Fast R-CNN Faster R-CNN. In our method, we formulate reduction a Markov random field optimization problem. employ machine learning technique to reconsider scores windows. experimentally demonstrate improvements R-CNN-based using two datasets. Moreover,...

10.1109/icassp.2017.7952437 article EN 2017-03-01

Painting Style-Aware Manga Colorization Based On Generative Adversarial Networks

OPENALEX - Publications

Yugo Shimizu Ryosuke Furuta Delong Ouyang Yukinobu Taniguchi Ryota Hinami and 1 more

Japanese comics (called manga) are traditionally created in monochrome format. In recent years, addition to comics, full color a more attractive medium, have appeared. Unfortunately, require manual colorization, which incurs high labor costs. Although automatic colorization methods been recently proposed, most of them designed for illustrations, not comics. Unlike since composed many consecutive images, the painting style must be consistent. To realize consistent we propose here...

10.1109/icip42928.2021.9506254 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2021-08-23

Coarse-to-fine strategy for efficient cost-volume filtering

OPENALEX - Publications

Ryosuke Furuta Satoshi Ikehata Toshihiko Yamasaki Kiyoharu Aizawa

Cost-volume filtering is one of the most widely known techniques to solve general multi-label problems, however it problematically inefficient when label space size extremely large. This paper presents a coarse-to-fine strategy cost-volume that handles efficiently and accurately problems with large size. Based upon observation true labels at same image coordinate different scales are highly correlated, we truncate unimportant for by leveraging labeling output lower scales. Experimental...

10.1109/icip.2014.7025770 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2014-10-01

Coming Soon ...