Fereshteh Sadeghi

ORCID: 0000-0003-4058-5261
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Multimodal Machine Learning Applications
  • Advanced Vision and Imaging
  • Advanced Image and Video Retrieval Techniques
  • Human Pose and Action Recognition
  • Reinforcement Learning in Robotics
  • Robotics and Sensor-Based Localization
  • Domain Adaptation and Few-Shot Learning
  • Video Analysis and Summarization
  • Robot Manipulation and Learning
  • Natural Language Processing Techniques
  • Robotic Locomotion and Control
  • Topic Modeling
  • Image and Object Detection Techniques
  • Image Processing Techniques and Applications
  • Muscle activation and electromyography studies
  • Video Surveillance and Tracking Methods
  • Data Visualization and Analytics
  • Image Processing and 3D Reconstruction
  • vaccines and immunoinformatics approaches
  • Cell Image Analysis Techniques
  • Influenza Virus Research Studies
  • Robotics and Automated Systems
  • Artificial Immune Systems Applications
  • Image Retrieval and Classification Techniques
  • Robotic Path Planning Algorithms

Tehran University of Medical Sciences
2024-2025

Iranshahr University
2021-2025

Google (United States)
2024

DeepMind (United Kingdom)
2023-2024

Google (United Kingdom)
2024

University College London
2023

Mashhad University of Medical Sciences
2021

University of Washington
2014-2019

Seattle University
2015

Walt Disney (United States)
2015

Deep reinforcement learning has emerged as a promising and powerful technique for automatically acquiring control policies that can process raw sensory inputs, such images, perform complex behaviors. However, extending deep RL to real-world robotic tasks proven challenging, particularly in safety-critical domains autonomous flight, where trial-and-error is often impractical. In this paper, we explore the following question: train vision-based navigation entirely simulation, then transfer...

10.15607/rss.2017.xiii.034 article EN 2017-07-12

We investigated whether deep reinforcement learning (deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies. used RL train play simplified one-versus-one soccer game. The resulting agent exhibits robust dynamic skills, such as rapid fall recovery, walking, turning, kicking, it transitions between them in smooth efficient manner. It also learned anticipate ball movements block...

10.1126/scirobotics.adi8022 article EN Science Robotics 2024-04-10

Humans are remarkably proficient at controlling their limbs and tools from a wide range of viewpoints. In robotics, this ability is referred to as visual servoing: moving tool or end-point desired location using primarily feedback. paper, we propose learning viewpoint invariant servoing skills in robot manipulation task. We train deep recurrent controller that can automatically determine which actions move the end-effector robotic arm object. This problem fundamentally ambiguous: under...

10.1109/cvpr.2018.00493 article EN 2018-06-01

How can we know whether a statement about our world is valid. For example, given relationship between pair of entities e.g., `eat(horse, hay)', how this true or false in general. Gathering such knowledge and their relationships one the fundamental challenges extraction. Most previous works on extraction have focused purely text-driven reasoning for verifying relation phrases. In work, introduce problem visual verification phrases developed Visual Knowledge Extraction system called VisKE....

10.1109/cvpr.2015.7298752 article EN 2015-06-01

Large-scale recognition problems with thousands of classes pose a particular challenge because applying the classifier requires more computation as number grows. The label tree model integrates classification traversal so that complexity grows logarithmically. In this paper, we show how parameters can be found using maximum likelihood estimation. This new probabilistic learning technique produces significantly improved accuracy.

10.1109/cvpr.2013.114 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2013-06-01

We present a system for applying sim2real approaches to "in the wild" scenes with realistic visuals, and policies which rely on active perception using RGB cameras. Given short video of static scene collected generic phone, we learn scene's contact geometry function novel view synthesis Neural Radiance Field (NeRF). augment NeRF rendering by overlaying other dynamic objects (e.g. robot's own body, ball). A simulation is then created engine in physics simulator computes dynamics from...

10.1109/icra48891.2023.10161544 article EN 2023-05-29

Humans are remarkably proficient at controlling their limbs and tools from a wide range of viewpoints angles, even in the presence optical distortions. In robotics, this ability is referred to as visual servoing: moving tool or end-point desired location using primarily feedback. paper, we study how viewpoint-invariant servoing skills can be learned automatically robotic manipulation scenario. To end, train deep recurrent controller that determine which actions move arm object. The problem...

10.48550/arxiv.1712.07642 preprint EN other-oa arXiv (Cornell University) 2017-01-01

We present MuSHR, the Multi-agent System for non-Holonomic Racing. MuSHR is a low-cost, open-source robotic racecar platform education and research, developed by Personal Robotics Lab in Paul G. Allen School of Computer Science & Engineering at University Washington. aspires to contribute towards democratizing field robotics as low-cost that can be built deployed following detailed, open documentation do-it-yourself tutorials. A set demos lab assignments Mobile Robots course Washington...

10.48550/arxiv.1908.08031 preprint EN other-oa arXiv (Cornell University) 2019-01-01

A scene category imposes tight distributions over the kind of objects that might appear in scene, appearance those and their layout. In this paper, we propose a method to learn structures can encode three main interlacing components scene: category, context-specific objects, Our experimental evaluations show our learned outperform state-of-the-art Deformable Part Models detecting scene. structure provides level understanding is amenable deep visual inferences. The also generate features...

10.1109/cvpr.2014.37 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2014-06-01

Train in SimulationFig. 1: Domain Invariant Visual Servoing (DIVIS) learns collision-free goal reaching entirely simulation using dense multi-step rollouts and a recurrent fully convolutional neural network (bottom).DIViS can be directly deployed on real physical robots with RGB cameras for servoing to visually indicated goals as well semantic object categories (top).Abstract-Robots should understand both semantics physics functional the world.While robot platforms provide means interacting...

10.15607/rss.2019.xv.055 article EN 2019-06-22

We investigate the use of prior knowledge human and animal movement to learn reusable locomotion skills for real legged robots. Our approach builds upon previous work on imitating or dog Motion Capture (MoCap) data a skill module. Once learned, this module can be reused complex downstream tasks. Importantly, due imposed by MoCap data, our does not require extensive reward engineering produce sensible natural looking behavior at time reuse. This makes it easy create well-regularized,...

10.48550/arxiv.2203.17138 preprint EN cc-by arXiv (Cornell University) 2022-01-01

We introduce Segment-Phrase Table (SPT), a large collection of bijective associations between textual phrases and their corresponding segmentations. Leveraging recent progress in object recognition natural language semantics, we show how can successfully build high-quality segment-phrase table using minimal human supervision. More importantly, demonstrate the unique value unleashed by this rich bimodal resource, for both vision as well understanding. First, that fine-grained labels...

10.1109/iccv.2015.10 article EN 2015-12-01

We propose the problem of automated photo album creation from an unordered image collection. The is difficult as it involves a number complex perceptual tasks that facilitate selection and ordering photos to create compelling visual narrative. To help solve this problem, we collect (and will make available) new benchmark dataset based on Flickr images. Album Dataset provides variety annotations useful for task, including manually created albums various lengths. analyze provide experimental...

10.1109/wacv.2015.74 article EN IEEE Winter Conference on Applications of Computer Vision 2015-01-01

In this paper, we study the problem of answering visual analogy questions. These questions take form image A is to B as C what. Answering these entails discovering mapping from and then extending searching for D such that relation holds D. We pose learning an embedding encourages pairs analogous images with similar transformations be close together using convolutional neural networks a quadruple Siamese architecture. introduce dataset in natural images, show first results its kind on solving images.

10.48550/arxiv.1510.08973 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Generalized Hough Transform (GHT) is an efficient method for detecting curves by exploiting the duality between points on a curve and parameters of that curve. However GHT has some practical limitations such as high computational cost huge memory requirement scaled rotated objects. In this paper new method, namely fuzzy generalized transform (FGHT), proposed alleviates these deficiencies utilizing concept inference system. FGHT R-table consists set rules which are fired gradient direction...

10.1109/fuzzy.2009.5277217 article EN IEEE International Conference on Fuzzy Systems 2009-08-01

Steganography is a branch in the information hiding research area which aims to conceal data transmission between two parties. In this paper new method based on predictive coding proposed employs Quantization Index Modulation (QIM) for quantizing error values and embedding simultaneously. Furthermore, correction mechanism preserve histogram of cover image make it resistant against histogram-based attacks. To evaluate performance method, several experiments gray-level images are carried out...

10.1109/socpar.2009.55 article EN 2009-01-01

Objectives: Considering the low rate of exclusive breastfeeding in mothers using drugs and role behavioral intention as one effective factors on breastfeeding, present study aimed to determine impact training based theory planned behavior continuation drug-dependent mothers. Materials Methods: This clinical trial was performed three hospitals Mashhad 2018. To this end, 52 drug-abusing were randomly divided into experimental control groups. The intervention protocol consisted four separate...

10.15296/ijwhr.2023.46 article EN International Journal of Women s Health and Reproduction Sciences 2021-05-06
Coming Soon ...