Fitsum A. Reda

ORCID: 0000-0003-3072-4109
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image Processing Techniques
  • Advanced Vision and Imaging
  • Medical Image Segmentation Techniques
  • Hearing Loss and Rehabilitation
  • Generative Adversarial Networks and Image Synthesis
  • Advanced Neural Network Applications
  • Image and Signal Denoising Methods
  • Robotics and Sensor-Based Localization
  • Underwater Acoustics Research
  • Image Enhancement Techniques
  • Medical Imaging and Analysis
  • Video Coding and Compression Technologies
  • Ear Surgery and Otitis Media
  • Computer Graphics and Visualization Techniques
  • Image Processing Techniques and Applications
  • Retinal Imaging and Analysis
  • Human Pose and Action Recognition
  • Video Analysis and Summarization
  • Multimodal Machine Learning Applications
  • Dental Radiography and Imaging
  • Meningioma and schwannoma management
  • 3D Shape Modeling and Analysis
  • Facial Nerve Paralysis Treatment and Research
  • Nasal Surgery and Airway Studies
  • Craniofacial Disorders and Treatments

Google (United States)
2020-2023

META Health
2023

Meta (Israel)
2022

Nvidia (United States)
2017-2019

Nvidia (United Kingdom)
2019

Siemens Healthcare (United States)
2015

Vanderbilt University
2011-2014

Siemens (United States)
2014

St. John's Hospital
1991

Southern Illinois University School of Medicine
1991

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology scale up training sets by synthesizing new samples in order improve the accuracy semantic networks. We exploit prediction models' ability predict future frames also labels. A joint propagation strategy is proposed alleviate mis-alignments synthesized samples. demonstrate that models on datasets augmented leads significant improvements...

10.1109/cvpr.2019.00906 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Given two images depicting a person and garment worn by another person, our goal is to generate visualization of how the might look on input person. A key challenge synthesize photorealistic detail-preserving garment, while warping accommodate significant body pose shape change across subjects. Previous methods either focus detail preservation without effective variation, or allow tryon with desired but lack details. In this paper, we propose diffusion-based architecture that unifies UN ets...

10.1109/cvpr52729.2023.00447 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Objectives/Hypothesis Minimally invasive image‐guided approach to cochlear implantation (CI) involves drilling a narrow, linear tunnel the cochlea. Reported herein is first clinical implementation of this approach. Study Design Prospective cohort study. Methods On preoperative computed tomography (CT), safe trajectory through facial recess targeting scala tympani was planned. Intraoperatively, fiducial markers were bone‐implanted, second CT acquired, and transferred from intraoperative CT. A...

10.1002/lary.24520 article EN The Laryngoscope 2013-11-24

Learning to synthesize high frame rate videos via interpolation requires large quantities of training videos, which, however, are scarce, especially at resolutions. Here, we propose unsupervised techniques directly from low using cycle consistency. For a triplet consecutive frames, optimize models minimize the discrepancy between center and its reconstruction, obtained by interpolating back interpolated intermediate frames. This simple constraint alone achieves results comparable with...

10.1109/iccv.2019.00098 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Physical AI needs to be trained digitally first. It a digital twin of itself, the policy model, and world, world model. In this paper, we present Cosmos World Foundation Model Platform help developers build customized models for their setups. We position foundation model as general-purpose that can fine-tuned into downstream applications. Our platform covers video curation pipeline, pre-trained models, examples post-training tokenizers. To builders solve most critical problems our society,...

10.48550/arxiv.2501.03575 preprint EN arXiv (Cornell University) 2025-01-07

In this paper, we present a simple yet effective padding scheme that can be used as drop-in module for existing convolutional neural networks. We call it partial convolution based padding, with the intuition padded region treated holes and original input non-holes. Specifically, during operation, results are re-weighted near image borders on ratios between area sliding window area. Extensive experiments various deep network models ImageNet classification semantic segmentation demonstrate...

10.48550/arxiv.1811.11718 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Partial convolution weights convolutions with binary masks and renormalizes on valid pixels. It was originally proposed for image inpainting task because a corrupted processed by standard convolutional often leads to artifacts. Therefore, are constructed that define the pixels, so partial results only calculated based has been also used conditional synthesis task, when scene is generated, of an instance depend feature values belong same instance. One unexplored applications padding which...

10.1109/tpami.2022.3209702 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-01-01

To test whether there are significant differences in pediatric and adult temporal bone anatomy as related to cochlear implant (CI) surgery.

10.1097/mao.0b013e318245cc9f article EN Otology & Neurotology 2012-02-29

A cochlear implant (CI) is a device that restores hearing using an electrode array surgically placed in the cochlea. After implantation, CI programmed to attempt optimize outcome. Currently, we are testing imageguided programming (IGCIP) technique recently developed relies on knowledge of relative position intracochlear anatomy implanted electrodes. IGCIP enabled by number algorithms permit determining positions electrodes intra-cochlear pre- and post-implantation CT. One issue with this it...

10.1117/12.2043260 article EN Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE 2014-03-21

Purpose: Cochlear implant surgery is used to an electrode array in the cochlea treat hearing loss.The authors recently introduced a minimally invasive image-guided technique termed percutaneous cochlear implantation.This approach achieves access by drilling single linear channel from outer skull into via facial recess, region bounded nerve and chorda tympani.To exploit existing methods for computing automatically safe trajectories, tympani need be segmented.The goal of this work segment...

10.1118/1.3634048 article EN Medical Physics 2011-09-22

Percutaneous cochlear implantation (PCI) is a minimally-invasive image-guided implant approach, where access to the cochlea achieved by drilling linear channel from skull surface cochlea. The PCI approach requires pre- and intra-operative planning. Computation of safe trajectory performed in preoperative CT. This mapped intraoperative space using transformation matrix that registers CTs. However, difference orientation between CTs too extreme be recovered standard, gradient descent-based...

10.1109/tbme.2012.2214775 article EN IEEE Transactions on Biomedical Engineering 2012-08-22

We propose an efficient neural network for RAW image denoising. Although network-based denoising has been extensively studied restoration, little attention given to compute limited and power sensitive devices, such as smartphones wearables. In this paper, we present a novel architecture suite of training techniques high quality in mobile devices. Our work is distinguished by three main contributions. (1) The Feature-Align layer that modulates the activations encoder-decoder with input noisy...

10.1109/wacvw54805.2022.00078 article EN 2022-01-01

Existing deep learning based image inpainting methods use a standard convolutional network over the corrupted image, using filter responses conditioned on both valid pixels as well substitute values in masked holes (typically mean value). This often leads to artifacts such color discrepancy and blurriness. Post-processing is usually used reduce artifacts, but are expensive may fail. We propose of partial convolutions, where convolution renormalized be only pixels. further include mechanism...

10.48550/arxiv.1804.07723 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Objective Minimally invasive image‐guided cochlear implantation (CI) involves accessing the cochlea via a linear path from lateral skull to avoiding vital structures including facial nerve. Herein, we describe and demonstrate feasibility of technique for pediatric patients. Study Design Prospective. Setting Children's Hospital. Subjects Methods Thirteen patients (1.5 8 years) undergoing traditional CI participated in this Institutional Review Board–approved study. Three fiducial markers were...

10.1177/0194599813519050 article EN Otolaryngology 2014-01-21

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology scale up training sets by synthesizing new samples in order improve the accuracy semantic networks. We exploit prediction models' ability predict future frames also labels. A joint propagation strategy is proposed alleviate mis-alignments synthesized samples. demonstrate that models on datasets augmented leads significant improvements...

10.48550/arxiv.1812.01593 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Conventional CNNs for texture synthesis consist of a sequence (de)-convolution and up/down-sampling layers, where each layer operates locally lacks the ability to capture long-term structural dependency required by synthesis. Thus, they often simply enlarge input texture, rather than perform reasonable As compromise, many recent methods sacrifice generalizability training testing on same single (or fixed set of) image(s), resulting in huge re-training time costs unseen images. In this work,...

10.48550/arxiv.2007.07243 preprint EN other-oa arXiv (Cornell University) 2020-01-01

In video transmission applications, signals are transmitted over lossy channels, resulting in low-quality received signals. To re- store videos on recipient edge devices real-time, we introduce an efficient restoration network, EVRNet. EVRNet efficiently allocates parameters inside the network using alignment, differential, and fusion modules. With extensive experiments different tasks (deblocking, denoising, super-resolution), demonstrate that delivers competitive performance to existing...

10.1145/3474085.3475477 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

A cochlear implant (CI) is a neural prosthetic device that restores hearing by directly stimulating the auditory nerve with an electrode array. In CI surgery, surgeon threads array into cochlea, blind to internal structures. We have recently developed algorithms for determining position of electrodes relative intra-cochlear anatomy using pre- and post-implantation CT. are currently this approach develop programming assistance system uses knowledge determine patient-customized sound...

10.1117/12.2008098 article EN Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE 2013-03-08

McRackan, Theodore R.; Carlson, Matthew L.; Reda, Fitsum A.; Noble, Jack H.; Rivas, Alejandro Author Information

10.1097/mao.0000000000000274 article EN Otology & Neurotology 2014-04-23

Percutaneous cochlear implantation (PCI) is a minimally invasive image-guided implant approach, where access to the cochlea achieved by drilling linear channel from outer skull cochlea. The PCI approach requires pre- and intra-operative planning. Segmentation of critical ear anatomy computation safe trajectory are performed in pre-operative CT. computed must then be mapped intraoperative space. mapping can done using transformation matrix that registers CTs. However, difference orientation...

10.1117/12.911803 article EN Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE 2012-02-13
Coming Soon ...