- Advanced Image Processing Techniques
- Advanced Vision and Imaging
- Medical Image Segmentation Techniques
- Hearing Loss and Rehabilitation
- Generative Adversarial Networks and Image Synthesis
- Advanced Neural Network Applications
- Image and Signal Denoising Methods
- Robotics and Sensor-Based Localization
- Underwater Acoustics Research
- Image Enhancement Techniques
- Medical Imaging and Analysis
- Video Coding and Compression Technologies
- Ear Surgery and Otitis Media
- Computer Graphics and Visualization Techniques
- Image Processing Techniques and Applications
- Retinal Imaging and Analysis
- Human Pose and Action Recognition
- Video Analysis and Summarization
- Multimodal Machine Learning Applications
- Dental Radiography and Imaging
- Meningioma and schwannoma management
- 3D Shape Modeling and Analysis
- Facial Nerve Paralysis Treatment and Research
- Nasal Surgery and Airway Studies
- Craniofacial Disorders and Treatments
Google (United States)
2020-2023
META Health
2023
Meta (Israel)
2022
Nvidia (United States)
2017-2019
Nvidia (United Kingdom)
2019
Siemens Healthcare (United States)
2015
Vanderbilt University
2011-2014
Siemens (United States)
2014
St. John's Hospital
1991
Southern Illinois University School of Medicine
1991
Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology scale up training sets by synthesizing new samples in order improve the accuracy semantic networks. We exploit prediction models' ability predict future frames also labels. A joint propagation strategy is proposed alleviate mis-alignments synthesized samples. demonstrate that models on datasets augmented leads significant improvements...
Given two images depicting a person and garment worn by another person, our goal is to generate visualization of how the might look on input person. A key challenge synthesize photorealistic detail-preserving garment, while warping accommodate significant body pose shape change across subjects. Previous methods either focus detail preservation without effective variation, or allow tryon with desired but lack details. In this paper, we propose diffusion-based architecture that unifies UN ets...
Objectives/Hypothesis Minimally invasive image‐guided approach to cochlear implantation (CI) involves drilling a narrow, linear tunnel the cochlea. Reported herein is first clinical implementation of this approach. Study Design Prospective cohort study. Methods On preoperative computed tomography (CT), safe trajectory through facial recess targeting scala tympani was planned. Intraoperatively, fiducial markers were bone‐implanted, second CT acquired, and transferred from intraoperative CT. A...
Learning to synthesize high frame rate videos via interpolation requires large quantities of training videos, which, however, are scarce, especially at resolutions. Here, we propose unsupervised techniques directly from low using cycle consistency. For a triplet consecutive frames, optimize models minimize the discrepancy between center and its reconstruction, obtained by interpolating back interpolated intermediate frames. This simple constraint alone achieves results comparable with...
Physical AI needs to be trained digitally first. It a digital twin of itself, the policy model, and world, world model. In this paper, we present Cosmos World Foundation Model Platform help developers build customized models for their setups. We position foundation model as general-purpose that can fine-tuned into downstream applications. Our platform covers video curation pipeline, pre-trained models, examples post-training tokenizers. To builders solve most critical problems our society,...
In this paper, we present a simple yet effective padding scheme that can be used as drop-in module for existing convolutional neural networks. We call it partial convolution based padding, with the intuition padded region treated holes and original input non-holes. Specifically, during operation, results are re-weighted near image borders on ratios between area sliding window area. Extensive experiments various deep network models ImageNet classification semantic segmentation demonstrate...
Partial convolution weights convolutions with binary masks and renormalizes on valid pixels. It was originally proposed for image inpainting task because a corrupted processed by standard convolutional often leads to artifacts. Therefore, are constructed that define the pixels, so partial results only calculated based has been also used conditional synthesis task, when scene is generated, of an instance depend feature values belong same instance. One unexplored applications padding which...
To test whether there are significant differences in pediatric and adult temporal bone anatomy as related to cochlear implant (CI) surgery.
A cochlear implant (CI) is a device that restores hearing using an electrode array surgically placed in the cochlea. After implantation, CI programmed to attempt optimize outcome. Currently, we are testing imageguided programming (IGCIP) technique recently developed relies on knowledge of relative position intracochlear anatomy implanted electrodes. IGCIP enabled by number algorithms permit determining positions electrodes intra-cochlear pre- and post-implantation CT. One issue with this it...
Purpose: Cochlear implant surgery is used to an electrode array in the cochlea treat hearing loss.The authors recently introduced a minimally invasive image-guided technique termed percutaneous cochlear implantation.This approach achieves access by drilling single linear channel from outer skull into via facial recess, region bounded nerve and chorda tympani.To exploit existing methods for computing automatically safe trajectories, tympani need be segmented.The goal of this work segment...
Percutaneous cochlear implantation (PCI) is a minimally-invasive image-guided implant approach, where access to the cochlea achieved by drilling linear channel from skull surface cochlea. The PCI approach requires pre- and intra-operative planning. Computation of safe trajectory performed in preoperative CT. This mapped intraoperative space using transformation matrix that registers CTs. However, difference orientation between CTs too extreme be recovered standard, gradient descent-based...
We propose an efficient neural network for RAW image denoising. Although network-based denoising has been extensively studied restoration, little attention given to compute limited and power sensitive devices, such as smartphones wearables. In this paper, we present a novel architecture suite of training techniques high quality in mobile devices. Our work is distinguished by three main contributions. (1) The Feature-Align layer that modulates the activations encoder-decoder with input noisy...
Existing deep learning based image inpainting methods use a standard convolutional network over the corrupted image, using filter responses conditioned on both valid pixels as well substitute values in masked holes (typically mean value). This often leads to artifacts such color discrepancy and blurriness. Post-processing is usually used reduce artifacts, but are expensive may fail. We propose of partial convolutions, where convolution renormalized be only pixels. further include mechanism...
Objective Minimally invasive image‐guided cochlear implantation (CI) involves accessing the cochlea via a linear path from lateral skull to avoiding vital structures including facial nerve. Herein, we describe and demonstrate feasibility of technique for pediatric patients. Study Design Prospective. Setting Children's Hospital. Subjects Methods Thirteen patients (1.5 8 years) undergoing traditional CI participated in this Institutional Review Board–approved study. Three fiducial markers were...
Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology scale up training sets by synthesizing new samples in order improve the accuracy semantic networks. We exploit prediction models' ability predict future frames also labels. A joint propagation strategy is proposed alleviate mis-alignments synthesized samples. demonstrate that models on datasets augmented leads significant improvements...
Conventional CNNs for texture synthesis consist of a sequence (de)-convolution and up/down-sampling layers, where each layer operates locally lacks the ability to capture long-term structural dependency required by synthesis. Thus, they often simply enlarge input texture, rather than perform reasonable As compromise, many recent methods sacrifice generalizability training testing on same single (or fixed set of) image(s), resulting in huge re-training time costs unseen images. In this work,...
In video transmission applications, signals are transmitted over lossy channels, resulting in low-quality received signals. To re- store videos on recipient edge devices real-time, we introduce an efficient restoration network, EVRNet. EVRNet efficiently allocates parameters inside the network using alignment, differential, and fusion modules. With extensive experiments different tasks (deblocking, denoising, super-resolution), demonstrate that delivers competitive performance to existing...
A cochlear implant (CI) is a neural prosthetic device that restores hearing by directly stimulating the auditory nerve with an electrode array. In CI surgery, surgeon threads array into cochlea, blind to internal structures. We have recently developed algorithms for determining position of electrodes relative intra-cochlear anatomy using pre- and post-implantation CT. are currently this approach develop programming assistance system uses knowledge determine patient-customized sound...
McRackan, Theodore R.; Carlson, Matthew L.; Reda, Fitsum A.; Noble, Jack H.; Rivas, Alejandro Author Information
Percutaneous cochlear implantation (PCI) is a minimally invasive image-guided implant approach, where access to the cochlea achieved by drilling linear channel from outer skull cochlea. The PCI approach requires pre- and intra-operative planning. Segmentation of critical ear anatomy computation safe trajectory are performed in pre-operative CT. computed must then be mapped intraoperative space. mapping can done using transformation matrix that registers CTs. However, difference orientation...