NFDI4DS | UHH-SEMS - Publication Details

Simon See

ORCID: 0000-0002-4958-9237

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5077539496

Research Areas

Distributed and Parallel Computing Systems
Topic Modeling
Parallel Computing and Optimization Techniques
Advanced Neural Network Applications
Human Pose and Action Recognition
Natural Language Processing Techniques
Cloud Computing and Resource Management
Advanced Data Storage Technologies
Domain Adaptation and Few-Shot Learning
Multimodal Machine Learning Applications
Scientific Computing and Data Management
Video Surveillance and Tracking Methods
Advanced Vision and Imaging
Anomaly Detection Techniques and Applications
Image Enhancement Techniques
Interconnection Networks and Systems
Adversarial Robustness in Machine Learning
Advanced Graph Neural Networks
Neural Networks and Applications
Advanced Image Processing Techniques
Video Analysis and Summarization
Speech Recognition and Synthesis
Sentiment Analysis and Opinion Mining
Computer Graphics and Visualization Techniques
Context-Aware Activity Recognition Systems

Nvidia (United States)
2011-2025

Nvidia (United Kingdom)
2011-2024

Technology Centre Prague
2018-2024

University of Indonesia
2024

Shanghai Jiao Tong University
2011-2023

University of Alberta
2023

Coventry University
2023

Mahindra University
2023

Mahindra Group (India)
2023

The University of Tokyo
2023

DeepHunter: a coverage-guided fuzz testing framework for deep neural networks

OPENALEX - Publications

Xiaofei Xie Lei Ma Felix Juefei-Xu Minhui Xue Hongxu Chen and 5 more

The past decade has seen the great potential of applying deep neural network (DNN) based software to safety-critical scenarios, such as autonomous driving. Similar traditional software, DNNs could exhibit incorrect behaviors, caused by hidden defects, leading severe accidents and losses. In this paper, we propose DeepHunter, a coverage-guided fuzz testing framework for detecting defects general-purpose DNNs. To end, first metamorphic mutation strategy generate new semantically preserved...

10.1145/3293882.3330579 article EN 2019-07-10

FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation

OPENALEX - Publications

Xiaoyu Shi Zhaoyang Huang Dasong Li Manyuan Zhang Ka Chun Cheung and 4 more

FlowFormer [24] introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance. The core component of is the transformer-based cost-volume encoder. Inspired by recent success masked autoencoding (MAE) pretraining in unleashing transformers' capacity encoding visual representation, we propose Masked Cost Volume Autoencoding (MCVA) to enhance encoder with novel MAE scheme. Firstly, introduce block-sharing masking strategy prevent information...

10.1109/cvpr52729.2023.00160 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

OPENALEX - Publications

Xiaoyu Shi Zhaoyang Huang Weikang Bian Dasong Li Manyuan Zhang and 5 more

We introduce VideoFlow, a novel optical flow estimation framework for videos. In contrast to previous methods that learn estimate from two frames, VideoFlow concurrently estimates bi-directional flows multiple frames are available in videos by sufficiently exploiting temporal cues.We first propose TRi-frame Optical Flow (TROF) module the center frame three-frame manner. The information of triplet is iteratively fused onto frame. To extend TROF handling more we further MOtion Propagation...

10.1109/iccv51070.2023.01146 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

A Simple Baseline for Video Restoration with Grouped Spatial-Temporal Shift

OPENALEX - Publications

Dasong Li Xiaoyu Shi Yi Zhang Ka Chun Cheung Simon See and 3 more

Video restoration, which aims to restore clear frames from degraded videos, has numerous important applications. The key video restoration depends on utilizing inter-frame information. However, existing deep learning methods often rely complicated network architectures, such as optical flow estimation, deformable convo-lution, and cross-frame self-attention layers, resulting in high computational costs. In this study, we propose a sim-ple yet effective framework for restoration. Our approach...

10.1109/cvpr52729.2023.00947 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Predicting blood–brain barrier permeability of molecules with a large language model and machine learning

OPENALEX - Publications

Eddie Huang Jai‐Sing Yang Ken Ying-Kai Liao Warren C. W. Tseng Chien-Yu Lee and 4 more

Abstract Predicting the blood–brain barrier (BBB) permeability of small-molecule compounds using a novel artificial intelligence platform is necessary for drug discovery. Machine learning and large language model on (AI) tools improve accuracy shorten time new development. The primary goal this research to develop computing models deep architectures capable predicting whether molecules can permeate human (BBB). in silico (computational) vitro (experimental) results were validated by Natural...

10.1038/s41598-024-66897-y article EN cc-by Scientific Reports 2024-07-09

CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation

OPENALEX - Publications

Jun Wang Abhir Bhalerao Terry Yin Simon See Yulan He

Radiology report generation (RRG) has gained increasing research attention because of its huge potential to mitigate medical resource shortages and aid the process disease decision making by radiologists. Recent advancements in Report Generation are largely driven improving a model's capabilities encoding single-modal feature representations, while few studies explicitly explore cross-modal alignment between image regions words. Radiologists typically focus first on abnormal before composing...

10.1109/jbhi.2024.3354712 article EN IEEE Journal of Biomedical and Health Informatics 2024-01-16

Understanding Top-k Sparsification in Distributed Deep Learning

OPENALEX - Publications

Shaohuai Shi Xiaowen Chu Ka Chun Cheung Simon See

Distributed stochastic gradient descent (SGD) algorithms are widely deployed in training large-scale deep learning models, while the communication overhead among workers becomes new system bottleneck. Recently proposed sparsification techniques, especially Top-$k$ with error compensation (TopK-SGD), can significantly reduce traffic without an obvious impact on model accuracy. Some theoretical studies have been carried out to analyze convergence property of TopK-SGD. However, existing do not...

10.48550/arxiv.1911.08772 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Continual Semantic Segmentation with Automatic Memory Sample Selection

OPENALEX - Publications

Lanyun Zhu Tianrun Chen Jianxiong Yin Simon See Jun Liu

Continual Semantic Segmentation (CSS) extends static semantic segmentation by incrementally introducing new classes for training. To alleviate the catastrophic forgetting issue in CSS, a memory buffer that stores small number of samples from previous is constructed replay. However, existing methods select either randomly or based on single-factor-driven handcrafted strategy, which has no guarantee to be optimal. In this work, we propose novel sample selection mechanism selects informative...

10.1109/cvpr52729.2023.00301 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Designing an Educational Metaverse: A Case Study of NTUniverse

OPENALEX - Publications

Jing Kai Sim Kaichao William Xu Y. Jin Zhi Yu Lee Yi Jie Teo and 9 more

An up-and-coming concept that seeks to transform how students learn about and study complex systems, as well industrial workers are trained, metaverse technology is characterized in this context by its use virtual simulation analysis. In work, a environment created duplicates real-world situations enables immersive interactive learning the educational metaverse. For purpose, we built digital twin of Nanyang Technological University (NTU) campus foundation, called NTUniverse. It designed an...

10.3390/app14062559 article EN cc-by Applied Sciences 2024-03-19

Sensor grid: integration of wireless sensor networks and the grid

OPENALEX - Publications

Hock Beng Lim Yong Meng Teo Proshikshya Mukherjee Vinh The Lam Weng‐Fai Wong and 1 more

Wireless sensor networks have emerged as an exciting technology for a wide range of important applications that acquire and process information from the physical world. Grid computing has evolved standards-based approach coordinated resource sharing. Sensor grids combine these two promising technologies by extending grid paradigm to sharing resources in wireless networks. There are several issues challenges design grids. In this paper, we propose architecture, called scalable proxy-based...

10.1109/lcn.2005.123 article EN 2005-01-01

An Evaluation of Unified Memory Technology on NVIDIA GPUs

OPENALEX - Publications

Wenqiang Li Guanghao Jin Xuewen Cui Simon See

Unified Memory is an emerging technology which supported by CUDA 6.X. Before 6.X, the existing programming model relies on programmers to explicitly manage data between CPU and GPU hence increases complexity. 6.X provides a new called as provide that defines memory space single coherent (imaging same common address space). The system manages access without explicit copy functions. This paper evaluate through different applications GPUs show users how use of efficiently. include Diffusion3D...

10.1109/ccgrid.2015.105 article EN 2015-05-01

Multi-class Twitter sentiment classification with emojis

OPENALEX - Publications

Mengdi Li Eugene Ch’ng Alain Yee‐Loong Chong Simon See

Purpose Recently, various Twitter Sentiment Analysis (TSA) techniques have been developed, but little has paid attention to the microblogging feature – emojis, and few works conducted on multi-class sentiment analysis of tweets. The purpose this paper is consider popularity emojis investigate feasibility an emoji training heuristic for classification Tweets from “2016 Orlando nightclub shooting” were used as a source study. Besides, study also aims demonstrate how mapping can contribute...

10.1108/imds-12-2017-0582 article EN Industrial Management & Data Systems 2018-09-05

An empirical analysis of emoji usage on Twitter

OPENALEX - Publications

Mengdi Li Eugene Ch’ng Alain Yee‐Loong Chong Simon See

Purpose Emoji has become an essential component of any digital communication and its importance can be attested to by sustained popularity widespread use. However, research in Emojis is rarely seen due the lack data at a greater scale. The purpose this paper systematically analyse compare usage cross-cultural manner. Design/methodology/approach This conducted empirical analysis using large-scale, cross-regional emoji set from Twitter, platform where limited 140 characters allowance made it...

10.1108/imds-01-2019-0001 article EN Industrial Management & Data Systems 2019-09-04

Improving Individual Brain Age Prediction Using an Ensemble Deep Learning Framework

OPENALEX - Publications

Chen-Yuan Kuo Tsung-Ming Tai Pei‐Lin Lee Chiu-Wang Tseng Chieh-Yu Chen and 5 more

Brain age is an imaging-based biomarker with excellent feasibility for characterizing individual brain health and may serve as a single quantitative index clinical domain-specific usage. has been successfully estimated using extensive neuroimaging data from healthy participants various feature extraction conventional machine learning (ML) approaches. Recently, several end-to-end deep (DL) analytical frameworks have proposed alternative approaches to predict higher accuracy. However, the...

10.3389/fpsyt.2021.626677 article EN cc-by Frontiers in Psychiatry 2021-03-23

Learning Gabor Texture Features for Fine-Grained Recognition

OPENALEX - Publications

Lanyun Zhu Tianrun Chen Jianxiong Yin Simon See Jun Liu

Extracting and using class-discriminative features is critical for fine-grained recognition. Existing works have demonstrated the possibility of applying deep CNNs to exploit that distinguish similar classes. However, suffer from problems including frequency bias loss detailed local information, which restricts performance recognizing categories. To address challenge, we propose a novel texture branch as complimentary CNN feature extraction. We innovatively utilize Gabor filters powerful...

10.1109/iccv51070.2023.00156 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation

OPENALEX - Publications

Zhehua Zhou Jiayang Song Xuan Xie Zhan Shu Lei Ma and 3 more

As a representative cyber-physical system (CPS), robotic manipulators have been widely adopted in various academic research and industrial processes, indicating their potential to act as universal interface between the cyber physical worlds. Recent studies robotics manipulation started employing artificial intelligence (AI) approaches controllers achieve better adaptability performance. However, inherent challenge of explaining AI components introduces uncertainty unreliability these...

10.1145/3639477.3639740 article EN 2024-04-14

FedDrip: Federated Learning With Diffusion-Generated Synthetic Image

OPENALEX - Publications

Karin Huangsuwan Timothy Liu Simon See Aik Beng Ng Peerapon Vateekul

In the realm of machine learning in healthcare, federated (FL) is often recognized as a practical solution for addressing issues related to data privacy and distribution. However, many real-world datasets are not identically independently distributed (non-IID). That is, characteristics differ from one institute another. Non-IID poses challenges convergence FL models, such client drifting, where model weights drift towards local optima instead global optimum. As solution, leveraging synthetic...

10.1109/access.2025.3525806 article EN cc-by IEEE Access 2025-01-01

Hybrid Two-Level MCMC with Deep Learning Surrogates for Bayesian Inverse Problems

OPENALEX - Publications

Yang Juntao Gianmarco Mengaldo Jeff Adie Simon See Adriano Gualandi

Bayesian inverse problems arise in various scientific and engineering domains, solving them can be computationally demanding. This is especially the case for governed by partial differential equations, where repeated evaluation of forward operator extremely expensive. Recent advances Deep Learning (DL)-based surrogate models have shown promising potential to accelerate solution such problems. However, despite their ability learn from complex data, DL-based generally cannot match accuracy...

10.2139/ssrn.5084031 preprint EN 2025-01-01

Validating Large-Scale Quantum Machine Learning: Efficient Simulation of Quantum Support Vector Machines Using Tensor Networks

OPENALEX - Publications

Kuan-Cheng Chen Tai-Yue Li Yun-Yuan Wang Simon See Chun‐Chieh Wang and 4 more

Abstract We present an efficient tensor-network-based approach for simulating large-scale quantum circuits exemplified by Quantum Support Vector Machines (QSVMs). Experimentally, leveraging the cuTensorNet library on multiple GPUs, our method effectively reduces exponential runtime growth to near-quadratic scaling with respect number of qubits in practical scenarios. Traditional state-vector simulations become computationally infeasible beyond approximately 50 qubits; contrast, simulator...

10.1088/2632-2153/adb4ba article EN cc-by Machine Learning Science and Technology 2025-02-11

Feature-based Graph Attention Networks Improve Online Continual Learning

OPENALEX - Publications

A J W Sim Zhengkui Wang Aik Beng Ng Shalini De Mello Simon See and 1 more

Online continual learning for image classification is crucial models to adapt new data while retaining knowledge of previously learned tasks. This capability essential address real-world challenges involving dynamic environments and evolving distributions. Traditional approaches predominantly employ Convolutional Neural Networks, which are limited processing images as grids primarily capture local patterns rather than relational information. Although the emergence transformer architectures...

10.48550/arxiv.2502.09143 preprint EN arXiv (Cornell University) 2025-02-13

LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning

OPENALEX - Publications

Tianshi Zheng Jiayang Cheng Chunyang Li Haochen Shi Zihao Wang and 4 more

Modern large language models (LLMs) employ various forms of logical inference, both implicitly and explicitly, when addressing reasoning tasks. Understanding how to optimally leverage these inference paradigms is critical for advancing LLMs' capabilities. This paper adopts an exploratory approach by introducing a controlled evaluation environment analogical -- fundamental cognitive task that systematically parameterized across three dimensions: modality (textual, visual, symbolic),...

10.48550/arxiv.2502.11176 preprint EN arXiv (Cornell University) 2025-02-16

CondensNet: Enabling stable long-term climate simulations via hybrid deep learning models with adaptive physical constraints

OPENALEX - Publications

Xin Wang Yang Juntao Jeff Adie Simon See Kalli Furtado and 4 more

Accurate and efficient climate simulations are crucial for understanding Earth's evolving climate. However, current general circulation models (GCMs) face challenges in capturing unresolved physical processes, such as cloud convection. A common solution is to adopt resolving models, that provide more accurate results than the standard subgrid parametrisation schemes typically used GCMs. also referred super paramtetrizations, remain computationally prohibitive. Hybrid modeling, which...

10.48550/arxiv.2502.13185 preprint EN arXiv (Cornell University) 2025-02-18

Coming Soon ...