- Tactile and Sensory Interactions
- Subtitles and Audiovisual Media
- Multimodal Machine Learning Applications
- Video Analysis and Summarization
- Biochemical Analysis and Sensing Techniques
- Face Recognition and Perception
- Mobile Crowdsensing and Crowdsourcing
- Olfactory and Sensory Function Studies
- Visual perception and processing mechanisms
- Nuclear Receptors and Signaling
- Macrophage Migration Inhibitory Factor
- Circular RNAs in diseases
- Data Visualization and Analytics
- Cardiac Ischemia and Reperfusion
- Migraine and Headache Studies
- Human-Automation Interaction and Safety
- Multi-Agent Systems and Negotiation
- Speech and dialogue systems
- Biometric Identification and Security
- Digital Games and Media
- Cultural Differences and Values
- Humor Studies and Applications
- Sleep and Wakefulness Research
- Erythropoietin and Anemia Treatment
- Meromorphic and Entire Functions
Xuzhou Medical College
2021-2024
First Affiliated Hospital of Bengbu Medical College
2023-2024
Sichuan University
2024
West China Hospital of Sichuan University
2024
Tongji University
2024
Pudong New Area People's Hospital
2024
Zhejiang University
2024
Shenzhen Polytechnic
2024
Hong Kong Polytechnic University
2024
Yeshiva University
2024
User-generated videos are an increasingly important source of information online, yet most online inaccessible to blind and visually impaired (BVI) people. To find that accessible, or understandable without additional description the visual content, BVI people in our formative studies reported they used a time-consuming trial-and-error approach: clicking on video, watching portion, leaving repeating process. also video accessibility heuristics characterize accessible videos. We instantiate 7...
Olfactory dysfunction is an early pre-motor symptom of Parkinson's disease (PD) but the neural mechanisms underlying this remain largely unknown. Aggregation α-synuclein observed in olfactory bulb (OB) during stages PD, indicating a relationship between pathology and hyposmia. Here we investigate whether how aggregates modulate activity OB at single-cell synaptic levels. We induced aggregation specifically via overexpression double-mutant human by adeno-associated viral (AAV) vector. found...
In recent years, there has been a proliferation of multimedia applications that leverage machine learning (ML) for interactive experiences. Prototyping ML-based is, however, still challenging, given complex workflows are not ideal design and experimentation. To better understand these challenges, we conducted formative study with seven ML practitioners to gather insights about common evaluation workflows.
Video conferencing solutions like Zoom, Google Meet, and Microsoft Teams are becoming increasingly popular for facilitating conversations, recent advancements such as live captioning help people better understand each other. We believe that the addition of visuals based on context conversations could further improve comprehension complex or unfamiliar concepts. To explore potential capabilities, we conducted a formative study through remote interviews (N=10) crowdsourced dataset over 1500...
Images on social media platforms are inaccessible to people with vision impairments due a lack of descriptions that can be read by screen readers. Providing accurate alternative text for all visual content is not yet feasible, but certain subsets images, such as internet memes, offer affordances automatic or semi-automatic generation text. We present two methods making memes accessible semi-automatically through (1) the rich and (2) creation audio macro memes. Meme authors create templates...
The fusiform face area (FFA) is a widely studied region causally involved in perception. Even though cognitive neuroscientists have been studying the FFA for over two decades, answers to foundational questions regarding function, architecture, and connectivity of from large (N>1000) group participants are still lacking. To fill this gap knowledge, we quantified these multimodal features face-selective regions 1053 Human Connectome Project. After manually defining 4,000 regions, report five...
With the incredible growth of scale and complexity datasets, creating proper visualizations for users becomes more challenging in large datasets. Though several visualization recommendation systems have been proposed, so far, lack practical engineering inputs is still a major concern regarding usage recommendations industry. In this paper, we proposed AVA, an open-sourced web-based framework Automated Visual Analytics. AVA contains both empiric-driven insight-driven methods to meet demands...
With the rapid development of virtual reality (VR) technology, how to further improve user's experience in this field has become a research hotspot. Based on Large Language Model (LLM), paper discusses its application and optimization path VR field. Firstly, basic principle core technology LLM are expounded, working mechanism is analyzed emphatically. Then, discussed, including assistant, intelligent recommendation, natural language interaction multi-modal collaboration. Finally, for...
This paper describes the results of 2023 edition "LivDet" series iris presentation attack detection (PAD) competitions. New elements in this fifth competition include (1) GAN-generated images as a category instruments (PAI), and (2) an evaluation human accuracy at detecting PAI reference benchmark. Clarkson University Notre Dame contributed image datasets for competition, composed samples representing seven different categories, well baseline PAD algorithms. Fraunhofer IGD, Beijing Civil...
Authors make their videos visually accessible by adding audio descriptions (AD), and auditorily closed captions (CC). However, creating AD CC is challenging tedious, especially for non-professional describers captioners, due to the difficulty of identifying accessibility problems in videos. A video author will have watch through manually check inaccessible information frame-by-frame, both visual auditory modalities. In this paper, we present CrossA11y, a system that helps authors efficiently...
Situationally Induced Impairments and Disabilities (SIIDs) can significantly hinder user experience in contexts such as poor lighting, noise, multi-tasking. While prior research has introduced algorithms systems to address these impairments, they predominantly cater specific tasks or environments fail accommodate the diverse dynamic nature of SIIDs. We introduce Human I/O, a unified approach detecting wide range SIIDs by gauging availability human input/output channels. Leveraging egocentric...
In mammals, odour information within the olfactory bulb (OB) is processed by complex neural circuits before being ultimately represented in action potential activity of mitral/tufted cells (M/Ts). Cholecystokinin-expressing (CCK
We studied the movable singularities of solutions autonomous non-algebraic first-order ordinary differential equations in form y′=I(y(t)) and y′=I1(y(t))+I2(y(t))+⋯+In(y(t)), aiming to prove that all complex these are at most algebraic branch points. This study explores use constructing triangle method analyze equations. For y=w+iv, we treat as a way construct right-angled plane, with lengths adjacent sides being w v. definitions trigonometric functions sin cos (the ratio side hypotenuse)...
In recent years, live captions have gained significant popularity through its availability in remote video conferences, mobile applications, and the web. Unlike preprocessed subtitles, require real-time responsiveness by showing interim speech-to-text results. As prediction confidence changes, may update, leading to visual instability that interferes with user's viewing experience. this paper, we characterize stability of proposing a vision-based flickering metric using luminance contrast...
We demonstrate Visual Captions, a real-time system that integrates with video conferencing platform to enrich verbal communication. Captions leverages fine-tuned large language model proactively suggest visuals are relevant the context of ongoing conversation. implemented as user-customizable Chrome plugin three levels AI proactivity: Auto-display (AI autonomously adds visuals), Auto-suggest recommends and On-demand-suggest suggests when prompted). showcase usage in open-vocabulary settings,...
Many theorize that cultural similarities in moral judgments arise from a specialized cognitive system devoted to morality. We claim, contrast, people make using general-purpose, value-based decision-making process. present computational model predict response time and choice dilemmas valuations as input. Cultural judgment are explained by culturally stable set of drives choices aid survival. Corresponding differences changes decisional bias parameter accounts for the perceived costs making...
<sec> <title>BACKGROUND</title> Background: Depression is the most disabling and prevalent psychiatric disorder; transcranial magnetic stimulation (TMS) widely used in treatment of depression because its remarkable efficacy. </sec> <title>OBJECTIVE</title> Objective: To investigate current status, hotspots frontiers this research field, advance on TMS for treating depression, paper provides a visualization bibliometric analysis studies related to depression. <title>METHODS</title> Methods:...
<title>Abstract</title> Background The prevalence of youth depression is rising, making the identification reliable biomarkers for early detection increasingly challenging. This study explores potential in experiencing their first depressive episode, with comorbid anxiety, and metabolic or thyroid imbalances. Methods We recruited 399 participants measured stimulating hormone (TSH), triiodothyronine (FT3), free thyroxine (FT4), fasting blood glucose (FBG), cholesterol levels, body mass index...
This study presents the Knowledge-Aware Model (KAM), a pioneering approach in sports analytics for predicting highlights badminton matches. Utilizing extensive rally-by-rally data from significant tournaments, model integrates domain-specific insights with data-driven techniques. Our analysis of over 5,180 rallies 140 singles matches reveals model`s effectiveness, outperforming baseline and state-of-the-art methods an F1-score 0.793. KAM`s innovative use match statistics rally-specific opens...
One of the long-standing aspirations in conversational AI is to allow them autonomously take initiatives conversations, i.e., being proactive. This especially challenging for multi-party conversations. Prior NLP research focused mainly on predicting next speaker from contexts like preceding In this paper, we demonstrate limitations such methods and rethink what it means be proactive multi-party, human-AI We propose that just humans, rather than merely reacting turn-taking cues, a formulates...
Abstract Here we report the results of a speeded relative quantity task with Chinese participants. On each trial single numeral (the probe) was presented and instructions were to respond as whether it signified less than or greater five standard). In separate blocks trials, numerals either in Mandarin Arabic number formats. addition standard influence numerical distance, significant predictor performance degree physical similarity between probe depicted Mandarin. Additionally, competing...
ABSTRACT The Fusiform Face Area (FFA) is a widely studied region causally involved in face perception. Even though cognitive neuroscientists have been studying the FFA for over two decades, answers to foundational questions regarding structure, function, and connectivity of from large (N>1000) group participants are still lacking. To fill this gap, we quantified structural, functional, features fusiform face-selective regions 1080 Human Connectome Project (HCP). After manually defining...