NFDI4DS | UHH-SEMS - Publication Details

Jiaxian Guo

ORCID: 0000-0002-0212-8329

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5084631859

Research Areas

Topic Modeling
Multimodal Machine Learning Applications
Domain Adaptation and Few-Shot Learning
Natural Language Processing Techniques
Generative Adversarial Networks and Image Synthesis
Advanced Neural Network Applications
Human Pose and Action Recognition
Glaucoma and retinal disorders
Robot Manipulation and Learning
Video Surveillance and Tracking Methods
Retinal Diseases and Treatments
Image Retrieval and Classification Techniques
COVID-19 diagnosis using AI
Human Motion and Animation
Robotic Path Planning Algorithms
Fungal Infections and Studies
Explainable Artificial Intelligence (XAI)
Connective Tissue Growth Factor Research
3D Shape Modeling and Analysis
Anomaly Detection Techniques and Applications
Brain Tumor Detection and Classification
Cancer-related molecular mechanisms research
Reinforcement Learning in Robotics
Adversarial Robustness in Machine Learning
Handwritten Text Recognition Techniques

The University of Tokyo
2024

Beijing University of Posts and Telecommunications
2024

The University of Sydney
2020-2023

China Earthquake Administration
2023

Tianjin University of Technology
2022

Civil Aviation University of China
2022

Huawei Technologies (China)
2020

National University of Singapore
2020

Shanghai First People's Hospital
2016-2018

Shanghai Jiao Tong University
2016-2018

Long Text Generation via Adversarial Training with Leaked Information

OPENALEX - Publications

Jiaxian Guo Sidi Lu Han Cai Weinan Zhang Yong Yu and 1 more

Automatically generating coherent and semantically meaningful text has many applications in machine translation, dialogue systems, image captioning, etc. Recently, by combining with policy gradient, Generative Adversarial Nets(GAN) that use a discriminative model to guide the training of generative as reinforcement learning shown promising results generation. However, scalar guiding signal is only available after entire been generated lacks intermediate information about structure during...

10.1609/aaai.v32i1.11957 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-27

Texygen

OPENALEX - Publications

Yaoming Zhu Sidi Lu Lei Zheng Jiaxian Guo Weinan Zhang and 2 more

We introduce Texygen, a benchmarking platform to support research on open-domain text generation models. Texygen has not only implemented majority of models, but also covered set metrics that evaluate the diversity, quality and consistency generated texts. The could help standardize improve reproductivity reliability future work in generation.

10.1145/3209978.3210080 article EN 2018-06-27

From Images to Textual Prompts: Zero-shot Visual Question Answering with Frozen Large Language Models

OPENALEX - Publications

Jiaxian Guo Junnan Li Dongxu Li Anthony Meng Huat Tiong Boyang Li and 2 more

Large language models (LLMs) have demonstrated excellent zero-shot generalization to new tasks. However, effective utilization of LLMs for visual question-answering (VQA) remains challenging, primarily due the modality disconnect and task between LLM VQA End-to-end training on multimodal data may bridge disconnects, but is inflexible computationally expensive. To address this issue, we propose Img2LLM, a plug-and-play module that provides prompts enable perform zeroshot tasks without...

10.1109/cvpr52729.2023.01046 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Texygen: A Benchmarking Platform for Text Generation Models

OPENALEX - Publications

Yaoming Zhu Sidi Lu Lei Zheng Jiaxian Guo Weinan Zhang and 2 more

We introduce Texygen, a benchmarking platform to support research on open-domain text generation models. Texygen has not only implemented majority of models, but also covered set metrics that evaluate the diversity, quality and consistency generated texts. The could help standardize facilitate sharing fine-tuned open-source implementations among researchers for their work. As consequence, this would in improving reproductivity reliability future work generation.

10.48550/arxiv.1802.01886 preprint EN cc-by arXiv (Cornell University) 2018-01-01

Long Text Generation via Adversarial Training with Leaked Information

OPENALEX - Publications

Jiaxian Guo Sidi Lu Han Cai Weinan Zhang Yong Yu and 1 more

Automatically generating coherent and semantically meaningful text has many applications in machine translation, dialogue systems, image captioning, etc. Recently, by combining with policy gradient, Generative Adversarial Nets (GAN) that use a discriminative model to guide the training of generative as reinforcement learning shown promising results generation. However, scalar guiding signal is only available after entire been generated lacks intermediate information about structure during...

10.48550/arxiv.1709.08624 preprint EN cc-by arXiv (Cornell University) 2017-01-01

Repeatability and Reproducibility of Foveal Avascular Zone Area Measurements Using AngioPlex Spectral Domain Optical Coherence Tomography Angiography in Healthy Subjects

OPENALEX - Publications

Jiaxian Guo Xiangjun She Xiaoying Liu Xiaodong Sun

Purpose: The aim of this study was to evaluate the repeatability and reproducibility foveal avascular zone (FAZ) area measurements using AngioPlex spectral domain optical coherence tomography (OCT) angiography in normal subjects. Methods: Twenty-two healthy subjects (25 eyes) underwent FAZ with OCT. Each volunteer separately examined 3 consecutive times by 2 experienced observers. measured ImageJ software....

10.1159/000453112 article EN Ophthalmologica 2017-01-01

Reliability of Vessel Density Measurements in the Peripapillary Retina and Correlation with Retinal Nerve Fiber Layer Thickness in Healthy Subjects Using Optical Coherence Tomography Angiography

OPENALEX - Publications

Xiangjun She Jiaxian Guo Xiaoying Liu Hong Zhu Tong Li and 3 more

To evaluate the reliability of vessel density measurements in peripapillary retina using optical coherence tomography angiography (OCT-A) and to analyze correlation with retinal nerve fiber layer (RNFL) thickness healthy subjects.Thirty-five volunteers were recruited study. The optic disc region was scanned three times spectral-domain OCT (SD-OCT) split-spectrum amplitude decorrelation by two skilled examiners. Vessel automatically calculated software RTVue-XR (version 2015.1.1.98). RNFL on...

10.1159/000485957 article EN Ophthalmologica 2018-01-01

Alleviating Semantics Distortion in Unsupervised Low-Level Image-to-Image Translation via Structure Consistency Constraint

OPENALEX - Publications

Jiaxian Guo Jiachen Li Huan Fu Mingming Gong Kun Zhang and 1 more

Unsupervised image-to-image (I21) translation aims to learn a domain mapping function that can preserve the semantics of input images without paired data. However, because underlying distributions in source and target domains are often mismatched, current distribution matching-based methods may distort when matching distributions, resulting inconsistency between translated images, which is known as distortion problem. In this paper, we focus on low-level I21 translation, where structure...

10.1109/cvpr52688.2022.01771 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Paste and Harmonize via Denoising: Subject-Driven Image Editing with Frozen Pre-Trained Diffusion Model

OPENALEX - Publications

Xin Zhang Jiaxian Guo Paul Yoo Yutaka Matsuo Yusuke Iwasawa

Text-to-Image generative models have shown a remarkable ability to produce high-quality images. However, existing methods still face difficulties in exemplar-guided image editing without destroying the given objects' identity exemplar image. To address this problem, we propose new framework called Paste and Harmonize via Denoising, which leverages pre-trained diffusion facilitate text-driven transfer of objects from an edited while preserving their appearance characteristics. The consists...

10.1109/icassp48485.2024.10448510 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Real-Time Data-Efficient Portrait Stylization Via Geometric Alignment

OPENALEX - Publications

Xinrui Wang Zhuoru Li Xuanyu Yin Xiaofang Zhou Yusuke Iwasawa and 2 more

10.2139/ssrn.5113384 preprint EN 2025-01-01

Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection

OPENALEX - Publications

Bo Yang Jiaxian Guo Yusuke Iwasawa Yutaka Matsuo

Recent studies have increasingly demonstrated that large language models (LLMs) possess significant theory of mind (ToM) capabilities, showing the potential for simulating tracking mental states in generative agents. In this study, we propose a novel paradigm called ToM-agent, designed to empower LLMs-based agents simulate ToM open-domain conversational interactions. ToM-agent disentangles confidence from states, facilitating emulation an agent's perception its counterpart's such as beliefs,...

10.48550/arxiv.2501.15355 preprint EN arXiv (Cornell University) 2025-01-25

From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models

OPENALEX - Publications

Jiaxian Guo Junnan Li Dongxu Li Anthony Meng Huat Tiong Boyang Li and 2 more

Large language models (LLMs) have demonstrated excellent zero-shot generalization to new tasks. However, effective utilization of LLMs for visual question-answering (VQA) remains challenging, primarily due the modality disconnection and task between LLM VQA task. End-to-end training on vision data may bridge disconnections, but is inflexible computationally expensive. To address this issue, we propose \emph{Img2Prompt}, a plug-and-play module that provides prompts can aforementioned so...

10.48550/arxiv.2212.10846 preprint EN cc-by arXiv (Cornell University) 2022-01-01

DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model Given Sparse Views

OPENALEX - Publications

Paul D. Yoo Jiaxian Guo Yutaka Matsuo Shixiang Gu

Synthesizing novel view images from a few views is challenging but practical problem. Existing methods often struggle with producing high-quality results or necessitate per-object optimization in such few-view settings due to the insufficient information provided. In this work, we explore leveraging strong 2D priors pre-trained diffusion models for synthesizing images. models, nevertheless, lack 3D awareness, leading distorted image synthesis and compromising identity. To address these...

10.48550/arxiv.2306.03414 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Antiangiogenic Effects of Doxazosin on Experimental Choroidal Neovascularization in Mice

OPENALEX - Publications

Jiaxian Guo Xueting Luo Jian Liang Meichun Xiao Xiaodong Sun

The present study was designed to evaluate the effects of doxazosin on experimental choroidal neovascularization (CNV) in mice.Six- 8-week-old male C57BL/6 mice were divided into a control group and doxazosin-treated (5 mg/kg, i.p., daily). Experimental CNV induced by laser photocoagulation. Seven 14 days after induction, fluorescein angiography, flat mounts, histological studies performed fluorescence leakage, area, thickness lesions, respectively. In addition, western blot analysis carried...

10.1089/jop.2016.0153 article EN Journal of Ocular Pharmacology and Therapeutics 2016-12-19

PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination

OPENALEX - Publications

Xingzhou Lou Jiaxian Guo Junge Zhang Jun Wang Kaiqi Huang and 1 more

Zero-shot human-AI coordination holds the promise of collaborating with humans without human data. Prevailing methods try to train ego agent a population partners via self-play. However, these suffer from two problems: 1) The diversity finite is limited, thereby limiting capacity trained collaborate novel human; 2) Current only provide common best response for every partner in population, which may result poor zero-shot performance or humans. To address issues, we first propose policy...

10.48550/arxiv.2301.06387 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4

OPENALEX - Publications

Jiaxian Guo Bo Yang Paul D. Yoo Yu‐Chen Lin Yusuke Iwasawa and 1 more

Unlike perfect information games, where all elements are known to every player, imperfect games emulate the real-world complexities of decision-making under uncertain or incomplete information. GPT-4, recent breakthrough in large language models (LLMs) trained on massive passive data, is notable for its knowledge retrieval and reasoning abilities. This paper delves into applicability GPT-4's learned games. To achieve this, we introduce \textbf{Suspicion-Agent}, an innovative agent that...

10.48550/arxiv.2309.17277 preprint EN cc-by arXiv (Cornell University) 2023-01-01

A real-time moving objects detection and classification approach for static cameras

OPENALEX - Publications

Hong-Son Vu Jiaxian Guo Kuan-Hung Chen Shu-Jui Hsieh Desheng Chen

Moving objects recognition plays an important role in camera-only active safety systems and intelligent autonomous vehicles. For these applications, reliable detection performance is required; however, pedestrian challenging due to their divergent dressing action variety. Besides, real-time also critical. This paper aims optimize the by combining both temporal-domain spatial-domain methods. Accordingly, we first use Background Subtraction (BS) technique detect moving objects. Then, AdaBoost...

10.1109/icce-tw.2016.7521014 article EN 2016-05-01

DNQ: Dynamic Network Quantization

OPENALEX - Publications

Yuhui Xu Shuai Zhang Yingyong Qi Jiaxian Guo Weiyao Lin and 1 more

Network quantization is an effective method for the deployment of neural networks on memory and energy constrained mobile devices. In this paper, we propose a Dynamic Quantization (DNQ) framework which composed two modules: bit-width controller quantizer. Unlike most existing methods that use universal whole network, utilize policy gradient to train agent learn each layer by controller. This can make trade-off between accuracy compression ratio. Given sequence, quantizer adopts distance as...

10.48550/arxiv.1812.02375 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Weight fused functional sliced average variance estimation

OPENALEX - Publications

Wenjuan Hu Jiaxian Guo Guochang Wang Baoxue Zhang

Selecting the number of slice is a key step for implement sliced average variance estimation (SAVE) method. To our knowledge, there no widely accepted method it in practical application. And an incorrect may leads to inaccurate conclusion. In traditional multivariate sufficient dimension reduction procedure, usually adopt fuze approach which combined kernel operators SAVE with various numbers slices solve this problem. Due infinite functional data, can not be directly applied (FSAVE). Hence...

10.1080/03610918.2020.1752382 article EN Communications in Statistics - Simulation and Computation 2020-04-14

Design of Potted Automatic Irrigation Intelligent Control System Based on MCU

OPENALEX - Publications

Jiaxian Guo

With the further development of China's society and economy, modern people are not only satisfied with adequate food clothing, but also spiritual satisfaction has become an indispensable part people's busy life, potted plants have necessities life adjustment for some people. In a developed pace is getting faster faster, it easy to forget watering, go out frequently, family often no one water several days, so watering big problem. order solve this problem, set automatic irrigation control...

10.1109/icpeca53709.2022.9719065 article EN 2022 IEEE 2nd International Conference on Power, Electronics and Computer Applications (ICPECA) 2022-01-21

The HW-TSC Video Speech Translation System at IWSLT 2020

OPENALEX - Publications

Minghan Wang Hao Yang Yao Deng Ying Qin Lizhi Lei and 5 more

Minghan Wang, Hao Yang, Yao Deng, Ying Qin, Lizhi Lei, Daimeng Wei, Hengchao Shang, Ning Xie, Xiaochun Li, Jiaxian Guo. Proceedings of the 17th International Conference on Spoken Language Translation. 2020.

10.18653/v1/2020.iwslt-1.23 article EN cc-by 2020-01-01

Coming Soon ...