Zheng Zhang

ORCID: 0000-0002-1805-7705
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Natural Language Processing Techniques
  • Flood Risk Assessment and Management
  • Machine Learning in Healthcare
  • Precipitation Measurement and Analysis
  • Meteorological Phenomena and Simulations
  • Methane Hydrates and Related Phenomena
  • Arctic and Antarctic ice dynamics
  • Remote Sensing and Land Use
  • Radar Systems and Signal Processing
  • Remote-Sensing Image Classification
  • AI in Service Interactions
  • Anomaly Detection Techniques and Applications
  • Data-Driven Disease Surveillance
  • Privacy-Preserving Technologies in Data
  • Maritime Navigation and Safety
  • Marine and Coastal Research
  • Speech and Audio Processing
  • Internet of Things and Social Network Interactions
  • Monetary Policy and Economic Impact
  • Data Stream Mining Techniques
  • Multi-Agent Systems and Negotiation
  • Mental Health via Writing
  • Advanced Neural Network Applications
  • Human Mobility and Location-Based Analysis

Harbin Institute of Technology
2022-2024

Shanghai Chengtou (China)
2024

Shanghai Industrial Technology Institute
2024

Shanghai Tongji Urban Planning and Design Institute
2024

Tongji University
2024

Sichuan University
2023

State Key Laboratory of Biotherapy
2023

University of Michigan
2023

Amazon (United States)
2023

Beijing Institute of Technology
2023

Abstract. Natural disasters caused by heavy rainfall often cause huge loss of life and property. Hence, the task precipitation nowcasting is great importance. To solve this problem, several deep learning methods have been proposed to forecast future radar echo images, then predicted maps are converted distribution rainfall. The prevailing spatiotemporal sequence prediction apply a ConvRNN structure, which combines convolution recurrent neural network. Although achieve remarkable success,...

10.5194/gmd-15-5407-2022 article EN cc-by Geoscientific model development 2022-07-15

Automatic classification of sea ice and open water plays a vital role in climate change research, polar shipping, other applications. Many deep learning-based methods are proposed to automatically classify address this issue. Even though these have achieved remarkable success, the noise phenomenon SAR images still causes considerable limitations model performance. Meanwhile, existing ignore multi-scale global information from large-scale which tends produce misclassification. In paper, we...

10.1109/jstars.2024.3354912 article EN cc-by-nc-nd IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2024-01-01

As a key technology for maritime applications, trajectory prediction can effectively help ships reduce risks such as collisions and groundings at sea. Currently, although the combination of rich automatic identification system (AIS) data deep learning brings new possibilities ship prediction, is still hugely challenging due to complexity motion. In this paper, we improved model based on TrAISformer. On one hand, sparse multi-dimensional through dictionary coding, map it into probability...

10.1117/12.3015737 article EN 2024-02-20

Narrative reasoning relies on the understanding of eventualities in story contexts, which requires a wealth background world knowledge. To help machines leverage such knowledge, existing solutions can be categorized into two groups. Some focus implicitly modeling eventuality knowledge by pretraining language models (LMs) with eventuality-aware objectives. However, this approach breaks down structures and lacks interpretability. Others explicitly collect structured eventuality-centric graphs...

10.48550/arxiv.2404.00209 preprint EN arXiv (Cornell University) 2024-03-29

Recent advances in large language models (LLMs) have enhanced their ability to process long input contexts. This development is particularly crucial for tasks that involve retrieving knowledge from an external datastore, which can result inputs. However, recent studies show a positional bias LLMs, demonstrating varying performance depending on the location of useful information within sequence. In this study, we conduct extensive experiments investigate root causes bias. Our findings...

10.48550/arxiv.2404.01430 preprint EN arXiv (Cornell University) 2024-04-01

Tianhang Zhang, Lin Qiu, Qipeng Guo, Cheng Deng, Yue Zheng Chenghu Zhou, Xinbing Wang, Luoyi Fu. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023.

10.18653/v1/2023.emnlp-main.58 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2023-01-01

Recent works show the effectiveness of cache-based neural coreference resolution models on long documents. These incrementally process a document from left to right and extract relations between mentions entities in cache, resulting much lower memory computation cost compared computing all parallel. However, they do not handle cache misses when high-quality are purged which causes wrong assignments leads prediction errors. We propose new hybrid that integrates two eviction policies capture...

10.18653/v1/2023.acl-long.851 article EN cc-by 2023-01-01

Large language models (LLMs) have achieved remarkable success in NLP and multimodal tasks, among others. Despite these successes, two main challenges remain developing LLMs: (i) high computational cost, (ii) fair objective evaluations. In this paper, we report a solution to significantly reduce LLM training cost through growth strategy. We demonstrate that 101B-parameter with 0.31T tokens can be trained budget of 100K US dollars. Inspired by IQ tests, also consolidate an additional range...

10.48550/arxiv.2309.03852 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Precipitation nowcasting plays an important role in our life. Many deep learning-based methods are proposed for precipitation by predicting radar echo sequence over the past years, and achieving better performance than traditional approaches. However, all of them based on a static model, which is trained offline learning does not adapt to real-time changing data. Recently, online incremental (OIL) has been dynamically update model continually new data preventing forgetting historical...

10.1109/tgrs.2023.3330303 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-11-06

Abstract Precipitation forecasting plays an important role in disaster warning, agricultural production, and other fields. To solve this issue, some deep learning methods are proposed to forecast future radar echo images convert them into rainfall distributions. Prevailing spatiotemporal sequence prediction usually based on a ConvRNN structure that combines Convolutional Neural Network Recurrent Network. However, these existing ignore the image change prediction, which causes coherence of...

10.1049/cit2.12184 article EN cc-by-nc-nd CAAI Transactions on Intelligence Technology 2023-01-26

å›¾åƒè‡ªé€‚åº”æ»¤æ³¢æ˜¯éžçº¿æ€§çš„å›¾åƒå˜æ¢ï¼Œæœ‰å¹¿æ³›çš„åº”ç”¨åœºæ™¯ã€‚ä¼ ç»Ÿçš„å›¾åƒè‡ªé€‚åº”æ»¤æ³¢å™¨å‡æ˜¯ä¸“å®¶è®¾è®¡çš„ï¼Œå¦‚åŒè¾¹æ»¤æ³¢å™¨å’Œå½¢çŠ¶è‡ªé€‚åº”æ»¤æ³¢ç­‰ã€‚CNNä½œä¸ºç‰¹å¾æå–å’Œéžçº¿æ€§èƒ½åŠ›è¡¨è¾¾çš„æœ‰æ•ˆå·¥å ·ï¼Œå¯ç”¨äºŽå­¦ä¹ æž„é€ å›¾åƒè‡ªé€‚åº”æ»¤æ³¢å™¨ã€‚æœ¬æ–‡é¦–å...

10.11834/jrs.20232174 article DA National Remote Sensing Bulletin 2023-01-01

Identifying anomalous human spatial trajectory patterns can indicate dynamic changes in mobility behavior with applications domains like infectious disease monitoring and elderly care. Recent advancements large language models (LLMs) have demonstrated their ability to reason a manner akin humans. This presents significant potential for analyzing temporal mobility. In this paper, we conduct empirical studies assess the capabilities of leading LLMs GPT-4 Claude-2 detecting behaviors from data,...

10.48550/arxiv.2310.04942 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Introduction Clinicians iteratively adjust treatment approaches to improve outcomes but date, automatable for continuous learning of risk factors as these adjustments are made lacking. We combined a large-scale comprehensive real-world Learning Health System infrastructure (LHSI), with automated statistical profiling, visualization, and artificial intelligence (AI) approach test evidence-based discovery clinical three use cases: dysphagia, xerostomia, 3-year survival head neck cancer...

10.1101/2023.10.24.23297349 preprint EN cc-by medRxiv (Cold Spring Harbor Laboratory) 2023-10-25

Objectives: When detecting changes in synthetic aperture radar (SAR) images, the quality of difference map has an important impact on detection results, and speckle noise image interferes with extraction change information. In order to improve accuracy SAR map, this paper proposes a method that combines popular deep neural network clustering algorithm.Methods: Firstly, was constructed, FFDNet architecture used retrain image, parameters better effect suppression were obtained. Then log ratio...

10.30564/jees.v5i2.5980 article EN Journal of Environmental & Earth Sciences 2023-11-13

Large Language Models (LLMs) have shown impressive capabilities but also a concerning tendency to hallucinate. This paper presents RefChecker, framework that introduces claim-triplets represent claims in LLM responses, aiming detect fine-grained hallucinations. In an extractor generates from response, which are then evaluated by checker against reference. We delineate three task settings: Zero, Noisy and Accurate Context, reflect various real-world use cases. curated benchmark spanning NLP...

10.48550/arxiv.2405.14486 preprint EN arXiv (Cornell University) 2024-05-23

With the development of Human-AI Collaboration in Classification (HAI-CC), integrating users and AI predictions becomes challenging due to complex decision-making process. This process has three options: 1) autonomously classifies, 2) learning complement, where collaborates with users, 3) defer, defers users. Despite their interconnected nature, these options have been studied isolation rather than as components a unified system. In this paper, we address weakness novel HAI-CC methodology,...

10.48550/arxiv.2407.07003 preprint EN arXiv (Cornell University) 2024-07-09

Parameter-efficient fine-tuning (PEFT) methods typically assume that Large Language Models (LLMs) are trained on data from a single device or client. However, real-world scenarios often require these models private distributed across multiple devices. Federated Learning (FL) offers an appealing solution by preserving user privacy, as sensitive remains local devices during training. Nonetheless, integrating PEFT into FL introduces two main challenges: communication overhead and heterogeneity....

10.48550/arxiv.2410.13097 preprint EN arXiv (Cornell University) 2024-10-16

The combination of Oblivious RAM (ORAM) with Trusted Execution Environments (TEE) has found numerous real-world applications due to their complementary nature. TEEs alleviate the performance bottlenecks ORAM, such as network bandwidth and roundtrip latency, ORAM provides general-purpose protection for TEE against attacks exploiting memory access patterns. defining property this combination, which sets it apart from traditional designs, is its ability ensure that accesses, both inside outside...

10.48550/arxiv.2409.07167 preprint EN arXiv (Cornell University) 2024-09-11
Coming Soon ...