Kun Sun

ORCID: 0000-0001-9766-269X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Neurobiology of Language and Bilingualism
  • Language and cultural evolution
  • Text Readability and Simplification
  • Advanced Text Analysis Techniques
  • Authorship Attribution and Profiling
  • Syntax, Semantics, Linguistic Variation
  • Language, Metaphor, and Cognition
  • Medical Image Segmentation Techniques
  • Language, Discourse, Communication Strategies
  • Second Language Acquisition and Learning
  • Image and Object Detection Techniques
  • Linguistic Variation and Morphology
  • Evolutionary Game Theory and Cooperation
  • AI in cancer detection
  • Advanced Image Processing Techniques
  • Multisensory perception and integration
  • Categorization, perception, and language
  • Speech Recognition and Synthesis
  • Translation Studies and Practices
  • Color perception and design
  • linguistics and terminology studies
  • Swearing, Euphemism, Multilingualism
  • Laser and Thermal Forming Techniques

University of Southern Denmark
2025

University of Tübingen
2018-2024

Yangzhou University
2024

Tongji University
2024

Jiangxi Normal University
2023

Oklahoma State University
2022

University of Stuttgart
2021

Hangzhou Dianzi University
2018-2021

Beijing Foreign Studies University
2021

Ruijin Hospital
2020

Precise identification and categorization of building materials are essential for informing strategies related to embodied carbon reduction, retrofitting, circularity in urban environments. However, existing material databases typically limited individual projects or specific geographic areas, offering only approximate assessments. Acquiring large-scale precise data is hindered by inadequate records financial constraints. Here, we introduce a novel automated framework that harnesses recent...

10.1016/j.ese.2025.100538 article EN cc-by-nc-nd Environmental Science and Ecotechnology 2025-02-03

Global change such as atmospheric nitrogen (N) deposition can facilitate alien plant invasions, which is often attributed to the increase in soil N availability. However, few studies have considered effects of global change-driven alterations forms, especially under conditions with interspecific competition. In this study, we first determined differences growth, biomass allocation, and photosynthesis different forms levels between three noxious invasive species their respective related...

10.20944/preprints202504.1738.v1 preprint EN 2025-04-23

Using creativity to promote recreational services is crucial. Accordingly, creative linguistic landscapes (CLLs) are being used improve visitors’ experiences in some zones. However, relevant research still its early stages. Therefore, this study was conducted. It summarized the leisure function categories and evaluation indicators of CLLs zones respectively based on image materials related online reviews. The outcomes all CLL types were ranked using fuzzy PROMETHEE method; ranking, a...

10.1371/journal.pone.0299775 article EN cc-by PLoS ONE 2024-03-22

The majority of research in computational psycholinguistics has concentrated on the processing words. This study introduces innovative methods for computing sentence-level metrics using multilingual large language models. developed sentence surprisal and relevance then are tested compared to validate whether they can predict how humans comprehend sentences as a whole across languages. These offer significant interpretability achieve high accuracy predicting human reading speeds. Our results...

10.48550/arxiv.2403.15822 preprint EN arXiv (Cornell University) 2024-03-23

Background A generative adversarial network could be used for high‐resolution (HR) medical image synthesis with reduced scan time. Purpose To evaluate the potential of using a deep convolutional (DCGAN) generating HR pre and post images based on their corresponding low‐resolution (LR) (LR LR ). Study Type This was retrospective analysis prospectively acquired cohort. Population In all, 224 subjects were randomly divided into 200 training an independent 24 testing set. Field Strength/Sequence...

10.1002/jmri.27256 article EN Journal of Magnetic Resonance Imaging 2020-07-12

The analysis of punctuation in philology is mainly carried out with a view to better understand the meaning literature concerned. Punctuation generally believed play role ‘assisting written language indicating those elements speech that cannot be conveniently set down on paper: chiefly pause, pitch and stress speech’ (Markwardt, 1942: 156). Most us often ignore importance writing systems tend believe only depends tradition personal styles writers. In fact, marks may contribute significantly...

10.1017/s0266078418000512 article EN English Today 2018-12-17

Abstract Scientific writings, as one essential part of human culture, have evolved over centuries into their current form. Knowing how scientific writings is particularly helpful in understanding trends culture developed. It also allows us to better understand was interwoven with generally. The availability massive digitized texts and the progress computational technologies today provide a convenient credible way discern evolutionary patterns by examining diachronic linguistic changes....

10.1007/s11192-020-03816-8 article EN cc-by Scientometrics 2020-12-17

Abstract The double‐nominal construction ( DNC ), also called ‘topic construction’, is a common occurrence in Chinese and other East Asian languages. It characterized by two initial NP s which appear before the predicate verb. has mostly been analyzed using syntactic angle singly approach. topic (the nominal phrase, abbreviated as 1) needs to syntactically establish some connection with comment rest of construction) but this has, unfortunately, not case due numerous counterexamples. This so...

10.1111/stul.12085 article EN Studia Linguistica 2018-02-26

The topic chain, one of the essential organization devices in Chinese discourse, is highlighted by use many co-referential zero forms. Although chain has been realized to play an important role organizing few attempts have made explore how forms integrated and meaningful unit facilitates discourse organization, which are called “integration functions” this paper. This study, based on a comprehensive review studies, re-examines core characteristics chain. After this, integration functions...

10.4312/ala.9.1.29-57 article EN cc-by-sa Acta Linguistica Asiatica 2019-01-30

Hyphenated compounds have largely been neglected in the studies of compounding, which seldom analysed context. In this study, we argue that hyphen use is strongly motivated. Hyphenation used when words form a unit, reduces possibility parsing them into separate units or other forms. The current study adopts new perspective on contextual factors, namely, part speech (PoS) compound as whole belongs to and how people correctly parse unit. This process can be observed by considering examples....

10.1016/j.langsci.2020.101326 article EN cc-by-nc-nd Language Sciences 2020-10-08

Frequency distribution of words, syntax and semantics in many languages abides by certain laws. However, because the shortage discourse corpora, few studies have examined whether frequency relations follows some distributional patterns. Although there is research based on Rhetorical Structure Theory treebank (RST-DT), each these limited to a single language. Otherwise RST-DT, Penn Discourse Treebank (PDTB), adopting another annotation system, has had an enormous influence study structure...

10.1080/09296174.2017.1390934 article EN Journal of Quantitative Linguistics 2018-01-05

Abstract This study proposes a novel, new ensemble model (NEM) designed to simulate the maximum water level increases caused by storm surges in frequently cyclone‐affected coastal of Hong Kong, China. The relies on and data spanning 1978–2022. NEM amalgamates three machine learning algorithms: Random Forest (RF), Gradient Boosting Decision Tree (GBDT), XGBoost (XGB), employing stacking technique for integration. Six parameters, determined using Recursive Feature Elimination algorithms...

10.1029/2023ea003243 article EN cc-by-nc Earth and Space Science 2023-12-01

In past studies, the few quantitative approaches to discourse structure were mostly confined presentation of frequency relations. However, should take into account both hierarchical and relational layers in structure. This study considers these factors addresses issue how relations units are related. It draws upon available corpora (rhetorical theory-discourse treebank (RST-DT)) from a new perspective. Since an RST tree can be converted syntactic dependency tree, data extracted RST-DT useful...

10.1177/1461445619866985 article EN Discourse Studies 2019-08-02

Abstract The notion of sentencehood in Mandarin Chinese is much less well-defined than many other languages, with a block clauses often joined by commas without conjunctions and the period occurring at end to indicate meaning completeness rather sentential structure. potential factors that may affect native speakers’ judgment perception sentence boundaries have not yet been systematically examined. In light this research gap, study investigates play role boundary perception. To end, we...

10.1007/s11145-022-10272-8 article EN cc-by Reading and Writing 2022-03-17

Amidst the rapid evolution of LLMs, significance evaluation in comprehending and propelling these models forward is increasingly paramount. Evaluations have revealed that factors such as scaling, training types, architectures other profoundly impact performance LLMs. However, extent nature impacts continue to be subjects debate because most assessments been restricted a limited number data points. Clarifying effects on scores can more effectively achieved through statistical lens. Our study...

10.48550/arxiv.2403.15250 preprint EN arXiv (Cornell University) 2024-03-22

In recent years, several influential computational models and metrics have been proposed to predict how humans comprehend process sentence. One particularly promising approach is contextual semantic similarity. Inspired by the attention algorithm in Transformer human memory mechanisms, this study proposes an ``attention-aware'' for computing relevance. This new takes into account different contributions of parts expectation effect, allowing it incorporate information fully. The...

10.48550/arxiv.2403.18542 preprint EN arXiv (Cornell University) 2024-03-27

This study employs deep learning techniques to explore four speaker profiling tasks on the TIMIT dataset, namely gender classification, accent age estimation, and identification, highlighting potential challenges of multi-task versus single-task models. The motivation for this research is twofold: firstly, empirically assess advantages drawbacks over models in context profiling; secondly, emphasize undiminished significance skillful feature engineering recognition tasks. findings reveal...

10.48550/arxiv.2404.12077 preprint EN arXiv (Cornell University) 2024-04-18

Data-driven approaches have revolutionized scientific research. Machine learning and statistical analysis are commonly utilized in this type of Despite their widespread use, these methodologies differ significantly techniques objectives. Few studies a consistent dataset to demonstrate differences within the social sciences, particularly language cognitive sciences. This study leverages Buckeye Speech Corpus illustrate how both machine applied data-driven research obtain distinct insights....

10.48550/arxiv.2404.14052 preprint EN arXiv (Cornell University) 2024-04-22

Machine Translation (MT) Quality Estimation (QE) assesses translation reliability without reference texts. This study introduces "textual similarity" as a new metric for QE, using sentence transformers and cosine similarity to measure semantic closeness. Analyzing data from the MLQE-PE dataset, we found that textual exhibits stronger correlations with human scores than traditional metrics (hter, model evaluation etc.). Employing GAMMs statistical tool, demonstrated consistently outperforms...

10.48550/arxiv.2406.07440 preprint EN arXiv (Cornell University) 2024-06-11

In recent years, several influential computational models and metrics have been proposed to predict how humans comprehend process sentence. One particularly promising approach is contextual semantic similarity. Inspired by the attention algorithm in Transformer human memory mechanisms, this study proposes an "attention-aware" for computing relevance. This new takes into account different contributions of parts expectation effect, allowing it incorporate information fully. The attention-aware...

10.1016/j.cognition.2024.105991 article EN cc-by-nc Cognition 2024-11-26
Coming Soon ...