Yao Yao

ORCID: 0000-0001-8825-2680
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Phonetics and Phonology Research
  • Linguistic Variation and Morphology
  • Speech Recognition and Synthesis
  • Natural Language Processing Techniques
  • Topic Modeling
  • Speech and dialogue systems
  • Language Development and Disorders
  • Language, Metaphor, and Cognition
  • Speech and Audio Processing
  • Mobile Crowdsensing and Crowdsourcing
  • Neurobiology of Language and Bilingualism
  • Syntax, Semantics, Linguistic Variation
  • Multilingual Education and Policy
  • Categorization, perception, and language
  • Advanced Text Analysis Techniques
  • Second Language Acquisition and Learning
  • EFL/ESL Teaching and Learning
  • Sentiment Analysis and Opinion Mining
  • Reading and Literacy Development
  • Color perception and design
  • Semantic Web and Ontologies
  • Language and cultural evolution
  • Social Robot Interaction and HRI
  • Data Stream Mining Techniques
  • Discourse Analysis in Language Studies

Hong Kong Polytechnic University
2013-2023

Peking University
2019-2023

Hwa Chong Institution
2023

Shanghai Jiao Tong University
2023

Zhejiang International Studies University
2022

Zhejiang University
2022

Research Institute of Highway
2021

Ministry of Transport
2021

Nanjing Forestry University
2021

East China Normal University
2017

Mobile apps are ubiquitous, operate in complex environments and developed under the time-to-market pressure. Ensuring their correctness reliability thus becomes an important challenge. This paper introduces Stoat, a novel guided approach to perform stochastic model-based testing on Android apps. Stoat operates two phases: (1) Given app as input, it uses dynamic analysis enhanced by weighted UI exploration strategy static reverse engineer model of app's GUI interactions; (2) adapts Gibbs...

10.1145/3106237.3106298 article EN 2017-08-02

This study investigates the source and status of a recent sound change in Shanghainese (Wu, Sinitic) that has been attributed to language contact with Mandarin. The involves two vowels, /e/ /ɛ/, reported be merged three decades ago but produced distinctly contemporary Shanghainese. Results production experiments show speaker age, mode (monolingual vs. bilingual Shanghainese-Mandarin), crosslinguistic phonological similarity all influence these vowels. These findings provide evidence for as...

10.1353/lan.2016.0031 article EN Language 2016-01-01

This study tested the hypothesis that heritage speakers of a minority language, due to their childhood experience with two languages, would outperform late learners in producing contrast: language-internal phonological contrast, as well cross-linguistic phonetic contrast between similar, yet acoustically distinct, categories different languages. To this end, production Mandarin and English by was compared native American English-speaking three experiments. In experiment 1, back vowels were...

10.1121/1.3569736 article EN The Journal of the Acoustical Society of America 2011-06-01

Individual variation is key to understanding phenomena in phonetic and change, including the production-perception link. To test generalizability of this relationship, study compares community- individual-level across three long-standing consonant mergers Hong Kong Cantonese speakers: [n]→[l], [ŋ̩]→[m̩], [ŋ]↔Ø. Concurrently, we document these understudied a community that has undergone rapid social change recent decades. Younger (college-aged) older (middle-aged) Kongers completed reading...

10.16995/labphon.6461 article EN cc-by Laboratory Phonology Journal of the Association for Laboratory Phonology 2022-05-26

Abstract Children acquire their language in different ways. In this paper we propose some new measures from a network approach quantifying these individual differences. Children's and care-takers' speech data are represented as series of networks, word forms being taken nodes collocation words links. First, compare two independent indices on growth, including the size connectivity. with small vocabulary (i.e. size) may have more flexibility combination large connectivity), vice versa....

10.1080/09296170701794286 article EN Journal of Quantitative Linguistics 2008-01-17

Driver's under-arousal occurred in automated driving systems (ADS) impairs takeover safety. This study aims to determine electrodermal activity (EDA) features' importance for driver's arousal quantification. A car-following simulator was conducted with participants concurrently executing four levels of cognitive tasks, triggering arousal. Participants' skin conductance (SC) data were collected and decomposed into tonic (skin level, SCL) phasic response, SCR) components. Seventeen features...

10.1109/tits.2021.3135266 article EN IEEE Transactions on Intelligent Transportation Systems 2021-12-24

In recent years, corpus phonetics has become a rapidly expanding field.However, the lack of appropriate tools for automatic acoustic analysis hinders further development field.In this paper, we present methodological study on extraction vowel formants using both robust linear predictive coding (RLPC; Lee, 1988) and dynamic formant tracking (Talkin, 1987).Acoustic data were taken from Buckeye English conversations.We varied two aspects -preemphasis LPC order -to optimize results by speaker...

10.5070/p72pm9c9sq article EN UC Berkeley Phonology Lab Annual Reports 2010-01-01

In previous work examining heritage language phonology, speakers have often patterned differently from native and late-onset second (L2) learners with respect to overall accent segmentals. The current study extended this line of inquiry suprasegmentals, comparing the properties lexical tones produced by heritage, native, L2 Mandarin living in U.S. We hypothesized that would approximate norms for more closely than speakers, yet diverge these one or ways. further that, due their unique...

10.46538/hlj.13.2.4 article EN Heritage Language Journal 2016-08-31

Abstract Plant color landscape plays an important role in improving the quality of visual landscapes, regulating emotion space, and highlighting characteristics urban landscapes. How to reasonably quantify create rich plant landscapes achieve best perception at different scales, so as better meet aesthetic needs public, has become a hot difficult issue design application. Therefore, this article selects four typical parks Nanjing study communities. The natural system card is used extract...

10.1002/col.22713 article EN Color Research & Application 2021-07-17

Whether tone language experience facilitates non-native perception is an area of research that previously yielded conflicting results, potentially because the lack systematical control speaker normalization effects across studies. Under a high-variability testing condition with controlled cues, Cantonese (native controls), Mandarin (Cantonese-naive listeners), and English (non-tone listeners) listeners identified three level tones. The results indicate facilitatory effect on when for...

10.1121/1.4976037 article EN The Journal of the Acoustical Society of America 2017-02-01

Despite the increasing interest in emotion and sentiment analysis Chinese text, field lacks reliable, normative ratings of emotional content valence words. This paper reports first large-scale survey average language users' judgment perceived type (e.g., anger, happiness), intensity, positive, negative) The results reveal significant differences from previously proposed lexicons, which mostly relied on a few researchers' or automatic annotation. Furthermore, current study also explores issue...

10.1186/s40655-016-0015-y article EN cc-by Lingua Sinica 2016-10-19

Previous literature has documented phonetic accommodation for various segmental and suprasegmental features, but the of tone remains under-explored. The current study contributes to by investigating two merging tones in Hong Kong Cantonese, mid-level Tone 3 (T3) low-level 6 (T6), a speech shadowing experiment. Specifically, we ask whether shadowers will reverse trend after exposure model talker with distinct T3–T6 productions if so, what factors modulate accommodative behaviors. Evidence...

10.1016/j.wocn.2021.101060 article EN cc-by-nc-nd Journal of Phonetics 2021-05-11

Researchers are witnessing knowledge-inspired natural language processing shifts the focus from entity-level to event-level, whereas event coreference resolution is one of core challenges. This paper proposes a novel model for within-document resolution. On basis but not entity as before, our learns and integrates multiple representations both alone pair. For former, we introduce linguistics-motivated features more discriminative representations. latter, consider similarity measures capture...

10.18653/v1/2023.findings-acl.855 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2023-01-01

This is a corpus study on closure duration and VOT in English voiceless stops word-initial position.19 speakers' (10 female, 9 male) data from the Buckeye Speech are used study.The first half of paper introduces novel approach automatically finding point stop release large speech database, using Mel spectral templates similarity scores.The performance robustness algorithm discussed detail.To our knowledge, this also automatic measure that reported detail literature.The second studies as...

10.5070/p71hs7h769 article EN UC Berkeley Phonology Lab Annual Reports 2007-01-01

This study investigated the production of five Mandarin and English sibilant fricatives by heritage speakers in comparison to native late learners. Almost all were found distinguish retroflex alveolo-palatal, as well alveolo-palatal palato-alveolar. However, fewer distinguished palato-alveolar or alveolars, with majority falling into this group distinguishers both cases. These results indicate that speakers, addition most learners, do not have much trouble post-alveolar contrast,...

10.5070/p75t2092k9 article EN UC Berkeley Phonology Lab Annual Reports 2008-01-01

This paper reports a corpus study on the variation of VOT in voiceless stops spontaneous speech.Two speakers' data from Buckeye are used: one is an older female speaker with low speaking rate while other younger male extremely high rate.Linear regression analysis shows that place articulation, word frequency, phonetic context, speech and utterance position all have effect length VOT.However, altogether less than 20% explained both speakers, which suggests pronunciation highly complicated...

10.5070/p76dd1x6cs article EN UC Berkeley Phonology Lab Annual Reports 2009-01-01

Abstract Differential affective processing has been widely documented for bilinguals: L1 words elicit higher levels of arousal and stronger emotionality ratings than L2 (Pavlenko, 2012). In this study, we focus on two closely related Chinese languages, Mandarin Cantonese, whose lexicons are highly overlapping, with shared lexical items that only differ in pronunciation across languages. We recorded Cantonese – bilinguals’ pupil responses to auditory tokens words. Our results showed...

10.1017/s1366728922000931 article EN cc-by Bilingualism Language and Cognition 2023-01-17

Phonological neighborhood effects have been found in spoken word recognition, production and phonetic variation (Gahl, Yao, & Johnson, 2012; Luce Pisoni, 1998; Vitevitch, 2002). Overall, words from dense neighborhoods are harder to recognize but easier produce. However, most previous studies focused on English, while evidence suggests that these may not generalize cross-linguistically due language-specific configurations of the lexicon (Michael S Vitevitch Stamer, 2006, 2009). In current...

10.3765/plsa.v2i0.4090 article EN Proceedings of the Linguistic Society of America 2017-06-12

Acoustic analyses of normal voiced and whispered Mandarin Chinese reveal significant differences in duration intensity among the four lexical tones, that are moreover similar across two speech genres.In contrast to previous claims, however, these tones found shrink whisper rather than being exaggerated facilitate perception.Furthermore, individual variation exists production which shorten or lengthen with respect depending on speaker.

10.5070/p71581q7qr article EN UC Berkeley Phonology Lab Annual Reports 2007-01-01

Abstract Semantic transparency deals with the interface between lexical semantics and morphology. It is an important linguistic phenomenon in Chinese context of prediction meanings compounds from their constituents. Given prominence compounding morpho-lexical processes, to date there no semantic dataset available support verifiable replicable quantitative analysis Mandarin Chinese. In addition, relation morphological structure has not been systematically examined. This paper reports a...

10.1075/lali.00035.wan article EN cc-by-nc Language and Linguistics 語言暨語言學 2019-04-05
Coming Soon ...