NFDI4DS | UHH-SEMS - Publication Details

Murat Saraçlar

ORCID: 0000-0002-7435-8510

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5055086464

Research Areas

Speech Recognition and Synthesis
Natural Language Processing Techniques
Topic Modeling
Music and Audio Processing
Speech and Audio Processing
Speech and dialogue systems
Hand Gesture Recognition Systems
Hearing Impairment and Communication
Blind Source Separation Techniques
Advanced Text Analysis Techniques
Algorithms and Data Compression
Handwritten Text Recognition Techniques
Text and Document Classification Technologies
Human Pose and Action Recognition
Phonocardiography and Auscultation Techniques
Video Analysis and Summarization
Gait Recognition and Analysis
Advanced Data Compression Techniques
Sentiment Analysis and Opinion Mining
Web Data Mining and Analysis
Time Series Analysis and Forecasting
Advanced Adaptive Filtering Techniques
Neural Networks and Applications
Flow Measurement and Analysis
Phonetics and Phonology Research

Stantec (Canada)
2022

Boğaziçi University
2012-2021

IBM (United States)
2013

Brigham Young University - Idaho
2012

Philips (Netherlands)
2008

AT&T (United States)
2002-2006

Google (United States)
2004

Johns Hopkins University
2000-2002

Carnegie Mellon University
1999

Discriminative n-gram language modeling

OPENALEX - Publications

Brian Roark Murat Saraçlar Michael Collins

10.1016/j.csl.2006.06.006 article EN Computer Speech & Language 2006-08-07

Retrieval and browsing of spoken content

OPENALEX - Publications

Ciprian Chelba Timothy J. Hazen Murat Saraçlar

Ever-increasing computing power and connectivity bandwidth, together with falling storage costs, are resulting in an overwhelming amount of data various types being produced, exchanged, stored. Consequently, information search retrieval has emerged as a key application area. Text-based is the most active area, applications that range from Web local network to searching for personal residing on one's own hard-drive. Speech received less attention perhaps because large collections spoken...

10.1109/msp.2008.917992 article EN IEEE Signal Processing Magazine 2008-04-23

Lattice Indexing for Spoken Term Detection

OPENALEX - Publications

Doğan Can Murat Saraçlar

This paper considers the problem of constructing an efficient inverted index for spoken term detection (STD) task. More specifically, we construct a deterministic weighted finite-state transducer storing soft-hits in form (utterance ID, start time, end posterior score) quadruplets. We propose generalized factor structure which retains time information necessary performing STD. The required is embedded into path weights without disrupting inherent optimality. also describe how to all...

10.1109/tasl.2011.2134087 article EN IEEE Transactions on Audio Speech and Language Processing 2011-04-26

Discriminative language modeling with conditional random fields and the perceptron algorithm

OPENALEX - Publications

Brian Roark Murat Saraçlar Michael Collins Mark Johnson

This paper describes discriminative language modeling for a large vocabulary speech recognition task. We contrast two parameter estimation methods: the perceptron algorithm, and method based on conditional random fields (CRFs). The models are encoded as deterministic weighted finite state automata, applied by intersecting automata with word-lattices that output from baseline recognizer. algorithm has benefit of automatically selecting relatively small feature set in just couple passes over...

10.3115/1218955.1218962 article EN 2004-01-01

Morph-based speech recognition and modeling of out-of-vocabulary words across languages

OPENALEX - Publications

Mathias Creutz Teemu Hirsimäki Mikko Kurimo Antti Puurula Janne Pylkkönen and 5 more

We explore the use of morph-based language models in large-vocabulary continuous-speech recognition systems across four so-called morphologically rich languages: Finnish, Estonian, Turkish, and Egyptian Colloquial Arabic. The morphs are subword units discovered an unsupervised, data-driven way using Morfessor algorithm. By estimating n -gram over sequences instead words, quality model is improved through better vocabulary coverage reduced data sparsity. Standard word suffer from high...

10.1145/1322391.1322394 article EN ACM Transactions on Speech and Language Processing 2007-12-01

Turkish Broadcast News Transcription and Retrieval

OPENALEX - Publications

Ebru Arısoy Doğan Can Siddika Parlak Haşim Sak Murat Saraçlar

This paper summarizes our recent efforts for building a Turkish Broadcast News transcription and retrieval system. The agglutinative nature of leads to high number out-of-vocabulary (OOV) words which in turn lower automatic speech recognition (ASR) accuracy. situation compromises the performance systems based on ASR output. Therefore using word-based is not adequate transcribing Turkish. To alleviate this problem, various sub-word-based units are utilized. These solve OOV problem with...

10.1109/tasl.2008.2012313 article EN IEEE Transactions on Audio Speech and Language Processing 2009-06-10

Stochastic pronunciation modelling from hand-labelled phonetic corpora

OPENALEX - Publications

Michael Riley Bill Byrne Michael Finke Sanjeev Khudanpur Andrej Ljolje and 5 more

10.1016/s0167-6393(99)00037-0 article FR Speech Communication 1999-11-01

Pronunciation modeling by sharing Gaussian densities across phonetic models

OPENALEX - Publications

Murat Saraçlar Harriet J. Nock Sanjeev Khudanpur

Conversational speech exhibits considerable pronunciation variability, which has been shown to have a detrimental effect on the accuracy of automatic recognition. There many attempts model variation, including use decision trees generate alternate word pronunciations from phonemic baseforms. Use models during recognition is known improve accuracy. This paper describes incorporation into acoustic training in addition Subtle difficulties straightforward alternatives canonical are first...

10.1006/csla.2000.0140 article EN cc-by-nc-nd Computer Speech & Language 2000-04-01

General indexation of weighted automata

OPENALEX - Publications

Cyril Allauzen Mehryar Mohri Murat Saraçlar

Much of the massive quantities digitized data widely available, e.g., text, speech, hand-written sequences, are either given directly, or, as a result some prior processing, weighted automata. These compact representations large number alternative sequences and their weights reflecting uncertainty or variability data. Thus, indexation such requires indexing

10.3115/1626307.1626314 article EN 2004-01-01

Spoken Term Detection for Turkish Broadcast News

OPENALEX - Publications

Siddika Parlak Murat Saraçlar

In this paper, we present a baseline spoken term detection (STD) system for Turkish broadcast news. The agglutinative structure of causes high out-of-vocabulary (OOV) rate and increases word error (WER) in automatic speech recognition. Several approaches are attempted to reduce negative effect on the STD system. Sub-word units used handle OOV queries lattice-based indexing is obtain different operating points WER cases. A recently proposed method setting specific thresholds also evaluated...

10.1109/icassp.2008.4518842 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2008-03-01

A Comparison of SVM and GMM-Based Classifier Configurations for Diagnostic Classification of Pulmonary Sounds

OPENALEX - Publications

İpek Şen Murat Saraçlar Yasemin P. Kahya

The aim of this study is to find a useful methodology classify multiple distinct pulmonary conditions including the healthy condition and various pathological types, using sounds data.Fourteen-channel data 40 subjects (healthy pathological, where pathologies are obstructive restrictive types) modeled second order 250-point vector autoregressive model. estimated model parameters fed support machine Gaussian mixture (GMM) classifiers which used in configurations, resulting eight different...

10.1109/tbme.2015.2403616 article EN IEEE Transactions on Biomedical Engineering 2015-02-12

Discriminative syntactic language modeling for speech recognition

OPENALEX - Publications

Michael Collins Brian Roark Murat Saraçlar

We describe a method for discriminative training of language model that makes use syntactic features. follow reranking approach, where baseline recogniser is used to produce 1000-best output each acoustic input, and second "reranking" then choose an utterance from these lists. The features together with parameter estimation based on the perception algorithm. experiments Switchboard speech recognition task. provide additional 0.3% reduction in test-set error rate beyond (Roark et al., 2004a;...

10.3115/1219840.1219903 article EN 2005-01-01

Unlimited vocabulary speech recognition for agglutinative languages

OPENALEX - Publications

Mikko Kurimo Antti Puurula Ebru Arısoy Vesa Siivola Teemu Hirsimäki and 3 more

It is practically impossible to build a word-based lexicon for speech recognition in agglutinative languages that would cover all the relevant words. The problem words are generally built by concatenating several prefixes and suffixes word roots. Together with compounding inflections this leads millions of different, but still frequent forms. Due inflections, ambiguity other phenomena, it also not trivial automatically split into meaningful parts. Rule-based morphological analyzers can...

10.3115/1220835.1220897 article EN 2006-01-01

The AT&T WATSON Speech Recognizer

OPENALEX - Publications

Vincent Goffin Cyril Allauzen Enrico Bocchieri Dilek Hakkani‐Tür Andrej Ljolje and 4 more

This paper describes the AT&T WATSON real-time speech recognizer, product of several decades research at AT&T. The recognizer handles a wide range vocabulary sizes and is based on continuous-density hidden Markov models for acoustic modeling finite state networks language modeling. recognition network optimized efficient search. We identify algorithms used high-accuracy, low-latency recognition. present results small large tasks taken from VoiceTone/sup /spl reg// service, showing word...

10.1109/icassp.2005.1415293 article EN 2006-10-11

Resources for Turkish morphological processing

OPENALEX - Publications

Haşim Sak Tunga Güngör Murat Saraçlar

10.1007/s10579-010-9128-6 article EN Language Resources and Evaluation 2010-08-09

Effect of pronounciations on OOV queries in spoken term detection

OPENALEX - Publications

Doğan Can Erica Cooper Abhinav Sethy Chris White Bhuvana Ramabhadran and 1 more

The spoken term detection (STD) task aims to return relevant segments from a archive that contain the query terms whether or not they are in system vocabulary. This paper focuses on pronunciation modeling for out-of-vocabulary (OOV) which frequently occur STD queries. described this indexes word-level and sub-word level lattices confusion networks produced by an LVCSR using weighted finite state transducers (WFST).We investigate inclusion of n-best variants OOV (obtained letter-to-sound...

10.1109/icassp.2009.4960494 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2009-04-01

Wind Speed Forecasting Based on Second Order Blind Identification and Autoregressive Model

OPENALEX - Publications

Umut Fırat Şeref Naci Engin Murat Saraçlar A. Ertüzün

Wind power may present undesirable discontinuities and fluctuations due to considerable variations in wind speed, which affect adversely the smooth operation of grid. Effective forecast is essential order report amount energy supply with high accuracy, crucial for planning resources system operators. Variations cannot be sufficiently estimated by persistence type basic forecasting methods particularly medium long terms. Therefore a new statistical method presented here this paper based on...

10.1109/icmla.2010.106 article EN 2010-12-01

Hybrid language models for out of vocabulary word detection in large vocabulary conversational speech recognition

OPENALEX - Publications

Akan Yazgan Murat Saraçlar

In this paper, we propose a method for out-of-vocabulary (OOV) word detection and take step toward open vocabulary automatic speech recognition. The proposed uses hybrid language model combining words subword units such as phones or syllables. We describe algorithm based on the posterior count of OOV given model, compare it to using probability best string conventional only model. Experimental results Switchboard corpus are presented different sizes. new yields gain over 10% in detection....

10.1109/icassp.2004.1326093 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2004-09-28

Corrective language modeling for large vocabulary ASR with the perceptron algorithm

OPENALEX - Publications

Brian Roark Murat Saraçlar Michael J. Collins

This paper investigates error-corrective language modeling using the perceptron algorithm on word lattices. The resulting model is encoded as a weighted finite-state automaton, and used by intersecting with lattices, making it simple inexpensive to apply during decoding. We present results for various training scenarios Switchboard task, including n-gram features of different orders, performing n-best extraction versus full demonstrate importance conditions close possible testing conditions....

10.1109/icassp.2004.1326094 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2004-09-28

Morphology-based and sub-word language modeling for Turkish speech recognition

OPENALEX - Publications

Haşim Sak Murat Saraçlar Tunga Güngör

We explore morphology-based and sub-word language modeling approaches proposed for morphologically rich languages, evaluate contrast them Turkish broadcast news transcription task. In addition, as a model, we improve our previously morphology-integrated model automatic speech recognition. This is built by composing the finite-state transducer of morphological parser with over lexical morphemes. approach provides search network an unlimited vocabulary, generating only valid word forms while...

10.1109/icassp.2010.5494927 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2010-03-01

Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition

OPENALEX - Publications

Haşim Sak Murat Saraçlar Tunga Güngör

This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and exploit morphology as a knowledge source. The first model, morpholexical is generative <formula formulatype="inline" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex Notation="TeX">$n$</tex></formula> -gram where units are lexical-grammatical morphemes instead of commonly used words...

10.1109/tasl.2012.2201477 article EN IEEE Transactions on Audio Speech and Language Processing 2012-05-29

An empirical study of confusion modeling in keyword search for low resource languages

OPENALEX - Publications

Murat Saraçlar Abhinav Sethy Bhuvana Ramabhadran Lidia Mangu Jia Cui and 3 more

Keyword search, in the context of low resource languages, has emerged as a key area research. The dominant approach keyword search is to use Automatic Speech Recognition (ASR) front end produce representation audio that can be indexed. biggest drawback this lies its inability deal with out-of-vocabulary words and query terms are not ASR system output. In paper we present an empirical study evaluating various approaches based on using confusion models expansion techniques address problem. We...

10.1109/asru.2013.6707774 article EN 2013-12-01

Coming Soon ...