NFDI4DS | UHH-SEMS - Publication Details

M. Lang

ORCID: 0009-0001-8080-0010

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5088367992

Research Areas

Speech and Audio Processing
Music and Audio Processing
Emotion and Mood Recognition
Speech and dialogue systems
Speech Recognition and Synthesis
Handwritten Text Recognition Techniques
Hand Gesture Recognition Systems
Natural Language Processing Techniques
Human Pose and Action Recognition
Semiconductor Lasers and Optical Devices
Video Analysis and Summarization
Music Technology and Sound Studies
Mathematics, Computing, and Information Processing
Radio Frequency Integrated Circuit Design
Advancements in PLL and VCO Technologies
Infant Health and Development
Usability and User Interface Design
Gaze Tracking and Assistive Technology
Gait Recognition and Analysis
Advanced Photonic Communication Systems
Interactive and Immersive Displays
Photonic and Optical Devices
Image Retrieval and Classification Techniques
Face and Expression Recognition
Augmented Reality Applications

New York Hospital Queens
2024

NewYork–Presbyterian Hospital
2024

Friedrich-Alexander-Universität Erlangen-Nürnberg
2022

Technical University of Munich
1992-2009

University of Toronto
2005

Fraunhofer Institute for Applied Solid State Physics
1995-2004

Ludwig-Maximilians-Universität München
2002-2003

Fraunhofer Society
1999

Siemens (Germany)
1988

Hidden Markov model-based speech emotion recognition

OPENALEX - Publications

Björn W. Schuller Gerhard Rigoll M. Lang

In this contribution we introduce speech emotion recognition by use of continuous hidden Markov models. Two methods are propagated and compared throughout the paper. Within first method a global statistics framework an utterance is classified Gaussian mixture models using derived features raw pitch energy contour signal. A second introduces increased temporal complexity applying considering several states low-level instantaneous instead statistics. The paper addresses design working engines...

10.1109/icme.2003.1220939 article EN 2003-01-01

Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture

OPENALEX - Publications

Björn W. Schuller Gerhard Rigoll M. Lang

In this paper we introduce a novel approach to the combination of acoustic features and language information for most robust automatic recognition speaker's emotion. Seven discrete emotional states are classified throughout work. Firstly model emotion by is presented. The derived signal-, pitch-, energy, spectral contours ranked their quantitative contribution estimation an Several different classification methods including linear classifiers, Gaussian mixture models, neural nets, support...

10.1109/icassp.2004.1326051 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2004-09-28

Hidden Markov model-based speech emotion recognition

OPENALEX - Publications

Björn W. Schuller Gerhard Rigoll M. Lang

We introduce speech emotion recognition by use of continuous hidden Markov models. Two methods are propagated and compared. In the first method, a global statistics framework an utterance is classified Gaussian mixture models using derived features raw pitch energy contour signal. A second method introduces increased temporal complexity, applying considering several states low-level instantaneous instead statistics. The paper addresses design working engines, results achieved with respect to...

10.1109/icassp.2003.1202279 article EN 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003-12-22

Speaker Independent Speech Emotion Recognition by Ensemble Classification

OPENALEX - Publications

Björn W. Schuller S.A. Reiter Ronald Müller Marc Al-Hames M. Lang and 1 more

Emotion recognition grows to an important factor in future media retrieval and man machine interfaces. However, even human deciders often experience problems realizing one's emotion, especially of strangers. In this work we strive recognize emotion independent the person concentrating on speech channel. Single feature relevance acoustic features is a critical point, which address by filter-based gain ratio calculation starting at basis 276 features. As optimization minimum set as whole...

10.1109/icme.2005.1521560 article EN 2005-10-24

Meta-Classifiers in Acoustic and Linguistic Feature Fusion-Based Affect Recognition

OPENALEX - Publications

Björn W. Schuller R.J. Villar Gerhard Rigoll M. Lang

We suggest a novel approach to affect recognition based on acoustic and linguistic analysis of spoken utterances. In order achieve maximum discrimination power within robust integration these information sources, fusion the feature level is introduced. Considering classification, we use meta-classifiers, such as StackingC Boosting, for stabilized performance, combination classifiers ensembles. Extensive comparisons diverse base-classifiers, including support vector machines, neural networks,...

10.1109/icassp.2005.1415116 article EN 2006-10-11

Segmentation and recognition of symbols within handwritten mathematical expressions

OPENALEX - Publications

M. Koschinski Hanspeter Winkler M. Lang

An efficient on-line recognition system for symbols within handwritten mathematical expressions is proposed. The based on the generation of a symbol hypotheses net and classification elements net. final done by calculating most probable path through under regard stroke group probabilities obtained recognizer hidden Markov models.

10.1109/icassp.1995.479986 article EN International Conference on Acoustics, Speech, and Signal Processing 2002-11-19

Multimodal emotion recognition in audiovisual communication

OPENALEX - Publications

Björn W. Schuller M. Lang Gerhard Rigoll

This paper discusses innovative techniques to automatically estimate a user's emotional state analyzing the speech signal and haptical interaction on touch-screen or via mouse. The knowledge of emotion permits adaptive strategies striving for more natural robust interaction. We classify seven states: surprise, joy, anger, fear, disgust, sadness, neutral user state. is extracted by parallel stochastic analysis his spoken machine interactions while understanding desired intention. introduced...

10.1109/icme.2002.1035889 article EN 2003-06-25

A soft-decision approach for structural analysis of handwritten mathematical expressions

OPENALEX - Publications

Hanspeter Winkler H. Fahrner M. Lang

An efficient system for structural analysis of handwritten mathematical expressions is proposed. To handle the problems caused by handwriting, this based on a soft-decision approach. This means that alternatives solution are generated during process if relation between two symbols within expression ambiguous. Finally string containing information and syntactical verified each alternative. Strings failing verification considered as invalid.

10.1109/icassp.1995.480046 article EN International Conference on Acoustics, Speech, and Signal Processing 2002-11-19

A real-time system for hand gesture controlled operation of in-car devices

OPENALEX - Publications

Martin Zobl M. Geiger Björn W. Schuller M. Lang Gerhard Rigoll

The integration of more and functionality into the human machine interface (HMI) vehicles increases complexity device handling. Thus optimal use different sensory channels is an approach to simplify interaction with in-car devices. This way user convenience as much distraction may decrease. In this paper a video based real-time hand gesture recognition system for presented. It was developed in course extensive usability studies. combination optimized HMI it allows intuitive effective...

10.1109/icme.2003.1221368 article EN 2003-01-01

Spotting dynamic hand gestures in video image sequences using hidden Markov models

OPENALEX - Publications

P. Morguet M. Lang

A new and general stochastic approach to find identify dynamic gestures in continuous video streams is presented. Hidden Markov models (HMMs) are used solve this combined problem of temporal segmentation classification an integral way. Basically, improved normalized Viterbi algorithm allows one continuously observe the output scores HMMs at every time step. Characteristic peaks respective indicate presence gestures. Our experiments domain hand gesture spotting provided excellent recognition...

10.1109/icip.1998.999009 article EN 2002-11-27

ISI mitigation using decision feedback loop demonstrated with PMD distorted 10 Gbit/s signals

OPENALEX - Publications

L. Möller A. Thiede S. Chandrasekhar W. Benz M. Lang and 2 more

Electrical polarisation mode dispersion (PMD) and receiver bandwidth generated intersymbol interference (ISI) mitigation using an analogue decision feedback loop for 10 Gbit/s NRZ signals is demonstrated. ISI caused by first-order PMD of up to 120 ps differential group delay was equalised. Error free recovery with completely closed eye diagrams achieved.

10.1049/el:19991418 article EN Electronics Letters 1999-11-25

Optical methods for non-contact measurements of membranes

OPENALEX - Publications

S. Roose Yvan Stockman Pierre Rochus Thomas Kuhn M. Lang and 3 more

10.1016/j.actaastro.2009.03.061 article EN Acta Astronautica 2009-04-24

A soft-decision approach for symbol segmentation within handwritten mathematical expressions

OPENALEX - Publications

S. Lehmberg Hanspeter Winkler M. Lang

A soft-decision approach for symbol segmentation within on-line sampled handwritten mathematical expressions is presented. Based on stroke-specific features as well geometrical between the strokes a hypotheses net generated. For assistance additional knowledge obtained by prerecognition stage used. The results achieved and experiments indicate performance of our approach.

10.1109/icassp.1996.550766 article EN 2002-12-24

A new benchmark of soft X-ray transition energies of $$\mathrm {Ne}$$, $$\mathrm {CO}_2$$, and $$\mathrm {SF}_6$$: paving a pathway towards ppm accuracy

OPENALEX - Publications

Jakob Stierhof Steffen Kühn M. Winter P. Micke René Steinbrügge and 22 more

A key requirement for the correct interpretation of high-resolution X-ray spectra is that transition energies are known with high accuracy and precision. We investigate K-shell features Ne, CO$_2$, SF$_6$ gases, by measuring their photo ion-yield at BESSY II synchrotron facility simultaneously 1s-np fluorescence emission He-like ions produced in Polar-X EBIT. Accurate ab initio calculations transitions these provide basis calibration. While CO$_2$ result agrees well previous measurements,...

10.1140/epjd/s10053-022-00355-0 article EN cc-by The European Physical Journal D 2022-03-01

Online symbol segmentation and recognition in handwritten mathematical expressions

OPENALEX - Publications

Hanspeter Winkler M. Lang

This paper is concerned with the symbol segmentation and recognition task in context of online sampled handwritten mathematical expressions, first processing stage an overall system for understanding arithmetic formulas. Within our a statistical approach used tolerating ambiguities within decision stages resolving them either automatically by additional knowledge acquired following or interaction user. The results obtained different writers expressions demonstrate performance approach.

10.1109/icassp.1997.595518 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2002-11-22

Nonlinear Speed-control for a Continuum Theory of Traffic Flow

OPENALEX - Publications

Henning Lenz Rudolf Sollacher M. Lang

10.1016/s1474-6670(17)57422-4 article EN IFAC Proceedings Volumes 1999-07-01

Complete monolithic integrated 2.5 Gbit/s optoelectronic receiver with large area MSM photodiode for 850 nm wavelength

OPENALEX - Publications

M. Lang W. Bronner W. Benz M. Ludwig V. Hurm and 4 more

A novel optoelectronic receiver chip for a data rate of 2.5 Gbit/s has been developed and tested. It integrates metal-semiconductor-metal photodiode with GaAs HEMT transimpedance amplifier, high gain amplifier limiting output buffer which is able to drive 50 Ω load. special feature the that it comprises very large 300 µm diameter, eliminating need expensive fibre alignment. Measurements reveal achieves required sensitivity –15.7 dBm at bit error 10-9.

10.1049/el:20010859 article EN Electronics Letters 2001-01-01

Feature Selection and Stacking for Robust Discrimination of Speech, Monophonic Singing, and Polyphonic Music

OPENALEX - Publications

Björn W. Schuller B.J.B. Schmitt Dejan Arsić S.A. Reiter M. Lang and 1 more

In this work we strive to find an optimal set of acoustic features for the discrimination speech, monophonic singing, and polyphonic music robustly segment media streams annotation interaction purposes. Furthermore introduce ensemble-based classification approaches within task. From a basis 276 attributes select most efficient by SVM SFFS. Additionally relevance single calculation information gain ratio is presented. As comparison reduce dimensionality PCA. We show extensive analysis...

10.1109/icme.2005.1521554 article EN 2005-10-24

Comparison of approaches to continuous hand gesture recognition for a visual dialog system

OPENALEX - Publications

P. Morguet M. Lang

Continuous hand gesture recognition requires the detection of gestures in a video stream and their classification. In this paper two continuous solutions using hidden Markov models (HMMs) are compared. The first approach uses motion algorithm to isolate candidates followed by HMM step. second is single-stage, HMM-based spotting method improved new implicit duration modeling. Both strategies have been tested on data containing 41 different types embedded random motion. has derived from...

10.1109/icassp.1999.757609 article EN 1999-01-01

HMM-based music retrieval using stereophonic feature information and framelength adaptation

OPENALEX - Publications

Björn W. Schuller Gerhard Rigoll M. Lang

Music retrieval methods are in the focus of recent interest due to increasing size music databases as e.g. Internet. Among different query content-based media analyzing intrinsic characteristics source seems form most intuitive access. The key-melody a song can be regarded major characteristic and leads by humming or singing. In this paper we turn our attention both, features algorithm matching audio retrieval. Nowadays approaches propagate use dynamic time warping for process. As reference...

10.1109/icme.2003.1221716 article EN 2003-01-01

Emotion recognition in the manual interaction with graphical user interfaces

OPENALEX - Publications

Björn W. Schuller Gerhard Rigoll M. Lang

We introduce a novel approach to human emotion recognition, based on manual computer interaction. The presented methods rely conventional graphical input devices. Firstly, standard mouse as used desktop PCs, and, secondly, the interaction with touch-screens or -pads in public information terminals, palm-top devices tablet PCs is considered. Additionally, gain of integration touch pressure evaluated. Four discrete emotional states are classified: irritation, annoyance, reflectiveness, and...

10.1109/icme.2004.1394440 article EN 2005-03-21

Multimodal music retrieval for large databases

OPENALEX - Publications

Björn W. Schuller Gerhard Rigoll M. Lang

We present a novel multi-modal access to large MP3 music databases. Retrieval can be fulfilled either in content-based manner or by keywords. As input modalities, speech natural language utterances singing, and manual interaction handwriting, typing hardkeys are used. In order achieve especially robust retrieval results automatically suggest the user, contextual knowledge of time, date, season, user emotion, listening habits is integrated process. The system communicates with visual...

10.1109/icme.2004.1394310 article EN 2005-03-21

Coming Soon ...