NFDI4DS | UHH-SEMS - Publication Details

Matt McVicar

ORCID: 0000-0003-0212-4093

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5017327131

Research Areas

Music and Audio Processing
Music Technology and Sound Studies
Speech and Audio Processing
Educational Technology in Learning
Social Sciences and Policies
Literacy and Educational Practices
Neuroscience and Music Perception
Speech Recognition and Synthesis
Music History and Culture
Advanced Text Analysis Techniques
Data Mining Algorithms and Applications
Semantic Web and Ontologies
Advanced Adaptive Filtering Techniques
Diverse Musicological Studies
Copyright and Intellectual Property
Computational Physics and Python Applications
Artificial Intelligence in Law
Human Mobility and Location-Based Analysis
Advanced Data Compression Techniques
Text and Document Classification Technologies
Animal Vocal Communication and Behavior
Emotion and Mood Recognition
Neural Networks and Applications
Magnetic Field Sensors Techniques
Water Systems and Optimization

Apple (United Kingdom)
2024

University of Bristol
2010-2018

National Institute of Advanced Industrial Science and Technology
2014-2015

Baidu (China)
2015

librosa: Audio and Music Signal Analysis in Python

OPENALEX - Publications

Brian McFee Colin Raffel Dawen Liang Daniel P. W. Ellis Matt McVicar and 2 more

This document describes version 0.4.0 of librosa: a Python package for audio and music signal processing. At high level, librosa provides implementations variety common functions used throughout the field information retrieval. In this document, brief overview library's functionality is provided, along with explanations design goals, software development practices, notational conventions.

10.25080/majora-7b98e3ed-003 article EN cc-by Proceedings of the Python in Science Conferences 2015-01-01

An End-to-End Machine Learning System for Harmonic Analysis of Music

OPENALEX - Publications

Yizhao Ni Matt McVicar Raúl Santos‐Rodríguez Tijl De Bie

We present a new system for the harmonic analysis of popular musical audio. It is focused on chord estimation, although proposed additionally estimates key sequence and bass notes. distinct from competing approaches in two main ways. First, it makes use improved chromagram representation audio that takes human perception loudness into account. Furthermore, first joint estimation chords, keys, notes fully based machine learning, requiring no expert knowledge to tune parameters. This means...

10.1109/tasl.2012.2188516 article EN IEEE Transactions on Audio Speech and Language Processing 2012-02-22

Understanding Effects of Subjectivity in Measuring Chord Estimation Accuracy

OPENALEX - Publications

Yizhao Ni Matt McVicar Raúl Santos‐Rodríguez Tijl De Bie

To assess the performance of an automatic chord estimation system, reference annotations are indispensable. However, owing to complexity music and sometimes ambiguous harmonic structure polyphonic music, inherently subjective, as a result any derived accuracy estimates will be subjective well. In this paper, we investigate extent confounding effect subjectivity in annotations. Our results show that is important, they affect different types systems ways. have implications for research on...

10.1109/tasl.2013.2280218 article EN IEEE Transactions on Audio Speech and Language Processing 2013-08-29

librosa 0.5.0

OPENALEX - Publications

Brian McFee Matt McVicar Oriol Nieto Stefan Balke Carl Thome and 17 more

10.5281/zenodo.293021 article 2017-02-17

librosa/librosa: 0.7.2

OPENALEX - Publications

Brian McFee Vincent Lostanlen Matt McVicar Alexandros Metsai Stefan Balke and 25 more

10.5281/zenodo.3606573 article ES 2020-01-13

Leveraging repetition for improved automatic lyric transcription in popular music

OPENALEX - Publications

Matt McVicar Daniel P. W. Ellis Masataka Goto

Transcribing lyrics from musical audio is a challenging research problem which has not benefited many advances made in the related field of automatic speech recognition, owing to prevalent accompaniment and differences between spoken sung voice. However, one aspect this yet be exploited by researchers that significant portions will repeated throughout song. In paper we investigate how information can leveraged form consensus transcription with improved consistency accuracy. Our results show...

10.1109/icassp.2014.6854174 article EN 2014-05-01

StructureNet: Inducing Structure in Generated Melodies

OPENALEX - Publications

Gabriele Medeot Srikanth Cherla Katerina Kosta Matt McVicar Samer Abdallah and 3 more

10.5072/zenodo.251848 article EN International Symposium/Conference on Music Information Retrieval 2018-09-23

librosa/librosa: 0.8.1rc2

OPENALEX - Publications

Brian McFee Alexandros Metsai Matt McVicar Stefan Balke Carl Thomé and 25 more

10.5281/zenodo.4792298 article ES 2021-05-25

librosa/librosa: 0.6.3

OPENALEX - Publications

Brian McFee Matt McVicar Stefan Balke Vincent Lostanlen Carl Thomé and 25 more

10.5281/zenodo.2564164 article ES 2019-02-13

AutoLeadGuitar: Automatic generation of guitar solo phrases in the tablature space

OPENALEX - Publications

Matt McVicar Satoru Fukayama Masataka Goto

We present AutoLeadGuitar, a system for automatically generating guitar solo tablatures from an input chord and key sequence. Our generates solos in distinct musical phrases, is trained using existing digital sourced the web. When AutoLeadGuitar assigns phrase boundaries, rhythms fretboard positions within probabilistic framework, guided towards tones by two user-specified parameters (chord tone preference during at end of phrases). Furthermore, guitar-specific ornaments such as hammer-ons,...

10.1109/icosp.2014.7015074 article EN 2014-10-01

librosa: v0.4.0

OPENALEX - Publications

Brian McFee Matt McVicar Colin Raffel Dawen Liang Oriol Nieto and 6 more

10.5281/zenodo.18369 article 2015-06-07

SuMoTED: An intuitive edit distance between rooted unordered uniquely-labelled trees

OPENALEX - Publications

Matt McVicar Benjamin Sach Cédric Mesnage Jefrey Lijffijt Eirini Spyropoulou and 1 more

Defining and computing distances between tree structures is a classical area of study in theoretical computer science, with practical applications the areas computational biology, information retrieval, text analysis, many others. In this paper, we focus on rooted, unordered, uniquely-labelled trees such as taxonomies other hierarchies. For these, introduce intuitive concept 'local move' operation an atomic edit tree. We then SuMoTED, new distance measure trees, defined minimal number local...

10.1016/j.patrec.2016.04.012 article EN cc-by Pattern Recognition Letters 2016-05-05

Using Online Chord Databases to Enhance Chord Recognition

OPENALEX - Publications

Matt McVicar Yizhao Ni Raúl Santos‐Rodríguez Tijl De Bie

Abstract Advances in chord recognition research using machine learning are hampered by two factors: the scarcity of annotated training data, and limited complexity features models used. Both problems intertwined, as with few examples, increasing model would inevitably lead to overfitting. In this paper we develop a way address first problem exploiting annotations from online databases. We show how such annotations, despite being noisy lacking exact onset times, can be put use both during...

10.1080/09298215.2011.573564 article EN Journal of New Music Research 2011-06-01

AutoGuitarTab: Computer-aided Composition of Rhythm \& Lead Guitar Parts in the Tablature Space

OPENALEX - Publications

Matt McVicar Satoru Fukayama Masataka Goto

We present AutoGuitarTab, a system for generating realistic guitar tablature given an input symbolic chord and key sequence. Our consists of two modules: AutoRhythmGuitar AutoLeadGuitar. The first these generates rhythm tablatures which outline the sequence in particular style (using Markov chains to ensure playability) performs structural analysis produce structurally consistent composition. AutoLeadGuitar lead parts distinct musical phrases, guiding pitch classes towards tones steering...

10.1109/taslp.2015.2419976 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2015-01-01

librosa: 0.4.1

OPENALEX - Publications

Brian McFee Matt McVicar Colin Raffel Dawen Liang Oriol Nieto and 9 more

10.5281/zenodo.32193 article 2015-10-17

Learning to separate vocals from polyphonic mixtures via ensemble methods and structured output prediction

OPENALEX - Publications

Matt McVicar Raúl Santos‐Rodríguez Tijl De Bie

Separating the singing from a polyphonic mixed audio signal is challenging but important task, with wide range of applications across music industry and informatics research. Various methods have been devised over years, ranging Deep Learning approaches to dedicated ad hoc solutions. In this paper, we present novel machine learning method for using Conditional Random Field (CRF) approach structured output prediction. We exploit diversity previously proposed by their predictions as input...

10.1109/icassp.2016.7471715 article EN 2016-03-01

librosa/librosa: 0.6.2

OPENALEX - Publications

Brian McFee Matt McVicar Stefan Balke Carl Thomé Vincent Lostanlen and 25 more

10.5281/zenodo.1342708 article ES 2018-08-09

Enhancing chord recognition accuracy using web resources

OPENALEX - Publications

Matt McVicar Tijl De Bie

Machine learning methods for chord recognition have improved considerably in the past few years. However, further progress seems constrained by scarcity of training data. In this paper, we show that problem can be partially solved exploiting noisy but freely and abundantly available online resources, addition to fully labeled We use these data restrict output Viterbi algorithm, resulting significant improvements over standard decoding process.

10.1145/1878003.1878017 article EN 2010-10-25

librosa/librosa: 0.6.1

OPENALEX - Publications

Brian McFee Matt McVicar Stefan Balke Carl Thomé Colin Raffel and 25 more

10.5281/zenodo.1252297 article ES 2018-05-24

Leveraging noisy online databases for use in chord recognition

OPENALEX - Publications

Matt McVicar Yizhao Ni Tijl De Bie Raúl Santos‐Rodríguez

The most significant problem faced by Machine Learningbased chord recognition systems is arguably the lack of highquality training examples. In this paper, we address leveraging availability annotations from guitarist websites. We show that such can be used as partial supervision a semi-supervised method—partial since accurate timing information lacking. A particular challenge in exploitation these data their low quality, potentially even leading to performance degradation if directly....

10.5281/zenodo.1418311 article EN 2011-10-24

Coming Soon ...