Found 611 bookmarks
Custom sorting
AI Audio Datasets
AI Audio Datasets
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
·github.com·
AI Audio Datasets
ACA-Slides
ACA-Slides
Slides and Code for "An Introduction to Audio Content Analysis," also taught at Georgia Tech as MUSI-6201. This introductory course on Music Information Retrieval is based on the text book "An Introduction to Audio Content Analysis", Wiley 2012/2022
·github.com·
ACA-Slides
DepAudioNet_reproduction
DepAudioNet_reproduction
Reproduction of DepAudioNet by Ma et al. {DepAudioNet An Efficient Deep Model for Audio based Depression Classification,(https//dl.acm.org/doi/10.1145/2988257.2988267), AVEC 2016}
·github.com·
DepAudioNet_reproduction
ISM2017
ISM2017
Reproducible research code for the experiments presented in our article "Kara1k a karaoke dataset for cover song identification and singing voice analysis" published at IEEE ISM 2017
·github.com·
ISM2017
aiSFX
aiSFX
Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022) Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.
·github.com·
aiSFX
CHAD
CHAD
Official Code of "A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task" (ISMIR 2023)
·github.com·
CHAD