Found 211 bookmarks
Custom sorting
GitHub aojsamuraiWebcamhtml5recordvideoaudiogetUserMedia Webcam capture using getUserMedia the stream is recorded via js and saved using php in two files inside the folder video one file for video as webm and one file for audio as wav
GitHub aojsamuraiWebcamhtml5recordvideoaudiogetUserMedia Webcam capture using getUserMedia the stream is recorded via js and saved using php in two files inside the folder video one file for video as webm and one file for audio as wav
Webcam capture using getUserMedia the stream is recorded via js and saved using php in two files inside the folder video one file for video as webm and one file for audio as wav aojsamuraiWebcamhtml5recordvideoaudiogetUserMedia
·github.com·
GitHub aojsamuraiWebcamhtml5recordvideoaudiogetUserMedia Webcam capture using getUserMedia the stream is recorded via js and saved using php in two files inside the folder video one file for video as webm and one file for audio as wav
GitHub theadamsabraLearningfromAudio Understand of the fundamentals of digital signal processing for Machine LearningDeep Learning applications
GitHub theadamsabraLearningfromAudio Understand of the fundamentals of digital signal processing for Machine LearningDeep Learning applications
Understand of the fundamentals of digital signal processing for Machine LearningDeep Learning applications theadamsabraLearningfromAudio
·github.com·
GitHub theadamsabraLearningfromAudio Understand of the fundamentals of digital signal processing for Machine LearningDeep Learning applications
GitHub mjhydriSingingVocalBeatTracking This repo contains the source code of the first deep learningbase singing voice beat tracking system It leverages WavLM and DistilHuBERT pretrained speech models to create vocal embeddings and trains linear multihead selfattention layers on top of them to extract vocal beat activations Then it uses HMM decoder to infer signing beats and tempo
GitHub mjhydriSingingVocalBeatTracking This repo contains the source code of the first deep learningbase singing voice beat tracking system It leverages WavLM and DistilHuBERT pretrained speech models to create vocal embeddings and trains linear multihead selfattention layers on top of them to extract vocal beat activations Then it uses HMM decoder to infer signing beats and tempo
This repo contains the source code of the first deep learningbase singing voice beat tracking system It leverages WavLM and DistilHuBERT pretrained speech models to create vocal embeddings and trains linear multihead selfattention layers on top of them to extract vocal beat activations Then it uses HMM decoder to infer signing beats and tempo GitHub mjhydriSingingVocalBeatTracking This repo contains the source code of the first deep learningbase singing voice beat tracking system It leverages WavLM and DistilHuBERT pretrained speech models to create vocal embeddings and trains linear multihead selfattention layers on top of them to extract vocal beat activations Then it uses HMM decoder to infer signing beats and tempo
·github.com·
GitHub mjhydriSingingVocalBeatTracking This repo contains the source code of the first deep learningbase singing voice beat tracking system It leverages WavLM and DistilHuBERT pretrained speech models to create vocal embeddings and trains linear multihead selfattention layers on top of them to extract vocal beat activations Then it uses HMM decoder to infer signing beats and tempo
GitHub andrebolacontrastivemirlearning This repo contains the code to reproduce the paper Enriched Music Representations with Multiple Crossmodal Contrastive Learning
GitHub andrebolacontrastivemirlearning This repo contains the code to reproduce the paper Enriched Music Representations with Multiple Crossmodal Contrastive Learning
This repo contains the code to reproduce the paper Enriched Music Representations with Multiple Crossmodal Contrastive Learning andrebolacontrastivemirlearning
·github.com·
GitHub andrebolacontrastivemirlearning This repo contains the code to reproduce the paper Enriched Music Representations with Multiple Crossmodal Contrastive Learning
GitHub TheMorpheus407OpenAIAudiobookGenerator This project is a webbased application that converts text into audio primarily focusing on creating audiobooks
GitHub TheMorpheus407OpenAIAudiobookGenerator This project is a webbased application that converts text into audio primarily focusing on creating audiobooks
This project is a webbased application that converts text into audio primarily focusing on creating audiobooks GitHub TheMorpheus407OpenAIAudiobookGenerator This project is a webbased application that converts text into audio primarily focusing on creating audiobooks
·github.com·
GitHub TheMorpheus407OpenAIAudiobookGenerator This project is a webbased application that converts text into audio primarily focusing on creating audiobooks
muscaps
muscaps
Source code for "MusCaps Generating Captions for Music Audio" (IJCNN 2021)
·github.com·
muscaps
GitHub alexanderlerchACASlides Slides and Code for An Introduction to Audio Content Analysis also taught at Georgia Tech as MUSI6201 This introductory course on Music Information Retrieval is based on the text book An Introduction to Audio Content Analysis Wiley 20122022
GitHub alexanderlerchACASlides Slides and Code for An Introduction to Audio Content Analysis also taught at Georgia Tech as MUSI6201 This introductory course on Music Information Retrieval is based on the text book An Introduction to Audio Content Analysis Wiley 20122022
Slides and Code for An Introduction to Audio Content Analysis also taught at Georgia Tech as MUSI6201 This introductory course on Music Information Retrieval is based on the text book An Introduction to Audio Content Analysis Wiley 20122022 alexanderlerchACASlides
·github.com·
GitHub alexanderlerchACASlides Slides and Code for An Introduction to Audio Content Analysis also taught at Georgia Tech as MUSI6201 This introductory course on Music Information Retrieval is based on the text book An Introduction to Audio Content Analysis Wiley 20122022
GitHub adbailey1DepAudioNet_reproduction Reproduction of DepAudioNet by Ma et al DepAudioNet An Efficient Deep Model for Audio based Depression Classificationhttpsdlacmorgdoi10114529882572988267 AVEC 2016
GitHub adbailey1DepAudioNet_reproduction Reproduction of DepAudioNet by Ma et al DepAudioNet An Efficient Deep Model for Audio based Depression Classificationhttpsdlacmorgdoi10114529882572988267 AVEC 2016
Reproduction of DepAudioNet by Ma et al DepAudioNet An Efficient Deep Model for Audio based Depression Classificationhttpsdlacmorgdoi10114529882572988267 AVEC 2016 adbailey1DepAudioNet_reproduction
·github.com·
GitHub adbailey1DepAudioNet_reproduction Reproduction of DepAudioNet by Ma et al DepAudioNet An Efficient Deep Model for Audio based Depression Classificationhttpsdlacmorgdoi10114529882572988267 AVEC 2016
GitHub ybayleISM2017 Reproducible research code for the experiments presented in our article Kara1k a karaoke dataset for cover song identification and singing voice analysis published at IEEE ISM 2017
GitHub ybayleISM2017 Reproducible research code for the experiments presented in our article Kara1k a karaoke dataset for cover song identification and singing voice analysis published at IEEE ISM 2017
Reproducible research code for the experiments presented in our article Kara1k a karaoke dataset for cover song identification and singing voice analysis published at IEEE ISM 2017 ybayleISM2017
·github.com·
GitHub ybayleISM2017 Reproducible research code for the experiments presented in our article Kara1k a karaoke dataset for cover song identification and singing voice analysis published at IEEE ISM 2017
GitHub alisonbmaaiSFX Representation Learning for the Automatic Indexing of Sound Effects Libraries ISMIR 2022 Deep audio embeddings pretrained on UCS NonUCScompliant datasets
GitHub alisonbmaaiSFX Representation Learning for the Automatic Indexing of Sound Effects Libraries ISMIR 2022 Deep audio embeddings pretrained on UCS NonUCScompliant datasets
Representation Learning for the Automatic Indexing of Sound Effects Libraries ISMIR 2022 Deep audio embeddings pretrained on UCS NonUCScompliant datasets alisonbmaaiSFX
·github.com·
GitHub alisonbmaaiSFX Representation Learning for the Automatic Indexing of Sound Effects Libraries ISMIR 2022 Deep audio embeddings pretrained on UCS NonUCScompliant datasets