Datasets and codes
Data
- ASVspoof 5 Database
- ASVspoof 5 Evaluation Plan
- ASVspoof 2019 LA Listening Test Data for Partial Rank Similarity MOS Prediction
- Voice Conversion Challenge 2020 database
- Corpus of Age-related Voice DisguiseCorpus of Age-related Voice Disguise (AVOID)
- Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof). See [IEEE-J-STSP overview paper of the ASVspoof challenge]
- ASVspoof 2021 challenge, webpage, workshop, data[LA][PA][DF], metadata[LA][PA][DF], baseline systems
- ASVSpoof2019 challenge data and webpage
- ASVSpoof2019 “Real PA” set
- ASVspoof2017 challenge data (audio replay attack detection)  [NOTE! this is patched v2.0 of the corpus, described here and recommended to be used instead of the original one] See also ICASSP 2017 paper about data collection  and Interspeech 2017 challenge overview paper [NOTE! this is patched v2.0 of the corpus, described here and recommended to be used instead of the original one] See also ICASSP 2017 paper about data collection  and Interspeech 2017 challenge overview paper
- ASVspoof2015 challenge data (voice conversion and text-to-speech attack detection task).  
 
- The Voice Conversion Challenge 2018: database and results (VCC18). See also the challenge overview paper and another paper containing supplementary speech artifact analysis (both will be presented at Odyssey 2018)
- I-vectors (~420 MB) used in IEEE-T-IFS paper (hosted at IDIAP).
- I4U consortium filelists for NIST SRE12 development purposes (from Rahim Saeidi’s pages) used in [Interspeech 2013 paper]
Program codes
- t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators, and Jupyter Notebook
- Household ASV baseline system, described in [Speaker Odyssey workshop paper]
- ASVTorch toolkit, described in ASVtorch toolkit: Speaker verification with deep neural networks
- Code for training an agent to play Minecraft by learning from human demonstrations.
- Program code for reading the image of a Windows/Linux game window and emulating keyboard/mouse controls.
- Toribash learning environment for training agents in a hand-to-hand combat setup.
- GPU accelerated implementation of i-vector extractor (training / extraction) using PyTorch.
- Semi-supervised speech activity detector, described in [Computer Speech & Language paper]
- Audio replay attack detection baseline code (Matlab), for ASVspoof 2017 challenge
- Local variability features (Matlab). See [the related paper in Digital Signal Processing]
- PLDA for anti-spoofing (Python). See also [IEEE-T-IFS paper]
- Fast probabilistic linear discriminant analysis (PLDA) implementation (Matlab and Python). See the related S+SSPR paper [PDF]
- Utterance-by-utterance adaptive speech activity detector (SAD) presented in ICASSP 2013 [PDF].
- Multiple window (multitaper) spectrum estimators (Matlab). See also the related publications in IEEE T-ASLP, Speech Communication, IEEE SPL, Interspeech 2010 and ASRU 2011.
- Regularized all-pole methods as an appendix of Odyssey 2012 paper. See also the related publication in IEEE SPL.
- Temporally weighted linear predictors (from Jouni Pohjalainen’s page)