{"id":64,"date":"2020-07-24T18:42:41","date_gmt":"2020-07-24T15:42:41","guid":{"rendered":"https:\/\/sites.uef.fi\/speech\/?page_id=64"},"modified":"2025-09-08T16:34:41","modified_gmt":"2025-09-08T13:34:41","slug":"data","status":"publish","type":"page","link":"https:\/\/sites.uef.fi\/speech\/data\/","title":{"rendered":"Datasets and codes"},"content":{"rendered":"\n<h3 class=\"wp-block-heading\">Data<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/zenodo.org\/records\/14498691\">ASVspoof 5 Database<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.asvspoof.org\/file\/ASVspoof5___Evaluation_Plan.pdf\">ASVspoof 5 Evaluation Plan<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/erepo.uef.fi\/handle\/123456789\/30639\">ASVspoof 2019 LA Listening Test Data for Partial Rank Similarity MOS Prediction<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/nii-yamagishilab\/VCC2020-database\">Voice Conversion Challenge 2020 database<\/a><\/li>\n\n\n\n<li><a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/AVOID_instructions.pdf\"><i>Corpus of Age-related Voice DisguiseCorpus of Age-related Voice Disguise <\/i> (AVOID)<\/a><\/li>\n\n\n\n<li><a href=\"http:\/\/www.asvspoof.org\/\">Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof).<\/a> See <a href=\"http:\/\/datashare.is.ed.ac.uk\/handle\/10283\/853\">[<\/a><a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/IEEE_J_STSP_ASVspoof.pdf\">IEEE-J-STSP overview paper of the ASVspoof challenge<\/a>]\n<ul class=\"wp-block-list\">\n<li>ASVspoof 2021 challenge, <a href=\"https:\/\/www.asvspoof.org\/index2021.html\">webpage<\/a>, <a href=\"https:\/\/www.isca-speech.org\/archive\/asvspoof_2021\/\">workshop<\/a>, data[<a href=\"https:\/\/lnkd.in\/g9FKAbi\">LA<\/a>][<a href=\"https:\/\/lnkd.in\/ga36Ktz\">PA<\/a>][<a href=\"https:\/\/lnkd.in\/gA5zvRz\">DF<\/a>], metadata[<a href=\"https:\/\/lnkd.in\/d5ivZHzc\">LA<\/a>][<a href=\"https:\/\/lnkd.in\/dqY7NY3N\">PA<\/a>][<a href=\"https:\/\/lnkd.in\/d4fUHrDs\">DF<\/a>], <a href=\"http:\/\/github.com\/asvspoof-challenge\/2021\">baseline systems<\/a><\/li>\n\n\n\n<li><a href=\"http:\/\/www.asvspoof.org\/\">ASVSpoof2019 challenge data and webpage<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.asvspoof.org\/user\/register\">ASVSpoof2019 &#8220;Real PA&#8221; set<\/a><\/li>\n\n\n\n<li><a href=\"http:\/\/dx.doi.org\/10.7488\/ds\/2332\">ASVspoof2017 challenge data (audio replay attack detection)<\/a><span style=\"color: #ff0000\"><b>&nbsp;<img loading=\"lazy\" decoding=\"async\" title=\"enlightened\" src=\"http:\/\/www.uef.fi\/html\/js\/editor\/ckeditor\/plugins\/smiley\/images\/lightbulb.gif\" alt=\"enlightened\" width=\"20\" height=\"20\"> <\/b><span style=\"color: #009900\"><b>[NOTE! this is patched v2.0 of the corpus, described <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/ASVspoof2.0_Odyssey2018.pdf\">here<\/a> and recommended to be used instead of the <\/b><a href=\"https:\/\/datashare.is.ed.ac.uk\/handle\/10283\/2778\">original one<\/a><b>]<\/b><\/span><\/span> See also<b> <\/b><span style=\"color: #009900\"><a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/reddots-replayed-replayed_icassp2017.pdf\">ICASSP 2017 paper about data collection<\/a><\/span><b><span style=\"color: #009900\">&nbsp; <\/span><\/b><span style=\"color: #009900\"><span style=\"color: #000000\">and <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/asvspoof-2017-challenge-overview.pdf\">Interspeech 2017 challenge overview paper<\/a><\/span><\/span><\/li>\n\n\n\n<li><a href=\"http:\/\/datashare.is.ed.ac.uk\/handle\/10283\/853\">ASVspoof2015 challenge data (voice conversion and text-to-speech attack detection task). <img loading=\"lazy\" decoding=\"async\" title=\"enlightened\" src=\"http:\/\/www.uef.fi\/html\/js\/editor\/ckeditor\/plugins\/smiley\/images\/lightbulb.gif\" alt=\"enlightened\" width=\"20\" height=\"20\"><\/a><\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><a href=\"http:\/\/dx.doi.org\/10.7488\/ds\/2337\">The Voice Conversion Challenge 2018: database and results (VCC18). <\/a>See also <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/VCC18_overview_Odyssey2018.pdf\">the challenge overview paper<\/a> and <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/Spoofing_benchmark_VCC18.pdf\">another paper containing supplementary speech artifact analysis<\/a> (both will be presented at <a href=\"http:\/\/www.odyssey2018.org\/\">Odyssey 2018<\/a>)<\/li>\n\n\n\n<li><a href=\"http:\/\/www.idiap.ch\/resource\/biometric\/data\/TIFS2015.zip\">I-vectors<\/a> (~420 MB) used in <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/TIFS2015_joint.pdf\">IEEE-T-IFS paper<\/a> (hosted at <a href=\"http:\/\/idiap.ch\">IDIAP<\/a>).<\/li>\n\n\n\n<li><a href=\"http:\/\/cls.ru.nl\/%7Esaeidi\/file_library\/I4U.tgz\">I4U consortium filelists for NIST SRE12 development purposes<\/a> (from Rahim Saeidi&#8217;s pages) used in [<a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/I4ULargescale_IS2013.pdf\">Interspeech 2013 paper<\/a>]<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">Program codes<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/github.com\/TakHemlata\/T-EER\">t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators<\/a>, and <a href=\"https:\/\/colab.research.google.com\/drive\/1ga7eiKFP11wOFMuZjThLJlkBcwEG6_4m?usp=sharing\">Jupyter Notebook<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/underdogliu\/household-speaker-recognition\">Household ASV baseline system<\/a>, described in [<a href=\"https:\/\/arxiv.org\/abs\/2205.00288\">Speaker Odyssey workshop paper<\/a>]<\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/ElsevierSoftwareX\/SOFTX-D-20-00038\">ASVTorch toolkit<\/a>, described in <a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S235271102100042X?via%3Dihub\">ASVtorch toolkit: Speaker verification with deep neural networks<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/Miffyli\/minecraft-bc-2020\">Code for training an agent to play Minecraft by learning from human demonstrations.<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/joonaspu\/ViControl\">Program code for reading the image of a Windows\/Linux game window and emulating keyboard\/mouse controls.<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/Miffyli\/ToriLLE\">Toribash learning environment for training agents in a hand-to-hand combat setup.<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/github.com\/vvestman\/pytorch-ivectors\">GPU accelerated implementation of i-vector extractor (training \/ extraction) using PyTorch.<\/a><\/li>\n\n\n\n<li><a href=\"http:\/\/cs.uef.fi\/pages\/tkinnu\/SSGMM_SAD\/SSGMM.zip\">Semi-supervised speech activity detector<\/a>, described in [<a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/semi-supervised-SAD-CSL.pdf\">Computer Speech &amp; Language paper<\/a>]<\/li>\n\n\n\n<li><a href=\"http:\/\/www.spoofingchallenge.org\/data2017\/baseline_CM.zip\">Audio replay attack detection baseline code (Matlab)<\/a>, for <a href=\"http:\/\/www.spoofingchallenge.org\/\">ASVspoof 2017 challenge<\/a><\/li>\n\n\n\n<li><a href=\"http:\/\/cs.joensuu.fi\/%7Esahid\/codes\/local_variability.zip\">Local variability features (Matlab)<\/a>. See [<a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/local_variability_DSP.pdf\">the related paper in Digital Signal Processing<\/a>]<\/li>\n\n\n\n<li><a href=\"https:\/\/pypi.python.org\/pypi\/xspear.fast_plda\">PLDA for anti-spoofing (Python)<\/a>. See also <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/TIFS2015_joint.pdf\">[IEEE-T-IFS paper]<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/sites.google.com\/site\/fastplda\/\">Fast probabilistic linear discriminant analysis (PLDA) implementation (Matlab and Python)<\/a>. See the related <a href=\"http:\/\/cs.uef.fi\/ssspr2014\/\">S+SSPR<\/a> paper [<a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/unifying_PLDA_ssspr2014.pdf\">PDF<\/a>]<\/li>\n\n\n\n<li><a href=\"http:\/\/cs.uef.fi\/pages\/tkinnu\/VQVAD\/VQVAD.zip\">Utterance-by-utterance adaptive speech activity detector (SAD)<\/a> presented in ICASSP 2013 [<a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/adaptive_vad.pdf\">PDF<\/a>].<\/li>\n\n\n\n<li><a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/multitaper\/multitaperspectrum_functions.zip\">Multiple window (multitaper) spectrum estimators (Matlab)<\/a>. See also the related publications in <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/multitaperMFCC_IEEE_TASLP_doublecolumn.pdf\">IEEE T-ASLP<\/a>, <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/multitaper_speechcomm.pdf\">Speech Communication,<\/a> <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/multitaper_SPL2010.pdf\">IEEE SPL,<\/a>&nbsp; <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/MultiTaper_Interspeech2010.pdf\">Interspeech 2010<\/a> and <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/ASRU2011_multitaper_ivector.pdf\">ASRU 2011<\/a>.<\/li>\n\n\n\n<li>Regularized all-pole methods as an appendix of <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/Odyssey_2012_RLP.pdf\">Odyssey 2012 paper<\/a>. See also the related publication in <a href=\"http:\/\/cs.joensuu.fi\/pages\/tkinnu\/webpage\/pdf\/RLP_IEEE_SPL_2012.pdf\">IEEE SPL<\/a>.<\/li>\n\n\n\n<li><a href=\"http:\/\/www.acoustics.hut.fi\/%7Ejpohjala\/xlp\/\">Temporally weighted linear predictors<\/a> (from Jouni Pohjalainen&#8217;s page)<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Data Program codes<\/p>\n","protected":false},"author":24,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"class_list":["post-64","page","type-page","status-publish","hentry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Datasets and codes - Computational speech group<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/sites.uef.fi\/speech\/data\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Datasets and codes - Computational speech group\" \/>\n<meta property=\"og:description\" content=\"Data Program codes\" \/>\n<meta property=\"og:url\" content=\"https:\/\/sites.uef.fi\/speech\/data\/\" \/>\n<meta property=\"og:site_name\" content=\"Computational speech group\" \/>\n<meta property=\"article:modified_time\" content=\"2025-09-08T13:34:41+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/www.uef.fi\/html\/js\/editor\/ckeditor\/plugins\/smiley\/images\/lightbulb.gif\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/data\\\/\",\"url\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/data\\\/\",\"name\":\"Datasets and codes - Computational speech group\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/data\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/data\\\/#primaryimage\"},\"thumbnailUrl\":\"http:\\\/\\\/www.uef.fi\\\/html\\\/js\\\/editor\\\/ckeditor\\\/plugins\\\/smiley\\\/images\\\/lightbulb.gif\",\"datePublished\":\"2020-07-24T15:42:41+00:00\",\"dateModified\":\"2025-09-08T13:34:41+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/data\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/data\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/data\\\/#primaryimage\",\"url\":\"http:\\\/\\\/www.uef.fi\\\/html\\\/js\\\/editor\\\/ckeditor\\\/plugins\\\/smiley\\\/images\\\/lightbulb.gif\",\"contentUrl\":\"http:\\\/\\\/www.uef.fi\\\/html\\\/js\\\/editor\\\/ckeditor\\\/plugins\\\/smiley\\\/images\\\/lightbulb.gif\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/data\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Datasets and codes\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/#website\",\"url\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/\",\"name\":\"Computational speech group\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/sites.uef.fi\\\/speech\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Datasets and codes - Computational speech group","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/sites.uef.fi\/speech\/data\/","og_locale":"en_US","og_type":"article","og_title":"Datasets and codes - Computational speech group","og_description":"Data Program codes","og_url":"https:\/\/sites.uef.fi\/speech\/data\/","og_site_name":"Computational speech group","article_modified_time":"2025-09-08T13:34:41+00:00","og_image":[{"url":"http:\/\/www.uef.fi\/html\/js\/editor\/ckeditor\/plugins\/smiley\/images\/lightbulb.gif","type":"","width":"","height":""}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/sites.uef.fi\/speech\/data\/","url":"https:\/\/sites.uef.fi\/speech\/data\/","name":"Datasets and codes - Computational speech group","isPartOf":{"@id":"https:\/\/sites.uef.fi\/speech\/#website"},"primaryImageOfPage":{"@id":"https:\/\/sites.uef.fi\/speech\/data\/#primaryimage"},"image":{"@id":"https:\/\/sites.uef.fi\/speech\/data\/#primaryimage"},"thumbnailUrl":"http:\/\/www.uef.fi\/html\/js\/editor\/ckeditor\/plugins\/smiley\/images\/lightbulb.gif","datePublished":"2020-07-24T15:42:41+00:00","dateModified":"2025-09-08T13:34:41+00:00","breadcrumb":{"@id":"https:\/\/sites.uef.fi\/speech\/data\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/sites.uef.fi\/speech\/data\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/sites.uef.fi\/speech\/data\/#primaryimage","url":"http:\/\/www.uef.fi\/html\/js\/editor\/ckeditor\/plugins\/smiley\/images\/lightbulb.gif","contentUrl":"http:\/\/www.uef.fi\/html\/js\/editor\/ckeditor\/plugins\/smiley\/images\/lightbulb.gif"},{"@type":"BreadcrumbList","@id":"https:\/\/sites.uef.fi\/speech\/data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/sites.uef.fi\/speech\/"},{"@type":"ListItem","position":2,"name":"Datasets and codes"}]},{"@type":"WebSite","@id":"https:\/\/sites.uef.fi\/speech\/#website","url":"https:\/\/sites.uef.fi\/speech\/","name":"Computational speech group","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/sites.uef.fi\/speech\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/sites.uef.fi\/speech\/wp-json\/wp\/v2\/pages\/64","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sites.uef.fi\/speech\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/sites.uef.fi\/speech\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/sites.uef.fi\/speech\/wp-json\/wp\/v2\/users\/24"}],"replies":[{"embeddable":true,"href":"https:\/\/sites.uef.fi\/speech\/wp-json\/wp\/v2\/comments?post=64"}],"version-history":[{"count":1,"href":"https:\/\/sites.uef.fi\/speech\/wp-json\/wp\/v2\/pages\/64\/revisions"}],"predecessor-version":[{"id":1244,"href":"https:\/\/sites.uef.fi\/speech\/wp-json\/wp\/v2\/pages\/64\/revisions\/1244"}],"wp:attachment":[{"href":"https:\/\/sites.uef.fi\/speech\/wp-json\/wp\/v2\/media?parent=64"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}