Licensed under Creative Commons BY-NC-ND 3.0.


“RoDigits” speech corpus was collected by the Speech and Dialogue Research Laboratory. The recordings were made under different conditions (various microphones and various audio recording systems), using an online audio recording application developed by the same research group. The speakers were mainly students of Faculty of Electronics, Telecommunications and Information Technology from University “Politehnica” of Bucharest.

The corpus consists of 15,389 audio files collected from 154 Romanian native speakers. Each audio file contains the utterances of 12 random digits [0-9] in Romanian language. In general, there are 100 audio files per speaker. There are several exceptions: for 11 speakers the corpus comprises only 99 audio files per speaker. The total size of the database is around 38 hours. The average length of an utterance is 8.7 seconds.

“RoDigits” speech corpus is split into training, development and evaluation sets, as follows:

  • training set: 11120 files – 80 files from 139 speakers (file IDs between 1-50 and 71-100)
  • development set: 2780 files – 20 files from 139 speakers (file IDs between 51-70)
  • evaluation set: 1489 files – ~100 files from 15 speakers

If you use this corpus in your research please cite one of the following papers:

  • Alexandru Lucian Georgescu, Alexandru Caranica, Horia Cucu, Corneliu Burileanu, “RoDigits – a Romanian connected-digits speech corpus for automatic speech and speaker recognition,” in University “Politehnica” of Bucharest Scientific Bulletin, Series C, vol. 80, issue 3, pp. 45-62, Bucharest, 2018, ISSN: 2286-3540.

Download RoDigits Speech Corpus (pass: rodigits)

Note: a first version of the corpus, available online before December 8, 2017, comprised some corrupted files. If you downloaded the corpus before this date please download the correct version which is now available.