Applications

Rich Speech Transcription Service demo

The Rich Speech Transcription (RST) service is the output of the LVCSR-ROM project carried out by the SpeeD research group between October 2013 and June 2014. The web-service enables individuals to access the textual content of a multimedia document (news bulletin, interview, lecture, meeting recording, etc.) without watching it. This feature is of critical importance in many applications such as multimedia databases indexing and retrieval, real-time radio/TV monitoring, transcription of self-recorded documents, etc.

The SpeeD Speech-to-Text demo

The SpeeD ASR online demo includes a 10k words news recognition system and several small vocabulary recognition systems for common speech recognition tasks, such as proper names, city names, numbers, dates, etc. This demo can be easily customized and adapted to any speech domain or any other small vocabulary ASR task.

Dictation demo

The SpeeD dictation demo.

The Online Speech Recorder

The SpeeD Online Speech Recorder is a bi-product of our LVCSR project which was developed and used to effectively expand the speech database needed to train the Romanian acoustic models.