Horia Cucu


Lecturer, PhD, Faculty of Electronics, Telecommunications and Information Technology,
University Politehnica of Bucharest

Email: horia.cucu@upb.ro
Tel: +4021 402 4635
Office: B042, Leu Campus


Last update: Aug 2017

  • Scientific Activity
    • Scientific traceroute and achievements
    • International research projects
    • National research projects
    • International Collaborations
    • National Collaborations
    • Benchmarking
    • Affiliation to scientific community
  • List of publications
    • PhD Thesis
    • Books and book chapters
    • Journal and conference papers
  • List of citations

Research Interests

  • Spoken Language Technology (Automatic Speech/Speaker Recognition, Speech Indexing, Spoken Term Detection)
  • Natural Language Processing (Statistical Language Modeling, Phonetization, Diacritics Restoration)
  • Autonomous Robotic Systems
  • Statistical Machine Translation

Degrees and Professional Experience

Degrees

  • PhD Degree, in Electronics and Telecommunications (Oct 2011), University “Politehnica” of Bucharest.
  • Engineer Degree, in Applied Electronics (June 2008), University “Politehnica” of Bucharest.
  • Bachelor Degree (June 2003), “Dr. Ioan Mesota” National College, Braşov.

Professional experience

  • Lecturer (Oct 2012 – present) at Faculty of Electronics, Telecommunications and Information Technology, University Politehnica of Bucharest. Teach Microcontrollers and Embedded Systems, Microprocessor Architectures and Spoken Language Technology. Supervised BSc, MSc and PhD projects in speech recognition and embedded systems.
  • Project Manager and Research Engineer (July 2014 – Sep 2017) at University Politehnica of Bucharest, in the ANVSIB project (research and development project funded by the Romanian Government through UEFISCDI, project coordinator University Politehnica of Bucharest).
  • Research Engineer (Oct 2014 – Sep 2017) at University Politehnica of Bucharest, in the AFLR project (research and development project funded by the Romanian Government through UEFISCDI, project coordinator Softwin Group, Bucharest, Romania).
  • Research Engineer (Oct 2014 – Jul 2017) at University Politehnica of Bucharest, in the SPLANN project (research and development project funded by the Romanian Government through UEFISCDI, project coordinator Softwin Group, Bucharest, Romania).
  • Post-doctoral Researcher (May 2014 – Nov 2015) at University Politehnica of Bucharest, in the KNOWLEDGE project (FSE – European Structural Funds POS-DRU project), project coordinator University Politehnica of Bucharest. Developed several enhancement modules for the Large Vocabulary Continuous Speech Recognition (LVCSR) system for the Romanian language.
  • Implementing expert (May 2014 – Nov 2015) at University Politehnica of Bucharest, in the PRACSIS project (FSE – European Structural Funds POS-DRU project), project coordinator University Politehnica of Bucharest. Career-counselled over 45 students.
  • Research Engineer (Jan 2014 – Jan 2016) at University Politehnica of Bucharest, in the eWALL project (research and development project funded by the European Commission through the 7th Framework Programme). Coordinated the development of an audio-based visitors monitoring application, a multilingual spoken command detection system and a cough detection system for monitoring cough crises.
  • Project Manager and Research Engineer (Oct 2013 – Jun 2014) at University Politehnica of Bucharest, in the LVCSR-ROM project (research and development project funded by the Romanian-American Foundation, Bucharest, Romania). Coordinated a team of four senior researchers. Developed several enhancement modules for the Large Vocabulary Continuous Speech Recognition (LVCSR) system for the Romanian language. Created a web-service which provides rich speech transcriptions for multimedia files.
  • IT Consultant (Dec 2013 – Jan 2014) for Intelligent IT, Sibiu, Romania. Designed and implemented a Natural Language Processing software module for a personal assistant smartphone application (OmniBuddy).
  • Implementing Expert (Jun 2013 – Sep 2013) at University Politehnica of Bucharest, in the CASIA project (FSE – European Structural Funds POS-DRU project), project coordinator Research Institute for Artificial Intelligence, Romanian Academy. Coordinated and supervised 14 students for their summer internships in speech and language processing.
  • Implementing Expert (Jan 2011 – Aug 2013) at University Politehnica of Bucharest, in the PROMISE project (FSE – European Structural Funds POS-DRU project), project coordinator University Politehnica of Bucharest. Developed the teaching infrastructure for several subjects within a new Master programme (BIOSINF).
  • Teaching Assistant (Oct 2008 – Oct 2012) at Faculty of Electronics, Telecommunications and Information Technology, University Politehnica of Bucharest. Teaching Microcontrollers and Embedded Systems, Microprocessor Architectures, Object-Oriented Programming and Spoken Language Technology. Supervising BSc and MSc projects in speech recognition and embedded systems.
  • IT Consultant (Jan 2011 – Feb 2011) for RSM Scot, Bucharest, Romania. Designed and implemented an employees timesheet software application.
  • IT Consultant (Oct 2009 – June 2010) for Grob Technologies Inc., Massachusetts, USA. Designed and implemented the server-side software application of a web-based service for social-networking tracking (What You Post).
  • Research Engineer (Aug 2007 – Mar 2009) at University Politehnica of Bucharest, in the PALIROM project (research and development project funded by the Romanian Government, project coordinator Softwin Group, Bucharest, Romania). Developed natural language resources for Romanian, implemented and tested a natural language compiler.
  • Research Engineer (Aug 2007 – Mar 2009) at University Politehnica of Bucharest, in the BIOACS project (research and development project funded by the Romanian Government, project coordinator Softwin Group, Bucharest, Romania). Collected a database of digital handwritten signatures and developed digital signal processing algorithms for handwritten signature recognition.
  • Software Engineer (Jan 2006 – Jan 2009) at Ubicore Technology, Bucharest, Romania. Designed and implemented several video processing algorithms on a new massive parallel CPU, developed a distributed-computing application for highly computational tasks, developed a complete debugger tool for a new CPU architecture.

Academic Activity

Teaching

All my teaching activity takes place in the Faculty of Electronics, Telecommunications and Information Technology, University “Politehnica” of Bucharest.

Master curricula

  • Spoken Language Technology, research project (2012 – present);
  • Microcontrollers and Embedded Systems, laboratory (2010 – present).

Bachelor curricula

  • Microprocessors Architecture, course (2012 – present) and laboratory (2008 – present);
  • Microcontrollers, course (2012 – present) and laboratory (2008 – present);
  • Object Oriented Programming (Java), laboratory (2010).

Teaching books

  • Horia Cucu, “Research and Development Project in Spoken Language Technology”, Laboratory Guide, Politehnica Press Publishing House, Bucharest, 2013, ISBN: 978-606-515-482-7.
  • Electronic support for “Microprocessors Architecture” laboratory;
  • Electronic support for “Microcontrollers” laboratory;
  • Electronic support for “Spoken Language Technology” research project.

Strategic programmes

  • 2014-2015: implementing expert, project PRACSIS (“Partnership for a successful career in information security and information systems”), FSE – European Structural Funds POS-DRU project, project coordinator University Politehnica of Bucharest, ID POSDRU/161/2.1/G/135813.
  • 2013: implementing expert, project CASIA (“Support for a successful career in artificial intelligence”), FSE – European Structural Funds POS-DRU project, project coordinator University „Politehnica” of Bucharest, ID POSDRU/109/2.1/G/81772.
  • 2010-2013: implementing expert, project PROMISE (“An Integrated Master’s Degree Programme in the Fields of Sound, Image and Multimedia Engineering”), FSE – European Structural Funds POS-DRU project, project coordinator University „Politehnica” of Bucharest, ID POSDRU/86/1.2/S/61810.

National collaborations

PhD student coordination

Since 2012, I was part of the PhD Coordination Commission for several PhD students:

  • 2016 – present: Elena-Diana Şandru, “Sisteme de prelucrare digitală a semnalelor de inspiraţie bio”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2016 – present: Andrei Sandu, “Reţele neuronale pentru sisteme adaptive autonome”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2014 – present: Mircea Sorin Rusu, “Baby language classification methods”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2014 – present: Gheorghe Pop, “Contributions to forensics expertise for audio recordings”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2012 – 2015: Alexandru Caranica, “Optimizations in spoken language recognition“, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2012 – 2015: Valentin Andrei, “Contributions to computational auditory scene analysis methods for continuous speech recognition“, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2012 – 2015: Andrei Purică, “Semantic Video Coding”, co-tutelle thesis between University Politehnica of Bucharest (supervisor prof. C. Burileanu) and Telecom ParisTech (supervisor prof. F. Dufaux, prof. B. Pesquet).
  • 2012 – 2014: Anca-Livia Radu, “Large Scale Media Analysis via Media Fusion and Crowdsourcing”, co-tutelle thesis between University Politehnica of Bucharest (supervisor prof. C. Burileanu) and University of Trento (supervisor prof. F. Giunchiglia).
  • Aurelian Derviş: “Contribuţii la sinteza vorbirii în limba română”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).

Bachelor and Master student supervision

Since 2013, I supervised and coordinated over 20 students during the SpeeD “Speech&Language” Internships.

Since 2009, I co-supervised/coordinated numerous Bachelor of Science and Master of Science projects in Spoken Language Technology, Object Oriented Programming and Embedded Systems:

2017

2016

2015

2014

2013

  • Mihai Cătălin Safta, “Spoken Term Detection for Romanian Language”.

2012

  • Sorin Duţulescu, “Management application for Internet Service Providers (ISP)”.
  • Alexandra Jica, “Topic-Based Language Model Adaptation for Automatic Speech Recognition”.
  • Adrian Liţă, “Multi-Processor 6-Propeller Tridimensional Flying Apparatus”.
  • Florin Matei, “Home Automation System Using Speech Recognition on an Embedded Platform”.
  • Cătălin Stănculescu, “Continuous Speech Recognition Agent for Mobile Devices”.
  • Iulia Stănoiu, “Noisy Speech Enhancement in Automatic Speech Recognition for Romanian Language”.
  • Vanessa Voinic, “Continuous Speech Recognition in Romanian Language for Medical Applications”.

2011

  • Daria Ion, “Speaker-dependent speech recognition: An in-depth analysis of speech features”.
  • Aniela Milea, “Time-domain analysis and compression methods applied to the speech signal”.
  • Radu-Mihai Pană-Tălpeanu, “Language models in speaker-dependent speech recognition”.

2010

  • Tudor Mihailescu, Ioana Rolea, “Speaker recognition systems”.
  • Adina Popa, Diana Uzum, “Speaker-dependent speech recognition”.

Scientific Activity

Scientific research trace-route and achievements

  • Automatic speech recognition for Romanian language (2012 – 2017). Key role in a team which developed the first DNN-based LVCSR system for Romanian language in 2017.
  • Automatic speech recognition for Romanian language (2008 – 2011). Key role in a team which developed the first Large Vocabulary Continuous Speech Recognition (LVCSR) system for Romanian in 2011. Developed the first statistical language model for Romanian language in 2011.
  • Digital signal processing (2007 – 2009). Part of a team which developed several dynamic (accelerometer-based) handwritten signature recognition techniques.
  • Natural language processing (2007 – 2009). Part of a team which developed a natural language compiler for Romanian and several other natural language applications for Romanian.
  • Image and video processing (2006 – 2007). Ported state-of-the-art scaling, de-noising and de-interlacing algorithms on a new massive parallel CPU.

International research projects

  • 2014 – 2016: research engineer (in the UPB research team), project eWALL (“eWALL for Active Long Living”), funded by the European Commission through the 7th Framework Programme, ID FP7-ICT-2013-10, no. 610658.

National research projects

  • 2014 – 2016: project manager and research engineer,  project ANVSIB (“Natural-language, Voice-controlled Assistive System for Intelligent Buildings”), funded by the Romanian Government through UEFISCDI, project coordinator University Politehnica of Bucharest, ID PN-II-PT-PCCA-2013-4-0789.
  • 2014 – 2016: research engineer, project AFLR (“Phonetic Analysis of the Romanian Language”), funded by the Romanian Government through UEFISCDI, project coordinator Softwin Group, ID PN-II-PT-PCCA-2013-4-1451.
  • 2014 – 2016: research engineer, project SPLANN (“Automatic Baby-Language Recognition System”), funded by the Romanian Government through UEFISCDI, project coordinator Softwin Group, ID PN-II-PT-PCCA-2013-4-1443.
  • 2013 – 2014: project manager and research engineer, project LVCSR-ROM (“Noise-robust, domain-adaptable, large-vocabulary automatic speech recognition system for the Romanian language”), funded by Romanian-American Foundation through the Applied Research, Technological Innovation and Entrepreneurship (ARTIE) Fellowship Program.
  • 2007 – 2009: research engineer, project PALIROM (“Applications Package for the Romanian Language”), funded by the Romanian Government through the National Research Authority (“Inovare” programme), project coordinator Softwin Group, ID 10018/26.09.2007.
  • 2007 – 2009: research engineer, project BIOACS (“Biometric System for the Acquisition and Verification of Dynamic Signature”), funded by the Romanian Government through the National Research Authority (“Inovare” programme), project coordinator Softwin Group, ID 10143/28.09.2007.

Patents

  • Andi Buzo, Horia Cucu, Lucian Petrică and Dragoş Burileanu, “Metodă și sistem pentru diarizare în timp real a semnalelor audio, utilizate pentru recunoașterea automată a vorbirii și a vorbitorului” (Method and system for real-time diarization of audio signals, with applications in automatic speech and speaker recognition), patent registered at OSIM, patent no. RO130883-A2 / 29.01.2016.
  • Lucian Petrică, Horia Cucu and Andi Buzo, “Metodă pentru restaurarea automată a semnelor diacritice, folosind texte achiziționate electronic, utilizată în procesarea limbajului natural” (Automatic diacritics restoration method using electronically collected texts with applications in natural language processing), patent registered at OSIM, patent no. RO130875-A2 / 29.01.2016.

International collaborations

National collaborations

  • Softwin R&D: several research projects in digital signal processing and natural language processing.

Prizes & Awards

  • Romanian Academy prize “Mihail Drăgănescu” (2016), for outstanding research contributions in Spoken Language Technology.

Benchmarking

  • MediaEval 2014 (Benchmarking Initiative for Multimedia Evaluation): Query by Example Search on Speech Task with SpeeD, University Politehnica of Bucharest, Romania.
  • MediaEval 2013 (Benchmarking Initiative for Multimedia Evaluation): Spoken Web Search Task with SpeeD, University Politehnica of Bucharest, Romania.
  • MediaEval 2012 (Benchmarking Initiative for Multimedia Evaluation): Spoken Web Search Task with LAPI & SpeeD, University Politehnica of Bucharest, Romania.

Affiliation to scientific community

  • Member of International Speech Communication Association (ISCA), Sep 2014 – present;
  • Member of European Association for Signal Processing (EURASIP), Jan 2013 – present;
  • Member of SpeeD Laboratory, Oct 2008 – present;

Reviewer

  • Signal, Image and Video Processing Journal, Springer Verlag, Jun 2016 – present;
  • Language Resources and Evaluation Journal, Springer Verlag, Oct 2013 – present;
  • Scientific Bulletin of University “Politehnica” of Bucharest (Series C), Jan 2013 – present;
  • International Workshop on Content-based Multimedia Indexing – CBMI 2017
  • International Conference on Speech Technology and Human-Computer Dialogue – SpeD 2011, 2013, 2015, 2017
  • European Signal Processing Conference – EUSIPCO 2012

Chair and organizer

  • Publicity co-chair for the Annual ACM International Conference on Multimedia Retrieval – ICMR 2017
  • Publications chair for the 9th International Conference on Speech Technology and Human-Computer Dialogue – SpeD 2017
  • Proceedings co-chair and session chair for the 14th International Workshop on Content-Based Multimedia Indexing – CBMI 2016
  • Member of the organizing committee for the 8th International Conference on Speech Technology and Human-Computer Dialogue – SpeD 2015

List of publications

PhD thesis

Horia Cucu, “Towards a speaker-independent, large-vocabulary continuous speech recognition system for Romanian”, PhD Thesis, University “Politehnica” of Bucharest, Oct 2011 (scientific coordinator: prof. Corneliu Burileanu).

Books and book chapters

  • Horia Cucu, “Research and Development Project in Spoken Language Technology”, Laboratory Guide, Politehnica Press Publishing House, Bucharest, 2013, ISBN: 978-606-515-482-7.
  • Corneliu Burileanu, Cristina Sorina Petrea, Andi Buzo, and Horia Cucu, “Speech Recognition Experiments Starting from Isolated Words for Spoken Romanian Language”, book chapter in D. Tufiş, Corina Forăscu (Eds.), “Multilinguality and Interoperability in Language Processing with Emphasis on Romanian”, Publishing House of the Romanian Academy, Bucharest 2010, pp. 229-242, ISBN: 978-973-27-1972-5.

Journal papers

2016

2015

2014

2013

  • Radu-Sebastian Marinescu, Andi Buzo, Horia Cucu, Corneliu Burileanu, “Applying the Accumulation of Cross-Power Spectrum Technique for Traditional Generalized Cross-Correlation Time Delay Estimation”, in International Journal on Advances in Telecommunications, vol 6, no 3-4, Dec 2013, pp 98-108, ISSN: 1942-2601.

2011

2008

  • Corneliu Burileanu, Cristina-Sorina Petrea, Andi Buzo, Horia Cucu and Alina Pasca, “Report on building a tool for Romanian spontaneous speech recognition”, in “The Phonetician”, Number 97 / 2008-I-II, 2008, pp. 68-98, ISSN: 0741-6164.

Conference papers

2017

  • Valentin Andrei, Horia Cucu, Corneliu Burileanu, “Detecting overlapped speech on short timeframes using deep learning,” in the Proceedings of the 18th Annual Conference of the International Speech Communication Association (Interspeech), Stockholm, Sweden 2017.
  • Alexandru-Lucian Georgescu, Horia Cucu, Corneliu Burileanu, “SpeeD’s DNN Approach to Romanian Speech Recognition,” in the Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2017, 8p, ISBN 978-1-5090-6496-0.
  • Alexandru Caranica, Horia Cucu, Corneliu Burileanu, François Portet, Michel Vacher, “Speech Recognition Results for Voice-controlled Assistive Applications,” in the Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2017, 8p, ISBN 978-1-5090-6496-0.
  • Gheorghe Pop, Dragoş Drăghicescu, Dragoş Burileanu, Horia Cucu, Corneliu Burileanu, “Fast Method for ENF Database Build and Search,” in the Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2017, 6p, ISBN 978-1-5090-6496-0.
  • Ștefan-Stelian Diaconescu, Monica-Mihaela Rizea, Mihaela Ionescu, Andrei Mincă, Liviu Dorobanțu, Ștefan Fulea, Monica Rădulescu, Horia Cucu, Dragoș Burileanu, “Building a Representative Audio Base of Syllables for Romanian Language,” in the Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2017, 10p, ISBN 978-1-5090-6496-0.

2016

  • Tiberiu Boros, Stefan Daniel Dumitrescu, Horia Cucu, “Voice Controlled Home Automation System”, in the Proceedings of CONSILR 2016, Romania, 2016.
  • Ioana-Alina Bănică, Horia Cucu, Andi Buzo, Dragoş Burileanu and Corneliu Burileanu, “Automatic Methods for Infant Cry Classification,” in the Proceedings of the 11th International Conference on Communications (COMM), Bucharest, Romania, 2016, pp. 51-54.
  • Alexandru Caranica, Horia Cucu, Andi Buzo, “Exploring an Unsupervised, Language Independent, Spoken Document Retrieval System,” in the Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing (CBMI), Buchareet, Romania, 2016.
  • Ioana-Alina Bănică, Horia Cucu, Andi Buzo, Dragoş Burileanu and Corneliu Burileanu, “Baby Cry Recognition in Real-World Conditions,” in the Proceedings of the 39th International Conference on Telecommunications and Signal Processing (TSP), Vienna, Austria, 2016, pp. 315-318, ISSN 1805-5435.

2015

  • Horia Cucu, Alexandru Caranica, Andi Buzo, Corneliu Burileanu, “On transcribing informally-pronounced numbers in Romanian speech,” in the Proceedings of the 38th International Conference on Telecommunications and Signal Processing (TSP), Prague, Czech Republic, 2015, pp. 372-376.
  • Valentin Andrei, Horia Cucu, Andi Buzo, Corneliu Burileanu, “Counting competing speakers in a time frame – human versus computer,” in the Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015, pp. 3999-4003, ISSN 1990-9770.
  • Horia Cucu, Andi Buzo, Corneliu Burileanu, “ASR errors in transcribing informal pronunciations of Romanian numbers,” in the Proceedings of the 2nd Workshop on Errors by Humans and Machines in multimedia, multimodal and multilingual data processing (ERRARE), Sinaia, Romania.
  • Bogdan Luduşan, Alexandru Caranica, Horia Cucu, Andi Buzo, Corneliu Burileanu, Emmanuel Dupoux, “Exploring Multi-Language Resources for Unsupervised Spoken Term Discovery,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 17-22, ISBN 978-1-4673-7559-7.
  • Gheorghe Pop, Alexandru Caranica, Horia Cucu, Dragoş Burileanu, “Sound Event Recognition in Smart Environments,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 45-50, ISBN 978-1-4673-7559-7.
  • Valentin Andrei, Horia Cucu, Andi Buzo, Corneliu Burileanu, “Estimating Competing Speaker Count for Blind Speech Source Separation,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 165-172, ISBN 978-1-4673-7559-7.
  • Mihai Dogariu, Horia Cucu, Andi Buzo, Dragoş Burileanu, Octavian Fratu, “Speech Database Acquisition for Assisted Living Environment Applications,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 191-196, ISBN 978-1-4673-7559-7.
  • Mihai Dogariu, Horia Cucu, Andi Buzo, Dragoş Burileanu, Octavian Fratu, “Speech Applications in the eWALL Project,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 197-204, ISBN 978-1-4673-7559-7.
  • Alexandru Caranica, Andi Buzo, Horia Cucu, Corneliu Burileanu, “SpeeD @ MediaEval 2015: Multilingual phone recognition approach to Query by example STD,” in the Proceedings of the MediaEval 2015 Multimedia Benchmark Workshop, Wurzen, Germany, 2015, ISSN: 1613-0073.

2014

2013

2012

2011

2010

List of citations

An up-to-date list of citations for my research work is provided by the Google Scholar Citations web-service. For convenience, a selection of the citations is also provided below.

2017

  • Torres, Rafael, Daniele Battaglino, and Ludovick Lepauloux. “Baby Cry Sound Detection: A Comparison of Hand Crafted Features and Deep Learning Approach.” In International Conference on Engineering Applications of Neural Networks, pp. 168-179. Springer, Cham, 2017.
  • Shen, Chia-Hao, Janet Y. Sung, and Hung-Yi Lee. “Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data.” arXiv preprint arXiv:1707.06519 (2017).
  • Stan, Adriana, Florina Dinescu, Cristina Ţiple, Şerban Meza, Bogdan Orza, Magdalena Chirilă, and Mircea Giurgiu. “The SWARA speech corpus: A large parallel Romanian read speech dataset.” In Speech Technology and Human-Computer Dialogue (SpeD), 2017 International Conference on, pp. 1-6. IEEE, 2017.
  • Toma, Ştefan-Adrian, Adriana Stan, Mihai-Lică Pura, and Traian Bârsan. “MaRePhoR—An open access machine-readable phonetic dictionary for Romanian.” In Speech Technology and Human-Computer Dialogue (SpeD), 2017 International Conference on, pp. 1-6. IEEE, 2017.
  • Suciu, George, Ştefan-Adrian Toma, and Romulus Cheveresan. “Towards a continuous speech corpus for banking domain automatic speech recognition.” In Speech Technology and Human-Computer Dialogue (SpeD), 2017 International Conference on, pp. 1-6. IEEE, 2017.
  • Dumitrescu, Stefan Daniel. “Cassandra smart-home system description.” In Speech Technology and Human-Computer Dialogue (SpeD), 2017 International Conference on, pp. 1-6. IEEE, 2017.
  • Boros, Tiberiu, Stefan Daniel Dumitrescu, and Sonia Pipa. “CASSANDRA: A multipurpose configurable voice-enabled human-computer-interface.” EACL 2017 (2017): 33.
  • Rostamzadeh, Negar, “Video Scene Understanding: Semantic-based representation, Temporal Variation Modeling, Multi-Task Learning,” PhD Thesis, Universita degli Studi di Trento, Apr 2017 (scientific coordinator: prof. Nicu Sebe).
  • Tengtrairat, N., P. Parathai, and W. L. Woo. “Blind 2D signal direction for limited-sensor space using maximum likelihood estimation.” Asia-Pacific Journal of Science and Technology 22.2 (2017): 42-49.
  • Alaoui, EM Ismaili, and E. Ibn-Elhaj. “A comparative study of new HOS-based estimators for moving objects in noisy video sequence.” Signal, Image and Video Processing (2017): 1-8.

2016

  • Sun, Q., and Zhao, X. “Speech enhancement based on maximum likelihood adaptive subspace estimation.” Revista De La Facultad De Ingenieria, vol. 31, iss. 9 (2016), pp. 48-59. doi:10.21311/002.31.9.06
  • Lopez-Otero, Paula, Laura Docio-Fernandez and Carmen Garcia-Mateo, “Better Phoneme Recognisers Lead to Better Phoneme Posteriorgrams for Search on Speech? An Experimental Analysis”, In Advances in Speech and Language Technologies for Iberian Languages, Lecture Notes in Computer Science, vol. 10077, pp 128-137, November 2016.
  • Koctúr, Tomáš, Ján Staš, and Jozef Juhár. “Unsupervised acoustic corpora building based on variable confidence measure thresholding.” ELMAR, 2016 International Symposium. 2016.
  • Gorin, Arseniy, Rasa Lileikyte, Guangpu Huang, Lori Lamel, Jean-Luc Gauvain, and Antoine Laurent. “Language Model Data Augmentation for Keyword Spotting in Low-Resourced Training Conditions.” In 17th Annual Conference of the International Speech Communication Association (INTERSPEECH). 2016.
  • Chen, Hongjie, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li. “Unsupervised bottleneck features for low-resource query-by-example spoken term detection.” In 17th Annual Conference of the International Speech Communication Association (INTERSPEECH). 2016.
  • Yu, Ling, Tian-shuang Qiu, and Ai-min Song. “A Time Delay Estimation Algorithm Based on the Weighted Correntropy Spectral Density.” Circuits, Systems, and Signal Processing (2016): 1-14.
  • Egorova, Ekaterina, and Jordi Luque Serrano, “Semi-Supervised Training of Language Model on Spanish Conversational Telephone Speech Data,” Procedia Computer Science, vol. 81 (2016): 114-120, doi:10.1016/j.procs.2016.04.038.
  • Koctúr, Tomas, Peter Viszlay, Jan Staš, Martin Lojka and Jozef Juhár. “Unsupervised speech transcription and alignment based on two complementary ASR systems.” In 2016 26th International Conference Radioelektronika, pp. 358-362. IEEE, 2016.

2015

  • Mao, H., and L. Zhang. “An improved accumulated cross-power spectrum phase method for time delay estimation.” In 2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), pp. 563-566. IEEE, 2015.
  • Voiron, Nicolas, “Structuration de bases multimédia pour une exploration visuelle,” PhD Thesis, Grenoble Alpes University, Dec 2015 (scientific coordinator: prof. Patrick Lambert).
  • Vasilescu, Ioana, Camille Dutrey, and Lori Lamel, “Large scale data based linguistic investigations using speech technology tools: The case of Romanian,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015.
  • Diaconescu, Ştefan-Stelian, Monica-Mihaela Rizea, Felicia-Carmen Codîrlaşu, Mihaela Ionescu, Monica Rădulescu, Andrei Mincă, Ştefan Fulea, “Methods for Automatic Generation of GRAALAN-based Phonetic Databases,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 135-142, ISBN 978-1-4673-7559-7.
  • Lee, Lin-shan, James Glass, Hung-yi Lee, and Chun-an Chan. “Spoken Content Retrieval—Beyond Cascading Speech Recognition with Text Retrieval.” Audio, Speech, and Language Processing, IEEE/ACM Transactions on, vol. 23, no. 9 (2015): 1389-1420.
  • Segal, Natalia, Hélène Bonneau-Maynard, and François Yvon. “Traduire la parole: le cas des TED Talks.” Traitement automatique des langues (TAL) (2015).
  • Teodorescu, Horia-Nicolai. “Fuzzy Logic in Speech Technology-Introductory and Overviewing Glimpses.” Fifty Years of Fuzzy Logic and its Applications. Springer International Publishing, 2015. 581-608.
  • Karpov, Alexey and Vasilisa Verkhodanova. “Speech Technologies for Under-Resourced Languages of the World.” Voprosy Jazykoznanija, vol. 2015, no. 2 (2015): 117-135.
  • Choi, Junhwi, S. Ryu, K. Lee, G.G. Lee. “One-step error detection and correction approach for voice word processor.” IEICE Transactions on Information and Systems, vol. E98D, no. 8 (2015): 1517-1525.

2014

  • Metze, Florian, Xavier Anguera, Etienne Barnard, Marelie Davel, and Guillaume Gravier. “Language independent search in MediaEval’s Spoken Web Search task.” Computer Speech & Language 28, no. 5 (2014): 1066-1082.
  • Schiopu, Daniela, and Mihaela Oprea. “Using neural networks for a discriminant speech recognition system.” In Development and Application Systems (DAS), 2014 International Conference on, pp. 165-169. IEEE, 2014.
  • Pinnis, Mārcis, Ilze Auziņa, and Kārlis Goba. “Designing the Latvian Speech Recognition Corpus.” In Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC’14). 2014.
  • Anguera, Xavier, Luis J. Rodriguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, and Mikel Penagarikano. “Query-by-Example Spoken Term Detection on Multilingual Unconstrained Speech.” In Fifteenth Annual Conference of the International Speech Communication Association. 2014.
  • Anguera, Xavier, Luis J. Rodriguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, and Mikel Penagarikano. “Query-by-Example Spoken Term Detection Evaluation on Low-Resource Languages.” In Spoken Language Technologies for Under-Resourced Languages. 2014.
  • Besacier, Laurent, Etienne Barnard, Alexey Karpov, and Tanja Schultz. “Automatic speech recognition for under-resourced languages: A survey.” Speech Communication 56 (2014): 85-100.
  • Anguera, Xavier, Jordi Luque, and Ciro Gracia. “Audio-to-text alignment for speech recognition with very limited resources.” In Fifteenth Annual Conference of the International Speech Communication Association. 2014.
  • Domokos, József, Ovidiu Buza, and Gavril Toderean. “Romanian phonetic transcription dictionary for speeding up language technology development.” Language Resources and Evaluation (2014): 1-15.
  • Şchiopu, Daniela. “Applying Nonlinear Techniques for an Automatic Speech Recognition System.” In Nonlinear Dynamics of Electronic Systems, pp. 371-378. Springer International Publishing, 2014.
  • Sigappi, A. N., and S. Palanivel. “Spoken query based word spotting in digitized Tamil documents.” AI & society 29, no. 1 (2014): 113-121.
  • Vasilescu, Ioana, Bianca Vieru, and Lori Lamel. “Exploring pronunciation variants for Romanian speech-to-text transcription.” In Spoken Language Technologies for Under-Resourced Languages. 2014.
  • Tarján, Balázs, Tibor Fegyό, and Péter Mihajlik. “A Bilingual Study on the Prediction of Morph-Based Improvement.” In Spoken Language Technologies for Under-Resourced Languages. 2014.
  • Hafeez, Aurish Hammad, Khawaja Mohiuddin, and Sohaib Ahmed. “Speaker-Dependent Live Quranic Verses Recitation Recognition System Using Sphinx-4 Framework.” In Multi-Topic Conference (INMIC), 2014 IEEE 17th International, pp. 333-337, Karachi, Pakistan, 2014.
  • Stahlberg, Felix. “Towards Automatic Speech Recognition for Non-Written Languages Using Translations From Other Languages.”, Master Thesis, Karlsruhe Institute of Technology, 2014.

2013

  • Metze, Florian, Xavier Anguera, Etienne Barnard, Marelie Davel, and Guillaume Gravier. “The spoken web search task at MediaEval 2012.” In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8121-8125. IEEE, 2013.
  • Tejedor, Javier, Doroteo T. Toledano, Xavier Anguera, Amparo Varona, Lluís F. Hurtado, Antonio Miguel, and José Colás. “Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion.” EURASIP Journal on Audio, Speech, and Music Processing 2013, no. 1 (2013): 1-17.
  • Mironica, Ionut, Jasper Uijlings, Negar Rostamzadeh, Bogdan Ionescu, and Nicu Sebe. “Time matters!: capturing variation in time in video using fisher kernels.” In Proceedings of the 21st ACM international conference on multimedia, pp. 701-704. ACM, 2013.
  • Schmiedeke, Sebastian, Peng Xu, Isabelle Ferrané, Maria Eskevich, Christoph Kofler, Martha A. Larson, Yannick Estève, Lori Lamel, Gareth JF Jones, and Thomas Sikora. “Blip10000: a social video dataset containing SPUG content for tagging and retrieval.” In Proceedings of the 4th ACM Multimedia Systems Conference, pp. 96-101. ACM, 2013.
  • Mironica, Ionut, Bogdan Ionescu, Peter Knees, and Patrick Lambert. “An in-depth evaluation of multimodal video genre categorization.” In Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on, pp. 11-16. IEEE, 2013.
  • Younessian, Ehsan, and Deepu Rajan. “Multi-modal fusion for associated news story retrieval.” Multimedia Tools and Applications (2013): 1-23.
  • Mironica, Ionut, Bogdan Ionescu, Christoph Rasche, and Patrick Lambert. “A visual-based late-fusion framework for video genre classification.” In Signals, Circuits and Systems (ISSCS), 2013 International Symposium on, pp. 1-4. IEEE, 2013.
  • Nakajima, Kaisuke, and Brian Strope. “Cross-lingual initialization of language models.” U.S. Patent 8,442,830, issued May 14, 2013.
  • Ungurean, Cătălin, Dragoş Burileanu, and Mihai Surmei. “Statistically augmented preprocessing/normalization module for a Romanian text-to-speech system.” In Speech Technology and Human-Computer Dialogue (SpeD), 2013 7th Conference on, pp. 1-6. IEEE, 2013.

2012

  • A.M. Riad, Hamdy K.Elmonier, Samaa. M. Shohieb, A.S. Asem, “SignsWorld; Deeping Into the Silence World and Hearing Its Signs (State of the Art),” International Journal of Computer Science & Information Technology (IJCSIT), Vol 4, No 1, Feb 2012.
  • Ordean, Mihai Alexandru, Andrei Şaupe, Mihaela Ordean, Gheorghe Cosmin Silaghi, and Corina Giurgea. “A Romanian Language Corpus for a Commercial Text-To-Speech Application.” In Text, Speech and Dialogue, pp. 405-414. Springer Berlin Heidelberg, 2012.
  • Ananthi, S., and P. Dhanalakshmi. “Speech Recognition System and Isolated Word Recognition based on Hidden Markov Model (HMM) for Hearing Impaired.” International Journal of Computer Applications 73, no. 20 (2012).
  • Tarján, B., T. Mozsolics, A. Balog, D. Halmos, T. Fegyó, and P. Mihajlik. “Broadcast news transcription in Central-East European languages.” In IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom), pp. 59-64. 2012.
  • Karpov, Alexey, Irina S. Kipyatkova, and Andrey Ronzhin. “Speech recognition for east Slavic languages: the case of Russian.” In SLTU, pp. 84-89. 2012.

2011

  • Ong, H. F., and A. M. Ahmad. “Malay Language Speech Recogniser with Hybrid Hidden Markov Model and Artificial Neural Network (HMM/ANN).” International Journal of Information and Education Technology 1.2 (2011): 114.