Horia Cucu


Associate Professor, PhD, Faculty of Electronics, Telecommunications and Information Technology,
University Politehnica of Bucharest

Email: horia.cucu@upb.ro
Tel: +4021 402 4635
Office: Room 204, UPB CAMPUS Research Center


Last update: June 2024

  • Scientific Activity
    • Scientific traceroute and achievements
    • International research projects
    • National research projects
    • International Collaborations
    • National Collaborations
    • Benchmarking
    • Affiliation to scientific community
  • List of publications
    • PhD Thesis
    • Books and book chapters
    • Journal and conference papers
  • List of citations

Research Interests

  • Machine learning and Deep learning
  • Spoken Language Technology (Automatic Speech/Speaker Recognition, Speech Indexing, Spoken Term Detection)
  • Natural Language Processing (Statistical Language Modeling, Phonetization, Diacritics Restoration)
  • Autonomous Robotic Systems
  • Statistical Machine Translation

Degrees and Professional Experience

Degrees

  • PhD Degree, in Electronics and Telecommunications (Oct 2011), University “Politehnica” of Bucharest.
  • Engineer Degree, in Applied Electronics (June 2008), University “Politehnica” of Bucharest.
  • Bachelor Degree (June 2003), “Dr. Ioan Mesota” National College, Braşov.

Professional experience

  • Associate Professor (Oct 2017 – present) at Faculty of Electronics, Telecommunications and Information Technology, University Politehnica of Bucharest. Teach Microcontrollers and Embedded Systems, Microprocessor Architectures and Spoken Language Technology. Supervised BSc, MSc and PhD projects in speech recognition and embedded systems.
  • Project Responsible and Researcher (Jan 2023 – present) at POLITEHNICA Bucharest, in the project “AI-based-technologies for trustworthy solutions against disinformation” (AI4TRUST), R&D project funded by the European Union through Horizon Europe programme, ID 101070190, project coordinator Fondazione Bruno Kessler, Trento, Italy.
  • Project Director and Researcher (June 2024 – May 2025) at POLITEHNICA Bucharest, in the project “Generative AI for Requirement Engineering (GenAI4RE)”, R&D project funded by Infineon Technologies Romania & Co. SCS, contract no. 4520242912.
  • Project Director and Researcher (Jan 2024 – Dec 2025) at POLITEHNICA Bucharest, in the project “Tehnologii bazate pe inteligență artificială pentru soluții de încredere împotriva dezinformării” (AI4TRUST-RO), R&D project funded by the Romanian Government through UEFISCDI, ID PN-IV-P8-8.1-PRE-HE-ORG-2023-0078, contract no. 27PHE/ 2023, project coordinator POLITEHNICA Bucharest.
  • Project Responsible and Researcher (Jun 2022 – Jul 2024) at POLITEHNICA Bucharest, in the project “AI-assisted Verification of Smart Integrated Circuits” (AVESIC), R&D project funded by the Romanian Government through UEFISCDI, ID PN-III-P2-2.1-PTE-2021-0460, contract no. 90PTE ⁄ 2022, project coordinator Infineon Technologies Romania & Co. SCS.
  • Project Director and Researcher (June 2023 – May 2024) at POLITEHNICA Bucharest, in the project “Efficiency and Effectiveness of Circuit Sizing using Machine Learning Methods (EFFSIZE)”, R&D project funded by Infineon Technologies Romania & Co. SCS, contract no. 4520158364.
  • Project Responsible and Researcher (Mar 2023 – Dec 2023) at University Politehnica of Bucharest, in the project “Integrated voice to text analytics system” (VOITA), R&D project funded by the Romanian Government through UEFISCDI, SMIS ID 156387, contract no. 450/390126/31.03.2023, project coordinator Bold Technologies SRL.
  • Project Director and Researcher (June 2022 – June 2023) at University Politehnica of Bucharest, in the project “Efficiency and Effectiveness of Circuit Verification using Machine Learning Methods v3”, R&D project funded by Infineon Technologies Romania & Co. SCS.
  • Project Director and Researcher (June 2021 – June 2022) at University Politehnica of Bucharest, in the project “Efficiency and Effectiveness of Circuit Verification using Machine Learning Methods v2”, R&D project funded by Infineon Technologies Romania & Co. SCS.
  • Project Director (Sep 2020 – Aug 2022) at University Politehnica of Bucharest, in the project “Artificial intelligence-assisted intelligent integrated circuit design” (DAIA), R&D project funded by the Romanian Government through UEFISCDI, ID PN-III-P2-2.1-PTE-2019-0861, contract 63PTE ⁄ 2020, project coordinator Infineon Technologies Romania & Co. SCS.
  • Team Lead and Researcher (Aug 2020 – present) at University Politehnica of Bucharest, in the project “Aerosol climatology – from remote sensing measurements to deep learning” (CLARA), R&D project funded by the Romanian Government through UEFISCDI, project coordinator National Institute of Research and Development for Optoelectronics.
  • Project Director and Researcher (May 2020 – May 2021) at University Politehnica of Bucharest, in the project “Efficiency and Effectiveness of Circuit Verification using Machine Learning Methods”, R&D project funded by Infineon Technologies Romania & Co. SCS.
  • Project Manager and Researcher (Aug 2019 – Dec 2019) at University Politehnica of Bucharest, in the project “RISC V-based hardware-software system for Machine Learning Applications”,  R&D project funded by NXP Semiconductors Romania SRL.
  • Project Director and Researcher (Feb 2019 – May 2020) at University Politehnica of Bucharest, in the project “Multi-objective Optimization for Analog / Mixed-Signal Circuit Designs”, R&D project funded by Infineon Technologies Romania & Co. SCS.
  • Project Manager and Researcher (Mar 2018 – Apr 2021) at University Politehnica of Bucharest, in the project “Technologies for automatic annotation of audio data and for the creation of automatic speech recognition interfaces” (TADARAV) within the complex project “Resources and Technologies for Developing of Human-Machine Interfaces in Romanian” (ReTeRom), R&D project funded by the Romanian Government through UEFISCDI, project coordinator Research Institute for Artificial Intelligence, Romanian Academy.
  • Researcher (Feb 2017 – Apr 2020) at University Politehnica of Bucharest, in the project “Intelligent Systems for Video and Audio Analysis — Technologies and Innovative Video Systems for Person Re-identification and Analysis of Dissimulated Behavior” (SPIA-VA), R&D project funded by the Romanian Government through UEFISCDI, project coordinator University Politehnica of Bucharest.
  • Project Director and Researcher (Apr 2019 – Jul 2019) at University Politehnica of Bucharest, in the project “Non-native English Automatic Speech Recognition System”, R&D project funded by Arnia Software SRL.
  • Project Manager (May 2017 – May 2019) at Autonomous Systems, in the project “Automatic Interpretation of Images and Video Sequences Using Natural Language Processing” (IAVPLN), R&D project funded by Ministry of Investments and European Projects, SMIS ID 109513, contract no. 160/13.01.2017.
  • Lecturer (Oct 2012 – Oct 2017) at Faculty of Electronics, Telecommunications and Information Technology, University Politehnica of Bucharest. Teach Microcontrollers and Embedded Systems, Microprocessor Architectures and Spoken Language Technology. Supervised BSc, MSc and PhD projects in speech recognition and embedded systems.
  • Project Director and Researcher (July 2014 – Sep 2017) at University Politehnica of Bucharest, in the project “Natural-language, Voice-controlled Assistive System for Intelligent Buildings” (ANVSIB), R&D project funded by the Romanian Government through UEFISCDI, project coordinator University Politehnica of Bucharest.
  • Researcher (Oct 2014 – Sep 2017) at University Politehnica of Bucharest, in the project “Romanian Language Phonetic Analysis: Study and applications” (AFLR), R&D project funded by the Romanian Government through UEFISCDI, project coordinator Softwin Group.
  • Researcher (Oct 2014 – Jul 2017) at University Politehnica of Bucharest, in the project “Automatic Infant Crying Recognition System” (SPLANN), R&D project funded by the Romanian Government through UEFISCDI, project coordinator Softwin Group.
  • Post-doctoral Researcher (May 2014 – Nov 2015) at University Politehnica of Bucharest, in the KNOWLEDGE project (FSE – European Structural Funds POS-DRU project), project coordinator University Politehnica of Bucharest. Developed several enhancement modules for the Large Vocabulary Continuous Speech Recognition (LVCSR) system for the Romanian language.
  • Implementing expert (May 2014 – Nov 2015) at University Politehnica of Bucharest, in the PRACSIS project (FSE – European Structural Funds POS-DRU project), project coordinator University Politehnica of Bucharest. Career-counselled over 45 students.
  • Research Engineer (Jan 2014 – Jan 2016) at University Politehnica of Bucharest, in the project “eWALL for Active Long Living” (eWALL), R&D project funded by the European Commission through the 7th Framework Programme. Coordinated the development of an audio-based visitors monitoring application, a multilingual spoken command detection system and a cough detection system for monitoring cough crises.
  • Project Director and Research Engineer (Oct 2013 – Jun 2014) at University Politehnica of Bucharest, in the project “Noise-robust, domain-adaptable, large-vocabulary automatic speech recognition system for the Romanian language” (LVCSR-ROM), R&D project funded by the Romanian-American Foundation. Coordinated a team of four senior researchers. Developed several enhancement modules for the Large Vocabulary Continuous Speech Recognition (LVCSR) system for the Romanian language. Created a web-service which provides rich speech transcriptions for multimedia files.
  • IT Consultant (Dec 2013 – Jan 2014) for Intelligent IT, Sibiu, Romania. Designed and implemented a Natural Language Processing software module for a personal assistant smartphone application (OmniBuddy).
  • Implementing Expert (Jun 2013 – Sep 2013) at University Politehnica of Bucharest, in the CASIA project (FSE – European Structural Funds POS-DRU project), project coordinator Research Institute for Artificial Intelligence, Romanian Academy. Coordinated and supervised 14 students for their summer internships in speech and language processing.
  • Implementing Expert (Jan 2011 – Aug 2013) at University Politehnica of Bucharest, in the PROMISE project (FSE – European Structural Funds POS-DRU project), project coordinator University Politehnica of Bucharest. Developed the teaching infrastructure for several subjects within a new Master programme (BIOSINF).
  • Teaching Assistant (Oct 2008 – Oct 2012) at Faculty of Electronics, Telecommunications and Information Technology, University Politehnica of Bucharest. Teaching Microcontrollers and Embedded Systems, Microprocessor Architectures, Object-Oriented Programming and Spoken Language Technology. Supervising BSc and MSc projects in speech recognition and embedded systems.
  • IT Consultant (Jan 2011 – Feb 2011) for RSM Scot, Bucharest, Romania. Designed and implemented an employees timesheet software application.
  • IT Consultant (Oct 2009 – June 2010) for Grob Technologies Inc., Massachusetts, USA. Designed and implemented the server-side software application of a web-based service for social-networking tracking (What You Post).
  • Research Engineer (Aug 2007 – Mar 2009) at University Politehnica of Bucharest, in the project PALIROM, R&D project funded by the Romanian Government, project coordinator Softwin Group. Developed natural language resources for Romanian, implemented and tested a natural language compiler.
  • Research Engineer (Aug 2007 – Mar 2009) at University Politehnica of Bucharest, in the project BIOACS, R&D project funded by the Romanian Government, project coordinator Softwin Group. Collected a database of digital handwritten signatures and developed digital signal processing algorithms for handwritten signature recognition.
  • Software Engineer (Jan 2006 – Jan 2009) at Ubicore Technology, Bucharest, Romania. Designed and implemented several video processing algorithms on a new massive parallel CPU, developed a distributed-computing application for highly computational tasks, developed a complete debugger tool for a new CPU architecture.

Academic Activity

Teaching

All my teaching activity takes place in the Faculty of Electronics, Telecommunications and Information Technology, University “Politehnica” of Bucharest.

Master curricula

  • Spoken Language Technology, research project (2012 – present);
  • Microcontrollers and Embedded Systems, laboratory (2010 – present).

Bachelor curricula

  • Microprocessors Architecture, course (2012 – present) and laboratory (2008 – present);
  • Microcontrollers, course (2012 – present) and laboratory (2008 – present);
  • Object Oriented Programming (Java), laboratory (2010).

Teaching books

  • Elena-Diana Şandru, George V. Popescu, Horia Cucu, Corneliu Burileanu, “Microprocessor Architecture”, Laboratory Guide, 150p, MatrixRom Publishing House, Bucharest, 2020, ISBN 978-606-25-0548-6.
  • Elena-Diana Şandru, Horia Cucu, Corneliu Burileanu, “Microprocessors Architecture”, Laboratory Guide, 120p, MatrixRom Publishing House, Bucharest, 2018, ISBN 978-606-25-0443-4.
  • Horia Cucu, “Research and Development Project in Spoken Language Technology”, Laboratory Guide, Politehnica Press Publishing House, Bucharest, 2013, ISBN: 978-606-515-482-7.
  • Electronic support for “Microprocessors Architecture” laboratory;
  • Electronic support for “Microcontrollers” laboratory;
  • Electronic support for “Spoken Language Technology” research project.

Strategic programmes

  • 2014-2015: implementing expert, project PRACSIS (“Partnership for a successful career in information security and information systems”), FSE – European Structural Funds POS-DRU project, project coordinator University Politehnica of Bucharest, ID POSDRU/161/2.1/G/135813.
  • 2013: implementing expert, project CASIA (“Support for a successful career in artificial intelligence”), FSE – European Structural Funds POS-DRU project, project coordinator University „Politehnica” of Bucharest, ID POSDRU/109/2.1/G/81772.
  • 2010-2013: implementing expert, project PROMISE (“An Integrated Master’s Degree Programme in the Fields of Sound, Image and Multimedia Engineering”), FSE – European Structural Funds POS-DRU project, project coordinator University „Politehnica” of Bucharest, ID POSDRU/86/1.2/S/61810.

National collaborations

PhD student supervision and PhD jury member

Since 2012, I was part of the PhD supervision committee for several PhD students. Some of them already defended their theses:

  • 2024 – present: Dan Curăvale
  • 2024 – present: Gabriel Pîrlogeanu
  • 2024 – present: Octavian Pascu
  • 2024 – present: Alexandru-Nicolae Guzu
  • 2021 – present: Cristian Manolache, “Optimization Techniques and Machine Learning Applied in Analogue Circuit Verification”, POLITEHNICA Bucharest (supervisor prof. C. Burileanu).
  • 2021 – 2024: Cătălin Vișan, “Artificial Intelligence Techniques for Integrated Circuit Design Automation”, POLITEHNICA Bucharest (supervisor prof. C. Burileanu).
  • 2019 – 2024: Andrei Gaiţă, “Machine Learning Methods for Supporting Verification of Analog Integrated Circuits”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2019 – 2023: Georgian Nicolae, “Machine Learning Applications in Power MOSFET Design”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2018 – 2022: Lucian Georgescu, “Methods and Technologies of Artificial Intelligence Applied in Speech Technology”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2018 – 2022: Mihai Boldeanu, “Automatic Pollen Classification Using Deep Learning Techniques”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2018 – 2022: Constantin Dragos Nicolae, “Aplicatii NLP pentru dialogurile cu un robot orientate catre sarcini”, Romanian Academy Research Institute for Artificial Intelligence “Mihai Drăgănescu” (supervisor acad. I.-D. Tufiș).
  • 2017 – 2022: Șerban Mihalache, “Speech Signal Analysis and Processing Techniques in Forensic Speech”, University Politehnica of Bucharest (supervisor prof. D. Burileanu).
  • 2016 – 2022: Elena-Diana Şandru, “Data-Driven Fabrication Process Variation Assessment in Circuit Design Analysis Using Machine Learning”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2015 – 2020: Gheorghe Pop, “Contributions to forensics expertise for audio recordings”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2015 – 2020: Ciprian Pop, “Application-Aware Lifetime Estimation of Power Devices,” University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2015 – 2020: Mihai Gabriel Constantin, “Automatic analysis the of visual impact of multimedia data,” University Politehnica of Bucharest (supervisor prof. B. Ionescu).
  • 2015 – 2019: Bogdan Boteanu, “Machine learning techniques for information retrieval systems,” University Politehnica of Bucharest (supervisor prof. B. Ionescu).
  • 2013 – 2016: Andrei Purică, “Semantic Video Coding”, co-tutelle thesis between University Politehnica of Bucharest (supervisor prof. C. Burileanu) and Telecom ParisTech (supervisor prof. F. Dufaux, prof. B. Pesquet).
  • 2012 – 2015: Alexandru Caranica, “Optimizations in spoken language recognition“, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2012 – 2015: Valentin Andrei, “Contributions to computational auditory scene analysis methods for continuous speech recognition“, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • 2012 – 2015: Anca-Livia Radu, “Large Scale Media Analysis via Media Fusion and Crowdsourcing”, co-tutelle thesis between University Politehnica of Bucharest (supervisor prof. C. Burileanu) and University of Trento (supervisor prof. F. Giunchiglia).

Since 2022 I was a member of several national and international juries for PhD Theses:

  • Rishabh Jain (2024), “Child Speech Understanding and Generation via Neural ASR and TTS Models”, School of Electrical and Electronics Engineering, University of Galway (supervisor prof. Peter Corcoran).
  • Sajal Sasmal (2024), “Development of an ASR System for ‘Adi’, a Zero-Resource Indigenous Language of Arunachal Pradesh”, National Institute of Technology, Arunachal Pradesh (supervisor Dr. Yang Saring).
  • Cătălin Vișan (2024), “Artificial Intelligence Techniques for Integrated Circuit Design Automation”, POLITEHNICA Bucharest (supervisor prof. C. Burileanu).
  • Lucian Georgescu (2022), “Methods and Technologies of Artificial Intelligence Applied in Speech Technology”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).
  • Mihai Boldeanu (2022), “Automatic Pollen Classification Using Deep Learning Techniques”, University Politehnica of Bucharest (supervisor prof. C. Burileanu).

Bachelor and Master student supervision

Since 2013, I supervised and coordinated over 20 students during the SpeeD “Speech&Language” Internships.

Since 2009, I co-supervised/coordinated numerous Bachelor of Science and Master of Science projects in Spoken Language Technology, Object Oriented Programming and Embedded Systems:

2020

2019

2018

2017

2016

2015

2014

2013

  • Mihai Cătălin Safta, “Spoken Term Detection for Romanian Language”.

2012

  • Sorin Duţulescu, “Management application for Internet Service Providers (ISP)”.
  • Alexandra Jica, “Topic-Based Language Model Adaptation for Automatic Speech Recognition”.
  • Adrian Liţă, “Multi-Processor 6-Propeller Tridimensional Flying Apparatus”.
  • Florin Matei, “Home Automation System Using Speech Recognition on an Embedded Platform”.
  • Cătălin Stănculescu, “Continuous Speech Recognition Agent for Mobile Devices”.
  • Iulia Stănoiu, “Noisy Speech Enhancement in Automatic Speech Recognition for Romanian Language”.
  • Vanessa Voinic, “Continuous Speech Recognition in Romanian Language for Medical Applications”.

2011

  • Daria Ion, “Speaker-dependent speech recognition: An in-depth analysis of speech features”.
  • Aniela Milea, “Time-domain analysis and compression methods applied to the speech signal”.
  • Radu-Mihai Pană-Tălpeanu, “Language models in speaker-dependent speech recognition”.

2010

  • Tudor Mihailescu, Ioana Rolea, “Speaker recognition systems”.
  • Adina Popa, Diana Uzum, “Speaker-dependent speech recognition”.

Scientific Activity

Scientific research trace-route and achievements

  • Automatic speech recognition for Romanian language (2012 – 2017). Key role in a team which developed the first DNN-based LVCSR system for Romanian language in 2017.
  • Automatic speech recognition for Romanian language (2008 – 2011). Key role in a team which developed the first Large Vocabulary Continuous Speech Recognition (LVCSR) system for Romanian in 2011. Developed the first statistical language model for Romanian language in 2011.
  • Digital signal processing (2007 – 2009). Part of a team which developed several dynamic (accelerometer-based) handwritten signature recognition techniques.
  • Natural language processing (2007 – 2009). Part of a team which developed a natural language compiler for Romanian and several other natural language applications for Romanian.
  • Image and video processing (2006 – 2007). Ported state-of-the-art scaling, de-noising and de-interlacing algorithms on a new massive parallel CPU.

International research projects

  • 2014 – 2016: research engineer (in the UPB research team), project eWALL (“eWALL for Active Long Living”), funded by the European Commission through the 7th Framework Programme, ID FP7-ICT-2013-10, no. 610658.

National research projects

  • 2014 – 2016: project manager and research engineer,  project ANVSIB (“Natural-language, Voice-controlled Assistive System for Intelligent Buildings”), funded by the Romanian Government through UEFISCDI, project coordinator University Politehnica of Bucharest, ID PN-II-PT-PCCA-2013-4-0789.
  • 2014 – 2016: research engineer, project AFLR (“Phonetic Analysis of the Romanian Language”), funded by the Romanian Government through UEFISCDI, project coordinator Softwin Group, ID PN-II-PT-PCCA-2013-4-1451.
  • 2014 – 2016: research engineer, project SPLANN (“Automatic Baby-Language Recognition System”), funded by the Romanian Government through UEFISCDI, project coordinator Softwin Group, ID PN-II-PT-PCCA-2013-4-1443.
  • 2013 – 2014: project manager and research engineer, project LVCSR-ROM (“Noise-robust, domain-adaptable, large-vocabulary automatic speech recognition system for the Romanian language”), funded by Romanian-American Foundation through the Applied Research, Technological Innovation and Entrepreneurship (ARTIE) Fellowship Program.
  • 2007 – 2009: research engineer, project PALIROM (“Applications Package for the Romanian Language”), funded by the Romanian Government through the National Research Authority (“Inovare” programme), project coordinator Softwin Group, ID 10018/26.09.2007.
  • 2007 – 2009: research engineer, project BIOACS (“Biometric System for the Acquisition and Verification of Dynamic Signature”), funded by the Romanian Government through the National Research Authority (“Inovare” programme), project coordinator Softwin Group, ID 10143/28.09.2007.

Patents

  • Stelian Stefan Diaconescu, Adrian Dinescu, Andrei Minca, Stefan Fulea, Mircea Sorin Rusu, Corneliu Burileanu, Horia Cucu, Andi Buzo, “Sistem de recunoastere automata a caracteristicilor din plansetul nou-nascutilor” (Baby-cry classification system), patent application registered at OSIM with no. A/10047/2017.
  • Andi Buzo, Horia Cucu, Lucian Petrică and Dragoş Burileanu, “Metodă și sistem pentru diarizare în timp real a semnalelor audio, utilizate pentru recunoașterea automată a vorbirii și a vorbitorului” (Method and system for real-time diarization of audio signals, with applications in automatic speech and speaker recognition), patent no. 130883 B1 / 27.02.2019, registered at OSIM.
  • Lucian Petrică, Horia Cucu and Andi Buzo, “Metodă pentru restaurarea automată a semnelor diacritice, folosind texte achiziționate electronic, utilizată în procesarea limbajului natural” (Automatic diacritics restoration method using electronically collected texts with applications in natural language processing), patent no. 130875 / 30.09.2020, registered at OSIM.

International collaborations

National collaborations

  • Softwin R&D: several research projects in digital signal processing and natural language processing.

Prizes & Awards

  • Romanian Academy prize “Mihail Drăgănescu” (2016), for outstanding research contributions in Spoken Language Technology.
  • PatriotFest 2019: first prize in section “Agilitatea cunoaşterii”, with the application “Speech Transcriber”.
  • PatriotFest 2018: second prize in section “Agilitatea cunoaşterii”, with the application “LiveTranscriber – Automatic Speech Recognition system for the Romanian language”.

Benchmarking

  • MediaEval 2014 (Benchmarking Initiative for Multimedia Evaluation): Query by Example Search on Speech Task with SpeeD, University Politehnica of Bucharest, Romania.
  • MediaEval 2013 (Benchmarking Initiative for Multimedia Evaluation): Spoken Web Search Task with SpeeD, University Politehnica of Bucharest, Romania.
  • MediaEval 2012 (Benchmarking Initiative for Multimedia Evaluation): Spoken Web Search Task with LAPI & SpeeD, University Politehnica of Bucharest, Romania.

Affiliation to scientific community

  • Member of IEEE, Dec 2018 – present;
  • Member of International Speech Communication Association (ISCA), Sep 2014 – Jun 2017;
  • Member of European Association for Signal Processing (EURASIP), Jan 2013 – Dec 2018;
  • Member of SpeeD Laboratory, Oct 2008 – present;

Reviewer

  • ACM Computing Surveys, 2018 – 2019;
  • IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018-2020;
  • Signal, Image and Video Processing Journal, Springer Verlag, 2016, 2020;
  • Language Resources and Evaluation Journal, Springer Verlag, 2013, 2016, 2017;
  • Romanian Journal of Information Science and Technology, Romanian Academy, 2017;
  • Scientific Bulletin of University “Politehnica” of Bucharest (Series C), Jan 2013 – present;
  • International Conference on Speech Technology and Human-Computer Dialogue – SpeD 2011, 2013, 2015, 2017, 2019
  • International Conference on Communications – COMM 2018
  • International Conference Telecommunications and Signal Processing – TSP 2018, 2019, 2020
  • International Workshop on Content-based Multimedia Indexing – CBMI 2017
  • European Signal Processing Conference – EUSIPCO 2012

Chair and organizer

  • Publications chair and member of the scientific committee for the 11th International Conference on Speech Technology and Human-Computer Dialogue – SpeD 2021
  • Session Chair for the International Conference on Linguistic Resources and Tools for Natural Language Processing –  ConsILR 2020.
  • Publications chair and member of the scientific committee for the 10th International Conference on Speech Technology and Human-Computer Dialogue – SpeD 2019
  • Session Chair for the International Conference on Linguistic Resources and Tools for Natural Language Processing –  ConsILR 2019.
  • Member of the technical program committee for the 12th International Conference on Communications – COMM 2018
  • Publicity co-chair for the Annual ACM International Conference on Multimedia Retrieval – ICMR 2017
  • Publications chair and member of the scientific committee for the 9th International Conference on Speech Technology and Human-Computer Dialogue – SpeD 2017
  • Proceedings co-chair and session chair for the 14th International Workshop on Content-Based Multimedia Indexing – CBMI 2016
  • Member of the organizing committee for the 8th International Conference on Speech Technology and Human-Computer Dialogue – SpeD 2015

List of publications

PhD thesis

Horia Cucu, “Towards a speaker-independent, large-vocabulary continuous speech recognition system for Romanian”, PhD Thesis, University “Politehnica” of Bucharest, Oct 2011 (scientific coordinator: prof. Corneliu Burileanu).

Books and book chapters

  • Horia Cucu, “Research and Development Project in Spoken Language Technology”, Laboratory Guide, Politehnica Press Publishing House, Bucharest, 2013, ISBN: 978-606-515-482-7.
  • Corneliu Burileanu, Cristina Sorina Petrea, Andi Buzo, and Horia Cucu, “Speech Recognition Experiments Starting from Isolated Words for Spoken Romanian Language”, book chapter in D. Tufiş, Corina Forăscu (Eds.), “Multilinguality and Interoperability in Language Processing with Emphasis on Romanian”, Publishing House of the Romanian Academy, Bucharest 2010, pp. 229-242, ISBN: 978-973-27-1972-5.

Journal papers

2024

  • Jain, Rishabh, Andrei Barcovschi, Mariam Yahayah Yiwere, Peter Corcoran, and Horia Cucu. “Exploring Native and Non-Native English Child Speech Recognition With Whisper.” IEEE Access 12 (2024): 41601-41610. ISI IF 3.4 (Q2).
  • Manolache, Cristian, Cristina Andronache, Alexandru Guzu, Alexandru Caranica, Horia Cucu, Andi Buzo, and Georg Pelz. “Synthetic Benchmark for Data-Driven Pre-Si Analogue Circuit Verification.” Electronics 13, no. 13 (2024): 2600. ISI IF 2.6 (Q2).

2023

  • Gaita, A., E. David, A. Buzo, M. Grigore, C. Burileanu, H. Cucu, and G. Pelz. “Convolutional neural network model used for aiding IC analog/mixed signal verification.” UPB Sci. Bull. Ser. C Electr. Eng. Comput. Sci. Politech. Univ. Buchar 85 (2023): 151-162. ISSN 2286-3540. ISI WOS:001015488500009
  • Jain, Rishabh, Andrei Barcovschi, Mariam Yahayah Yiwere, Dan Bigioi, Peter Corcoran, and Horia Cucu. “A wav2vec2-based experimental study on self-supervised learning methods to improve child speech recognition.” IEEE Access 11 (2023): 46938-46948. ISI IF 3.4 (Q2). ISSN 2169-3536. ISI WOS:001010118900001
  • Nicolae, G., H. Cucu, C. Burileanu, A. Buzo, C. Feuerbaum, and G. Pelz. “Automatic design optimization of microelectronic power switches.” Scientific Bulletin of University Politehnica Bucharest 85, no. 1 (2023). ISSN 2286-3540. ISI WOS:000957721700001
  • Yiwere, Mariam Y., Andrei Barcovschi, Rishabh Jain, Horia Cucu, and Peter Corcoran. “Augmentation Techniques for Adult-Speech to Generate Child-Like Speech Data Samples at Scale.” IEEE Access 11 (2023): 109066-109081. ISI IF 3.4 (Q2). ISSN 2169-3536. ISI WOS:001084546800001

2022

2021

2020

2019

2008 – 2018

Conference papers

2023

  • Jain, Rishabh, Andrei Barcovschi, Mariam Yiwere, Peter Corcoran, and Horia Cucu. “Adaptation of Whisper models to child speech recognition.” In Proc. INTERSPEECH 2023, pp. 5242-5246. ISSN 2308-457X. ISI WOS:001186650305079
  • Nicolae, Georgian, Catalin Visan, Dan Curavale, Mihai Boldeanu, Horia Cucu, Andi Buzo, and Georg Pelz. “A Study on Initial Population Sampling for Multi-Objective Optimization based on Differential Evolution and Bayesian Inference.” In 2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 128-132. IEEE, 2023. ISSN 2286-3540. ISI WOS:000957721700001
  • Pascu, Octavian, Catalin Visan, Georgian Nicolae, Mihai Boldeanu, Horia Cucu, Cristian Diaconu, Andi Buzo, and Georg Pelz. “Efficient Multi-Objective Optimization for PVT Variation-Aware Circuit Sizing Using Surrogate Models and Smart Corner Sampling.” In 2023 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED), pp. 1-6. IEEE, 2023. ISBN 979-8-3503-1175-4. ISI WOS:001073659300049
  • Pîrlogeanu, Gabriel, Dan Oneață, Alexandru-Lucian Georgescu, and Horia Cucu. “The SpeeD–ZevoTech submission at DISPLACE 2023.” In Proc. INTERSPEECH 2023, pp. 3572-3576, ISSN 2308-457X. ISI WOS:001186650303146
  • Manolache, Cristian, Cristina Andronache, Alexandru Caranica, Horia Cucu, Andi Buzo, Cristian Diaconu, and Georg Pelz. “Adaptive Planning Search Algorithm for Analog Circuit Verification.” In 2023 International Semiconductor Conference (CAS), pp. 81-84. IEEE, 2023. ISSN 2377-0678
  • Manolache, Cristian, Cristina Maria Andronache, Alexandru Caranica, Horia Cucu, Andi Buzo, Cristian Vasile Diaconu, and Georg Pelz. “Applying Multi-objective Acquisition Function Ensemble for a candidate proposal algorithm.” In 2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 116-121. IEEE, 2023.
  • Vişan, Cătălin, Michael Sieberer, and Horia Cucu. “Designer-like Automated Circuit Sizing for Multiloop LDO.” In 2023 International Semiconductor Conference (CAS), pp. 103-106. IEEE, 2023. ISSN 2377-0678
  • Nicolae, Georgian, Catalin Visan, Dan Curavale, Mihai Boldeanu, Horia Cucu, Andi Buzo, and Georg Pelz. “A Study on Initial Population Sampling for Multi-Objective Optimization based on Differential Evolution and Bayesian Inference.” In 2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 128-132. IEEE, 2023.

2022

  • Gaiță, Andrei, Emilian David, Andi Buzo, Horia Cucu, and Georg Pelz. “Waveform clustering based on dynamic time warping used in analog IC verification.” In 2022 International Symposium ELMAR, pp. 49-52. IEEE, 2022.
  • Gaiță, Andrei, Emilian David, Andi Buzo, Horia Cucu, and Georg Pelz. “A machine learning based wafer test ranking for root cause analysis.” In 2022 International Symposium ELMAR, pp. 45-48. IEEE, 2022.
  • Manolache, Cristian, Alexandru Caranica, Marius Stănescu, Horia Cucu, Andi Buzo, Cristian Diaconu, and Georg Pelz. “Advanced operating conditions search applied in analog circuit verification.” In 2022 18th International Conference on Synthesis, Modeling, Analysis and Simulation Methods and Applications to Circuit Design (SMACD), pp. 1-4. IEEE, 2022.
  • Manolache, Cristian, Alexandru Caranica, Horia Cucu, Andi Buzo, Cristian Diaconu, and Georg Pelz. “Enhanced candidate selection algorithm for analog circuit verification.” In 2022 International Semiconductor Conference (CAS), pp. 137-140. IEEE, 2022.
  • Manolache, Cristian, Mihai Boldeanu, Camelia Talianu, and Horia Cucu. “Unsupervised deep learning models for aerosol layers segmentation.” In 2022 14th International Conference on Communications (COMM), pp. 1-6. IEEE, 2022.
  • Nicolae, Georgian, Andi Buzo, Horia Cucu, Corneliu Burileanu, and Georg Pelz. “Manufacturing Variation Estimation of On Resistance in Power Semiconductors.” In 2022 18th International Conference on Synthesis, Modeling, Analysis and Simulation Methods and Applications to Circuit Design (SMACD), pp. 1-4. IEEE, 2022.
  • Pascu, Octavian, Cătălin Visan, Marius Stănescu, Horia Cucu, Cristian Diaconu, Andi Buzo, and Georg Pelz. “Efficient Modeling of PVT Variation for Mixed-Signal Circuit Sizing.” In 2022 International Semiconductor Conference (CAS), pp. 105-108. IEEE, 2022.

2021

  • Boldeanu, Mihai, Cristina Marin, Dragos Ene, Luminiţa Marmureanu, Horia Cucu, and Corneliu Burileanu. “MARS: the First Romanian Pollen Dataset using a Rapid-E Particle Analyzer.” In 2021 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 145-150. IEEE, 2021.
  • Vişan, Cătălin, Octavian Pascu, Marius Stănescu, Horia Cucu, Cristian Diaconu, Andi Buzo, and Georg Pelz. “Versatility and Population Diversity of Evolutionary Algorithms in Automated Circuit Sizing Applications.” In 2021 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 68-73. IEEE, 2021.
  • Georgescu, Alexandru-Lucian, Horia Cucu, and Corneliu Burileanu. “Improvements of SpeeD’s Romanian ASR system during ReTeRom project.” In 2021 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 177-182. IEEE, 2021.
  • Oneaţă, Dan, Adriana Stan, and Horia Cucu. “Speaker disentanglement in video-to-speech conversion.” In 2021 29th European Signal Processing Conference (EUSIPCO), pp. 46-50. IEEE, 2021.
  • Dan Oneață, Lucian Georgescu, Horia Cucu, Dragoş Burileanu, and Corneliu Burileanu. “Revisiting SincNet: An Evaluation of Feature and Network Hyperparameters for Speaker Recognition.” In 2020 28th European Signal Processing Conference (EUSIPCO), pp. 1-5. IEEE, 2021.
  • Alexandru-Lucian Georgescu, Cristian Manolache, Dan Oneaţă, Horia Cucu, and Corneliu Burileanu. “Data-Filtering Methods for Self-Training of Automatic Speech Recognition Systems.” In 2021 IEEE Spoken Language Technology Workshop (SLT), pp. 1-7. IEEE, 2021.
  • Dan Oneaţă, Alexandru Caranica, Adriana Stan, and Horia Cucu. “An evaluation of word-level confidence estimation for end-to-end automatic speech recognition.” In 2021 IEEE Spoken Language Technology Workshop (SLT), pp. 258-265. IEEE, 2021.
  • Stănescu, Marius, Cătălin Vişan, Gabriel Sandu, Horia Cucu, Cristian Diaconu, Andi Buzo, and Georg Pelz. “Multi-Objective Optimization Algorithms for Automated Circuit Sizing of Analog/Mixed-Signal Circuits.” In 2021 International Semiconductor Conference (CAS), pp. 117-120. IEEE, 2021.
  • Nicolae, G., A. Buzo, C. Feuerbaum, C. V. Diaconu, H. Cucu, G. Pelz, and C. Burileanu. “Metamodel-based prediction of On Resistance for microelectronic power switches.” In 2021 IEEE Electrical Design of Advanced Packaging and Systems (EDAPS), pp. 1-3. IEEE, 2021.

2020

  • Alexandru-Lucian Georgescu, Horia Cucu, Andi Buzo, Corneliu Burileanu, “RSC: A Romanian Read Speech Corpus for Automatic Speech Recognition,” in the Proceedings of The 12th Language Resources and Evaluation Conference (LREC), pp. 6606-6612, 2020, Marseille, France.
  • Dan Oneaţă, Alexandru-Lucian Georgescu, Horia Cucu, Dragoș Burileanu, Corneliu Burileanu, “Revisiting SincNet: An Evaluation of Feature and Network Hyperparameters for Speaker Recognition,” in the Proceedings of the 28th European Signal Processing Conference (EUSIPCO), Amsterdam, The Netherlands, 2020.
  • Cristian Manolache, Alexandru-Lucian Georgescu, Alexandru Caranica, Horia Cucu, “Automatic Annotation of Speech Corpora using Approximate Transcripts,” in the Proceedings of the 43rd International Conference on Telecommunications and Signal Processing (TSP), 2020, Milano, Italy.
  • Manolache, Cristian, Alexandru-Lucian Georgescu, Horia Cucu, Verginica Barbu Mititelu, and Corneliu Burileanu. “Improved text normalization and language models for SpeeD’s Automatic Speech Recognition System.” In the Proceedings of the 13th International Conference “Linguistic Resources and Tools for Processing the Romanian Language”, ConsILR 2020, Bucharest, Romania.
  • Georgian Nicolae, Cristian Boianceanu, Andi Buzo, Cristian Vasile Diaconu, Horia Cucu, Georg Pelz, and Corneliu Burileanu, “Automatic Parameter Tuning in Finite Element Analysis of Semiconductor Packages,” In 2020 International Semiconductor Conference (CAS), pp. 41-44. IEEE, 2020.

2019

  • Dan Oneaţă, Horia Cucu, “Kite: Automatic Speech Recognition for Unmanned Aerial Vehicles,” in the Proceedings of the 20th Annual Conference of the International Speech Communication Association (Interspeech), pp. 2998-3002, 2019, Graz, Austria.
  • Florin Iordache, Alexandru-Lucian Georgescu, Dan Oneaţă, Horia Cucu, “Romanian Automatic Diacritics Restoration Challenge,” in the Proceedings of the 14th International Conference on Linguistics Resources and Tools for Natural Language Processing, pp. 65-74, 2019, Cluj-Napoca, Romania.
  • Dan Oneaţă, Cosmin George Alexandru, Marius Stănescu, Octavian Pascu, Alexandru Magan, Adrian Postelnicu, and Horia Cucu, “The Quo Vadis submission at Traffic4cast 2019,” arXiv preprint arXiv:1910.12363 (2019).
  • Ciprian Pop, Andi Buzo, Cristian-Vasile Diaconu, Georg Pelz, Horia Cucu, Corneliu Burileanu, “Application-Aware Lifetime Model for Power Devices based on Electro-Thermal Simulation,” in the Proceedings of the 42nd International Semiconductor Conference, 2019, Sinaia, Romania.
  • Cristian Manolache, Horia Cucu, Corneliu Burileanu, “Lemma-based Dynamic Time Warping Search for Keyword Spotting Applications in Romanian,” in the Proceedings of the 10th Conference on Speech Technology and Human-Computer Dialogue (SpeD), 2019, Timișoara, Romania.
  • Alexandru-Lucian Georgescu, Horia Cucu, Corneliu Burileanu, “Kaldi-based DNN architectures for speech recognition in Romanian,” in the Proceedings of the 10th Conference on Speech Technology and Human-Computer Dialogue (SpeD), 2019, Timișoara, Romania.
  • Rodica Ileana Tuduce, Mircea Sorin Rusu, Horia Cucu, Corneliu Burileanu, “Automated Baby Cry Classification on a Hospital-Acquired Baby Cry Database,” in the Proceedings of the 42nd International Conference on Telecommunications and Signal Processing (TSP), pp. 343-346, 2019, Budapest, Hungary.
  • Alexandru-Lucian Georgescu, Horia Cucu, Corneliu Burileanu, “Progress on automatic annotation of speech corpora using complementary ASR systems,”  in the Proceedings of the 42nd International Conference on Telecommunications and Signal Processing (TSP), pp. 571-574, 2019, Budapest, Hungary.

2018

  • Alexandru-Lucian Georgescu, Horia Cucu, “Automatic annotation of speech corpora using complementary GMM and DNN acoustic models,”  in the Proceedings of the 41st International Conference on Telecommunications and Signal Processing (TSP), 2018, Athens, Greece.
  • Rodica Ileana Tuduce, Horia Cucu, Corneliu Burileanu, “Why is my Baby Crying? An in-depth Analysis of Paralinguistic Features and Classical Machine Learning Algorithms for Baby Cry Classification,” in the Proceedings of the 41st International Conference on Telecommunications and Signal Processing (TSP), 2018, Athens, Greece.
  • Alexandru-Lucian Georgescu, Horia Cucu, “GMM-UBM modeling for speaker recognition on a Romanian large speech corpora,” in the Proceedings of the 12th Romanian International Conference on Communications (COMM), 2018, Bucharest, Romania.
  • Ciprian Pop, Andi Buzo, Georg Pelz, Horia Cucu, Corneliu Burileanu, “Methodology for Determining the Influencing Factors of the Power Devices Lifetime Variation,” in the Proceedings of the 23rd IEEE European Test Symposium (ETS), 2018, 2p, Bremen, Germany.
  • Ana Neacșu, Cristina Ionescu, Bianca Bănică, Claudiu Anegroaiei, Marian Bănică, Horia Cucu, “Using EMG-based Armband to Control Different Robotic Systems,” International Conference on Automation, Quality and Testing, Robotics, AQTR 2018, Cluj-Napoca, Romania.
  • Alexandru Caranica, Lucian Georgescu, Alexandru Vulpe, Horia Cucu, “Multilingual Low-Resourced Prototype System for Voice-Controlled Intelligent Building Applications,” in Rocha Á., Adeli H., Reis L., Costanzo S. (eds) Trends and Advances in Information Systems and Technologies. WorldCIST’18 2018. Advances in Intelligent Systems and Computing, vol 747. Springer, Cham.
  • Alexandru-Lucian Georgescu, Horia Cucu, Corneliu Burileanu, “Comparison of i-vector and GMM-UBM speaker recognition on a Romanian large speech corpus,” in the Proceedings of the 13th International Conference “Linguistic Resources and Tools for Processing the Romanian Language”, pp. 25-32, ConsILR 2018, Iaşi, Romania.

2017

  • Valentin Andrei, Horia Cucu, Corneliu Burileanu, “Detecting overlapped speech on short timeframes using deep learning,” in the Proceedings of the 18th Annual Conference of the International Speech Communication Association (Interspeech), Stockholm, Sweden, 2017, pp. 1198-1202.
  • Alexandru-Lucian Georgescu, Horia Cucu, Corneliu Burileanu, “SpeeD’s DNN Approach to Romanian Speech Recognition,” in the Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2017, 8p, ISBN 978-1-5090-6496-0.
  • Alexandru Caranica, Horia Cucu, Corneliu Burileanu, François Portet, Michel Vacher, “Speech Recognition Results for Voice-controlled Assistive Applications,” in the Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2017, 8p, ISBN 978-1-5090-6496-0.
  • Gheorghe Pop, Dragoş Drăghicescu, Dragoş Burileanu, Horia Cucu, Corneliu Burileanu, “Fast Method for ENF Database Build and Search,” in the Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2017, 6p, ISBN 978-1-5090-6496-0.
  • Ștefan-Stelian Diaconescu, Monica-Mihaela Rizea, Mihaela Ionescu, Andrei Mincă, Liviu Dorobanțu, Ștefan Fulea, Monica Rădulescu, Horia Cucu, Dragoș Burileanu, “Building a Representative Audio Base of Syllables for Romanian Language,” in the Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2017, 10p, ISBN 978-1-5090-6496-0.
  • Elena-Diana Şandru, Andi Buzo, Horia Cucu, Corneliu Burileanu, “Recent Experiments and Findings in Baby Cry Classification,” in the Proceedings of the 3rd EAI International Conference on Future Access Enablers of Ubiquitous and Intelligent Infrastructures (FABULOUS), Bucharest, 2017, pp. 253-260.
  • Ana Antonia Neacşu, Corneliu Burileanu, Horia Cucu, “Autonomous System for Performing Dexterous, Human-Level Manipulation Tasks as Response to External Stimuli in Real Time,” in the Proceedings of the 3rd EAI International Conference on Future Access Enablers of Ubiquitous and Intelligent Infrastructures (FABULOUS), Bucharest, 2017, pp. 246-252.
  • Mădălin Frunzete, Horia Cucu, “Observability Coefficient for 2D Dynamical Systems,” in the Proceedings of the Signal Processing Algorithms, Architectures, Arrangements, and Applications, Poznan, Poland, 2017, 4p.

2016

  • Tiberiu Boros, Stefan Daniel Dumitrescu, Horia Cucu, “Voice Controlled Home Automation System”, in the Proceedings of CONSILR 2016, Romania, 2016.
  • Ioana-Alina Bănică, Horia Cucu, Andi Buzo, Dragoş Burileanu and Corneliu Burileanu, “Automatic Methods for Infant Cry Classification,” in the Proceedings of the 11th International Conference on Communications (COMM), Bucharest, Romania, 2016, pp. 51-54.
  • Alexandru Caranica, Horia Cucu, Andi Buzo, “Exploring an Unsupervised, Language Independent, Spoken Document Retrieval System,” in the Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing (CBMI), Buchareet, Romania, 2016.
  • Ioana-Alina Bănică, Horia Cucu, Andi Buzo, Dragoş Burileanu and Corneliu Burileanu, “Baby Cry Recognition in Real-World Conditions,” in the Proceedings of the 39th International Conference on Telecommunications and Signal Processing (TSP), Vienna, Austria, 2016, pp. 315-318, ISSN 1805-5435.

2015

  • Horia Cucu, Alexandru Caranica, Andi Buzo, Corneliu Burileanu, “On transcribing informally-pronounced numbers in Romanian speech,” in the Proceedings of the 38th International Conference on Telecommunications and Signal Processing (TSP), Prague, Czech Republic, 2015, pp. 372-376.
  • Valentin Andrei, Horia Cucu, Andi Buzo, Corneliu Burileanu, “Counting competing speakers in a time frame – human versus computer,” in the Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015, pp. 3999-4003, ISSN 1990-9770.
  • Horia Cucu, Andi Buzo, Corneliu Burileanu, “ASR errors in transcribing informal pronunciations of Romanian numbers,” in the Proceedings of the 2nd Workshop on Errors by Humans and Machines in multimedia, multimodal and multilingual data processing (ERRARE), Sinaia, Romania.
  • Bogdan Luduşan, Alexandru Caranica, Horia Cucu, Andi Buzo, Corneliu Burileanu, Emmanuel Dupoux, “Exploring Multi-Language Resources for Unsupervised Spoken Term Discovery,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 17-22, ISBN 978-1-4673-7559-7.
  • Gheorghe Pop, Alexandru Caranica, Horia Cucu, Dragoş Burileanu, “Sound Event Recognition in Smart Environments,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 45-50, ISBN 978-1-4673-7559-7.
  • Valentin Andrei, Horia Cucu, Andi Buzo, Corneliu Burileanu, “Estimating Competing Speaker Count for Blind Speech Source Separation,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 165-172, ISBN 978-1-4673-7559-7.
  • Mihai Dogariu, Horia Cucu, Andi Buzo, Dragoş Burileanu, Octavian Fratu, “Speech Database Acquisition for Assisted Living Environment Applications,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 191-196, ISBN 978-1-4673-7559-7.
  • Mihai Dogariu, Horia Cucu, Andi Buzo, Dragoş Burileanu, Octavian Fratu, “Speech Applications in the eWALL Project,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 197-204, ISBN 978-1-4673-7559-7.
  • Alexandru Caranica, Andi Buzo, Horia Cucu, Corneliu Burileanu, “SpeeD @ MediaEval 2015: Multilingual phone recognition approach to Query by example STD,” in the Proceedings of the MediaEval 2015 Multimedia Benchmark Workshop, Wurzen, Germany, 2015, ISSN: 1613-0073.

2014

2013

2012

2011

2010

List of citations

An up-to-date list of citations for my research work is provided by the Google Scholar Citations web-service. For convenience, a selection of the citations is also provided below.

2021

  • Abuelnaga, Ahmed, Mehdi Narimani, and Amir Sajjad Bahman. “A Review on IGBT Module Failure Modes and Lifetime Testing.” IEEE Access 9 (2021): 9643-9663.
  • Zhang, Yichi, Xinglai Ge, Yi Zhang, Dong Xie, Bo Yao, and Huimin Wang. “A Novel Three-Pulse Equivalent Power Loss Profile for Simplified Thermal Estimation.” IEEE Journal of Emerging and Selected Topics in Power Electronics (2021).

2020

  • Kovacs, Ingrid, Marina Țopa, Ciprian Pop, Elena-Diana Șandru, Andi Buzo, and Georg Pelz. “Correlating electrical and process parameters for yield detractors’ detection.” In 2020 International Symposium on Electronics and Telecommunications (ISETC), pp. 1-4. IEEE, 2020.
  • Avram, Andrei-Marius, P. A. I. Ş. Vasile, and Dan Tufis. “Towards a Romanian end-to-end automatic speech recognition based on Deepspeech2.” In Proc. Rom. Acad. Ser. A, vol. 21, pp. 395-402. 2020.
  • Naing, Hay Mar Soe, Risanuri Hidayat, Rudy Hartanto, and Yoshikazu Miyanaga. “A Front-End Technique for Automatic Noisy Speech Recognition.” In 2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), pp. 49-54. IEEE, 2020.
  • Cheng, Longbiao, Xingwei Sun, Dingding Yao, Junfeng Li, and Yonghong Yan. “Estimation Reliability Function Assisted Sound Source Localization With Enhanced Steering Vector Phase Difference.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2020): 421-435.
  • Chang, Chun-Min, Huan Yu Chen, Hsiang-Chun Chen, and Chi-Chun Lee. “Sensing with Contexts: Crying Reason Classification for Infant Care Center with Environmental Fusion.” In 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 314-318. IEEE, 2020.
  • Chen, Wenwan. “AmbianceCount: An Objective Social Ambiance Measure from Unconstrained Day-long Audio Recordings.” PhD Thesis, Rice University, 2020.
  • Ruseti, Stefan, Teodor-Mihai Cotet, and Mihai Dascalu. “Romanian Diacritics Restoration Using Recurrent Neural Networks.” arXiv preprint arXiv:2009.02743 (2020).
  • Prongnuch, Sethakarn, and Suchada Sitjongsataporn. “Thai voice-controlled analysis for car parking Assistance in System-on-Chip Architecture.” Advances in Technology Innovation 5, no. 4 (2020): 203.
  • Ji, Chunyan, Sunitha Basodi, Xueli Xiao, and Yi Pan. “Infant Sound Classification on Multi-stage CNNs with Hybrid Features and Prior Knowledge.” In International Conference on AI and Mobile Services, pp. 3-16. Springer, Cham, 2020.
  • Li, Zhen, Jiao Zhang, Mengwan Li, Jizhuo Huang, and Xiangyu Wang. “A Review of Smart Design Based on Interactive Experience in Building Systems.” Sustainability 12, no. 17 (2020): 6760.
  • Ogawa, Atsunori, Naohiro Tawara, and Marc Delcroix. “Language Model Data Augmentation Based on Text Domain Transfer.” Proc. Interspeech 2020 (2020): 4926-4930.
  • Liserre, Marco, Giampaolo Buticchi, Jose Ignacio Leon, Abraham Marquez Alcaide, Vivek Raveendran, Youngjong Ko, Markus Andresen, Vito Giuseppe Monopoli, and Leopoldo Franquelo. “Power routing: a new paradigm for maintenance scheduling.” IEEE Industrial Electronics Magazine 14, no. 3 (2020): 33-45.
  • Kubis, Marek, Zygmunt Vetulani, Mikołaj Wypych, and Tomasz Ziętkiewicz. “Open Challenge for Correcting Errors of Speech Recognition Systems.” arXiv preprint arXiv:2001.03041 (2020).
  • Diwan, Anuj, and Preethi Jyothi. “Reduce and Reconstruct: Improving Low-resource End-to-end ASR Via Reconstruction Using Reduced Vocabularies.” arXiv preprint arXiv:2010.09322 (2020).
  • Sharma, Atal, and Deepti Malhotra. “Speech recognition based IICC-Intelligent Infant Cry Classifier.” In 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT), pp. 992-998. IEEE, 2020.
  • Raj, Desh, Zili Huang, and Sanjeev Khudanpur. “Multi-class spectral clustering with overlaps for speaker diarization.” In 2021 IEEE Spoken Language Technology Workshop (SLT), pp. 582-589. IEEE, 2021.
  • Raj, Desh, Leibny Paola Garcia-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, and Sanjeev Khudanpur. “DOVER-Lap: A method for combining overlap-aware diarization outputs.” In 2021 IEEE Spoken Language Technology Workshop (SLT), pp. 881-888. IEEE, 2021.
  • Cristea, Dan, Ionuț Pistol, Șerban Boghiu, Anca-Diana Bibiri, Daniela Gîfu, Andrei Scutelnicu, Mihaela Plamada-Onofrei, Diana Trandabat, and George Bugeag. “CoBiLiRo: A research platform for bimodal corpora.” In Proceedings of the 1st International Workshop on Language Technology Platforms, pp. 22-27. 2020.
  • Sri, Karra Venkata Lakshmi, Mayuka Srinivasan, Radhika Rajeev Nair, K. Jeeva Priya, and Deepa Gupta. “Kaldi recipe in Hindi for word level recognition and phoneme level transcription.” Procedia Computer Science 171 (2020): 2476-2485.
  • Lim, Yeonsoo, Deokjin Seo, Jeong-sik Park, and Yuchul Jung. “An automatic data construction approach for Korean speech command recognition.” Journal of the Korea Society of Computer and Information 24, no. 12 (2019): 17-24.
  • Deekshitha, G., and Leena Mary. “Multilingual spoken term detection: a review.” International Journal of Speech Technology 23, no. 3 (2020): 653-667.
  • Cornell, Samuele, Maurizio Omologo, Stefano Squartini, and Emmanuel Vincent. “Detecting and counting overlapping speakers in distant speech scenarios.” In INTERSPEECH 2020. 2020.
  • Hutin, Mathilde, Oana Niculescu, Ioana Vasilescu, Lori Lamel, and Martine Adda-Decker. “Lenition and fortition of stop codas in Romanian.” In SLTU-CCURL. 2020.
  • Rhinehart, Tessa A., Lauren M. Chronister, Trieste Devlin, and Justin Kitzes. “Acoustic localization of terrestrial wildlife: Current practices and future opportunities.” Ecology and Evolution 10, no. 13 (2020): 6794-6818.
  • Ionescu, Bogdan, Marian Ghenescu, Florin Răstoceanu, Răzvan Roman, and Marian Buric. “Artificial intelligence fights crime and terrorism at a new level.” IEEE MultiMedia 27, no. 2 (2020): 55-61.
  • Bredin, Hervé, and Leibny-Paola Garcia-Perera. “Overlap-aware diarization: Resegmentation using neural end-to-end overlapped speech detection.” In IEEE International Conference on Acoustics, Speech, and Signal Processing. 2020.
  • Jitaru, Andrei Cosmin, Şeila Abdulamit, and Bogdan Ionescu. “LRRo: a lip reading data set for the under-resourced romanian language.” In Proceedings of the 11th ACM Multimedia Systems Conference, pp. 267-272. 2020.
  • Zaharia, George-Eduard, Andrei-Marius Avram, Dumitru-Clementin Cercel, and Traian Rebedea. “Exploring the power of Romanian BERT for dialect identification.” In Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, pp. 232-241. 2020.
  • Zheng, Qi, and Chunhui Zhao. “Gaussian Mixture Model Based Fault Diagnosis for Elevator Overspeed and Automatic Reset.” In 2020 39th Chinese Control Conference (CCC), pp. 4210-4215. IEEE, 2020.
  • Mubin, Siti Azreena, Annie Toh Mei Yi, Aida Zamnah Zainal Abidin, and Matthew Wee Ann Poh. “Designing Digital Interaction for Ageing People.” Fusion 2020 (2020): 1.
  • Feng, Chenwei, and Huimin Xie. “The Smart Home System Based on Voice Control.” In Advances in 3D Image and Graphics Representation, Analysis, Computing and Information Technology, pp. 383-392. Springer, Singapore, 2020.
  • Hanani, Ajib, and Mokhamad Amin Hariyadi. “Smart Home Berbasis IoT Menggunakan Suara Pada Google Assistant.” Jurnal Ilmiah Teknologi Informasi Asia 14, no. 1 (2020): 49-56.
  • Jubjainai, Phumpichet, Sittichai Pathomwong, Poom Siripujaka, Na Chiengmai, Anek Chaiboot, and Paramote Wardkein. “Chainsaw location finding based on travelling of sound wave in air and ground.” In IOP Conference Series: Earth and Environmental Science, vol. 467, no. 1, p. 012065. IOP Publishing, 2020.
  • Liao, Junwei, Sefik Emre Eskimez, Liyang Lu, Yu Shi, Ming Gong, Linjun Shou, Hong Qu, and Michael Zeng. “Improving readability for automatic speech recognition transcription.” arXiv preprint arXiv:2004.04438 (2020).
  • Nanyan, Shen. “An Intelligent Evaluation Model of English Pronunciation Quality Based on Sphinx.” In 2020 12th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA), pp. 1012-1016. IEEE, 2020.
  • Yousefi, Midia, and John HL Hansen. “Frame-based overlapping speech detection using convolutional neural networks.” In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6744-6748. IEEE, 2020.
  • Stöter, Fabian-Robert. “Separation and Count Estimation for Audio Sources Overlapping in Time and Frequency.” PhD Thesis, 2020.
  • Tanveer, Maham. “Classification of anomalous machine sounds using i-vectors.” PhD Thesis, Georgia Institute of Technology, 2020.
  • Tejaswini, S., N. Sriraam, and G. C. M. Pradeep. “Identification of High Risk and Low Risk Preterm Neonates in NICU: Pattern Recognition Approach.” In Biomedical and Clinical Engineering for Healthcare Advancement, pp. 119-140. IGI Global, 2020.

2019

  • Grollmisch, Sascha, Estefanıa Cano, Fernando Mora Ángel, and Gustavo López Gil. “Ensemble size classification in Colombian Andean string music recordings.” In 14th International Symposium on Computer Music Multidisciplinary Research, p. 565. 2019.
  • Zhang, Wangyou, Man Sun, Lan Wang, and Yanmin Qian. “End-to-End Overlapped Speech Detection and Speaker Counting with Raw Waveform.” In 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 660-666. IEEE, 2019.
  • Nuţu, Maria, Beáta Lőrincz, and Adriana Stan. “Deep Learning for Automatic Diacritics Restoration in Romanian.” In 2019 IEEE 15th International Conference on Intelligent Computer Communication and Processing (ICCP), pp. 235-240. IEEE, 2019.
  • Wolf, Dennis, Daniel Besserer, Karolina Sejunaite, Anja Schuler, Matthias Riepe, and Enrico Rukzio. “cARe: an augmented reality support system for geriatric inpatients with mild cognitive impairment.” In Proceedings of the 18th International Conference on Mobile and Ubiquitous Multimedia, pp. 1-11. 2019.
  • Punjabi, Surabhi, Harish Arsikere, and Sri Garimella. “Language Model Bootstrapping Using Neural Machine Translation for Conversational Speech Recognition.” In 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 487-493. IEEE, 2019.
  • Stan, Adriana. “Input Encoding for Sequence-to-Sequence Learning of Romanian Grapheme-to-Phoneme Conversion.” In 2019 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 1-6. IEEE, 2019.
  • Zhang, Chao, Wei Chen, and Chen Xu. “Depthwise Separable Convolutions for Short Utterance Speaker Identification.” In 2019 IEEE 8th Joint International Information Technology and Artificial Intelligence Conference (ITAIC), pp. 962-966. IEEE, 2019.
  • Poongothai, M., K. Sundar, and B. Vinayak Prabhu. “Implementation of IoT based Intelligent Voice Controlled Laboratory using Google Assistant.” International Journal of Computer Applications 975: 8887.
  • Butler, Janine, Brian Trager, and Byron Behm. “Exploration of Automatic Speech Recognition for Deaf and Hard of Hearing Students in Higher Education Classes.” In The 21st International ACM SIGACCESS Conference on Computers and Accessibility, pp. 32-42. 2019.
  • Kahrizi, Mohammad Rasoul, and Seyed Jahanshah Kabudian. “Long-Term Spectral Pseudo-Entropy (LTSPE): A New Robust Feature for Speech Activity Detection.” Information Systems & Telecommunication (2018): 204.
  • Tejedor, Javier, Doroteo T. Toledano, Paula Lopez-Otero, Laura Docio-Fernandez, Ana R. Montalvo, Jose M. Ramirez, Mikel Peñagarikano, and Luis Javier Rodriguez-Fuentes. “ALBAYZIN 2018 spoken term detection evaluation: a multi-domain international evaluation in Spanish.” EURASIP Journal on Audio, Speech, and Music Processing 2019, no. 1 (2019): 16.
  • Tsai, Wen-Chung, You-Jyun Shih, and Nien-Ting Huang. “Hardware-Accelerated, Short-Term Processing Voice and Nonvoice Sound Recognitions for Electric Equipment Control.” Electronics 8, no. 9 (2019): 924.
  • Pop, Gheorghe, and Dragoș Burileanu. “Speech enhancement for forensic purposes.” UPB Scientific Bulletin, Series C 81, no. 3 (2019): 41-52.
  • Kunešová, Marie, Marek Hrúz, Zbyněk Zajíc, and Vlasta Radová. “Detection of Overlapping Speech for the Purposes of Speaker Diarization.” In International Conference on Speech and Computer, pp. 247-257. Springer, Cham, 2019.
  • Gunawan, D., A. Amalia, and O. N. Maringga. “Building the Application to Identify Incorrect Capital Letters Writing in Bahasa Indonesia.” In Journal of Physics: Conference Series, vol. 1235, no. 1, p. 012108. IOP Publishing, 2019.
  • Wu, Kun, Chao Zhang, Xiaopei Wu, De Wu, and Xia Niu. “Research on Acoustic Feature Extraction of Crying for Early Screening of Children with Autism.” In 2019 34rd Youth Academic Annual Conference of Chinese Association of Automation (YAC), pp. 290-295. IEEE, 2019.
  • Lieskovska, Eva, Maros Jakubec, and Roman Jarina. “Acoustic surveillance system for children’s emotion detection.” In 2019 42nd International Conference on Telecommunications and Signal Processing (TSP), pp. 525-528. IEEE, 2019.
  • Mati, Diellza Nagavci, Jaumin Ajdari, Bujar Raufi, Mentor Hamiti, and Besnik Selimi. “A Systematic Mapping Study of Language Features Identification from Large Text Collection.” In 2019 8th Mediterranean Conference on Embedded Computing (MECO), pp. 1-5. IEEE, 2019.
  • Rusu, Alexandru-George, Radu-Sebastian Marinescu, Corneliu Burileanu, and Dumitru Bica. “Evaluation of Simultaneous Speech Detection Based on MFCC-DTW with Two-Stage Normalization.” International Journal of Advances in Telecommunications, Electrotechnics, Signals and Systems 8, no. 2 (2019): 29-34.
  • Koložvari, Andrej, Radovan Stojanović, Anton Zupan, Eugene Semenkin, Vladimir Stanovov, Davorin Kofjač, and Andrej Škraba. “Speech-recognition cloud harvesting for improving the navigation of cyber-physical wheelchairs for disabled persons.” Microprocessors and Microsystems 69 (2019): 179-187.
  • Ibrahim, Zein Al Abidin, Marwa Saab, and Ihab Sbeity. “VideoToVecs: a new video representation based on deep learning techniques for video classification and clustering.” SN Applied Sciences 1, no. 6 (2019): 560.
  • Agustin, Eva Inaiyah, Riky Tri Yunardi, and Aji Akbar Firdaus. “Voice recognition system for controlling electrical appliances in smart hospital room.” Telkomnika 17, no. 2 (2019): 965-972.
  • Ibrahim, Zein Al Abidin, Siba Haidar, and Ihab Sbeity. “Large-scale Text-based Video Classification using Contextual Features.” European Journal of Electrical Engineering and Computer Science 3, no. 2 (2019).
  • Shumailov, Ilia, Laurent Simon, Jeff Yan, and Ross Anderson. “Hearing your touch: A new acoustic side channel on smartphones.” arXiv preprint arXiv:1903.11137 (2019).
  • Iancu, Bogdan. “Evaluating Google Speech-to-Text API’s Performance for Romanian e-Learning Resources.” Informatica Economica 23, no. 1 (2019).
  • Padois, Thomas, Olivier Doutres, and Franck Sgard. “On the use of modified phase transform weighting functions for acoustic imaging with the generalized cross correlation.” The Journal of the Acoustical Society of America 145, no. 3 (2019): 1546-1555.
  • Ricossa, Davide, Enrico Baccaglini, Elvira Di Nardo, Emilia Parodi, and Riccardo Scopigno. “On the automatic audio analysis and classification of cry for infant pain assessment.” International Journal of Speech Technology 22, no. 1 (2019): 259-269.
  • Shivakumar, Prashanth Gurunath, Haoqi Li, Kevin Knight, and Panayiotis Georgiou. “Learning from past mistakes: improving automatic speech recognition output via noisy-clean phrase context modeling.” APSIPA Transactions on Signal and Information Processing 8 (2019).
  • Stöter, Fabian-Robert, Soumitro Chakrabarty, Bernd Edler, and Emanuël AP Habets. “CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning.” IEEE/ACM Transactions on Audio, Speech, and Language Processing 27, no. 2 (2019): 268-282.
  • Lopez-Otero, Paula, Javier Parapar, and Alvaro Barreiro. “Efficient query-by-example spoken document retrieval combining phone multigram representation and dynamic time warping.” Information Processing & Management 56, no. 1 (2019): 43-60.

2018

  • Tsai, Wen-Chung, Yu-Ruei Lian, Shih-Hung Hsu, Qi-Xun Zheng, Yu-Chen Su, and Jia-Xian Chen. “An implementation of voice recognition and control system for electric equipment.” In 2018 International Symposium on Computer, Consumer and Control (IS3C), pp. 356-359. IEEE, 2018.
  • Tufiş, Dan, and Dan Cristea. “A Bird’s-eye View of Language Processing Projects at the Romanian Academy.” In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). 2018.
  • Bekli, Zeid, William Ouda, “A performance measurement of a Speaker Verification system based on a variance in data collection for Gaussian Mixture Model and Universal Background Model.” Master Thesis, Malmo University, 2017 (scientific coordinator: Prof. Arezoo Sarkheyli Haegele).
  • Shruthi, S., G. Yashaswi, V. Shruti, and J. Manikandan. “Design and Evaluation of a Real-Time Speech Recognition System.” In 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 425-430. IEEE, 2018.
  • Galliakis, Michael, Christos Skourlas, Eleni Galiotou, and Ioannis Voyiatzis. “A low-cost smart home for the assistance of elderly persons and patients.” In Proceedings of the 22nd Pan-Hellenic Conference on Informatics, pp. 93-98. ACM, 2018.
  • Stan, Adriana, Mircea Giurgiu, “A Comparison Between Traditional Machine Learning Approaches and Deep Neural Networks for Text Processing in Romania.”  In 13th International Conference “Linguistic Resources and Tools for Processing the Romanian Language” (ConsILR), pp. 33-42. 2018.
  • Guo, Shuxiang, Zhi Wang, Jian Guo, Qiang Fu, and Nan Li. “Design of the Speech Control System for a Upper Limb Rehabilitation Robot Based on Wavelet De-noising.” In 2018 IEEE International Conference on Mechatronics and Automation (ICMA), pp. 2300-2305. IEEE, 2018.
  • Krstev, Cvetana, Ranka Stankovic, Dusko Vitas, “Knowledge and Rule-Based Diacritic Restoration in Serbian.” In 3rd International Conference Computational Linguistics in Bulgaria (CLIB), pp. 41-51. 2018.
  • Kazimirova, Evdokia, A. Belyaev. “Automatic detection of multi-speaker fragments with high time resolution.” In 19th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 1388-1392. 2018. doi:10.21437/interspeech.2018-1878.
  • Marinescu, Radu-Sebastian, Alexandru George Rusu, Corneliu Burileanu, and Dumitru Bica. “Simultaneous Speech Detection Based on MFCC-DTW with Two-Stage Normalization.” In 2018 41st International Conference on Telecommunications and Signal Processing (TSP), pp. 1-5. IEEE, 2018.
  • Franti, Eduard, Ioan Ispas, and Monica Dascalu. “Testing the Universal Baby Language Hypothesis-Automatic Infant Speech Recognition with CNNs.” In 2018 41st International Conference on Telecommunications and Signal Processing (TSP), pp. 1-4. IEEE, 2018.
  • Stoter, Fabian-Robert, Soumitro Chakrabarty, Bernd Edler, Emanuël AP Habets. “Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation.” In Acoustics, Speech and Signal Processing (ICASSP), 2018 IEEE International Conference on, pp. 436-440. IEEE, 2018.
  • Casebeer, Jonah, Hillol Sarker, Murtaza Dhuliawala, Nicholas Fay, Mary Pietrowicz, and Amar Das. “Verbal Protest Recognition in Children with Autism.” In Acoustics, Speech and Signal Processing (ICASSP), 2018 IEEE International Conference on, pp. 301-305. IEEE, 2018.
  • Pop, Gheorghe, Dragoş Burileanu, and Şerban Mihalache. “An Evaluative ENF-based Framework for Forensic Authentication of Digital Audio Recordings.” Proceedings of the Romanian Academy Series A-mathematics Physics Technical Sciences Information Science 19, No. 4 (2018): pp. 605-612.
  • Hsieh, Sung-Hsien, Chun-Shien Lu, and Soo-Chang Pei. “Fast computing position of maximum of circulant convolution.” Digital Signal Processing (2018): 83, pp. 83-97, ISSN 1051-2004, doi:10.1016/j.dsp.2018.08.009.
  • Vavrek, Jozef, Peter Viszlay, Martin Lojka, Jozef Juhár, and Matúš Pleva. “Weighted fast sequential DTW for multilingual audio Query-by-Example retrieval.” Journal of Intelligent Information Systems (2018): pp. 1-17.
  • Sahak, R., W. Mansor, Khuan Y. Lee, and A. Zabidi. “Performance of Principal Component Analysis and Orthogonal Least Square on Optimized Feature Set in Classifying Asphyxiated Infant Cry Using Support Vector Machine.” Indonesian Journal of Electrical Engineering and Computer Science, 9.1 (2018): pp. 139-145.
  • Tejedor, Javier, et al. “ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation.” EURASIP Journal on Audio, Speech, and Music Processing 2018.1 (2018): 2.
  • Polyakov, E. V., M. S. Mazhanov, A. Y. Rolich, L. S. Voskov, M. V. Kachalova, and S. V. Polyakov. “Investigation and development of the intelligent voice assistant for the Internet of Things using machine learning.” In Electronic and Networking Technologies (MWENT), 2018 Moscow Workshop on, pp. 1-5. IEEE, 2018.

2017

  • Torres, Rafael, Daniele Battaglino, and Ludovick Lepauloux. “Baby Cry Sound Detection: A Comparison of Hand Crafted Features and Deep Learning Approach.” In International Conference on Engineering Applications of Neural Networks, pp. 168-179. Springer, Cham, 2017.
  • Shen, Chia-Hao, Janet Y. Sung, and Hung-Yi Lee. “Language Transfer of Audio Word2Vec: Learning Audio Segment Representations without Target Language Data.” arXiv preprint arXiv:1707.06519 (2017).
  • Stan, Adriana, Florina Dinescu, Cristina Ţiple, Şerban Meza, Bogdan Orza, Magdalena Chirilă, and Mircea Giurgiu. “The SWARA speech corpus: A large parallel Romanian read speech dataset.” In Speech Technology and Human-Computer Dialogue (SpeD), 2017 International Conference on, pp. 1-6. IEEE, 2017.
  • Toma, Ştefan-Adrian, Adriana Stan, Mihai-Lică Pura, and Traian Bârsan. “MaRePhoR—An open access machine-readable phonetic dictionary for Romanian.” In Speech Technology and Human-Computer Dialogue (SpeD), 2017 International Conference on, pp. 1-6. IEEE, 2017.
  • Suciu, George, Ştefan-Adrian Toma, and Romulus Cheveresan. “Towards a continuous speech corpus for banking domain automatic speech recognition.” In Speech Technology and Human-Computer Dialogue (SpeD), 2017 International Conference on, pp. 1-6. IEEE, 2017.
  • Dumitrescu, Stefan Daniel. “Cassandra smart-home system description.” In Speech Technology and Human-Computer Dialogue (SpeD), 2017 International Conference on, pp. 1-6. IEEE, 2017.
  • Boros, Tiberiu, Stefan Daniel Dumitrescu, and Sonia Pipa. “CASSANDRA: A multipurpose configurable voice-enabled human-computer-interface.” EACL 2017 (2017): 33.
  • Rostamzadeh, Negar, “Video Scene Understanding: Semantic-based representation, Temporal Variation Modeling, Multi-Task Learning,” PhD Thesis, Universita degli Studi di Trento, Apr 2017 (scientific coordinator: prof. Nicu Sebe).
  • Tengtrairat, N., P. Parathai, and W. L. Woo. “Blind 2D signal direction for limited-sensor space using maximum likelihood estimation.” Asia-Pacific Journal of Science and Technology 22.2 (2017): 42-49.
  • Tengtrairat, Naruephorn, and Wai Lok Woo. “Blind 3D sound source direction using stereo microphones based on time-delay estimation and polar-pattern histogram.” Information Technology (INCIT), 2017 2nd International Conference on. IEEE, 2017.
  • Alaoui, EM Ismaili, and E. Ibn-Elhaj. “A comparative study of new HOS-based estimators for moving objects in noisy video sequence.” Signal, Image and Video Processing (2017): 1-8.
  • Korvel, Gražina, and Bożena Kostek. “Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System.” Archives of Acoustics 42, no. 3 (2017): pp. 375-383.
  • Masmoudi, Abir, et al. “Automatic speech recognition system for Tunisian dialect.” Language Resources and Evaluation (2017): 1-19.
  • Jiang, C., Fan, P., Liang, K., Wang, Z. “Complex sound recognition method based on cluster labels.” International Journal of Signal Processing, Image Processing and Pattern Recognition, vol. 10, iss. 3 (2017), pp. 41-52.
  • Duong, Long. “Natural language processing for resource-poor languages.” PhD Thesis, The University of Melbourne, Oct 2017 (scientific coordinators: A/Prof. Steven Bird, A/Prof. Trevor Cohn).
  • Cocioceanu, A., T. Ivănoaica, A. I. Nicolin, and M. C. Raportaru. “Computer-based statistical description of phonetical balance for Romanian utterances.” In International Conference on ICT Innovations, pp. 59-67. Springer, Cham, 2016.
  • Kapočiūtė-Dzikienė, Jurgita, Andrius Davidsonas, and Aušra Vidugirienė. “Character-Based Machine Learning vs. Language Modeling for Diacritics Restoration.” Information Technology And Control 46, no. 4 (2017): 508-520.

2016

  • Tejaswini, S., Natarajan Sriraam, and G. C. M. Pradeep. “Recognition of Infant Cries Using Wavelet Derived Mel Frequency Feature with SVM Classification.” In Circuits, Controls, Communications and Computing (I4C), 2016 International Conference on, 2016.
  • Sun, Q., and Zhao, X. “Speech enhancement based on maximum likelihood adaptive subspace estimation.” Revista De La Facultad De Ingenieria, vol. 31, iss. 9 (2016), pp. 48-59. doi:10.21311/002.31.9.06
  • Lopez-Otero, Paula, Laura Docio-Fernandez and Carmen Garcia-Mateo, “Better Phoneme Recognisers Lead to Better Phoneme Posteriorgrams for Search on Speech? An Experimental Analysis”, In Advances in Speech and Language Technologies for Iberian Languages, Lecture Notes in Computer Science, vol. 10077, pp 128-137, November 2016.
  • Koctúr, Tomáš, Ján Staš, and Jozef Juhár. “Unsupervised acoustic corpora building based on variable confidence measure thresholding.” ELMAR, 2016 International Symposium. 2016.
  • Gorin, Arseniy, Rasa Lileikyte, Guangpu Huang, Lori Lamel, Jean-Luc Gauvain, and Antoine Laurent. “Language Model Data Augmentation for Keyword Spotting in Low-Resourced Training Conditions.” In 17th Annual Conference of the International Speech Communication Association (INTERSPEECH). 2016.
  • Chen, Hongjie, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li. “Unsupervised bottleneck features for low-resource query-by-example spoken term detection.” In 17th Annual Conference of the International Speech Communication Association (INTERSPEECH). 2016.
  • Yu, Ling, Tian-shuang Qiu, and Ai-min Song. “A Time Delay Estimation Algorithm Based on the Weighted Correntropy Spectral Density.” Circuits, Systems, and Signal Processing (2016): 1-14.
  • Egorova, Ekaterina, and Jordi Luque Serrano, “Semi-Supervised Training of Language Model on Spanish Conversational Telephone Speech Data,” Procedia Computer Science, vol. 81 (2016): 114-120, doi:10.1016/j.procs.2016.04.038.
  • Koctúr, Tomas, Peter Viszlay, Jan Staš, Martin Lojka and Jozef Juhár. “Unsupervised speech transcription and alignment based on two complementary ASR systems.” In 2016 26th International Conference Radioelektronika, pp. 358-362. IEEE, 2016.

2015

  • Mao, H., and L. Zhang. “An improved accumulated cross-power spectrum phase method for time delay estimation.” In 2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), pp. 563-566. IEEE, 2015.
  • Voiron, Nicolas, “Structuration de bases multimédia pour une exploration visuelle,” PhD Thesis, Grenoble Alpes University, Dec 2015 (scientific coordinator: prof. Patrick Lambert).
  • Vasilescu, Ioana, Camille Dutrey, and Lori Lamel, “Large scale data based linguistic investigations using speech technology tools: The case of Romanian,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015.
  • Diaconescu, Ştefan-Stelian, Monica-Mihaela Rizea, Felicia-Carmen Codîrlaşu, Mihaela Ionescu, Monica Rădulescu, Andrei Mincă, Ştefan Fulea, “Methods for Automatic Generation of GRAALAN-based Phonetic Databases,” in the Proceedings of the 8th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, 2015, pp. 135-142, ISBN 978-1-4673-7559-7.
  • Lee, Lin-shan, James Glass, Hung-yi Lee, and Chun-an Chan. “Spoken Content Retrieval—Beyond Cascading Speech Recognition with Text Retrieval.” Audio, Speech, and Language Processing, IEEE/ACM Transactions on, vol. 23, no. 9 (2015): 1389-1420.
  • Segal, Natalia, Hélène Bonneau-Maynard, and François Yvon. “Traduire la parole: le cas des TED Talks.” Traitement automatique des langues (TAL) (2015).
  • Teodorescu, Horia-Nicolai. “Fuzzy Logic in Speech Technology-Introductory and Overviewing Glimpses.” Fifty Years of Fuzzy Logic and its Applications. Springer International Publishing, 2015. 581-608.
  • Karpov, Alexey and Vasilisa Verkhodanova. “Speech Technologies for Under-Resourced Languages of the World.” Voprosy Jazykoznanija, vol. 2015, no. 2 (2015): 117-135.
  • Choi, Junhwi, S. Ryu, K. Lee, G.G. Lee. “One-step error detection and correction approach for voice word processor.” IEICE Transactions on Information and Systems, vol. E98D, no. 8 (2015): 1517-1525.

2014

  • Metze, Florian, Xavier Anguera, Etienne Barnard, Marelie Davel, and Guillaume Gravier. “Language independent search in MediaEval’s Spoken Web Search task.” Computer Speech & Language 28, no. 5 (2014): 1066-1082.
  • Schiopu, Daniela, and Mihaela Oprea. “Using neural networks for a discriminant speech recognition system.” In Development and Application Systems (DAS), 2014 International Conference on, pp. 165-169. IEEE, 2014.
  • Pinnis, Mārcis, Ilze Auziņa, and Kārlis Goba. “Designing the Latvian Speech Recognition Corpus.” In Proceedings of the 9th edition of the Language Resources and Evaluation Conference (LREC’14). 2014.
  • Anguera, Xavier, Luis J. Rodriguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, and Mikel Penagarikano. “Query-by-Example Spoken Term Detection on Multilingual Unconstrained Speech.” In Fifteenth Annual Conference of the International Speech Communication Association. 2014.
  • Anguera, Xavier, Luis J. Rodriguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, and Mikel Penagarikano. “Query-by-Example Spoken Term Detection Evaluation on Low-Resource Languages.” In Spoken Language Technologies for Under-Resourced Languages. 2014.
  • Besacier, Laurent, Etienne Barnard, Alexey Karpov, and Tanja Schultz. “Automatic speech recognition for under-resourced languages: A survey.” Speech Communication 56 (2014): 85-100.
  • Anguera, Xavier, Jordi Luque, and Ciro Gracia. “Audio-to-text alignment for speech recognition with very limited resources.” In Fifteenth Annual Conference of the International Speech Communication Association. 2014.
  • Domokos, József, Ovidiu Buza, and Gavril Toderean. “Romanian phonetic transcription dictionary for speeding up language technology development.” Language Resources and Evaluation (2014): 1-15.
  • Şchiopu, Daniela. “Applying Nonlinear Techniques for an Automatic Speech Recognition System.” In Nonlinear Dynamics of Electronic Systems, pp. 371-378. Springer International Publishing, 2014.
  • Sigappi, A. N., and S. Palanivel. “Spoken query based word spotting in digitized Tamil documents.” AI & society 29, no. 1 (2014): 113-121.
  • Vasilescu, Ioana, Bianca Vieru, and Lori Lamel. “Exploring pronunciation variants for Romanian speech-to-text transcription.” In Spoken Language Technologies for Under-Resourced Languages. 2014.
  • Tarján, Balázs, Tibor Fegyό, and Péter Mihajlik. “A Bilingual Study on the Prediction of Morph-Based Improvement.” In Spoken Language Technologies for Under-Resourced Languages. 2014.
  • Hafeez, Aurish Hammad, Khawaja Mohiuddin, and Sohaib Ahmed. “Speaker-Dependent Live Quranic Verses Recitation Recognition System Using Sphinx-4 Framework.” In Multi-Topic Conference (INMIC), 2014 IEEE 17th International, pp. 333-337, Karachi, Pakistan, 2014.
  • Stahlberg, Felix. “Towards Automatic Speech Recognition for Non-Written Languages Using Translations From Other Languages.”, Master Thesis, Karlsruhe Institute of Technology, 2014.

2013

  • Metze, Florian, Xavier Anguera, Etienne Barnard, Marelie Davel, and Guillaume Gravier. “The spoken web search task at MediaEval 2012.” In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8121-8125. IEEE, 2013.
  • Tejedor, Javier, Doroteo T. Toledano, Xavier Anguera, Amparo Varona, Lluís F. Hurtado, Antonio Miguel, and José Colás. “Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion.” EURASIP Journal on Audio, Speech, and Music Processing 2013, no. 1 (2013): 1-17.
  • Mironica, Ionut, Jasper Uijlings, Negar Rostamzadeh, Bogdan Ionescu, and Nicu Sebe. “Time matters!: capturing variation in time in video using fisher kernels.” In Proceedings of the 21st ACM international conference on multimedia, pp. 701-704. ACM, 2013.
  • Schmiedeke, Sebastian, Peng Xu, Isabelle Ferrané, Maria Eskevich, Christoph Kofler, Martha A. Larson, Yannick Estève, Lori Lamel, Gareth JF Jones, and Thomas Sikora. “Blip10000: a social video dataset containing SPUG content for tagging and retrieval.” In Proceedings of the 4th ACM Multimedia Systems Conference, pp. 96-101. ACM, 2013.
  • Mironica, Ionut, Bogdan Ionescu, Peter Knees, and Patrick Lambert. “An in-depth evaluation of multimodal video genre categorization.” In Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on, pp. 11-16. IEEE, 2013.
  • Younessian, Ehsan, and Deepu Rajan. “Multi-modal fusion for associated news story retrieval.” Multimedia Tools and Applications (2013): 1-23.
  • Mironica, Ionut, Bogdan Ionescu, Christoph Rasche, and Patrick Lambert. “A visual-based late-fusion framework for video genre classification.” In Signals, Circuits and Systems (ISSCS), 2013 International Symposium on, pp. 1-4. IEEE, 2013.
  • Nakajima, Kaisuke, and Brian Strope. “Cross-lingual initialization of language models.” U.S. Patent 8,442,830, issued May 14, 2013.
  • Ungurean, Cătălin, Dragoş Burileanu, and Mihai Surmei. “Statistically augmented preprocessing/normalization module for a Romanian text-to-speech system.” In Speech Technology and Human-Computer Dialogue (SpeD), 2013 7th Conference on, pp. 1-6. IEEE, 2013.

2012

  • A.M. Riad, Hamdy K.Elmonier, Samaa. M. Shohieb, A.S. Asem, “SignsWorld; Deeping Into the Silence World and Hearing Its Signs (State of the Art),” International Journal of Computer Science & Information Technology (IJCSIT), Vol 4, No 1, Feb 2012.
  • Ordean, Mihai Alexandru, Andrei Şaupe, Mihaela Ordean, Gheorghe Cosmin Silaghi, and Corina Giurgea. “A Romanian Language Corpus for a Commercial Text-To-Speech Application.” In Text, Speech and Dialogue, pp. 405-414. Springer Berlin Heidelberg, 2012.
  • Ananthi, S., and P. Dhanalakshmi. “Speech Recognition System and Isolated Word Recognition based on Hidden Markov Model (HMM) for Hearing Impaired.” International Journal of Computer Applications 73, no. 20 (2012).
  • Tarján, B., T. Mozsolics, A. Balog, D. Halmos, T. Fegyó, and P. Mihajlik. “Broadcast news transcription in Central-East European languages.” In IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom), pp. 59-64. 2012.
  • Karpov, Alexey, Irina S. Kipyatkova, and Andrey Ronzhin. “Speech recognition for east Slavic languages: the case of Russian.” In SLTU, pp. 84-89. 2012.

2011

  • Ong, H. F., and A. M. Ahmad. “Malay Language Speech Recogniser with Hybrid Hidden Markov Model and Artificial Neural Network (HMM/ANN).” International Journal of Information and Education Technology 1.2 (2011): 114.