ANR BRUEL Project

1 January 2023

Development of a methodology for evaluating voice identification systems The BRUEL project concerns the evaluation/certification of voice identification systems against adversarial attacks. Indeed, speaker recognition systems are vulnerable not only to speech artificially produced by voice synthesis but also to other forms of attacks such as voice identity conversion and replay attacks. The artifacts created during the creation or manipulation of these fraudulent attacks leave marks in the signal by voice synthesis algorithms, thus distinguishing the original real voice from a forged voice. Under these conditions, detecting identity theft requires evaluating identity theft countermeasures concurrently with speaker recognition systems. The BRUEL project aims to propose the first methodology for evaluating/certifying voice identification systems based on a Common Criteria approach. List of partners: CEA Eurecom Service National de Police Scientifique IRCAM LIA Laboratoire d’Informatique d’Avignon Project Coordinator: LIA Scientific Manager for LIA: Driss Matrouf Start Date: 01/01/2023 End Date: 30/06/2026 More

ANR EVA Project

1 January 2023

Explicit Voice Attributes Describing a voice in a few words remains a very arbitrary task. We can speak with a “deep”, “breathy”, “bright” or “hoarse” voice, but the full characterization of a voice would require a close set of rigorously defined attributes constituting an ontology. However, such a description grid does not exist. Machine learning applied to speech also suffers the same weakness : in most automatic processing tasks, when a speaker is modeled, abstract global representations are used without making their characteristics explicit. For instance, automatic speaker verification / identification is usually tackled thanks to the x-vectors paradigm, which consists in describing a speaker’s voice by an embedding vector only designed to distinguish speakers. Despite their very good accuracy for speaker identification, x-vectors are usually unsuitable to detect similarities between different voices with common characteristics. The same observations can be made for speech generation. We propose to carry out a comprehensive set of analyses to extract salient, unaddressed voice attributes to enrich structured representations usable for synthesis and voice conversion. Partner list: Project leader: Orange Scientific leader for LIA: Yannick Estève Start date: 01/01/2023 — End date: 31/12/2025 More

ANR UMICROWD Project

1 September 2022

Understanding, Modeling and Improving the outcome of Crowdfunding campaigns UMICrowd project explores CF from economical and sociological perspectives, using advancedmathematical modeling tools, Artificial Intelligence (AI) and empirical analysis. It aims to proposedecision-making tools that help entrepreneurs in designing their campaigns and CFP managers inselecting, classifying and promoting projects. Partners: CentraleSupelec CRAN FPF ESCE LIA Period: 2022-2026 ANR Webpage: https://anr.fr/Projet-ANR-22-CE38-0013

H2020 SELMA Project

1 January 2021

Stream Learning for Multilingual Knowledge Transfer The internet contains vast amounts of data and information in various languages, both written and audiovisual. There’s an increasing need to leverage this largely untapped resource. The SELMA project, funded by the EU, focuses on ingesting and monitoring large quantities of data. It systematically trains machine learning models to perform tasks in natural language and utilizes these models to monitor data streams, aiming to enhance multilingual media monitoring and real-time content production. Ultimately, the project will advance cutting-edge techniques in language modeling, automatic translation, speech recognition, and synthesis. Project Coordinator: Deutsche Welle, DE Scientific Lead for LIA: Yannick ESTEVE Start Date: 01/01/2021 End Date: 30/12/2023 More

ANR muDialBot Project

1 January 2021

MUlti-party perceptually-active situated DIALog for human-roBOT interaction In muDialBot, our ambition is to proactively incorporate human-like behavioral traits in human-robot spoken communication. We aim to reach a new stage in harnessing the rich information provided by audio and visual data streams from humans. In particular, extracting verbal and non-verbal events should enhance the decision-making abilities of robots to manage turns of speech more naturally and also switch from group interactions to face-to-face dialogues according to the situation. There has been growing interest recently in companion robots capable of assisting individuals in their daily lives and effectively communicating with them. These robots are perceived as social entities, and their relevance to health and psychological well-being has been highlighted in studies. Patients, their families, and healthcare professionals will better appreciate the potential of these robots as certain limitations are quickly overcome, such as their ability to move, see, and listen to communicate naturally with humans, beyond what touchscreen displays and voice commands already enable. The scientific and technological outcomes of the project will be implemented on a commercial social robot and tested and validated with multiple use cases in the context of a day hospital unit. Large-scale data collection will complement in-situ Plus d'infos

H2020 ESPERANTO Project

1 January 2021

Exchanges for SPEech ReseArch aNd TechnOlogies Speech processing technologies are crucial for numerous commercial applications. The ESPERANTO project, funded by the EU, aims to make the next generation of AI algorithms used in speech processing applications more accessible. For instance, they should consider human involvement and be interpretable to allow sensitive applications and safeguard personal data. ESPERANTO envisions disseminating these technologies across European SMEs, expanding and ensuring their implementation for forensic, healthcare, and educational purposes. The project will support the development of freely accessible tools, conduct seminars on various speech processing themes to assist new students, researchers, and engineers working in speech AI, and contribute to the collection and sharing of linguistic and speech-related resources. Project Coordinator: University of Le Mans, FR Scientific Manager for LIA: Jean-François Bonastre Start Date: 01/01/2021 End Date: 30/06/2025 More

ANR AISSPER Project

1 January 2020

AISSPER: Artificial Intelligence for Semantically controlled SPEech undeRstanding Artificial Intelligence (AI) holds strategic importance at the national level due to impressive outcomes achieved by deep learning algorithms in various domains such as natural language processing (NLP), medicine, and political analytics across a wide range of applications. France has emerged as a leader in deep learning owing to recent political efforts highlighted in recent years. Over the last decade, substantial efforts have been dedicated to end-to-end Spoken Language Understanding (SLU) systems, driven by the feasibility of applications like personal assistants and conversational systems. Superior results have been observed in automatic speech recognition (ASR) with architectures based on hyper-complex number algebra called quaternions, requiring less processing time (Morchid 2018) and fewer parameters to estimate compared to conventional models (Parcollet et al 2018; 2019). Reducing model parameters efficiently trains neural architectures with limited data quantities, often challenging to obtain for specific semantic concepts and contexts from specific domains. Intrinsically linked learning processes like ASR and SLU hinder the parallelization of learning examples, critical for lengthy sequences as memory constraints limit batch processing using examples. Furthermore, error analysis conducted on completed projects like M2CR, JOKER, VERA, SUMACC, Media, or DECODA highlighted the importance of Plus d'infos

ANR DeCoMaP Project

1 September 2019

DeCoMaP: Detecting Corruption in Public Procurement The societal benefits of opening up public data are expected to be huge. This is particularly true with Public Procurement Data which are supposed to help discover and dismantle corrupt activities by facilitating critical information, tools, and mechanism for judicial enforcement. In a multidisciplinary project, bridging computer science, economics and law, DeCoMap is intended to collect, process and analyze French procurement data in order to create a software tool for automatic identification of corruption and fraud in public procurement (automated red flagging) and provide normative analytical grid by highlighting the main factors that public authorities should identify and pay attention to. Supported by Transparency International France and Open Contracting Partnership, DeCoMap brings together academic researchers from 7 universities, with strong expertise in procurement and digital law, procurement economics and econometrics, law and economics, graph optimization and complex network analysis. 4 members of Datactivist, a cooperative company that assists organizations from the public, private and non-profit sectors in producing and re-using Open Data, with strong expertise with open data of public procurement, open contracting and open government, complement the consortium. Date: 2019–2024 Website: https://decomap.univ-avignon.fr ANR page: https://anr.fr/Projet-ANR-19-CE38-0004  

ANR Project VoicePersonae

1 February 2019

With recent advancements in automatic speech and language processing, humans are increasingly interacting vocally with intelligent artificial agents. The use of voice in applications is expanding rapidly, and this mode of interaction is becoming more widely accepted. Nowadays, vocal systems can offer synthesized messages of such quality that discerning them from human-recorded messages is difficult. They are also capable of understanding requests expressed in natural language, albeit within their specific application framework. Furthermore, these systems frequently recognize or identify their users by their voices. Plus d'infos

ANR RUGBI Project

1 February 2019

Searching for Linguistic Units to Improve the Measurement of Speech Intelligibility Altered by Pathological Production Disorders In the context of speech production disorders observed in ORL cancers, neurological, sensory, or structural pathologies, the goal of the RUGBI project is to enhance the measurement of intelligibility deficits. List of Partners: IRIT Institut de Recherche en Informatique de Toulouse CHU Toulouse Direction de la Recherche LPL Laboratoire Parole et Langage LIA Laboratoire d’Informatique d’Avignon OCTOGONE UNITE DE RECHERCHE INTERDISCIPLINAIRE OCTOGON Project Coordinator: Jérome Farinas (IRIT) Scientific Lead for LIA: Corinne Fredouille Period: 2019-2022 More

1 2 3