14 December 2023

Planning And leaRning For AI-Edge compuTing Partners: Period: 2023-2027


13 December 2023

Validity of a task of acoustic-phonetic decoding on anatomic deficits in paramedical assessment of speech disorders for patients treated for oral or oropharyngeal cancer Plus d'infos


1 January 2023

Tracking and Detecting Fake News and Deepfakes in Arabic Social Networks The 4th Generation War (4GW) is known as an information war involving non-military populations. It is conducted by national or transnational groups following ideologies based on cultural beliefs, religion, economic or political interests, aiming to sow chaos in a targeted region globally. In 1989, authors discussing the 4th generation war, some of whom were military experts, explained that it would be widespread and challenging to define in the decades to come. With the emergence of social networks, the previously vague battlefield found a platform for 4GW. One of its penetration points is the extensive use of social networks to manipulate opinions, aiming to shape the targeted region’s perspective to accept a certain state of affairs and render it socially and politically acceptable. Much like the 4th generation war, cognitive warfare aims to blur comprehension mechanisms regarding politics, economy, religion, etc. Its consequence is destabilizing and weakening the adversary. This cognitive war targets what is assumed to be the enemy’s brain, altering reality by flooding the adversary’s population with misleading information, rumors, or manipulated videos. Furthermore, the proliferation of social bots today enables automated dissemination of disinformation on social networks. Plus d'infos

ANR ESSL Project

1 January 2023

Efficient Self-Supervised Learning for Inclusive and Innovative Speech Technologies Self-Supervised Learning (SSL) has recently emerged as an incredibly promising artificial intelligence (AI) method. Through this method, massive amounts of unlabeled data that are accessible can be utilized by AI systems to surpass known performances. Particularly, the field of Automatic Speech Processing (ASP) is swiftly being transformed by the arrival of SSL, thanks in part to massive industrial investments and the explosion of data, both provided by a handful of companies. The performance gains are impressive, but the complexity of SSL models requires researchers and industry professionals in the field to have extraordinary computational capacity, drastically limiting access to fundamental research in this area and its deployment in everyday products. For instance, a significant portion of work using an SSL model for ASP relies on a system maintained and provided by a single company (wav2vec 2.0). The entire lifecycle of the technology, from its theoretical foundations to its practical deployment and societal analysis, therefore depends solely on institutions with the physical and financial means to support the intensity of this technique’s development. The E-SSL project aims to restore to the scientific community and ASP industry the necessary control over self-supervised learning Plus d'infos


1 January 2023

Development of a methodology for evaluating voice identification systems The BRUEL project concerns the evaluation/certification of voice identification systems against adversarial attacks. Indeed, speaker recognition systems are vulnerable not only to speech artificially produced by voice synthesis but also to other forms of attacks such as voice identity conversion and replay attacks. The artifacts created during the creation or manipulation of these fraudulent attacks leave marks in the signal by voice synthesis algorithms, thus distinguishing the original real voice from a forged voice. Under these conditions, detecting identity theft requires evaluating identity theft countermeasures concurrently with speaker recognition systems. The BRUEL project aims to propose the first methodology for evaluating/certifying voice identification systems based on a Common Criteria approach. List of partners: CEA Eurecom Service National de Police Scientifique IRCAM LIA Laboratoire d’Informatique d’Avignon Project Coordinator: LIA Scientific Manager for LIA: Driss Matrouf Start Date: 01/01/2023 End Date: 30/06/2026 More

ANR EVA Project

1 January 2023

Explicit Voice Attributes Describing a voice in a few words remains a very arbitrary task. We can speak with a “deep”, “breathy”, “bright” or “hoarse” voice, but the full characterization of a voice would require a close set of rigorously defined attributes constituting an ontology. However, such a description grid does not exist. Machine learning applied to speech also suffers the same weakness : in most automatic processing tasks, when a speaker is modeled, abstract global representations are used without making their characteristics explicit. For instance, automatic speaker verification / identification is usually tackled thanks to the x-vectors paradigm, which consists in describing a speaker’s voice by an embedding vector only designed to distinguish speakers. Despite their very good accuracy for speaker identification, x-vectors are usually unsuitable to detect similarities between different voices with common characteristics. The same observations can be made for speech generation. We propose to carry out a comprehensive set of analyses to extract salient, unaddressed voice attributes to enrich structured representations usable for synthesis and voice conversion. Partner list: Project leader: Orange Scientific leader for LIA: Yannick Estève Start date: 01/01/2023 — End date: 31/12/2025 More


1 September 2022

Understanding, Modeling and Improving the outcome of Crowdfunding campaigns UMICrowd project explores CF from economical and sociological perspectives, using advancedmathematical modeling tools, Artificial Intelligence (AI) and empirical analysis. It aims to proposedecision-making tools that help entrepreneurs in designing their campaigns and CFP managers inselecting, classifying and promoting projects. Partners: CentraleSupelec CRAN FPF ESCE LIA Period: 2022-2026 ANR Webpage:

H2020 SELMA Project

1 January 2021

Stream Learning for Multilingual Knowledge Transfer The internet contains vast amounts of data and information in various languages, both written and audiovisual. There’s an increasing need to leverage this largely untapped resource. The SELMA project, funded by the EU, focuses on ingesting and monitoring large quantities of data. It systematically trains machine learning models to perform tasks in natural language and utilizes these models to monitor data streams, aiming to enhance multilingual media monitoring and real-time content production. Ultimately, the project will advance cutting-edge techniques in language modeling, automatic translation, speech recognition, and synthesis. Project Coordinator: Deutsche Welle, DE Scientific Lead for LIA: Yannick ESTEVE Start Date: 01/01/2021 End Date: 30/12/2023 More

ANR muDialBot Project

1 January 2021

MUlti-party perceptually-active situated DIALog for human-roBOT interaction In muDialBot, our ambition is to proactively incorporate human-like behavioral traits in human-robot spoken communication. We aim to reach a new stage in harnessing the rich information provided by audio and visual data streams from humans. In particular, extracting verbal and non-verbal events should enhance the decision-making abilities of robots to manage turns of speech more naturally and also switch from group interactions to face-to-face dialogues according to the situation. There has been growing interest recently in companion robots capable of assisting individuals in their daily lives and effectively communicating with them. These robots are perceived as social entities, and their relevance to health and psychological well-being has been highlighted in studies. Patients, their families, and healthcare professionals will better appreciate the potential of these robots as certain limitations are quickly overcome, such as their ability to move, see, and listen to communicate naturally with humans, beyond what touchscreen displays and voice commands already enable. The scientific and technological outcomes of the project will be implemented on a commercial social robot and tested and validated with multiple use cases in the context of a day hospital unit. Large-scale data collection will complement in-situ Plus d'infos

H2020 ESPERANTO Project

1 January 2021

Exchanges for SPEech ReseArch aNd TechnOlogies Speech processing technologies are crucial for numerous commercial applications. The ESPERANTO project, funded by the EU, aims to make the next generation of AI algorithms used in speech processing applications more accessible. For instance, they should consider human involvement and be interpretable to allow sensitive applications and safeguard personal data. ESPERANTO envisions disseminating these technologies across European SMEs, expanding and ensuring their implementation for forensic, healthcare, and educational purposes. The project will support the development of freely accessible tools, conduct seminars on various speech processing themes to assist new students, researchers, and engineers working in speech AI, and contribute to the collection and sharing of linguistic and speech-related resources. Project Coordinator: University of Le Mans, FR Scientific Manager for LIA: Jean-François Bonastre Start Date: 01/01/2021 End Date: 30/06/2025 More

1 2