Conference day – 16/02/2024

13 February 2024

À l’occasion de la journée internationale des femmes en sciences, qui a lieu cette année le dimanche 11 février, le comité égalité-diversité du LIA organise une journée de conférences le vendredi 16 février 2024. Ces conférences seront menées par Cécile Favre, de l’université Lyon 2. Conférence 1: (de 10h à 11h30) Lieu : Amphi Ada Lovelace Titre : La « science des données » au prisme des études de genre : objet de recherche et source de questionnements méthodologiques pour la scientométrie. Le cas du défi EGC 2020. Résumé : Cette communication s’appuie sur le défi lancé par l’association EGC (Extraction et Gestion des Connaissances) qui rassemble des chercheuses et des chercheurs travaillant au croisement de l’informatique et des statistiques, en « science des données ». Pour la 20ème édition de sa conférence annuelle, l’association a mis à disposition des matériaux la concernant en vue de leur analyse. Les matériaux empiriques proposés sont notamment les actes publiés, l’ensemble des emails envoyés sur sa liste de diffusion. Nous avons complété ces matériaux avec des éléments récoltés sur le site Web concernant l’organisation des 19 éditions de la conférence tels que les conférences invitées, les comités de lecture, les comités d’organisation, etc. Plus d'infos

Cornet Seminar – 31/01/2024

26 January 2024

The next seminar of the Cornet team will take place on January 31, 2024, at 11:35 am in S3 and will consist of two parts. Firstly, Felipe Albuquerque (LIA) will present his thesis topic on ‘The p-Median Problem with Coverage Constraints: New Resolution Methods and Application to the Design of Public Services.’ Following that, Luca Dini and Pierre Jourlin will present their ongoing work on the theme of ‘Hybrid Methods for Cognitive Attitudes Detection.’ Summary: In this seminar, we will present ongoing work on the transformation of a keyword spotting system into a concept-based labeling engine. We will highlight four major axes of this work:

SLG Seminar – Ryan Whetten – 01/02/2024

25 January 2024

The next SLG meeting will take place in room S5 on Thursday, February 1st, from 12:00 PM to 1:00 PM. Ryan Whetten will present his work, and you can find a brief introduction below. ——————————————————————— Open Implementation and Study of BEST-RQ for Speech Processing Abstract: Self-Supervised Learning (SSL) has proven to be useful in various speech tasks. However, these methods are generally very demanding in terms of data, memory, and computational resources. Recently, Google came out with a model called BEST-RQ (BERT-based Speech pre-Training with Random-projection Quantizer). Despite BEST-RQ’s great performance and simplicity, details are lacking in the original paper and there is no official easy-to-use open-source implementation. Furthermore, BEST-RQ has not been evaluated on other downstream tasks aside from ASR. In this presentation, we will discuss the details of my implementation of BEST-RQ and then see results from our preliminary study on four downstream tasks. Results show that a random projection quantizer can achieve similar downstream performance as wav2vec 2.0 while decreasing training time by over a factor of two.

SLG Seminar – Paul Gauthier Noé – 18/01/2024

10 January 2024

On 18 January from 12 am, we will host a talk from Dr. Paul Gauthier Noé on « Explaining probabilistic predictions … ». The presentation will be hosted on room S6.    More details will follow   Bio: Paul Gauthier Noe just received a PhD in Computer Science in Avignon Université under the supervision of Prof. Jean-François Bonastre and Dr. Driss Matrouf. He was working for the international JST-ANR VoicePersonae project and his main research interests are Speaker verification, Bayesian decision theory, Calibration of probabilities and Privacy in Speech.

PhD defense of Noé Cécillon – 18 January 2024

8 January 2024

Date: Thursday, January 18, 2023 at 14:00. Place: University of Avignon, Campus JH Fabre, Ada Lovelace amphitheater Jury : Title: Combining Graph and Text to Model Conversations: An Application to Online Abuse Detection. Abstract: Online abusive behaviors can have devastating consequences on individuals and communities. With the global expansion of internet and the social networks, anyone can be confronted with these behaviors. Over the past few years, laws and regulations have been established to regulate this kind of abuse but the responsibility ultimately lies with the platforms that host online communications. They are asked to monitor their users in order to prevent the proliferation of abusive content. Timely detection and moderation is a key factor to reduce the quantity and impact of abusive behaviors. However, due to the sheer quantity of online messages posted every day, platforms struggle to provide adequate resources. Since this implies high human and financial costs, companies have a keen interest in automating this process. Although it may seem a relatively simple task, it turns out to be quite complex. Indeed, malicious users have developed numerous techniques to bypass the standard automated methods. Allusions or implied meaning are other examples of strategies that automatic methods struggle Plus d'infos

SLG Seminar – Fenna Poletiek – 12/01/2024

8 January 2024

On 12 January from 12 am, we will host a virtual talk from Dr. Fenna Poletiek from Institute of Psychology at Leiden University on « Language learning in the lab ».   The presentation will be hosted on room S6.   Abstract: Language learning in the lab Language learning skills have been considered a defining feature of humanness. In this view language cannot be acquired by mere associative or statistical learning processes, only, like many other skills are learned by human and nonhuman primates during development. Indeed, the high (recursive) complexity of human grammars have been shown to make them impossible to learn by exposure to language exemplars only. Some research suggests, however, that at least some statistical learning is recruited in language acquisition (Perruchet & Pacton, 2006). And primates have been shown to mimic complex grammatical patterns after being trained on a sequence of stimulus responses (Rey et al., 2012). We performed series of studies with artificial languages in the lab, to investigate associative and statistical learning processes that support language learning. The results thus far suggest a fine tuned cooperation between three crucial features of the natural language learning process: first, learning proceeds ‘starting small’ with short simple sentences growing in complexity Plus d'infos

PhD defense of Julio Perez-Garcia – 18 December 2023

14 December 2023

Place: University of Avignon, Campus Hannah Arendt, Salle des ThèsesDate: Monday, December 18, 2023 at 14:00. Title: Contribution to security and privacy in the Blockchain-based Internet of Things: Robustness, Reliability, and Scalability. Abstract: The Internet of Things (IoT) is a diverse network of objects or ”things” typically interconnected via the Internet. Given the sensitivity of the information exchanged in IoT applications, it is essential to guarantee security and privacy. This problem is aggravated by the open nature of wireless communications, and the power and computing resource limitations of most IoT devices. At the same time, existing IoT security solutions are based on centralized architectures, which raises scalability issues and the single point of failure problem, making them susceptible to denial-of-service attacks and technical failures. Blockchain has emerged as an attractive solution to IoT security and centralization issues. Blockchains replicate a permanent, append-only record of all transactions occurring on a network across multiple devices, keeping them synchronized through a consensus protocol. Blockchain implementation may involve high computational and energy costs for devices. Consequently, solutions based on Fog/Edge computing have been considered in the integration with IoT. This approach shifts the higher computational load and higher energy consumption to the devices with higher Plus d'infos

SLG Meeting – St Germes Bengono Obiang – 21/12/2023

12 December 2023

The next SLG meeting will be held in room S1 on Thursday, December 21st, from 12:00 PM to 1:00 PM.    We will have the pleasure of hosting St Germes BENGONO OBIANG, a PhD student in speech processing, focusing on tone recognition in under-resourced languages. He is supervised by Norbert TSOPZE and Paulin MELATAGIA from the University of Yaoundé 1, as well as by Jean-François BONASTRE and Tania JIMENEZ from LIA.   Abstract: Many sub-Saharan African languages are categorized as tone languages and for the most part, they are classified as low resource languages due to the limited resources and tools available to process these languages. Identifying the tone associated with a syllable is therefore a key challenge for speech recognition in these languages. We propose models that automate the recognition of tones in continuous speech that can easily be incorporated into a speech recognition pipeline for these languages. We have investigated different neural architectures as well as several features extraction algorithms in speech (Filter banks, Leaf, Cestrogram, MFCC). In the context of low-resource languages, we also evaluated Wav2vec models for this task. In this work, we use a public speech recognition dataset on Yoruba. As for the results, using the Plus d'infos

PhD defense of Anais Chanclu – 11 December 2023

11 December 2023

Thesis defense of Anais Chanclu Date: Monday 11 December 2023 at 14:30  Location: Thesis room, Hannah Arendt campus. Title: Recognizing individuals by their voice: defining a scientific framework to ensure the reliability of voice comparison results in forensic contexts Jury: Abstract: In police investigations or criminal trials, voice recordings are often collected for comparison purposes with the voice of suspects. Typically, these recordings, referred to as ‘traces’, come from phone taps, emergency service calls, or voicemail messages. Recordings of suspects, known as ‘comparison pieces’, are usually obtained by law enforcement through voice sampling. Since the traces and comparison pieces were not recorded under the same conditions, and the recording conditions of the traces are often poorly known or entirely unknown, the variability between the recordings being compared cannot be quantified. Numerous factors come into play, including audio file characteristics, linguistic content, the recording environment, and the speaker(s). Voice comparison practices have evolved throughout history without conforming to a scientific framework. This has led to questioning the reliability of voice expertise (as in the Trayvon Martin case) and the use of fallacious practices (as in the Élodie Kulik case), potentially leading to judicial errors. Nowadays, the French Scientific Police (SNPS) and the Plus d'infos

1 2 3 4 5 9