PhD thesis defense of Arthur Amalvy – 12/09/2024

28 November 2024

Thesis title: Natural Language Processing for the Representation of Narrative Texts through Character Networks Date: 12/09/2024 – 9 AM Place: CERI’s Ada Lovelace amphitheater. Abstract: A character network represents characters as vertices in a graph, and their relationships as edges between them. In the case of literary works, they model a whole narrative using a single mathematical object. Depending on the needs, their edges can represent different types of interactions between characters: co-occurrence, conversation, direct action… Additionally, the temporal changes in the relationships between characters can be modeled with dynamic networks. Thanks to this flexibility, character networks have been used to tackle a number of tasks, such as literary genre classification, story segmentation, recommendation or summarization. Manually extracting these networks is costly, which is why many researchers interested in automating the process. This, in turn, requires solving different Natural Language Processing (NLP) tasks such as Named Entity Recognition (NER), coreference resolution or speaker attribution. In this thesis, we present contributions to this automatic extraction process in the case of novels, as well as to character network applications. Inspired by the 2019 survey of Labatut and Bost that summarizes existing extraction efforts in a generic extraction framework, we propose Renard, a Plus d'infos

PhD defense of Willie KOUAM – 03/12/2024

26 November 2024

Title: Network Centrality Game for Cyber Deception against Network Epidemic Propagation Date: Tuesday, December 3, 2024 – 3 pm Place: thesis room (salle des thèses) at the Hannah Arendt campus. Abstract: The rise in data breaches and service disruptions increasingly threatens internal security, with potentially devastating consequences for individuals and organizations. As a result, users of information and communication technologies must adopt tools that are both effective and efficient in combating the spread of malware. The term “users” encompasses a wide range of actors, including individuals, businesses, governmental and non-governmental organizations, and states, in short anyone who communicates through modern technologies.  Among the most pressing threats they face are lateral movement and widespread epidemic propagation through the covert recruitment of unsuspecting users into botnets, the cyber-terrorist armies capable of inflicting significant damage, such as crippling businesses whose services are used by the same users. In these scenarios, as in many others, users, deceived by skilled experts known as attackers, unknowingly contribute to cyberattacks, with deception serving as the primary attack vector. Cybercriminals, unlike defenders, frequently violate privacy rules, allowing them to be better informed, sometimes unilaterally, about the level of compromise of each user.  In their efforts to control multiple Plus d'infos

Cornet seminar – Felipe Albuquerque – 11/07/2024

4 November 2024

Title: The Capacitated p-Location Problem with Territorial Coverage Constraints Date: 11/07/2024 – 11:35 AM Room: S6 Résumé : In spatial planning, the efficient location and allocation of services pose complex challenges across diverse contexts. Our research focuses on the capacitated p-location problem, which aims to select p facilities from a set of potential locations to minimize allocation costs between facilities and consumers with specific demand weights, while respecting capacity constraints. To better model real-world applications, we extended this problem by introducing territorial coverage constraints. We examined the adapted formulation of this expanded problem and developed a heuristic approach to handle larger instances effectively. A case study in France’s PACA (Provence-Alpes-Côte d’Azur) region illustrates the impact of these coverage constraints.

SLG seminar – Ana Montalvo – 11/06/2024

4 November 2024

Title: Exploring Short-Duration Spoken Language Recognition: Insights from CENATAV Date: 11/06/2024 – 11AM Room: S4 Abstract : This presentation will introduce the Advanced Technologies Application Center (CENATAV), outlining its core mission and research areas, with a focus on the work of its Voice Processing Group. We will discuss the challenges of conducting research with limited access to high-performance computing resources and large datasets, emphasizing our recent work on spoken language recognition in very short-duration audio signals. Language: English

PhD Defense of Timothée Dhaussy – 10/21/2024

18 October 2024

Date: 21th of october 2024 at 2PM Place: Thesis room, at the Hannah Arendt campus of Avignon Université. The videoconference link is the following: https://bbb.univ-avignon.fr/rooms/vtj-xje-xex-gyw/join .  The jury will be composed of: Dr Aurélie Clodic, LAAS-CNRS,  RapporteurePr Julien Pinquier, Université de Toulouse, IRIT, RapporteurPr Laurence Devillers, Sorbonne Université, LISN-CNRS, ExaminatricePr Olivier Alata, Université Jean Monnet, Laboratoire Hubert Curien, ExaminateurPr Fabrice Lefèvre, Avignon Université, LIA, Directeur de thèseDr Bassam Jabaian, Avignon Université, LIA, Co-encadrant Title: Proactive multimodal human-robot interaction in a hospital In this thesis, we focus on creating a proactive multimodal system for the social robot Pepper, designed for a hospital waiting room. To achieve this, we developed a cognitive human-robot interaction architecture, based on a continuous loop of perceptions, representation, and decision-making. The flow of perceptions is divided into two steps: first, retrieving data from the robot’s sensors, and then enriching it through refining modules. A speaker diarization refining module, based on a Bayesian model of fusion of audio and visual perceptions through spatial coincidence, was integrated. To enable proactive action, we designed a model analyzing the users’ availability for interaction in a waiting room. The refined perceptions are then organized and aligned to create a constantly updated representation of Plus d'infos

PhD defense of Lucas Druart – 24/10/2024

16 October 2024

Date:  Jeudi 24 octobre à 15h Lieu: salle des thèses sur le campus Hannah Arendt. Vous pouvez également y assister à distance si vous le souhaitez grâce au lien suivant : https://v-au.univ-avignon.fr/live/bbb-soutenance-these-l-druart-24-octobre-2024/. Title : Towards Contextual and Structured Spoken Task-Oriented Dialogue Understanding Abstract : Accurately understanding users’ requests is key to provide smooth interactions with spoken Task-Oriented Dialogue (TOD) systems. Traditionally such systems adopt cascade approaches which combine an Automatic Speech Recognition (ASR) component with a Natural Language Understanding (NLU) one. Yet, those systems still have trouble to accurately map complex user’s request with their internal representation. Recent work highlights potential directions to improve those systems. On the one hand, end-to-end approaches have successfully enhanced Spoken Language Understanding (SLU) system’s performance. Indeed, they provide more robust and accurate predictions by leveraging joint optimization and paralinguistic information. On the other hand, textual datasets propose fine-grained semantic representations. Such representations seem more adequate to represent user’s complex requests. This thesis explores both directions towards contextual and structured spoken task-oriented dialogue understanding. We first conduct a preliminary study dedicated to getting the grips of SLU in the context of TOD. We designed a cascade approach to perform spoken Dialogue State Tracking (DST) on MultiWOZ. Our approach ranked first in Plus d'infos

Cornet seminar – Léo Joubert – 24/09/2024

17 September 2024

Le prochain séminaire de l’équipe CORNET se tiendra le 24 septembre à 11h35 en salle C057 (ancienne BU). Nous aurons le plaisir d’accueillir Léo Joubert, Maître de Conférences du laboratoire Dysolab à l’Université de Rouen-Normandie, qui nous présentera : _________________________________________________________ De quoi est faite Wikipédia ? Quelques pistes pour une analyse de sa régulation au prisme des effacements réciproques. Résumé : Vieille de 20 ans et aujourd’hui largement utilisée, l’encyclopédie Wikipédia continue à interroger par la manière dont elle est produite. Une communauté de « wikipédien.nes » volontaires remplace les habituels expert.es mobilisés par les comités de rédaction. Alors qu’au fil du temps, des règles encadrant la rédaction acquéraient leur légitimité, le volume du corpus augmentait en même temps que son hétérogénéité. Ce séminaire vise à catégoriser cette diversité, dans le but de saisir comment Wikipédia parvient à faire cohabiter différents modes de régulation ajustée à la pluralité des contextes. Nous mobiliserons pour cela une base de données compilant l’ensemble des ajouts et effacements sur l’ensemble des pages wikipédiennes. À partir d’une comparaison de plusieurs sous-corpus, l’analyse statistique constituera le point de départ d’une sociologie de la régulation des communs numériques de masse au regard de la pluralité des formes de coécriture. _________________________________________________________

PhD defense of Gaelle Laperrière – 09/09/2024

3 September 2024

Date: 9th of Septembre 2024 Time : 3PM  Place: Ada Lovelace CERI’s amphitheater, at the Jean-Henri Fabre campus of Avignon Université.   The jury will be composed of:   Alexandre Allauzen, PR at Université Paris Dauphine-PSL, LAMSADE – Rapporteur Benoit Favre, PR at Aix-Marseille Université, LIS – Rapporteur Marco Dinarelli, CR at CNRS, LIG – Examiner Nathalie Camelin, MCF at Le Mans Université, LIUM – Examiner Philippe Langlais, PR at Université de Montréal, DIRO, RALI – Examiner Fabrice Lefèvre, PR at Avignon Université, LIA – Examiner Yannick Estève, PR at Avignon Université, LIA – Thesis director   Sahar Ghannay, MCF at Université Paris-Saclay, LISN, CNRS – Thesis co-supervisor Bassam Jabaian, MCF at Avignon Université, LIA – Thesis co-supervisor Title: Spoken Language Understanding in a multilingual context This thesis falls within the scope of Deep Learning applied to Spoken Language Understanding. Its primary objective is to leverage existing data of large resourced annotated languages for speech semantics to develop effective understanding systems in low resourced languages. In recent years, significant advances were made in the field of automatic speech translation through new approaches that converge audio and textual modalities, the latter benefiting from vast amounts of data. By visualizing spoken language understanding as a translation task from a natural Plus d'infos

Cornet Seminar – Anna Melnykova – 10/09/2024

2 September 2024

First CORNET seminar, Tuesday 10th september 2024 11:00, room C057. Anna Melnykova from Mathematics Lab of Avignon (LMA). Abstract: Hawkes processes is a versatile probabilistic tool which permits to model a variety of real-world phenomena: earthquakes, stock markets, population dynamics. In this talk we will focus on its use in neuroscience, where they permit to model interactions between different group of neurons (or individual neurons). Furthermore, due to memory property, models based on Hawkes processes naturally embed the refractory (inter-spiking) period for each individual neuron. The focus of this talk is on numerical and statistical challenges associated with Hawkes processes (simulation, parametric and non-parametric inference, causality).

ANR PANTAGRUEL Project

21 August 2024

Modèles de langue multimodaux et inclusifs pour le français général et clinique Le projet Pantagruel (ANR 23-IAS1-0001) ambitionne de développer et évaluer des modèles linguistiques multimodaux (écrit, oral, pictogrammes) inclusifs pour le français. Il mobilise des chercheurs de diverses disciplines telles que l’informatique, le traitement du signal, la sociologie et la linguistique pour assurer des résultats fiables et variés. Liste des partenaires : Responsable Scientifique pour le LIA : Yannick Estève (Equipe SLG) Date Début : 2023-11-20 Date Fin : 2026-11-20 En Savoir Plus

1 2 3