SLG seminar – Yanis Labrak – 27/03/2025

17 March 2025

La prochaine réunion de l’équipe SLG aura lieu le jeudi 27 Mars prochain, en salle S4 de 12h00 à 13h00. Title: Text-Speech Language Models with Improved Cross-Modal Transfer by Aligning Abstraction LevelsAbstract: Text-Speech Language Models (TSLMs), language models trained to jointly process and generate text and speech, aim to enable cross-modal knowledge transfer to overcome the scaling limitations of unimodal speech LMs. The predominant approach to TSLM training expands the vocabulary of a pre-trained text LM by appending new embeddings and linear projections for speech, followed by fine-tuning on speech data. We hypothesize that this method limits cross-modal transfer by neglecting feature compositionality, preventing text-learned functions from being fully leveraged at appropriate abstraction levels. To address this, we propose augmenting vocabulary expansion with modules that better align abstraction levels between speech and text across the model’s layers. Representation analyses and improved multimodal performance suggest that our method enhances cross-modal transfer, even surpassing or rivaling state-of-the-art TSLMs trained using orders of magnitude more compute.  

Cornet seminar – Giuseppe Di Molfetta – 10/03/2025

6 March 2025

Dans le cadre de séminaires de l’équipe CORNET,  nous aurons le plaisird’accueillir M Giuseppe DI MOLFETTA ce lundi 10/03 à 12h00  Salle 6 CERI Titre: Quantum Computing : a gentle introduction Résumé : A short, self-consistent one-hour seminar to introduce quantumcomputing and some simple applications in algorithmics in a non-formalway. No pre-requisites required, the presentation will cater for apotentially heterogeneous audience.

PhD thesis defense of Thibault Bañeras-Roux – 17/01/2025

15 January 2025

Title : Analysis and understanding of the evaluation of automatic speechrecognition systems: towards metrics integrating human perception. Date: Friday January 17 at 2:00 pm Place : Amphithéâtre du bâtiment 34, LS2N, Campus Lombarderie, 2 chemin de laHoussinière 44000 Nantes. The defense will be presented in French. Abstract : Today, word error rate remains the most widely used metric forevaluating automatic speech recognition (ASR) systems. However, thismetric has limitations in terms of correlation with human perception andfocuses only on spelling preservation. In this thesis, we proposealternative metrics that can evaluate spelling, but also grammar,semantics or phonetics. To analyze the ability of these metrics to reflect transcript qualityfrom the user’s point of view, we built up a dataset named HATS,annotated by 143 French-speaking subjects. Each annotator examined 50triplets, made up of a manual reference transcription and two hypothesesfrom different ASR systems, to determine which hypothesis was, in theiropinion, the most faithful. By calculating the number of times a metric agrees with the annotators’choices, we obtain a measure of its correlation with human perception.This corpus can thus be used to rank different metrics according to thejudgment of a human reader. Our results show that SemDist, a metricbased on BERT’s semantic representations for comparing Plus d'infos

PhD thesis defense of Alix Dupont – 25/02/2025

15 January 2025

Date: 25 février 2025 à 14h Lieu: EDF lab Paris-Saclay dans l’amphi 1 (adresse du site : 7 Bd Gaspard Monge, 91120 Palaiseau, France). Titre : Stratégies des Opérateurs pour la Recharge des Véhicules Électriques en Espaces Publics avec un Comportement Piloté par l’Utilisateur. Cette thèse s’est déroulée au département SYSTEME, dans le groupe R4T, au sein du projet smart charging. Elle a également été supervisée par le Laboratoire d’Informatique d’Avignon (LIA), de l’université d’Avignon. Encadrement : Résumé : Les véhicules électriques sont vus comme une solution essentielle pour réduire les émissions de carbone dans le secteur des transports. Cependant, les infrastructures de recharge actuelles, comme les bornes publiques, ont des capacités limitées. Augmenter la puissance disponible ou installer de nombreuses bornes entraîne des coûts élevés, aussi bien pour le réseau électrique que pour les opérateurs. Cela crée un environnement où la recharge doit souvent être gérée dans des conditions de forte demande et de congestion, ce qui peut réduire la qualité du service pour les utilisateurs. Cette thèse explore des stratégies pour aider les opérateurs à optimiser la recharge des VE dans ce contexte. Une approche décentralisée est adoptée : chaque utilisateur prend ses décisions de recharge individuellement, en fonction de ses propres Plus d'infos

Cornet seminar – Rita Safi – 15/01/2025

13 January 2025

L’équipe CORNET démarre les séminaires 2025 la semaine prochaine avec une présentation de Rita SAFI, doctorante de CORNET et EDF, encadrée par Yezekael HAYEL et Tania JIMENEZ. Attention : ce séminaire aura lieu exceptionnellement un mercredi. Rendez-vous le mercredi 15 janvier à 11h35, en salle C057 (ancienne BU) :_________________________________________________________ Smart charging and optimization of personalized flexibility services for electric vehicles’s users. Rita SAFI, doctorante CORNET-LIA & EDF Résumé : The increasing number of electric vehicles (EVs) presents new challenges for charging point operators (CPOs) due to the increasing charging demand. However, it also creates opportunities to influence the flexibility of EV users. In this work, we consider a CPO that offers a price menu to EV users. Each option in the menu represents a pair of charging times to satisfy the EV charging demand and the corresponding charging price. The goal of the price menu is to encourage EV users to be flexible in their charging time. The price menu design problem can be formulated as a bilevel optimization problem in which the upper level determines the charging prices of the price menu and the optimal allocation of power among EVs to maximize the profit of the CPO, while the Plus d'infos

Rita Safi : Prix de Master

18 December 2024

Rita SAFI a été sélectionnée par la ROADEF parmi les 3 finalistes pour le Prix de Master RO/AD 2024 dans la catégorie “Apports Théoriques/Applications”. Son stage de Master, co-encadré par T. Jiménez (LIA/CORNET), Y. Hayel (LIA/CORNET) et R. Payen (EDF R&D), a porté sur l’optimisation du chargement des véhicules électriques avec un modèle bi-niveau.

PhD thesis defense of Arthur Amalvy – 12/09/2024

28 November 2024

Thesis title: Natural Language Processing for the Representation of Narrative Texts through Character Networks Date: 12/09/2024 – 9 AM Place: CERI’s Ada Lovelace amphitheater. Abstract: A character network represents characters as vertices in a graph, and their relationships as edges between them. In the case of literary works, they model a whole narrative using a single mathematical object. Depending on the needs, their edges can represent different types of interactions between characters: co-occurrence, conversation, direct action… Additionally, the temporal changes in the relationships between characters can be modeled with dynamic networks. Thanks to this flexibility, character networks have been used to tackle a number of tasks, such as literary genre classification, story segmentation, recommendation or summarization. Manually extracting these networks is costly, which is why many researchers interested in automating the process. This, in turn, requires solving different Natural Language Processing (NLP) tasks such as Named Entity Recognition (NER), coreference resolution or speaker attribution. In this thesis, we present contributions to this automatic extraction process in the case of novels, as well as to character network applications. Inspired by the 2019 survey of Labatut and Bost that summarizes existing extraction efforts in a generic extraction framework, we propose Renard, a Plus d'infos

PhD thesis defense of Willie Kouam – 03/12/2024

26 November 2024

Title: Network Centrality Game for Cyber Deception against Network Epidemic Propagation Date: Tuesday, December 3, 2024 – 3 pm Place: thesis room (salle des thèses) at the Hannah Arendt campus. Abstract: The rise in data breaches and service disruptions increasingly threatens internal security, with potentially devastating consequences for individuals and organizations. As a result, users of information and communication technologies must adopt tools that are both effective and efficient in combating the spread of malware. The term “users” encompasses a wide range of actors, including individuals, businesses, governmental and non-governmental organizations, and states, in short anyone who communicates through modern technologies.  Among the most pressing threats they face are lateral movement and widespread epidemic propagation through the covert recruitment of unsuspecting users into botnets, the cyber-terrorist armies capable of inflicting significant damage, such as crippling businesses whose services are used by the same users. In these scenarios, as in many others, users, deceived by skilled experts known as attackers, unknowingly contribute to cyberattacks, with deception serving as the primary attack vector. Cybercriminals, unlike defenders, frequently violate privacy rules, allowing them to be better informed, sometimes unilaterally, about the level of compromise of each user.  In their efforts to control multiple Plus d'infos

Cornet seminar – Felipe Albuquerque – 11/07/2024

4 November 2024

Title: The Capacitated p-Location Problem with Territorial Coverage Constraints Date: 11/07/2024 – 11:35 AM Room: S6 Résumé : In spatial planning, the efficient location and allocation of services pose complex challenges across diverse contexts. Our research focuses on the capacitated p-location problem, which aims to select p facilities from a set of potential locations to minimize allocation costs between facilities and consumers with specific demand weights, while respecting capacity constraints. To better model real-world applications, we extended this problem by introducing territorial coverage constraints. We examined the adapted formulation of this expanded problem and developed a heuristic approach to handle larger instances effectively. A case study in France’s PACA (Provence-Alpes-Côte d’Azur) region illustrates the impact of these coverage constraints.

SLG seminar – Ana Montalvo – 11/06/2024

4 November 2024

Title: Exploring Short-Duration Spoken Language Recognition: Insights from CENATAV Date: 11/06/2024 – 11AM Room: S4 Abstract : This presentation will introduce the Advanced Technologies Application Center (CENATAV), outlining its core mission and research areas, with a focus on the work of its Voice Processing Group. We will discuss the challenges of conducting research with limited access to high-performance computing resources and large datasets, emphasizing our recent work on spoken language recognition in very short-duration audio signals. Language: English

1 2 3 4 5 13