HDRs 2024
Pascal Andre, Modélisation rigoureuse au service du logiciel et des systèmes ►
Modélisation rigoureuse au service du logiciel et des systèmesRester modeste et avancer ensemble
Author : Pascal Andre
Keywords : Software EngineeringModelsSystemsVerificationFormal MethodsInformation SystemsSkills
Abstract
Software presents the paradox of opening up an huge field of potentials while remaining with empirical and craft development. No doubts this is the only good-service with such a lucrative business maintenance and so costly failures. The work discussed in this document deals with software engineering and its application. We embrace a model driven vision with a virtuous construction cycle (modelling, verifying properties, exploitation) to produce qualitatively quality software. The verification of properties, e.g. operational safety or security, is fundamental to establish solid software service contract. Software being a living object in a changing context, its construction must continuously follow and adapt. We propose both technology adaptation and requirements evolution. In practice, we are apply to business information systems, a complex ecosystem that weaves a plethora of interacting software, and to industrial production systems, which adds on the cyber physical system complexity. In such cases, a systemic vision is required to manage complexity. The document ends with a reflection on the skills needed to conduct research.
Defense date : 01-07-2024
Jury president : Catherine Da Cunha
Jury :
- Agnès Front [Rapporteure]
- Antoine Beugnard [Rapporteur]
- Pascal Poizat [Rapporteur]
- Stéphane Ducasse
- Mireille Blay-Fornarino
- Christian Attiogbé
- Benoit Delahaye
Carito Guziolowski, Modeling Biological Networks as Logic Programs ►
Keywords : Logic programmingRegulatory networksBiological systems modeling
Abstract
In this manuscript it is proposed to explore two representations of a biological system using computational modeling. These representations both gave birth to several methodological publications, and in some cases research projects in close collaboration with biologists. One is done through the sign-consistency modeling. In this approach a regulatory network (signed directed graph) is combined with a dataset of gene expression observations, using a logic program. This logic program, written in Answer Set Programming, expresses a rule that has to be valid for each species in the network, which relates the sign of a network species with its direct predecessors influences and signs. This rule is tested in a global way, through all the network species by using an efficient solver, clasp. The sign-consistency modeling framework we proposed is named Iggy. Iggy performs as well automatic and optimal correction of sign inconsistencies. The sign-consistency modeling framework has been applied to different biological case studies. For example, the signaling pathway of Hepatocyte Growth Factor, where some of the computational predictions of our model were validated experimentally. A case-study well described in this manuscript is the modeling of Multiple Myeloma patients gene expression data. Our main results on this system was to propose Multiple Myeloma markers, that is, species in the network, coupled with our computational predictions, that allow to identify patients having a better survival. Iggy has inspired MajS, our last sign-consistency modeling framework contribution. In this on-going research project we plan to integrate gene regulatory and metabolic network modeling. A second modeling approach is learning Boolean network families. In this framework, given a regulatory network (also called Prior Knowledge Network, PKN) and a set of network species observations upon multiple perturbations over the system, our framework learns a family of Boolean Networks (BNs), compatible with the PKN topology, that fits the perturbation data with minimal error. The first system we conceived is named caspo. It is also implemented using Answer Set Programming. An extension of caspo was implemented, so that new experimental designs (i.e. new experimental perturbations) can be proposed to decrease the number of learned BNs. Later, we proposed a system named caspo-ts, which deals with perturbation time-series data, and the output of this system is a family of dynamic BNs. caspo-ts has been applied to the data of HPN-DREAM challenge, concerning Breast cancer cell lines. Our objective was to identify the different BNs underlying the four Breast Cancer cell lines considered. In parallel, since multiple perturbation data, essential for \emph{caspo} or \emph{caspo-ts}, is sometimes hard to obtain in Human systems because of ethical reasons; we have begun a research subject towards the extraction of multiple pseudo-perturbations from non perturbed datasets, such as proteomics or RNA-Seq datasets. This method has been applied to discriminate Acute Myeloid Leukemia patients having different treatment prognosis. Currently, we are exploring to extract multiple pseudo-perturbations from single cell data, in the study of Human embryo development.
Defense date : 25-01-2024
Jury president : Anne Siegel
Jury :
- Elisabeth Remy [Rapporteure]
- Pedro Monteiro [Rapporteur]
- Mohamed Elati [Rapporteur]
- Jérémie Bourdon
- Damien Eveillard
- Marie-France Sagot