Legrand, Damien (2024) Generative models for adaptative immunity PRE - Projet de recherche, ENSTA.

Fichier(s) associé(s) à ce document :

[img]PDF
Restricted to Accès restreint

621Kb

Résumé

This raport investigates the generation of protein sequences using Long Short-Term Memory (LSTM) networks and Variational Autoencoders (VAE). The study compares these models based on their ability to produce biologically meaningful and diverse protein sequences. The goal is to find relevant metrics to assess these generative models. To do so, I am going to evaluate a range of metrics over real data before using them to assess LSTM and VAE generations. Unfortunately, it was not possible to generate sequences with these models, thus I evaluate the metrics over mutated sequences to determine at what point the sequences lose the grammar of immune sequences due to mutations, which act as a proxy.

Type de document:Rapport ou mémoire (PRE - Projet de recherche)
Mots-clés libres:Protein sequences, Generative models, Synthetic data, Metrics
Sujets:Sciences et technologies de l'information et de la communication
Code ID :10119
Déposé par :Damien LEGRAND
Déposé le :28 août 2024 18:45
Dernière modification:28 août 2024 18:45

Modifier les métadonnées de ce document.