Legrand, Damien (2024) Generative models for adaptative immunity PRE - Projet de recherche, ENSTA.
Fichier(s) associé(s) à ce document :
PDF Restricted to Accès restreint 621Kb |
Résumé
This raport investigates the generation of protein sequences using Long Short-Term Memory (LSTM) networks and Variational Autoencoders (VAE). The study compares these models based on their ability to produce biologically meaningful and diverse protein sequences. The goal is to find relevant metrics to assess these generative models. To do so, I am going to evaluate a range of metrics over real data before using them to assess LSTM and VAE generations. Unfortunately, it was not possible to generate sequences with these models, thus I evaluate the metrics over mutated sequences to determine at what point the sequences lose the grammar of immune sequences due to mutations, which act as a proxy.
Type de document: | Rapport ou mémoire (PRE - Projet de recherche) |
---|---|
Mots-clés libres: | Protein sequences, Generative models, Synthetic data, Metrics |
Sujets: | Sciences et technologies de l'information et de la communication |
Code ID : | 10119 |
Déposé par : | Damien LEGRAND |
Déposé le : | 28 août 2024 18:45 |
Dernière modification: | 28 août 2024 18:45 |