Clustering problems in gene data analysis
The information obtained by monitoring gene expression levels in different developmental stages, tissue types, clinical conditions and different organism can help in understanding gene function and gene networks, assist in the diagnostic of disease conditions and reveal the effects of medical treatments.A key step in the analysis of gene expression data is the identification of groups of genes that manifest similar expression patterns. This translates to the algorithmic problem of clustering gene expression data.The challenges relate to the large quantity of data to be analyzed, which makes the clustering algorithms extremely computational expensive. Also, there is no assumption on the number of clusters or their structure, which complicates furthermore the problem.The goal of this project is to modify and adapt a category of graph theoretical methods (NCUT) to be used in clustering gene data. Ways to improve the speed of the algorithms are also to be investigated.
Etudiant: Hugo Torres
Année: 2004
Département: TIC
Filière: Informatique et systèmes de communication (anciennement Informatique)
Partenaire externe: EIVD et University of British Columbia, Canada
Enseignant responsable: Laura Elena Raileanu
Téléchargement:
- Télécharger l'affiche