1

Single Document Keyphrase Extraction Using Sentence Clustering and Latent Dirichlet Allocation

This paper describes the design of a system for extracting keyphrases from a single document The principle of the algorithm is to cluster sentences of the documents in order to highlight parts of text that are semantically related. The clusters of …

Mining Association Rule Bases from Integrated Genomic Data and Annotations

During the last decade, several clustering and association rule mining techniques have been applied to identify groups of co-regulated genes in gene expression data. Nowadays, integrating biolog- ical knowledge and gene expression data into a single …

GenMiner: Mining Informative Association Rules from Genomic Data

GENMINER is a smart adaptation of closed itemsets based association rules extraction to genomic data. It takes advantage of the novel NORDI discretization method and of the CLOSE [27] algorithm to efficiently generate min- imal non-redundant …

Interpreting Microarray Experiments via Co-Expressed Gene Groups Analysis

Microarray technology produces vast amounts of data by measuring simultaneously the expression levels of thousands of genes under hundreds of biological conditions. Nowadays, one of the principal challenges in bioinfor- matics is the interpretation …

Analyse des groupes de gènes co-exprimés (AGGC): un outil automatique pour l'interprétation des expériences de biopuces

La technologie des biopuces permet de mesurer les niveaux d’expression de milliers de gènes dans différentes conditions biologiques générant ainsi des masses de données à analyser. De nos jours, l’interprétation de ces volumineux jeux de données à la …

Exploratory Analysis of Cancer SAGE Data

Using several analyse techniques for the hierarchical clustering of a SAGE expression dataset of 822 tags from 74 tissue samples (normal and cancer) we show that cleaning the dataset (tags and experiments) is critical and that attribution of a tag to …

Analysis of Microarray Data with THEA

Microarray technology makes it possible to measure thousands of variables and to compare their values under hundreds of conditions. Once microarray data are quantified, normalized and classified, the analysis phase is essentially a manual and …

Distributed BLAST with ProActive

Protein and DNA sequence comparison is one of the most important tool of molecular biologists, but sequence databases are growing at an exponential rate, and sequence comparison is becoming increasingly computationally intensive. We propose to …

Aspect and XML-oriented Semantic Framework Generator : Smarttools

SmartTools is a semantic framework generator, based on XML and object technologies. Thanks to a process of automatic generation from specifications, SmartTools makes it possible to quickly develop environments dedicated to domain-specific and …

DAM-BIO: Bioinformatics Internet Workbench for Protein Analysis. New Modules and Applications to Biological Problems

Computational analysis of protein sequences and structures is critical for the exploitation of the massive information contained in sequenced genomes. Direct access to bioinformatics tools through the Internet promotes their efficient use by …