Benchmarking RNA secondary structure comparison algorithms

authors

  • Allali Julien
  • d'Aubenton-Carafa Yves
  • Chauve Cedric
  • Denise Alain
  • Drevet Christine
  • Ferraro Pascal
  • Gautheret Daniel
  • Herrbach Claire
  • Leclerc Fabrice
  • de Monte Antoine
  • Ouangraoua Aïda
  • Sagot Marie-France
  • Saule C.
  • Termier Michel
  • Thermes Claude
  • Touzet Helene

document type

COMM

abstract

In the last ten years, several tools have been proposed for RNA secondary structure pairwise comparison. These tools use different models (ordered tree or forest, arc annotated sequence, multi-level tree) and methods (edit distance, alignment). We present a first benchmark for comparing these tools. For various RNA families, we built two sets of secondary structures. The first, called the reference set, is composed of a small number of RNAs with their known structures. The second is composed of sequences folded using Mfold and RNAshapes. Some of these sequences correspond to structural RNAs of the same families (true events), other correspond to noise. We studied the ability of each tool to find the true events using the reference set. In particular we analysed the results in terms of sensibility/specificity, distribution and spread of the scores, and computation time.

more information