Role of Galaxy in a bioinformatic plant breeding platform

TitleRole of Galaxy in a bioinformatic plant breeding platform
Publication TypeConference Paper
Year of Publication2012
AuthorsMaillol V, Bacilieri R, Bocs S, Boursiquot J-M, Carrier G, Dereeper A, Droc G, Fleury C, Larmande P, Lecunff L, Péros J-P, Pitollat B, Ruiz M, Sarah G, Sempéré G, Summo M, This P, Dufayard JF
Conference NameGalaxy Community Conference
Date Published07/2012
Conference LocationChicago
Full Text


With NGS development, bioinformatics has become central in plant breeding laboratories, and researchers are in need of some autonomy in its use. The Southgreen platform (CIRAD, IRD, INRA) performs bioinformatics analyses for many plant breeding research teams in Montpellier (France), and offers many systems to users: for example GNPAnnot (automatic genomic sequence annotation), Greenphyl (phylogenetic orthology prediction), ESTtik (EST annotation) or the Bacchus analysis pipeline. Most of these systems have been translated into Galaxy workflows.

As for the Bacchus pipeline, it has been created at INRA Montpellier (France) to investigate clonal diversity in grapevine genomes. For this task, many softwares have been wrapped in the Galaxy framework. Bacchus can be decomposed in three steps: i) Genome reconstruction, ii) test of reconstruction results, and iii) diversity analysis. This last step is done using SNP's and structural variations. To detect SNP's, the latest Freebayes version was used, while the IDfixe software was developped for structural variation detection. Some of the softwares developped for this pipeline are now used in the international project Grapereseq.

Today, the Galaxy framework is widely used by Southgreen plateform users as an alternative to the command line system. In this context, dozens of users have already been trained in Galaxy-using bioinformatics. During weekly collective pair-programming sessions, platform engineers and interested scientists integrate new tools and functionalities.Thus, Galaxy is now a core component of the plant breeding community around the Southgreen platform, and the main access portal for non- bioinformatics specialists to our computing clusters.