文档介绍:Nispero: a cloud -computing based Scala tool specially suited for bioinformatics data processing Evdokim Kovach, Alexey Alekhin, Marina Manrique, Pablo Pareja -Tobes, Ed uardo Pareja, Raquel Tobes and Eduardo Pareja -Tobes * Oh no sequences! Research Group. Era7 bioinformatics *eparejatobes@ Abstract. Nowadays it is widely accepted that the bioinformatics data analysis is a r eal b ottleneck i n many r esearch act ivities r elated t o l ife s ciences. High - throughput t echnologies l ike N ext Generation S equencing (NGS) ha ve completely r eshaped t he bi ology a nd bi oinformatics l andscape. U ndoubtedly NGS has allowed important progress in many life -sciences related fields but has also p resented i nteresting ch allenges i n t erms o f co mputation cap abilities an d algorithms. Many kinds o f ta sks r elated w ith N GS d ata a nalysis, as w ell as other bioinformatics data analysis, can puted in a parallel, independent way; t aking t he maximum advantage of t his can obviously help in leveraging the analysis bottleneck. Given the way NGS data is generated scalability plays also an i mportant role in its a nalysis. NGS data i s not generated i n a c ontinous f ashion but i n a batch way, t hus t he co mputation n eeds can b e d ramatically d ifferent at d ifferent points. Cloud c omputing pr ovides a pe rfect framework for s ystems with t hese