文档介绍:Biological Information Integration Toolkit
Jeremy Praissman, Dawei Lin, John Rose, Bi-Cheng Wang
Motivation
Calculate simple properties of genomes and group genes according to these properties
Tie together and integrate data and analyses (Blast results, annotation etc.)
plish the above in a way that provides additional support for implementing new bioinformatics algorithms
Genomes & Genes
Loads data and initializes other data structures
Exports most of the functionality of the Strand data structure (described below)
Contains “Gene” objects which are wrappers for BioPython SeqFeatures
Calculate upstream/downstream intergenic distance (IGD)
Easily generate subsequences relative to gene location
Genome Data Structure
Gene Data Structure
Strands
A list of Genes - represents a strand of biological information
Implicit intergenic regions
Functionality
Base object puting statistics
Number of sequence features (total, overlapping etc..)
Number of bases (in overlapping areas of features, in features etc..)
Generate new Strands based on IGD
Filter and map operations for obtaining “Sets”
Sets
Unordered collection of objects
Supports:
Membership testing
Union
Intersection
Other set operations
Example: set_1 = all Genes in a Genome with start codon GTG
set_2 = all Genes in a Genome with upstream igd < 20
set_3 = (set_2) is the set cont