文档介绍:BioInformatics (3) Computational Issues Data Warehousing: Organising Biological Information into a Structured Entity (World’s Largest Distributed DB) Function Analysis (Numerical Analysis) : Gene Expression Analysis : Applying sophisticated data mining/Visualisation to understand gene activities within an environment (Clustering ) Integrated Genomic Study : Relating structural analysis with functional analysis Structure Analysis (Symbolic Analysis) : Sequence Alignment: Analysing a sequence parative methods against existing databases to develop hypothesis concerning relatives (ics) and functions (Dynamic Programming and HMM) Structure prediction : from a sequence of a protein to predict its 3D structure (Inductive LP) Data Warehousing : Mapping Biologic into Data Logic Structure Analysis :Alignments & Scores Global (. haplotype) ACCACACA ::xx::x: ATA Score= 5(+1) + 3(-1) = 2 Suffix (shotgun assembly) ACCACACA ::: ATA Score= 3(+1) =3 Local (motif) ACCACACA :::: ATA Score= 4(+1) = 4 parison of the homology search and the motif search for functional interpretation of sequence information. Homology Search Motif Search New sequence Retrieval Similar sequence Expert knowledge Sequence interpretation Sequence database (Primary data) Knowledge acquisition Motif library (Empirical rules) Expert knowledge New sequence Inference Sequence interpretation Search and learning problems in sequence analysis (Whole genome) Gene Expression Analysis Quantitative Analysis of Gene Activities (Transcription Profiles) Gene Expression Biotinylated RNA from experiment GeneChip expression analysis probe array Image of hybridized probe array Each probe cell contains millions of copies of a specific oligonucleotide probe Streptavidin- phycoerythrin conjugate (Sub)cellular inhomogeneity ( see figure) Cell-cycle differences in expression. XIST RNA localized on inactive X-chromosome Cluster Analysis Protein/plex Genes DNA regulatory elements