1 / 47
文档名称:

A Study of Smoothing Methods for Language Models Applied to ....ppt

格式:ppt   页数:47页
下载后只包含 1 个 PPT 格式的文档,没有任何的图纸或源代码,查看文件列表

如果您已付费下载过本站文档,您可以点这里二次下载

A Study of Smoothing Methods for Language Models Applied to ....ppt

上传人:中国课件站 2011/12/4 文件大小:0 KB

下载得到文件列表

A Study of Smoothing Methods for Language Models Applied to ....ppt

文档介绍

文档介绍:Pattern Discovery in Biological Sequences: A Review
ChengXiang Zhai
Language Technologies Institiute
School puter Science
Carnegie Mellon University
Presentation at the Biological Language Modeling Seminar, June17, 2002
Outline
Biology
Computer
Science
Basic Concepts (“Common Language”)
Pattern
Discovery
Motivation
Formalization
Algorithm
Application
Basic Concepts
Alphabet & Language
Alphabet = set of symbols, ., ={A, T, G, C} is the nucleotide alphabet
String/Sequence (over an alphabet) = finite seq. of symbols, ., w=AGCTGC ( How many different nucleotide strings of length 3 are there?)
Language (over an alphabet) = set of strings, ., L={AAA, AAT, ATA, AGC, …, AGG} all nucleotide triplets starting with A.
Example:“Essential AA Language”
The language (set) of “essential” amino acids
on the alphabet {A, U, C, G}
L={CAC, CAU, …, UAC, UAU}
The ic Code
Questions to Ask about a Language (L)
Syntax & Semantics
How do we describe L and interpret L?
Recognition
Is sequence s in L or not?
Learning
Given example sequences in L and not in L, how do we learn L? What if given sequences that either match or do not match a sub-sequence in L ?
Syntax & Semantics of Language
Syntax: description of the form of sequences
“Surface” description: enumeration
“Deep” description: a concise decision rule or a characterizing pattern, .,
L contains all the triplets ending with “A”, or
L contains all sequences that match “AGGGGGA”
Semantics: meaning of sequences
Functional description of a amino acid sequence
Gene regulation of a nucleotide sequence
Recognizing Sequences in L
Recognizer (for L): given a sequence s, it tells us if s is in L or not. An operational way of describing L!
L (G-receptors)
* ( all protein sequences)
Is the sequence “SNASCTTNAP…TGAK” a G-receptor?
Algorithm
(G-rec. Recgonizer)
0 (no)
1 (yes)
More than “recognizing”...
Can the recognizer explain why a sequence is a “G-receptor”? Is the explanation biologically meaningf

最近更新

2024年塔吊项目资金需求报告代可行性研究报告.. 72页

2024年数控冲床项目资金筹措计划书代可行性研.. 66页

2024年兽用药品项目资金筹措计划书代可行性研.. 62页

2024年石油钻井泥浆固控设备项目资金筹措计划.. 66页

244.甲型肝炎临床路径 8页

45道几何题(初一)及答案 15页

40T梁预制专项施工方案 25页

30进制计数器 12页

4-5岁幼儿年龄特点 6页

6个转型升级成功案例 18页

3状语从句讲解及专项练习(附答案) 41页

3.5-探索与表达规律-练习题 11页

幼儿园大班礼仪学会分享教案(7篇) 17页

下半年四川泸州市纳溪区事业单位考试招聘工作.. 15页

教案《称象》第二课时教学设计 18页

板栗的作文(10篇) 11页

学前班学期工作计划秋季5篇 14页

感恩老师的句子或一段话33条 14页

中班数学教案《十五只老鼠送礼物》及教学反思.. 15页

小学英语教师下学期工作计划【四篇】 13页

大班语言教案及教学反思《猪先生去野餐》 17页

有关小学数学教师个人工作计划系列 93页

农村小学教导主任述职报告 86页

2022-2023学年全国初中八年级上物理人教版同步.. 13页

2024年日历(A4打印版)中英文Word 6页

脚手架和操作平台减员控员专项施工方案 13页

ICP备案授权书范例 1页

生物酶辅助提取菊粉的方法 10页

(精校版)2023年浙江英语高考试题文档版(含.. 11页

煤矿井下防爆电气设备检查标准 5页