DNA Translation

Description

Deoxyribonucleic acid (DNA) is composed of a sequence of nucleotide bases paired together to form a double-stranded helix structure. Through a series of complex biochemical processes the nucleotide sequences in an organism's DNA are translated into the proteins it requires for life. The object of this problem is to write a computer program which accepts a DNA strand and reports the protein generated, if any, from the DNA strand.

The nucleotide bases from which DNA is built are adenine, cytosine, guanine, and thymine (hereafter referred to as A, C, G, and T, respectively). These bases bond together in a chain to form half of a DNA strand. The other half of the DNA strand is a similar chain, but each nucleotide is replaced by its complementary base. The bases A and T are complementary, as are the bases C and G. These two "half-strands" of DNA are then bonded by the pairing of the complementary bases to form a strand of DNA.

Typically a DNA strand is listed by simply writing down the bases which form the primary strand (the complementary strand can always be created by writing the complements of the bases in the primary strand). For example, the sequence TACTCGTAATTCACT represents a DNA strand whose complement would be ATGAGCATTAAGTGA. Note that A is always paired with T, and C is always paired with G.

From a primary strand of DNA, a strand of ribonucleic acid (RNA) known as messenger RNA (mRNA for short) is produced in a process known as transcription. The transcribed mRNA is identical to the complementary DNA strand with the exception that thymine is replaced by a nucleotide known as uracil (hereafter referred to as U). For example, the mRNA strand for the DNA in the previous paragraph would be AUGAGCAUUAAGUGA.

It is the sequence of bases in the mRNA which determines the protein that will be synthesized. The bases in the mRNA can be viewed as a collection of codons, each codon having exactly three bases. The codon AUG marks the start of a protein sequence, and any of the codons UAA, UAG, or UGA marks the end of the sequence. The one or more codons between the start and termination codons represent the sequence of amino acids to be synthesized to form a protein. For example, the mRNA codon AGC corresponds to the amino acid serine (Ser), AUU corresponds to isoleucine (Ile), and AAG corresponds to lysine (Lys). So, the protein formed from the example mRNA in the previous paragraph is, in its abbreviated form, Ser-Ile-Lys.

The complete genetic code from which codons are translated into amino acids is shown in the table below (note that only the amino acid abbreviations are shown). It should also be noted that the sequence AUG, which has already been identified as the start sequence, can also correspond to the amino acid methionine (Met). So, the first AUG in a mRNA strand is the start sequence, but subsequent AUG codons are translated normally into the Met amino acid.
First base in codon Second base in codon Third base in codon
U C A G
U Phe Ser Tyr Cys U
Phe Ser Tyr Cys C
Leu Ser --- --- A
Leu Ser --- Trp G
C Leu Pro His Arg U
Leu Pro His Arg C
Leu Pro Gln Arg A
Leu Pro Gln Arg G
A Ile Thr Asn Ser U
Ile Thr Asn Ser C
Ile Thr Lys Arg A
Met Thr Lys Arg G
G Val Ala Asp Gly U
Val Ala Asp Gly C
Val Ala Glu Gly A
Val Ala Glu Gly G
Input

The input for this program consists of strands of DNA sequences, one strand per line, from which the protein it generates, if any, should be determined and output. The given DNA strand may be either the primary or the complementary DNA strand, and it may appear in either forward or reverse order, and the start and termination sequences do not necessarily appear at the ends of the strand. For example, a given input DNA strand to form the protein Ser-Ile-Lys could be any of ATACTCGTAATTCACTCC, CCTCACTTAATGCTCATA, TATGAGCATTAAGTGAGG, or GGAGTGAATTACGAGTAT. The input will be terminated by a line containing a single asterisk character.
Output

You may assume the input to contain only valid, upper-case, DNA nucleotide base letters (A, C, G, and T). No input line will exceed 255 characters in length. There will be no blank lines or spaces in the input. Some sequences, though valid DNA strands, do not produce valid protein sequences; the string "*** No translatable DNA found ***" should be output when an input DNA strand does not translate into a valid protein.
Sample Input

ATACTCGTAATTCACTCC
CACCTGTACACAGAGGTAACTTAG
TTAATACGACATAATTAT
GCCTTGATATGGAGAACTCATTAGATA
AAGTGTATGTTGAATTATATAAAACGGGCATGA
ATGATGATGGCTTGA
*
Sample Output

Ser-Ile-Lys
Cys-Leu-His
Ser-Tyr
*** No translatable DNA found ***
Leu-Asn-Tyr-Ile-Lys-Arg-Ala
Met-Met-Ala

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
threenewbee 2017-11-12 14:19
关注
https://en.wikipedia.org/wiki/DNA

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

DNA Translation
2017-10-27 07:52

回答 1 已采纳 https://en.wikipedia.org/wiki/DNA
DNA repair
2017-11-26 04:40

回答 2 已采纳 https://www.2cto.com/kf/201502/375077.html
DNA Sequence
2016-12-31 08:30

回答 1 已采纳 http://blog.csdn.net/xing634325131/article/details/8806730?locationNum=1&fps=1
『杭电1600』DNA Translation
2020-09-07 07:24

漠宸离若的博客 Deoxyribonucleic acid (DNA) is composed of a sequence of nucleotide bases paired together to form a double-stranded helix structure. Through a series of complex biochemical processes the nucleotide ...
Copying DNA
2017-10-02 00:38

回答 1 已采纳 https://wenku.baidu.com/view/34c3cd620b1c59eef8c7b4ea.html
python2.7 实现 DNA反向互补 python
2018-05-12 11:48

回答 3 已采纳 ``` def DNA_complement(sequence): sequence = sequence.upper() sequence = sequence.re
DNA Sorting
2017-07-25 10:01

回答 2 已采纳 http://www.2cto.com/kf/201409/333419.html
Python-DNA-Tool:Python 中用于 DNA 翻译、RNA 转录、GC 含量计算、组成百分比和 ATCG 碱基计数计算的脚本
2021-05-29 00:58

DNA_translation - 返回互补序列 RNA_transcription - 返回 RNA 序列 nucleotate_count-返回特定的基本计数 total_nucleotide_count-返回包含ATCG基本计数值的字典 nucleotide_composition-返回特定碱基百分比组成 ...
python2.7 实现 DNA反向互补（新） python
2018-05-12 15:22

回答 5 已采纳 https://ask.csdn.net/questions/688856 程序已经给你程序非网上粘贴，而且调试通过有问题请追问。
求DNA对比的程序设计 c++ c语言有问必答
2021-07-08 13:33

回答 2 已采纳那DNA的数据是什么样子的呢？
wasm-dna-transcription-translation:锈菌中DNA的转录和翻译
2021-04-01 11:55

wasm-dna-transcription-translation 锈菌中DNA的转录和翻译
DNA_Translation-using-python:在这个资料库中，我研究如何将长的DNA序列翻译成蛋白质序列
2021-04-25 04:06

DNA_Translation-using-python 在此存储库中，我研究如何将长的DNA序列翻译成蛋白质序列。使用的Daatabase 国家生物技术信息中心使用的数据 DNA序列 ...
seqviz:DNA序列查看器，支持自定义，GenBank，FASTA，NCBI登录和iGEM输入
2021-04-30 16:05

DNA序列查看器，支持自定义，GenBank，FASTA，NCBI登录和iGEM输入特征SeqViz目标是成为具有简单API和易定制性的DNA序列查看器。目前提供：多种输入格式顺序加入（NCBI或iGEM）文件（FASTA，GenBank，SBOL，...
分子生物学第四章 DNA的生物合成
2023-04-13 19:40

丸丸丸子w的博客文章目录第四章 DNA的生物合成第一节 DNA复制的一般特征 1 DNA的半保留复制 2 DNA的双向复制 3 DNA的半不连续复制第二节 DNA复制的酶学 1 DNA聚合酶 1.1 原核生物DNA pol 1.1.1 DNA pol I 1.2 DNA pol II 1.3 DNA...
dnastar拼接反向互补序列_DNAstar 教程
2020-12-22 11:33

weixin_39768444的博客生物信息基因序列分析软件DNAStar简介郑伟文，林营志，刘波，曹宜，苏明星，朱育菁，蓝江林，车建美，郑斯平，陈坚(福建省农科院生物技术中心)1．设计公司SequenceAnalysisSoftwareforMacintoshandWindows，GETTING...
关于DNA 碱基序列检验的JAVA代码
2016-03-11 21:20

earthquake_aaa的博客 This assignment focuses on arrays and file/text processing. Turn in a file named DNA.java. You will also need the two input files dna.txt and ecoli.txt from the course web site. Save these files in
DNA-蛋白翻译过程的Python实现
2021-04-08 15:40

EmmettPeng的博客最近为了给平台上加上一个将DNA序列翻译为蛋白序列的工具，写了一个任何生信玩家初学时都会写的代码。看了一些别人的翻译工具，我也想尽量把代码写的完整一点，在这个过程中首次接触并使用了BioPython，目前看起来...
Nature news: 未来40年，DNA测序将走向何方？
2021-07-26 18:33

wangchuang2017的博客 Nature news: 未来40年，DNA测序将走向何方？ 2017-10-14 00:00 40年前,Sanger测序技术诞生,让DNA片段的测序成为现实.自此,DNA测序技术以惊人的速度发展,越过一座又一座的里程碑.那么,未来40年,DNA测序又将变成...
BEC listen and translation exercise 48
2018-02-06 10:19

weixin_33709609的博客 It's not publicly known who the kidnappers were. Because they are not eating such lovely food since they left home. Drafts leaked to the Greek media suggest the proposals broadly fall into 3 catego...
没有解决我的问题, 去提问

悬赏问题

¥15 如何实验stm32主通道和互补通道独立输出
¥30 这是哪个作者做的宝宝起名网站
¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题
¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题

DNA Translation

1条回答 默认 最新

悬赏问题

1条回答默认最新