DNA Translation DNA序列的问题

Description

Deoxyribonucleic acid (DNA) is composed of a sequence of nucleotide bases paired together to form a double-stranded helix structure. Through a series of complex biochemical processes the nucleotide sequences in an organism's DNA are translated into the proteins it requires for life. The object of this problem is to write a computer program which accepts a DNA strand and reports the protein generated, if any, from the DNA strand.

The nucleotide bases from which DNA is built are adenine, cytosine, guanine, and thymine (hereafter referred to as A, C, G, and T, respectively). These bases bond together in a chain to form half of a DNA strand. The other half of the DNA strand is a similar chain, but each nucleotide is replaced by its complementary base. The bases A and T are complementary, as are the bases C and G. These two "half-strands" of DNA are then bonded by the pairing of the complementary bases to form a strand of DNA.

Typically a DNA strand is listed by simply writing down the bases which form the primary strand (the complementary strand can always be created by writing the complements of the bases in the primary strand). For example, the sequence TACTCGTAATTCACT represents a DNA strand whose complement would be ATGAGCATTAAGTGA. Note that A is always paired with T, and C is always paired with G.

From a primary strand of DNA, a strand of ribonucleic acid (RNA) known as messenger RNA (mRNA for short) is produced in a process known as transcription. The transcribed mRNA is identical to the complementary DNA strand with the exception that thymine is replaced by a nucleotide known as uracil (hereafter referred to as U). For example, the mRNA strand for the DNA in the previous paragraph would be AUGAGCAUUAAGUGA.

It is the sequence of bases in the mRNA which determines the protein that will be synthesized. The bases in the mRNA can be viewed as a collection of codons, each codon having exactly three bases. The codon AUG marks the start of a protein sequence, and any of the codons UAA, UAG, or UGA marks the end of the sequence. The one or more codons between the start and termination codons represent the sequence of amino acids to be synthesized to form a protein. For example, the mRNA codon AGC corresponds to the amino acid serine (Ser), AUU corresponds to isoleucine (Ile), and AAG corresponds to lysine (Lys). So, the protein formed from the example mRNA in the previous paragraph is, in its abbreviated form, Ser-Ile-Lys.

The complete genetic code from which codons are translated into amino acids is shown in the table below (note that only the amino acid abbreviations are shown). It should also be noted that the sequence AUG, which has already been identified as the start sequence, can also correspond to the amino acid methionine (Met). So, the first AUG in a mRNA strand is the start sequence, but subsequent AUG codons are translated normally into the Met amino acid.
First base in codon Second base in codon Third base in codon
U C A G
U Phe Ser Tyr Cys U
Phe Ser Tyr Cys C
Leu Ser --- --- A
Leu Ser --- Trp G
C Leu Pro His Arg U
Leu Pro His Arg C
Leu Pro Gln Arg A
Leu Pro Gln Arg G
A Ile Thr Asn Ser U
Ile Thr Asn Ser C
Ile Thr Lys Arg A
Met Thr Lys Arg G
G Val Ala Asp Gly U
Val Ala Asp Gly C
Val Ala Glu Gly A
Val Ala Glu Gly G
Input

The input for this program consists of strands of DNA sequences, one strand per line, from which the protein it generates, if any, should be determined and output. The given DNA strand may be either the primary or the complementary DNA strand, and it may appear in either forward or reverse order, and the start and termination sequences do not necessarily appear at the ends of the strand. For example, a given input DNA strand to form the protein Ser-Ile-Lys could be any of ATACTCGTAATTCACTCC, CCTCACTTAATGCTCATA, TATGAGCATTAAGTGAGG, or GGAGTGAATTACGAGTAT. The input will be terminated by a line containing a single asterisk character.
Output

You may assume the input to contain only valid, upper-case, DNA nucleotide base letters (A, C, G, and T). No input line will exceed 255 characters in length. There will be no blank lines or spaces in the input. Some sequences, though valid DNA strands, do not produce valid protein sequences; the string "*** No translatable DNA found ***" should be output when an input DNA strand does not translate into a valid protein.
Sample Input

ATACTCGTAATTCACTCC
CACCTGTACACAGAGGTAACTTAG
TTAATACGACATAATTAT
GCCTTGATATGGAGAACTCATTAGATA
AAGTGTATGTTGAATTATATAAAACGGGCATGA
ATGATGATGGCTTGA
*
Sample Output

Ser-Ile-Lys
Cys-Leu-His
Ser-Tyr
*** No translatable DNA found ***
Leu-Asn-Tyr-Ile-Lys-Arg-Ala
Met-Met-Ala

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

DNA Translation
2017-04-07 04:27

回答 1 已采纳 http://www.acmerblog.com/hdu-1600-DNA-Translation-2161.html
python2.7 实现 DNA反向互补 python
2018-05-12 11:48

回答 3 已采纳 ``` def DNA_complement(sequence): sequence = sequence.upper() sequence = sequence.re
python2.7 实现 DNA反向互补（新） python
2018-05-12 15:22

回答 5 已采纳 https://ask.csdn.net/questions/688856 程序已经给你程序非网上粘贴，而且调试通过有问题请追问。
Python-DNA-Tool:Python 中用于 DNA 翻译、RNA 转录、GC 含量计算、组成百分比和 ATCG 碱基计数计算的脚本
2021-05-29 00:58

DNA_translation - 返回互补序列 RNA_transcription - 返回 RNA 序列 nucleotate_count-返回特定的基本计数 total_nucleotide_count-返回包含ATCG基本计数值的字典 nucleotide_composition-返回特定碱基百分比组成 ...
求DNA对比的程序设计 c++ c语言有问必答
2021-07-08 13:33

回答 2 已采纳那DNA的数据是什么样子的呢？
如何用python将指定基因的DNA序列从序列文档中提取出来？ python
2020-05-27 19:30

回答 1 已采纳没太绕明白，是要从2里提取1里有的基因名对应的序列吗？如果是，逐行读2，每行用空格分隔字符，放到字典里，然后用1里的关键字就能提取2的序列了。
python 创建类似下列程序实现将输入DNA序列转化为互补链序列 python
2021-12-13 09:38

回答 1 已采纳怎么样的dna序列？怎么样的互补链序列？
seqviz:DNA序列查看器，支持自定义，GenBank，FASTA，NCBI登录和iGEM输入
2021-04-30 16:05

DNA序列查看器，支持自定义，GenBank，FASTA，NCBI登录和iGEM输入特征SeqViz目标是成为具有简单API和易定制性的DNA序列查看器。目前提供：多种输入格式顺序加入（NCBI或iGEM）文件（FASTA，GenBank，SBOL，...
神农架野人问题（输出序列） c++ 蓝桥杯
2022-03-16 16:55

回答 1 已采纳 #include<bits/stdc++.h> using namespace std; int dp[1001][1001]; int main() { int t,i,j,
从键盘输入DNA序列字符串，统计并输出A T C G出现的次数 JAVA java
2021-12-21 15:45

回答 1 已采纳 public static void main(String[] args) { Scanner sc = new Scanner(System.in); String s = sc.
DNA repair
2017-11-26 04:40

回答 2 已采纳 https://www.2cto.com/kf/201502/375077.html
dnastar拼接反向互补序列_DNAstar 教程
2020-12-22 11:33

weixin_39768444的博客生物信息基因序列分析软件DNAStar简介郑伟文，林营志，刘波，曹宜，苏明星，朱育菁，蓝江林，车建美，郑斯平，陈坚(福建省农科院生物技术中心)1．设计公司SequenceAnalysisSoftwareforMacintoshandWindows，GETTING...
DNA_Translation-using-python:在这个资料库中，我研究如何将长的DNA序列翻译成蛋白质序列
2021-04-25 04:06

在此存储库中，我研究如何将长的DNA序列翻译成蛋白质序列。使用的Daatabase 国家生物技术信息中心使用的数据 DNA序列 GGTCAGAAAAAGCCCTCTCCATGTCTACTCACGATACATCCCTGAAAACCACTGAGGAAGTGGCTTTTCA ...
分子生物学第四章 DNA的生物合成
2023-04-13 19:40

丸丸丸子w的博客文章目录第四章 DNA的生物合成第一节 DNA复制的一般特征 1 DNA的半保留复制 2 DNA的双向复制 3 DNA的半不连续复制第二节 DNA复制的酶学 1 DNA聚合酶 1.1 原核生物DNA pol 1.1.1 DNA pol I 1.2 DNA pol II 1.3 DNA...
关于DNA 碱基序列检验的JAVA代码
2016-03-11 21:20

earthquake_aaa的博客 This assignment focuses on arrays and file/text processing. Turn in a file named DNA.java. You will also need the two input files dna.txt and ecoli.txt from the course web site. Save these files in
一次探索：基于香农熵预测DNA中编码序列，python实现。
2020-05-15 22:49

隔壁王同学啊的博客在实验室里，我们得到一段DNA序列后，如果想知道编码出的蛋白质序列等，我们还得对它的mRNA进行测序，但是mRNA降解很快，半衰期极短。所以我们能否绕过测序mRNA，直接由DNA序列预测编码序列。要想做到这一点...
DNA-蛋白翻译过程的Python实现
2021-04-08 15:40

EmmettPeng的博客最近为了给平台上加上一个将DNA序列翻译为蛋白序列的工具，写了一个任何生信玩家初学时都会写的代码。看了一些别人的翻译工具，我也想尽量把代码写的完整一点，在这个过程中首次接触并使用了BioPython，目前看起来...
Python的用途是什么？ Python编程语言有10多种编码用途。
2020-08-17 03:33

cumi7754的博客 ????欢迎 (???? Welcome) Hi! Please take a moment ... 请花一点时间考虑这个问题： How is Python applied in real-world scenarios? Python如何在实际场景中应用？ If you are learning Python and you want to...
多序列比对的c语言程序,学会正确选择多序列比对（coding-sequences）软件
2021-05-22 12:51

小喵汪的博客原本以为可以快速地进行下一步的选择压力分析，没想到却在多序列比对这一环节出现了棘手的问题。以前，我都是经过PRANK软件进行多序列比对，然后再使用Gblocks软件对数据进行过滤的。现在，由于师弟师妹在拼接CDS...
没有解决我的问题, 去提问

悬赏问题

¥20 matlab yalmip kkt 双层优化问题
¥15 如何在3D高斯飞溅的渲染的场景中获得一个可控的旋转物体
¥88 实在没有想法，需要个思路
¥15 MATLAB报错输入参数太多
¥15 python中合并修改日期相同的CSV文件并按照修改日期的名字命名文件
¥15 有赏，i卡绘世画不出
¥15 如何用stata画出文献中常见的安慰剂检验图
¥15 c语言链表结构体数据插入
¥40 使用MATLAB解答线性代数问题
¥15 COCOS的问题COCOS的问题

码龄粉丝数原力等级 --

DNA Translation DNA序列的问题

0条回答默认最新

悬赏问题

DNA Translation DNA序列的问题

0条回答 默认 最新

悬赏问题

0条回答默认最新