模拟 RNA 合成的 Perl 程序-6ren

模拟 RNA 合成的 Perl 程序

转载作者：行者123 更新时间：2023-12-05 00:45:42

28

4

寻找有关如何处理我的 Perl 编程作业以编写 RNA 合成程序的建议。我总结并概述了下面的程序。具体来说，我正在寻找对以下 block 的反馈(我将编号以方便引用)。我已经阅读了 Andrew Johnson 的 Elements of Programming with Perl (好书)的第 6 章。我还阅读了 perlfunc 和 perlop pod-pages，没有任何关于从哪里开始的内容。

程序说明:程序应从命令行读取输入文件，将其翻译成 RNA，然后将 RNA 转录成大写的单字母氨基酸名称序列。

接受以命令行命名的文件

here I will use the <> operator

检查以确保文件仅包含 acgt 或 die

if ( <> ne [acgt] ) { die "usage: file must only contain nucleotides \n"; }

将DNA转录成RNA(每个A被U替换，T被A替换，C被G替换，G被C替换)

not sure how to do this
将此转录本并从“AUG”的第一次出现开始将其分解为 3 个字符“密码子”

not sure but I'm thinking this is where I will start a %hash variables?
取 3 个字符“密码子”并给它们一个单字母符号(大写的单字母氨基酸名称)

Assign a key a value using (there are 70 possibilities here so I'm not sure where to store or how to access)
如果遇到间隙，则开始新行并重复处理

not sure but we can assume that gaps are multiples of threes.
我的方法是否正确？是否有一个我忽略的 Perl 函数可以简化主程序？

注意

必须是独立程序(密码子名称和符号的存储值)。

当程序读取一个没有符号的密码子时，这是 RNA 中的一个缺口，它应该开始一个新的输出行，并从下一次出现的“AUG”开始。为简单起见，我们可以假设间隙总是三的倍数。

在我花任何额外的时间进行研究之前，我希望得到确认，我正在采取正确的方法。感谢您花时间阅读并分享您的专业知识!

最佳答案

1. here I will use the <> operator

好的，您的计划是逐行读取文件。不要忘记 chomp每一行，否则你的序列中会出现换行符。

2. Check to make sure the file only contains acgt or die

if ( <> ne [acgt] ) { die "usage: file must only contain nucleotides \n"; }

在 while 循环中，<>运算符将读取的行放入特殊变量 $_ ，除非您明确指定它( my $line = <> )。

在上面的代码中，您正在从文件中读取一行并将其丢弃。您需要保存该行。

另外，ne运算符比较两个字符串，而不是一个字符串和一个正则表达式。您需要 !~此处的运算符(或 =~ 之一，具有否定字符类 [^acgt] 。如果您需要不区分大小写的测试，请查看 i 标志以进行正则表达式匹配。

3. Transcribe the DNA to RNA (Every A replaced by U, T replaced by A, C replaced by G, G replaced by C).

正如 GWW 所说，检查你的生物学。 T->U 是转录的唯一步骤。您会找到 tr (音译)运算符在这里很有帮助。

4. Take this transcription & break it into 3 character 'codons' starting at the first occurance of "AUG"

not sure but I'm thinking this is where I will start a %hash variables?

我会在这里使用缓冲区。在 while(<>) 之外定义一个标量环形。使用index匹配“AUG”。如果你没有找到它，把最后两个基放在那个标量上(你可以使用substr $line, -2, 2)。在循环的下一次迭代中(使用 .= )将行添加到这两个基础，然后 then 再次测试“AUG”。如果你成功了，你会知道在哪里，所以你可以标记地点并开始翻译。

5. Take the 3 character "codons" and give them a single letter Symbol (an uppercase one-letter amino acid name)

Assign a key a value using (there are 70 possibilities here so I'm not sure where to store or how to access)

再次，正如 GWW 所说，建立一个哈希表:

%codons = ( AUG => 'M', ...) .

然后您可以使用(例如)split构建您正在检查的当前行的数组，一次构建三个元素的密码子，并从哈希表中获取正确的氨基酸代码。

6.If a gap is encountered a new line is started and process is repeated

not sure but we can assume that gaps are multiples of threes.

见上文。您可以使用 exists $codons{$current_codon} 测试是否存在间隙。 .

7. Am I approaching this the right way? Is there a Perl function that I'm overlooking that can simplify the main program?

你知道，看看上面的内容，它似乎太复杂了。我 build 了一些积木；子程序 read_codon和 translate : 我认为它们极大地帮助了程序的逻辑。

我知道这是一项家庭作业，但我认为它可能会帮助您了解其他可能的方法:

use warnings; use strict;
use feature 'state';


# read_codon works by using the new [state][1] feature in Perl 5.10
# both @buffer and $handle represent 'state' on this function:
# Both permits abstracting reading codons from processing the file
# line-by-line.
# Once read_colon is called for the first time, both are initialized.
# Since $handle is a state variable, the current file handle position
# is never reset. Similarly, @buffer always holds whatever was left
# from the previous call.
# The base case is that @buffer contains less than 3bp, in which case
# we need to read a new line, remove the "\n" character,
# split it and push the resulting list to the end of the @buffer.
# If we encounter EOF on the $handle, then we have exhausted the file,
# and the @buffer as well, so we 'return' undef.
# otherwise we pick the first 3bp of the @buffer, join them into a string,
# transcribe it and return it.

sub read_codon {
    my ($file) = @_;

    state @buffer;
    open state $handle, '<', $file or die $!;

    if (@buffer < 3) {
        my $new_line = scalar <$handle> or return;
        chomp $new_line;
        push @buffer, split //, $new_line;
    }

    return transcribe(
                       join '', 
                       shift @buffer,
                       shift @buffer,
                       shift @buffer
                     );
}

sub transcribe {
    my ($codon) = @_;
    $codon =~ tr/T/U/;
    return $codon;
}


# translate works by using the new [state][1] feature in Perl 5.10
# the $TRANSLATE state is initialized to 0
# as codons are passed to it, 
# the sub updates the state according to start and stop codons.
# Since $TRANSLATE is a state variable, it is only initialized once,
# (the first time the sub is called)
# If the current state is 'translating',
# then the sub returns the appropriate amino-acid from the %codes table, if any.
# Thus this provides a logical way to the caller of this sub to determine whether
# it should print an amino-acid or not: if not, the sub will return undef.
# %codes could also be a state variable, but since it is not actually a 'state',
# it is initialized once, in a code block visible form the sub,
# but separate from the rest of the program, since it is 'private' to the sub

{
    our %codes = (
        AUG => 'M',
        ...
    );

    sub translate {
        my ($codon) = @_ or return;

        state $TRANSLATE = 0;

        $TRANSLATE = 1 if $codon =~ m/AUG/i;
        $TRANSLATE = 0 if $codon =~ m/U(AA|GA|AG)/i;

        return $codes{$codon} if $TRANSLATE;
    }
}

关于模拟 RNA 合成的 Perl 程序，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/4112003/

28

4

0

文章推荐： unix - 如何在 CD 命令后自动列出目录内容？

文章推荐：根据特定规则替换 NA 值

文章推荐： java - Dropwizard: .yml 配置是强制性的吗？

C 程序我想知道是否有办法简化我的 dayofyear 程序？
我是 C 语言新手，我编写了这个 C 程序，让用户输入一年中的某一天，作为返回，程序将输出月份以及该月的哪一天。该程序运行良好，但我现在想简化该程序。我知道我需要一个循环，但我不知道如何去做。这是程序
java - GUI Java 程序 - Paint 程序
我一直在努力找出我的代码有什么问题。这个想法是创建一个小的画图程序，并有红色、绿色、蓝色和清除按钮。我有我能想到的一切让它工作，但无法弄清楚代码有什么问题。程序打开，然后立即关闭。 import ja
homebrew - 从 Homebrew 程序/欺骗程序到 Homebrew 程序/核心的迁移是什么？
我想安装screen，但是接下来我应该做什么？ $ brew search screen imgur-screenshot screen
python - 客户端(python 程序)没有收到服务器(c 程序)返回的响应？
我有一个在服务器端工作的 UDP 套接字应用程序。为了测试服务器端，我编写了一个简单的 python 客户端程序，它发送消息“hello world how are you”。服务器随后应接收消息，将
python - 运行一个 shell 脚本，该脚本运行一个 python 程序，然后运行一个 R 程序
我有一个 shell 脚本，它运行一个 Python 程序来预处理一些数据，然后运行一个 R 程序来执行一些长时间运行的任务。我正在学习使用 Docker 并且我一直在运行 FROM r-base:l
python - 在 Linux 中从 Python 启动一个 c 程序，将一个大文本字符串作为参数传递给 c 程序
在 Linux 中。我有一个 c 程序，它读取一个 2048 字节的文本文件作为输入。我想从 Python 脚本启动 c 程序。我希望 Python 脚本将文本字符串作为参数传递给 c 程序，而不是将
在网页上调起本机C#程序
前言最近开始整理笔记里的库存草稿，本文是 23 年 5 月创建的了（因为中途转移到 onedrive，可能还不止）网页调起电脑程序是经常用到的场景，比如百度网盘下载，加入 QQ 群之类的我
VHDL 程序
对于一个类，我被要求编写一个 VHDL 程序，该程序接受两个整数输入 A 和 B，并用 A+B 替换 A，用 A-B 替换 B。我编写了以下程序和测试平台。它完成了实现和行为语法检查，但它不会模拟。尽
haskell 程序
module Algorithm where import System.Random import Data.Maybe import Data.List type Atom = String ty
求给定N个数的最小公倍数的C++程序
我想找到两个以上数字的最小公倍数求给定N个数的最小公倍数的C++程序最佳答案 int lcm(int a, int b) { return (a/gcd(a,b))*b; } 对于gcd，请查看
索引器的c#程序
这个程序有错误。谁能解决这个问题？ Error is :TempRecord already defines a member called 'this' with the same paramete
铁路围栏密码的C++程序
当我运行下面的程序时，我在 str1 和 str2 中得到了垃圾值。所以 #include #include #include using namespace std; int main() {
死兔子的C++程序
这是我的作业: 一对刚出生的兔子(一公一母)被放在田里。兔子在一个月大时可以交配，因此在第二个月的月底，每对兔子都会生出两对新兔子，然后死去。注:在第0个月，有0对兔子。第 1 个月，有 1 对兔子
十进制转十六进制的C++程序
我编写了一个程序，通过对字母使用 switch 命令将十进制字符串转换为十六进制，但是如果我使用 char，该程序无法正常工作!没有 switch 我无法处理 9 以上的数字。我希望你能理解我，因为我
连接MySQL的C++程序
我是 C++ 新手(虽然我有一些 C 语言经验)和 MySQL，我正在尝试制作一个从 MySQL 读取数据库的程序，我一直在关注这个 tutorial但当我尝试“构建”解决方案时出现错误。 (我正在使
Swift If 程序
仍然是一个初学者，只是尝试使用 swift 中的一些基本函数。有人能告诉我这段代码有什么问题吗？ import UIKit var guessInt: Int var randomNum = arc
折叠常量的C++程序
我正在用 C++11 编写一个函数，它采用 constant1 + constant2 形式的表达式并将它们折叠起来。 constant1 和 constant2 存储在 std::string 中，
2个矩阵相加和相乘的C++程序
我用 C++ 编写了这段代码，使用运算符重载对 2 个矩阵进行加法和乘法运算。当我执行代码时，它会在第 57 行和第 59 行产生错误，非法结构操作(两行都出现相同的错误)。请解释我的错误。提前致谢:
交换字符串中两个字符的C++程序
我是 C++ 的初学者，我想编写一个简单的程序来交换字符串中的两个字符。例如；我们输入这个字符串:“EXAMPLE”，我们给它交换这两个字符:“E”和“A”，输出应该类似于“AXEMPLA”。我在
确定三角形的C++程序
我需要以下代码的帮助: 声明 3 个 double 类型变量，每个代表三角形的三个边中的一个。提示用户为第一面输入一个值，然后将用户的输入设置为您创建的代表三角形第一条边的变量。将最后 2 个步

首页

博学

6Ren·AI

商城

模拟 RNA 合成的 Perl 程序

注意