perl - 使用 perl 创建层次文件-6ren

perl - 使用 perl 创建层次文件

转载作者：行者123 更新时间：2023-12-04 17:13:15

我的任务是使用 perl 创建一个父子层次结构文件。

示例输入文件(制表符分隔)。记录将按随机顺序排列在文件中，“父”可能出现在“子”之后。

 S5 S3
 S5 S8
 ROOT   S1
 S1 S7
 S2 S5
 S3 S4
 S1 S2
 S4 77
 S2 S9
 S3 88

示例输出文件(制表符分隔)

ROOT    S1  S2  S5  S3  S4  77
ROOT    S1  S2  S5  S3  88
ROOT    S1  S7
ROOT    S1  S2  S5  S8
ROOT    S1  S2  S9

产生上述输出文件的代码

use strict;

# usage: perl parent_child_generator.pl input.txt output.txt

my $input0=$ARGV[0] or die "must provide input.txt as the first argument\n";
my $output1=$ARGV[1] or die "must provide output.txt as the second argument\n";

open(IN0,"<",$input0) || die "Cannot open $input0 for reading: $!";
open(OUT1,">",$output1) || die "Cannot open $output1 for writing: $!";

sub trim
{
    my $string=shift;
$string=~s/\r?\n$//;
$string=~s/^\s+//;
$string=~s/\s+$//;
return $string;
}

sub connectByPrior
{
my $in_child=$_[0];
my %in_hash=%{$_[1]};
my @anscestor_arr;

for (sort keys %in_hash)
{
    my $key=$_;
    my @key_arr=split(/\t/,$key);
    my $parent=$key_arr[0];
    my $child=$key_arr[1];

    if ($in_child eq $child)
    {
        push (@anscestor_arr,$parent);
        @anscestor_arr=(@{connectByPrior($parent,\%in_hash)},@anscestor_arr);
        last;
    }
}
return \@anscestor_arr;
}

my %parent_hash;
my %child_hash;
my %unsorted_hash;
while(<IN0>)
{
my @cols=split(/\t/);
for (my $i=0; $i < scalar(@cols); $i++)
{
    $cols[$i]= trim($cols[$i]);
}

my $parent=$cols[0];
my $child=$cols[1];
my $parent_child="$parent\t$child";

$parent_hash{$parent}=1;
$child_hash{$child}=1;
$unsorted_hash{$parent_child}=1;
 }
 close(IN0);

my @lev0_arr;
for (sort keys %child_hash)
{
my $rec=$_;
if (!exists($parent_hash{$rec}))
{
    push (@lev0_arr,$rec);
}
}

for (@lev0_arr)
{
my $child=$_;
my @anscestor_arr=@{connectByPrior($child,\%unsorted_hash)};
push (@anscestor_arr,$child);
print OUT1 join("\t",@anscestor_arr)."\n";
}

问题:如果输入文件不是太大，代码工作正常。实际输入文件包含超过 200k 行，并且代码处理输出所用的时间太长。您建议进行哪些改进/更改，以免处理时间过长？

最佳答案

您似乎正在尝试构建和漂亮地打印有向图:

#!/usr/bin/perl

use strict; use warnings;
use Graph::Directed;
use Graph::TransitiveClosure::Matrix;

 my $g = Graph::Directed->new;

while ( my $line = <DATA> ) {
    next unless my ($x, $y) = split ' ', $line;
    $g->add_edge($x, $y);
}

my @start = $g->source_vertices;
my @end   = $g->sink_vertices;

my $tcm = Graph::TransitiveClosure::Matrix->new( $g,
    path_vertices => 1,
);

for my $s ( @start ) {
    for my $e ( @end ) {
        next unless $tcm->is_reachable($s, $e);
        print join("\t", $tcm->path_vertices($s, $e)), "\n";
    }
}

__DATA__
S5 S3
S5 S8
ROOT   S1
S1 S7
S2 S5
S3 S4
S1 S2
S4 77
S2 S9
S3 88

输出:

根 S1 S2 S9
根 S1 S2 S5 S8
根 S1 S2 S5 S3 S4 77
根 S1 S2 S5 S3 88
根 S1 S7

我不确定是否使用 Graph 的内存开销并计算 transitive closure matrix在你的情况下将是禁止的。

关于perl - 使用 perl 创建层次文件，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/3862217/

文章推荐： operating-system - 操作系统究竟如何保护内核

文章推荐： ruby-on-rails - 如何使用 RSpec 测试路由约束

文章推荐： scala - Scala 2.8 中是否有 do-until(后置条件)循环？

文章推荐： mef - 如何将满足导入的类的实例添加到 CompositionContainer

android - 从具有平面 View 层次 ConstraintLayout 的多个水平链创建垂直链
我正在尝试将多个水平链接的 Button 和 TextView 垂直链接为 View 集，但仍保持平面 View 层次结构。这是我的初始布局和代码:
machine-learning - 在Google BigQuery上训练模型后，如何获得其架构(层次，损失函数等)？
到目前为止，我已经在Google BigQuery上训练了几种模型，目前我需要查看模型的外观（即架构，损失函数等）。有没有办法获取这些信息？最佳答案仔细阅读文档后，我可以说该功能尚不存在。我什至
PHP实现二叉树深度优先遍历(前序、中序、后序)和广度优先遍历(层次)实例详解
本文实例讲述了PHP实现二叉树深度优先遍历(前序、中序、后序)和广度优先遍历(层次)。分享给大家供大家参考，具体如下：前言：深度优先遍历：对每一个可能的分支路径深入到不能再深入为止，而且每个

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

perl - 使用 perl 创建层次文件