bash - AWK - 通过 for 循环和条件检查处理多个文件-6ren

bash - AWK - 通过 for 循环和条件检查处理多个文件

转载作者：行者123 更新时间：2023-11-29 09:36:01

25

4

文件 1:我的文件名_WEEK.csv

w27_2018,257,1,26.20,0.00,24.26
w28_2018,257,1,7.97,0.00,24.26
w29_2018,257,1,34.86,0.00,24.26
w30_2018,257,1,3.29,0.00,24.26

文件 2:myfilename_MONTH.csv

m07_2018,257,1,94.78,0.00,121.31
m08_2018,257,1,719.60,0.00,262.47
m09_2018,257,1,14925.60,0.00,13903.24
m10_2018,257,1,51099.66,0.00,81600.69

文件 3:myfilename_HALF.csv

h02_2018,257,1,155345.19,480029.21,235802.91
h01_2019,257,1,273961.84,552545.36,140706.27
h02_2018,258,1,3250552.06,1299785.91,3697749.57
h01_2019,258,1,3582585.66,2670427.72,4009391.28

日历文件:

20180805,08/05/2018,w27_2018,WK27 2018,m07_2018,AUG 2018,q03_2018,Q03 2018,h02_2018,H02 2018,a2018,FY2018,27,WEEK 27,01,SUNDAY
20180806,08/06/2018,w27_2018,WK27 2018,m07_2018,AUG 2018,q03_2018,Q03 2018,h02_2018,H02 2018,a2018,FY2018,27,WEEK 27,02,MONDAY
...
20180811,08/11/2018,w27_2018,WK27 2018,m07_2018,AUG 2018,q03_2018,Q03 2018,h02_2018,H02 2018,a2018,FY2018,27,WEEK 27,07,SATURDAY
20180812,08/12/2018,w28_2018,WK28 2018,m07_2018,AUG 2018,q03_2018,Q03 2018,h02_2018,H02 2018,a2018,FY2018,28,WEEK 28,01,SUNDAY
..
20180816,08/16/2018,w28_2018,WK28 2018,m07_2018,AUG 2018,q03_2018,Q03 2018,h02_2018,H02 2018,a2018,FY2018,28,WEEK 28,05,THURSDAY

预期输出(为便于阅读而添加的换行符):

2018,w27_2018,WK27 2018,257,1,26.20,0.00,24.26
2018,w27_2018,WK27 2018,258,1,97192.07,9028.38,52130.32
2018,w27_2018,WK27 2018,300,1,181.44,0.00,-69.72

2018,m07_2018,AUG 2018,257,1,94.78,0.00,121.31
2018,m07_2018,AUG 2018,258,1,509253.46,45141.91,399648.71
2018,m07_2018,AUG 2018,300,1,409.10,0.00,-348.60

2018,h02_2018,H02 2018,257,1,155345.19,480029.21,235802.91
2018,h02_2018,H02 2018,258,1,3250552.06,1299785.91,3697749.57
2018,h02_2018,H02 2018,300,1,1112.93,0.00,-1164.35

我想加入所有 myfilename_* 使用 calendar_file 添加标签和财政年度:

个别命令是:

awk -F, 'NR==FNR {a[$3]=substr($12,3,4) FS $3 FS $4; next} {print a[$1] FS $2 FS $3 FS $4 FS $5 FS $6}' calendar_file myfilename_WEEK.csv >> my_report.csv

awk -F, 'NR==FNR {a[$5]=substr($12,3,4) FS $5 FS $6; next} {print a[$1] FS $2 FS $3 FS $4 FS $5 FS $6}' calendar_file myfilename_MONTH.csv >> my_report.csv

awk -F, 'NR==FNR {a[$9]=substr($12,3,4) FS $9 FS $10; next} {print a[$1] FS $2 FS $3 FS $4 FS $5 FS $6}' calendar_file myfilename_HALF.csv >> my_report.csv

我正在尝试将所有这些连接到一个循环中:

我已经尝试了以下但它不起作用:

    for exp_file in `ls myfilename_*.csv`
     do
     awk -F, '\
     { \
        if(NR==FNR && FILENAME ~ /WEEK/) {a[$3]=substr($12,3,4) FS $3 FS $4; next} ;\
        if(NR==FNR && FILENAME ~ /MONTH/) {a[$5]=substr($12,3,4) FS $5 FS $6; next} ;\
        if(NR==FNR && FILENAME ~ /HALF/) {a[$9]=substr($12,3,4) FS $9 FS $10; next} ;\
       {print a[$1] FS $2 FS $3 FS $4 FS $5 FS $6} \
     }' calendar_file $exp_file >> my_report.csv
     done

我怎样才能做到这一点？提前感谢您的帮助!

最佳答案

第一种方式(GNU awk，如果你没有GNU awk请留言):

awk -F, 'NR==FNR{y=substr($12,3,4); a[$3]=y FS $3 FS $4; b[$5]=y FS $5 FS $6; c[$9]=y FS $9 FS $10; next} FNR==1{printf nl;nl="\n"} match(FILENAME, /myfilename_([A-Z]*)/, f){NF=6;switch(f[1]){case "WEEK": $1=a[$1];break; case "MONTH": $1=b[$1];break; case "HALF": $1=c[$1];}}1' OFS=, calendar_file myfilename_{WEEK,MONTH,HALF}.csv

多行以提高可读性:

awk -F, '
NR==FNR{
    y=substr($12,3,4); 
    a[$3]=y FS $3 FS $4; 
    b[$5]=y FS $5 FS $6; 
    c[$9]=y FS $9 FS $10; 
    next
} 
FNR==1{printf nl;nl=ORS} ## The newlines between sectors, if you do not need those newlines then remove this line.
match(FILENAME, /myfilename_([A-Z]*)/, f){
    NF=6;  ## To limit results for 6 columns only, can remove it here.
    switch(f[1]){
    case "WEEK": 
        $1=a[$1];
        break; 
    case "MONTH": 
        $1=b[$1];
        break; 
    case "HALF": 
        $1=c[$1];
    }
}1' OFS=, calendar_file myfilename_{WEEK,MONTH,HALF}.csv

对其的更新:

awk -F, '
NR==FNR{
    y=substr($12,3,4); 
    a[$3]=y FS $3 FS $4; 
    b[$5]=y FS $5 FS $6; 
    c[$9]=y FS $9 FS $10; 
    next
} 
FNR==1{printf nl;nl=ORS} ## The newlines between sectors, if you do not need those newlines then remove this line.
match(FILENAME, /myfilename_([A-Z]*)/, f){
    NF=6;  ## To limit results for 6 columns only, can remove in your case.
    $1 = f[1]=="WEEK" ? a[$1] : ( f[1]=="MONTH" ? b[$1] : (f[1]=="HALF" ? c[$1] : $1) )
}1' OFS=, calendar_file myfilename_{WEEK,MONTH,HALF}.csv

第二种方式，更简洁且不使用switch(也是GNU awk):

awk -F, '
NR==FNR{
    y=substr($12,3,4); 
    a[$3 "WEEK"]=y FS $3 FS $4; 
    a[$5 "MONTH"]=y FS $5 FS $6; 
    a[$9 "HALF"]=y FS $9 FS $10; 
    next
} 
FNR==1{printf nl;nl=ORS} ## The newlines between sectors, if you do not need those newlines then remove this line.
match(FILENAME, /myfilename_([A-Z]*)/, f){
    $1=a[$1 f[1]];
}1' OFS=, calendar_file myfilename_{WEEK,MONTH,HALF}.csv

第三种方式:如果您的数据都与它们的文件名相对应，就像您在示例中展示的那样，还有第三种方式可以消除 match 的需要，所以它可以在其他 awk 上工作:

awk -F, '
NR==FNR{
    y=substr($12,3,4); 
    a[$3 "w"]=y FS $3 FS $4; 
    a[$5 "m"]=y FS $5 FS $6; 
    a[$9 "h"]=y FS $9 FS $10; 
    next
} 
FNR==1{printf nl;nl=ORS} ## The newlines between sectors, if you do not need those newlines then remove this line.
$1~/^([wmh])[0-9]{2}_[0-9]{4}/{   ## Check first fields to make sure it matches, the checking is optional if your data is all like you showed.
    $1=a[$1 substr($1,1,1)]
    print
}' OFS=, calendar_file myfilename_{WEEK,MONTH,HALF}.csv

再考虑一下，根据您的数据文件名关系，实际上没有必要检查第一个字母(也没有文件名):

awk -F, '
NR==FNR{
    y=substr($12,3,4); 
    a[$3]=y FS $3 FS $4; 
    a[$5]=y FS $5 FS $6; 
    a[$9]=y FS $9 FS $10; 
    next
} 
FNR==1{printf nl;nl=ORS} ## The newlines between sectors, if you do not need those newlines then remove this line.
{   ## Add  $1~/^([wmh])[0-9]{2}_[0-9]{4}/  to the beginning of this line if  you want to check and make sure first column.
    $1=a[$1]
}1' OFS=, calendar_file myfilename_{WEEK,MONTH,HALF}.csv

关于bash - AWK - 通过 for 循环和条件检查处理多个文件，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/55546075/

25

4

0

文章推荐： bash - 将终端中同一位置的不同短语大写

文章推荐： php - WordPress 更新查询不起作用，返回 false

文章推荐： java - Groovy:与 Java 相比，多线程性能差，计算速度慢？

文章推荐： bash - 不能将 $@ 的所有参数与 sh -c 一起使用

bash - IntelliJ : System bash and IntelliJ bash are not the same
我用 IntelliJ IDEA 2021.1 CE 在流行!_OS 20.04 与 bash 5.0.17 . 问题造句:我将IntelliJ终端设置为/bin/bash通过 IntelliJ 设
bash - 是否有 bash 命令来显示 bash 快捷方式？
给定如下命令: bash --shortcuts 我想显示一个快捷方式列表，就像在这个页面上一样: http://www.skorks.com/2009/09/bash-shortcuts-for-m
bash - 如何将带空格的参数从 bash 脚本传递到 bash 脚本？
我有一个脚本可以操作数据、创建参数并将它们发送到第二个脚本。其中一个参数包含一个空格。脚本1.sh: args=() args+=("A") args+=("1 2") args+=("B") .
bash - 从 bash 脚本到无限循环中的 bash 脚本
我的脚本的“只运行一次”版本的一个非常简单的示例: ./myscript.sh var1 "var2 with spaces" var3 #!/bin/bash echo $1 #output: va
bash - bash 中数字的表示和 bash 中十六进制数的 printf
我想了解数字( double )在 bash 中是如何表示的，以及当我在 bash 中以十六进制格式打印数字时会发生什么。根据 IEEE 754 标准，double 应由 64 位表示:52 位(1
bash - bash -c ""中的源 bash 脚本
我试图在 bash -c "..." 命令中获取 bash 脚本，但它不起作用。如果我在 bash -c "..." 之外运行命令，它会起作用。我需要使用 bash -c "..." 因为我想确保
bash - 检测 bash 中是否存在 Bash 补全
如何检测我的 bash shell 中是否加载了 bash 补全包？从 bash-completion 的 2.1 版(包含在 Debian 8 中)开始，除了 BASH_COMPLETION_COM
bash - 如何在 bash 脚本中使用 bash 配置文件中定义的函数？
我的 bash_profile 中有一个投影函数。现在我试图从 bash 脚本中调用这个函数，但是我得到了一个未找到的错误。如何使投影函数对 bash 脚本可见？最佳答案必须导出函数 export
bash - 通过 bash 脚本将参数传递给/bin/bash
我正在编写一个 bash 脚本，它接受许多命令行参数(可能包括空格)并通过登录 shell 将它们全部传递给程序 (/bin/some_program)。从 bash 脚本调用的登录 shell 将取
bash - 在新的 bash 中更改 bash 提示符
当我创建一个新的 bash 进程时，提示符默认为一个非常简单的提示符。我知道我可以编辑 .bashrc 等来更改它，但是有没有办法使用 bash 命令传递提示？谢谢! 最佳答案提示由 PS1、PS
bash - Bash shell 和 Bash 终端之间的区别？
好的，我希望这个问题有一定道理，但是 bash shell 和 bash 终端之间有什么区别？例子。当我第一次打开终端时，会提示我当前的目录和用户名。在终端窗口标题中显示 -bash- ，当我键入 e
bash - SBCL:从 bash 运行并退出回到 bash
我是 SBCL 的新手，我正在尝试从 bash 终端运行存储在文本文件中的 Lisp 脚本。这是我在文件开头写的内容 http://www.sbcl.org/manual/#Running-from
bash - Bash 中的十六进制到十进制
我知道我们可以在 bash 中使用将十六进制转换为十进制 #!/bin/bash echo "Type a hex number" read hexNum echo $(( 16#$hexNum ))
bash - bash 脚本中的自动完成
我正在尝试在 bash 脚本中自动完成文件夹名称。如果我输入完整的文件夹名称，一切正常，但我不知道如何自动完成名称。有什么想法吗？ repo() { cd ~/Desktop/_REPOS/$1 }
bash - 如何将多个命令通过管道传递给 bash？
我想检查远程网站上的一些文件。这里是bash命令生成计算文件md5的命令 [root]# head -n 3 zrcpathAll | awk '{print $3}' | xargs -I {}
bash - 获取给定日期后的下一个星期日 (bash)
是否有任何内置函数可以使用 bash shell 脚本从给定日期获取下周日(下周一、下周二等)？例如，2014 年 9 月 1 日之后的第一个星期日是什么时候？我预计 2014 年 9 月 7 日。
bash - 在循环中重命名匹配模式的文件 - Bash
我一直在尝试根据表格重命名一些特定文件，但没有成功。它要么重命名所有文件，要么给出错误。该目录包含数百个以长条形码命名的文件，我只想重命名包含模式 _1_ 的文件。例子 barcode_1_bar
bash - bash 中有没有办法用变量的内容替换文本文件中的占位符？
bash 中有没有办法用变量的内容替换文本文件中的占位符？例如，我想发送一封电子邮件通知，如下所示: Dear Foo, Alert: blah blah blah blah blah blah
bash - bash 脚本执行中出现的坏字符
我有一个 bash 脚本，它在某些字符串上附加了一个重音字符，导致它失败，我找不到这些字符在哪里或如何进入那里。这是一些示例输出: mv: cannot move â/tmp/myapp.zipâ
bash - bash 可以向终端输入写入命令吗？
这个问题在这里已经有了答案: How do I place stdout on edit line? (1 个回答) Can a bash script prepopulate the prompt

首页

博学

6Ren·AI

商城

bash - AWK - 通过 for 循环和条件检查处理多个文件