gpt4 book ai didi

bash - 存在字符串时如何在两个定界符之间提取文本

转载 作者:行者123 更新时间:2023-11-29 09:37:32 25 4
gpt4 key购买 nike

我有一个大数据文件,如下所示:

//
ID 1.1.1.258
DE 6-hydroxyhexanoate dehydrogenase.
CA 6-hydroxyhexanoate + NAD(+) = 6-oxohexanoate + NADH.
CC -!- Involved in the cyclohexanol degradation pathway in Acinetobacter
CC NCIB 9871.
//
ID 1.1.1.259
DE 3-hydroxypimeloyl-CoA dehydrogenase.
CA 3-hydroxypimeloyl-CoA + NAD(+) = 3-oxopimeloyl-CoA + NADH.
CC -!- Involved in the anaerobic pathway of benzoate degradation in
CC bacteria.
//
ID 1.1.1.260
DE Sulcatone reductase.
CA Sulcatol + NAD(+) = sulcatone + NADH.
CC -!- Studies on the effects of growth-stage and nutrient supply on the
CC stereochemistry of sulcatone reduction in Clostridia pasteurianum,
CC C.tyrobutyricum and Lactobacillus brevis suggest that there may be at
CC least two sulcatone reductases with different stereospecificities.
//

我想提取此文件中包含作品 anaerobic 的部分。我特别想要 ID 行。

有没有办法在 ID 和//之间搜索文件以找到 anaerobic 并将输出打印到新文件?如果打印了整个部分,那很好,我可以在之后将其 grep 出来。

预期的结果应该是

ID   1.1.1.259

ID   1.1.1.259
DE 3-hydroxypimeloyl-CoA dehydrogenase.
CA 3-hydroxypimeloyl-CoA + NAD(+) = 3-oxopimeloyl-CoA + NADH.
CC -!- Involved in the anaerobic pathway of benzoate degradation in
CC bacteria.
//

最佳答案

用awk很简单

awk '/anaerobic/' RS='//\n' ORS='\n//' ./file.txt

关于bash - 存在字符串时如何在两个定界符之间提取文本,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/48206207/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com