gpt4 book ai didi

regex - Linux筛选器大日志文件用于报告

转载 作者:太空宇宙 更新时间:2023-11-04 06:00:54 24 4
gpt4 key购买 nike

我有超过26000个文件的大日志,每个文件的内容如下所示。我需要排除所有带有404和JSON的行。在以下情况下,我需要获取最后一行,因为那是包含404而非JSON的内容。对编写过滤器正则表达式有帮助吗?感谢Linux Gurus的帮助。

-错误pbmzjYvLFIlLeth6mN2Yox9DH4vap1hcFHuJgNosd0XHVSxGdRcrWw == pdl.astro.com.my http 151 0.004---错误
2015-07-28 11:34:55 SIN3 659 14.192.213.22 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.jpg 404

版本:1.0

字段:日期时间x-edge-location sc-bytes c-ip cs-method cs(Host)cs-uri-stem sc-status cs(Referer)cs(User-Agent)cs-uri-query cs(Cookie)x -edge-result类型的x-edge-request-id x-host-header cs-protocol ss-protocol ssl-protocol ssl-cipher x-edge-response-result-type所花的时间x

2015-07-28 11:34:57 MAD50 658 124.13.170.152 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误tdlmnsfrOCxOelbe82y3kIp_QfbBF7S3dDCn4rHR65JOMkOtZu4dzA == pdl.astro.com.my http 151 0.004---错误
2015-07-28 11:34:53 SIN3 659 14.192.214.93 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误5r0xsHnxLY5TePeJ6ZfKvuHrhQnbd2lbWtDQosEXLj4Z7TZ5N68ZhA == pdl.astro.com.my http 151 0.002---错误
2015-07-28 11:34:53 SIN3 659 14.192.213.198 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误koGGTK2mc2dDS3XvABS0zAeqheH52toNmJgIqAh5A0TYKIZL6qsgRw == pdl.astro.com.my http 151 0.001---错误
2015-07-28 11:34:54 SIN3 659 14.192.208.27 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误bvLIe540oNMCeZ0QpOmX1OKoClgNgvSWppGuOmgVS85WnAXKJ1ryDg == pdl.astro.com.my http 151 0.002---错误
2015-07-28 11:34:54 SIN3 659 210.19.26.33 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误6Wl5xeCZArNN3WGaIGOA6XjUqZHEiENbWOmChiMZPayefDuLtC8WrA == pdl.astro.com.my http 151 0.001---错误
2015-07-28 11:34:54 SIN3 659 121.121.62.92 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误WLn7heBO3PvvVW1vt365EVXqoD440Byy6Qh6RYYazSyPBZUxwsS0Jg == pdl.astro.com.my http 151 0.001---错误
2015-07-28 11:34:54 SIN3 659 14.192.213.9 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误hTbk9HE5nyFSla1DmeC1D1jhuMtoUY6E7QQvyf0v1YyJ1GBp-I40bw == pdl.astro.com.my http 151 0.001---错误
2015-07-28 11:34:55 SIN3 659 14.192.213.250 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; HD)--错误avWgysZyGeGXdVxZHLfP5uLJ4ie5Hx8pa6ZJC5GHXfvOkyEXXp8o0g == pdl.astro.com.my http 151 0.001---错误
2015-07-28 11:34:55 SIN3 659 14.192.211.78 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误wBepjCn58o9AiTifvtrCprkjdAdg--zsLTsjDpUBkxnEU5tahmJxxQ == pdl.astro.com.my http 151 0.004---错误
2015-07-28 11:34:55 SIN3 659 121.121.101.4 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误YZ07B5vu7L4I3aoTcBXF5rcH8Dwrv5a77xRqqelkQqvQhYLDnkrKWg == pdl.astro.com.my http 151 0.001---错误
2015-07-28 11:34:55 SIN3 659 14.192.208.156 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404-NDS%2520VM%2520Engine / 002%2520Apr%252004%25202014%2520(OSD:%252032% 2520; SD)--错误pbmzjYvLFIlLeth6mN2Yox9DH4vap1hcFHuJgNosd0XHVSxGdRcrWw == pdl.astro.com.my http 151 0.004---错误
2015-07-28 11:34:55 SIN3 659 14.192.213.22 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.json 404


错误pbmzjYvLFIlLeth6mN2Yox9DH4vap1hcFHuJgNosd0XHVSxGdRcrWw == pdl.astro.com.my http 151 0.004---错误
2015-07-28 11:34:55 SIN3 659 14.192.213.22 GET d2v2sjgehuhalt.cloudfront.net /thumbnail/mediaInfo_211.jpg 404

最佳答案

请阅读how to ask,您的问题是off topic,您不是provide code;它与编码无关,在serverfault上可能会更好。

如果要解析大型HTTP日志,则应使用visitors,并且如果要JSON输出,那么由于此社区是关于编码的,因此可以对其进行扩展。

否则,对于您的原始问题,这是使用awk的方法:

awk '$NF == 404 && $(NF -1) ~ /\.json$/ { next; } {print}' /path/to/yourfile.log

$NF == 404 # the last field is 404
$(NF -1) # the field before the last
~ /\.json$/ # ends with .json
{ next; } # skip this line
{ print } # print anything else

关于regex - Linux筛选器大日志文件用于报告,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/31696137/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com