Python - 使用 readlines() 处理第 n 行跃点-6ren

Python - 使用 readlines() 处理第 n 行跃点

转载作者：行者123 更新时间：2023-11-30 23:31:02

25

4

我正在尝试修复一个我想在 Github 上使用的损坏的库。

我已在本地“修复”了该问题。但我不认为这是一个非常干净的方法......

我正在通过互联网存档查看 WARC 库，特别是 arc.py 部分 ( https://github.com/internetarchive/warc/blob/master/warc/arc.py )。

自从编写 lib 以来，制作 ARC 文件的工具已经发生了一些变化，因此，内置解析器失败，因为它不希望看到文件中的某些元数据。

我的本地修复如下所示:

    if header.startswith("<arcmetadata"):
        while not header.endswith("</arcmetadata>\n"):
            header = self.fileobj.readline()
        header = self.fileobj.readline()
        header = self.fileobj.readline()

而且我不确定我两次调用 readlines() 来删除接下来的两个空行(包含 "/n" 是最干净的方法推进文件对象。

这是好Python吗？或者有更好的方法吗？

最佳答案

该代码看起来像是复制/粘贴错误。使用 .readline() 没有任何问题，只需记录您正在做的事情即可:

# skip metadata
if header.startswith("<arcmetadata"):
    while not header.endswith("</arcmetadata>\n"):
        header = self.fileobj.readline()
    #NOTE: header ends with `"</arc..."` here i.e., it is not blank

# skip blank lines
while not header.strip():
    header = self.fileobj.readline()

顺便说一句，如果文件包含 xml，则使用 xml 解析器来解析它。不要用手做。

关于Python - 使用 readlines() 处理第 n 行跃点，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/20204757/

25

4

0

文章推荐： python - 如何从简单的 Python 服务器打印到 html 页面？

文章推荐： c# - 抓取和放下物体

文章推荐： c# - 在 Kinect One (v2) 中设置红外灯的亮度

需要 Readline - 您不能创建这种类型的实例 (Readline)
这个问题在这里已经有了答案: What could be the reason that `require` doesn't work in some places? (3 个回答) 6 个月前关闭。
java - .readLine()/readLine 的替代方案仅返回列表
我正在使用读取行从维基百科获取一些文本。但读取行仅返回列表，而不是我想要的文本。有什么方法可以使用替代方案或解决我的问题吗？ public class mediawiki { public s
Python readline 和 readlines 行为
我正在编写一小段代码，其中涉及使用子进程运行一个脚本来监听一些实时数据这是我的代码: def subscriber(): try: sub = subprocess.Pope
c - 'readline/readline.h' 文件未找到
我已包括: #include "stdio.h" #include #include 我的编译器包含标志 -lreadline 但我仍然收到错误消息: fatal error: 'readl
perl - 使用 Term::Readline-readline 停止无限循环的正确方法是什么？
使用 Term::Readline::readline 停止无限循环的正确方法是什么？ ? 这样我一个都看不懂 0 #!/usr/bin/env perl use warnings; use stri
readline - 使用 GNU Readline；如何在同一程序中添加 ncurses？
标题比我的实际目标更具体: 我有一个使用 GNU Readline 的命令行程序，主要用于命令历史记录(即使用向上箭头检索以前的命令)和其他一些细节。现在，程序的输出似乎散布在用户的输入中，有时是可以
readline - ipython:按 'esc' 键会中断 readline
在 ipython 中，如果我按“esc”，然后按“enter”(可能还有其他字符？)，读行会中断。我无法再使用“向上”键搜索命令历史记录，并且某些命令(例如 control-K)失败。有没有办法在
python - 使用python打开文件对象: readlines() and readline() does not return any value
我在使用 readlines() 和 readline() 返回值时遇到问题，但在使用 read() 时却没有。任何人都知道这是怎么发生的？欣赏一下 with open('seatninger.txt
readline - 使用 GNU Readline；如何在同一程序中添加 ncurses？
标题比我的实际目标更具体: 我有一个使用 GNU Readline 的命令行程序，主要用于命令历史记录(即使用向上箭头检索以前的命令)和其他一些细节。现在，程序的输出似乎散布在用户的输入中，有时是可以
c - 停止 readline、printf，然后恢复 readline
我正在编写一个聊天客户端，它必须在接收用户输入的同时输出接收到的消息。到目前为止，我已经 fork 成两个独立的进程，其中一个继续监听套接字连接并用 printf 写出接收到的字符串。另一个使用 r
C# - 为什么 StreamReader ReadLine 在调用 ReadLine 之前读取数据？
我在 NetworkStream 上使用 StreamReader，我只想读取一行或多行，而另一个数据是 byte array(如文件数据)我不想在 StreamReader 中读取该文件数据，例如我
c# - Console.ReadLine 和 Console.In.ReadLine 之间的区别
我遇到了这两个 API，用于在 C# 的简单控制台应用程序中读取用户的输入: System.Console.ReadLine() System.Console.In.ReadLine() 这是一个我试
bash - yum 显示已安装 readline 但 readline 命令不起作用
yum 我的系统显示已安装 readline rlwrap-0.41]$ sudo yum install readline Loaded plugins: fastestmirror, presto
readline - 将 readline 接口(interface)到 Rust
我尝试做 this tutorial在 Rust 中，到目前为止，我在将 C 库连接到 Rust 时遇到了很多问题。 C 等效代码: #include #include #include #in
python - Python 中 read()、readline() 和 readlines() 的区别
我正在寻找 web Python的标题中提到的命令及其区别；但是，我并不满足于对这些命令有完整的基本理解。假设我的文件只有以下内容。 This is the first time I am posi
f# - 为什么 Console.Readline 不起作用但 Console.Readline() 起作用？
你如何在 F# 中使用 Console.Readline？与 Console.Writeline 不同，当我调用它时，它并没有受到尊重。最佳答案如果你使用 let s = Console.Read
python - 为什么 readline() 比 Python 中的 readlines() 慢得多？
在一次面试中，面试官问我为什么 readline() 比 Python 中的 readlines() 慢很多？我回答的是readlines()需要多次读取，需要更多的开销。不知道我的回答对不对。
readline - 在 OSX Lion 上使用 readline pip 安装 ipython
要在 OSX Lion 上完全运行 ipython 需要什么？我试图让 ipython 与 readline 一起工作，但没有成功。我的做法: (在虚拟环境中) pip install ipytho
javascript - 为什么我不能在 Nodejs v10 中读取 "import * as readline from ' readline'"？
在 Nodejs 文档中，我看到: import EventEmitter from 'events'; import { readFile } from 'fs'; import fs, { rea
c - 为什么 readline 库中的 readline() 不接受 UNICODE？ ANSI C语言
我写了一个简单的应用程序: #include #include #include #include int main() { char *user_input; while(u

首页

博学

6Ren·AI

商城

Python - 使用 readlines() 处理第 n 行跃点