gpt4 book ai didi

Python将字符串拆分为下一个句号标点符号

转载 作者:行者123 更新时间:2023-12-01 00:56:17 24 4
gpt4 key购买 nike

每 8 个单词后分割字符串。如果第 8 个单词没有(. 或 !),移至下一个出现的单词。

我可以从字符串中拆分单词。

with open("file.txt") as c:
for line in c:
text = line.split()
n = 8
listword = [' '.join(text[i:i+n]) for i in range(0,len(text),n)]
for lsb in listword:
print(lsb)

预期输出应该是

I'm going to the mall for breakfast, Please meet me there for lunch. 
The duration of the next. He figured I was only joking!
I brought back the time.

这就是我得到的

I'm going to the mall for breakfast, Please
meet me there for lunch. The duration of
the next. He figured I was only joking!
I brought back the time.

最佳答案

您正在向单词序列添加换行符。换行的主要条件是最后一个单词以 .! 结尾。另外还有一个关于最小长度的次要条件(8 个字或更多)。以下代码收集缓冲区中的单词,直到满足打印行的条件。

with open("file.txt") as c:
out = []
for line in c:
for word in line.split():
out.append(word)
if word.endswith(('.', '!')) and len(out) >= 8:
print(' '.join(out))
out.clear()
# don't forget to flush the buffer
if out:
print(' '.join(out))

关于Python将字符串拆分为下一个句号标点符号,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/56226602/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com