pdf - PyPdf:将每页一分为二，用空格填充-6ren

pdf - PyPdf:将每页一分为二，用空格填充

转载作者：行者123 更新时间：2023-12-01 22:27:35

27

4

我有一个 PDF 文件(A4，纵向布局)，我想将其中的每一页分成一半高度。输出文件也应该是A4和纵向布局，但每页的下半部分需要是空白的。

我看到了https://stackoverflow.com/a/15743413/822789但不明白如何使用 mediaBox 添加空格。

最佳答案

我不太了解 PyPDF2，但我是 pdfrw 的作者如果我理解你的问题，pdfrw 当然可以很容易地做你想做的事。我需要更好地记录它，但我有一个预先存在的 unspread.py左右拆分页面的示例，将小报页面切成原始页面。这是该示例的修改版本。此版本将顶部和底部拆分页面，并更改输出页面的大小，使其与输入页面相匹配:

#!/usr/bin/env python

'''
usage:   splitv.py my.pdf

Creates splitv.my.pdf

This is similar to unspread.py, in that it creates
a new file that has twice the pages of the old file.

It is different in two ways:

1) It splits pages top and bottom rather than left and right
2) The destination pages are the same size as the source pages,
   and the output is placed at the top.
'''

import sys
import os

from pdfrw import PdfReader, PdfWriter, PageMerge


def splitpage(src):
    ''' Split a page into two (top and bottom)
    '''
    # Yield a result for each half of the page
    for y_pos in (0, 0.5):

        # Create a blank, unsized destination page.
        page = PageMerge()

        # add a portion of the source page to it as
        # a Form XObject.
        page.add(src, viewrect=(0, y_pos, 1, 0.5))

        # By default, the object we created will be
        # at coordinates (0, 0), which is the lower
        # left corner.  To move it up on the page
        # to the top, we simply use its height
        # (which is half the source page height) as
        # its y value.
        page[0].y = page[0].h

        # When we render the page, the media box will
        # encompass (0, 0) and all the objects we have
        # placed on the page, which means the output
        # page will be the same size as the input page.
        yield page.render()


inpfn, = sys.argv[1:]
outfn = 'splitv.' + os.path.basename(inpfn)
writer = PdfWriter()
for page in PdfReader(inpfn).pages:
    writer.addpages(splitpage(page))
writer.write(outfn)

关于pdf - PyPdf:将每页一分为二，用空格填充，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/31557909/

27

4

0

文章推荐： groovy - 如何防止使用默认构造函数？

文章推荐： java - 将带有整数的 ArrayList 转换为 UTF-16 字符串

文章推荐： java - 将局部变量的值传递给字段

文章推荐： sql - 如何在数字的最后两位数字之前放置小数点

python - 我无法安装 pyPDF 包根本找不到 pyPdf 的发行版
我尝试安装这个包... $ pip search pyPdf PyPDFLite - Simple PDF Writer. pypdfocr
python - PyPDF 合并和写入问题
使用它时出现意外错误。第一部分来自我在网上找到的脚本，我试图用它来提取 PDF 大纲中标识的特定部分。一切正常，除了在 output.write(outputfile1)它说: PdfReadErro
python - pyPdf 如何理解文档边界？
Here我找到了用于拆分 pdf 页面的代码。 #!/usr/bin/env python import copy, sys from pyPdf import PdfFileWriter, PdfF
python - pypdf python工具
使用 pypdf python 模块如何读取以下 pdf 文件 http://www.envis-icpe.com/pointcounterpointbook/Hindi_Book.pdf # -*-
python - pyPdf 无法写入文件？
我是Python新手。我尝试打开pdf文件并将其内容写入新的文本文件。文本文件名称由 pdf 名称生成。到目前为止我已经尝试过，但它没有达到我的预期。我怎样才能实现它 import glob,
python - pyPdf 错误参数无效
我实际上是在使用 pyPdf 打开、读取和写入 PDF 文件的内容。为此，我使用了这些代码行: from pyPdf import PdfFileWriter, PdfFileReader pdf
python - 用于间接对象提取的 pyPdf
按照这个例子，我可以将所有元素列成一个pdf文件 import pyPdf pdf = pyPdf.PdfFileReader(open("pdffile.pdf")) list(pdf.pages)
python - pypdf 将多个pdf文件合并为一个pdf
如果我有 1000 多个 pdf 文件需要合并为一个 pdf， from PyPDF2 import PdfReader, PdfWriter writer = PdfWriter() for i i
pdf - PyPdf:将每页一分为二，用空格填充
我有一个 PDF 文件(A4，纵向布局)，我想将其中的每一页分成一半高度。输出文件也应该是A4和纵向布局，但每页的下半部分需要是空白的。我看到了https://stackoverflow.com/a
pdf - PyPdf:将每页一分为二，用空格填充
我有一个 PDF 文件(A4，纵向布局)，我想将其中的每一页分成一半高度。输出文件也应该是A4和纵向布局，但每页的下半部分需要是空白的。我看到了https://stackoverflow.com/a
python - 为什么我不能使用切片表示法通过 pyPDF 迭代页面
我正在运行以下代码来创建一个新的 PDF 文件，其中包含源 PDF 的除第一页之外的所有页面: import os from pyPdf import PdfFileReader, PdfFileWr
python - pyPdf IndirectObject in/Rotate
这个问题在这里已经有了答案: How can I rotate a page with PyPDF2? (2 个答案) 关闭 12 个月前。我们有一个简单的脚本来读取传入的 PDF 文件。如果是横
python 和 pyPdf - 如何从页面中提取文本以便行与行之间有空格
目前，如果我使用 pyPdf 和 extractText() 创建一个 pdf 页面的页面对象，会发生什么行被连接在一起。例如，如果页面的第 1 行说“hello”而第 2 行说“world”，则从
python - 使用 pyPDF 从文档中检索自定义页面标签
目前我正在研究将 PDF 与 pyPdf 合并，但有时输入的顺序不正确，所以我正在研究抓取每一页的页码以确定它应该进入的顺序(例如，如果有人将一本书分成 20 份 10 页的 PDF，而我想将它们重新
python - Python 中 Pypdf 包中的断言错误
我在 Windows 平台上使用 Python 2.4 和 PyPdf 1.13。我正在尝试使用以下代码将列表中的 PDF 文件合并为一个文件: import os from pyPdf import
python - 使用 pyPdf 合并非标准 PDF
我想将几个 PDF 文件合并为一个 PDF 文档。事实证明，输入文件并不完全符合标准。 EOF 标记后跟一些附加信息: >> startxref 1994481 %%EOF %%PPIRoute: 4
python - pyPdf PdfFileReader 与 PdfFileWriter
我有以下代码: import os from pyPdf import PdfFileReader, PdfFileWriter path = "C:/Real Python/Course mater
python - pyPdf 忽略 PDF 文件中的换行符
我正在尝试将 PDF 的每一页提取为字符串: import pyPdf pages = [] pdf = pyPdf.PdfFileReader(file('g-reg-101.pdf', 'rb')
python - pyPdf 无法从我的 PDF 中的某些页面中提取文本
我正在尝试使用 pyPdf 从多页 PDF 中提取和打印页面。问题是，文本不是从某些页面中提取的。我在这里放了一个示例文件: http://www.4shared.com/document/kmJF6
使用 pyPDF 删除空白页的 Python 脚本
我正在尝试使用 pyPDF 编写几个 python 脚本，将 PDF 页面拆分为六个单独的页面，正确排序它们(通常正面和背面打印，因此每个其他页面都需要以不同方式排序)，并删除结果输出文档末尾的空白页

首页

博学

6Ren·AI

商城

pdf - PyPdf:将每页一分为二，用空格填充