puppeteer - Puppeter 的 page.pdf API 中的页眉和页脚打印是如何工作的？-6ren

puppeteer - Puppeter 的 page.pdf API 中的页眉和页脚打印是如何工作的？

转载作者：行者123 更新时间：2023-12-04 16:28:28

39

4

我在尝试使用 headerTemplate 时发现了一些不一致的地方。和 footerTemplate page.pdf 的选项:

页眉和页脚的 DPI 似乎较低(我认为主体为 72 对 96)。所以如果我想匹配边距，我必须按比例缩放。

样式不与主体共享，因此我必须将它们包含在模板中。

如果我尝试使用本地存储的字体，即使我在页眉/页脚模板中包含相同的 CSS，它也适用于主体，但不适用于页眉/页脚。

我怀疑这是因为页眉和页脚被视为单独的文档并分别转换为图像/pdf( https://cs.chromium.org/chromium/src/components/printing/resources/print_header_footer_template_page.html 也意味着类似的东西)。熟悉实现的人可以解释它的实际工作原理吗？谢谢!

最佳答案

简短的回答:

Puppeteer 通过 DevTools Protocol 控制 Chrome 或 Chromium .

Chrome 使用 Skia用于 PDF 生成。

Skia 分别处理页眉、对象集和页脚。

详细答案:

来自 Puppeteer Documentation :

page.pdf(options)

options <Object> Options object which might have the following properties:

headerTemplate <string> HTML template for the print header. Should be valid HTML markup with following classes used to inject printing values into them:

date formatted print date

title document title

url document location

pageNumber current page number

totalPages total pages in the document

footerTemplate <string> HTML template for the print footer. Should use the same format as the headerTemplate.

returns: <Promise<Buffer>> Promise which resolves with PDF buffer.

NOTE Generating a pdf is currently only supported in Chrome headless.

NOTE headerTemplate and footerTemplate markup have the following limitations:

Script tags inside templates are not evaluated.

Page styles are not visible inside templates.

我们可以借鉴 Puppeteer source code for page.pdf() 那:

Chrome DevTools 协议(protocol)方法Page.printToPDF (连同 headerTemplate 和 footerTemplate 参数)被发送到 page._client .

page._client是 page.target().createCDPSession() 的一个实例(Chrome DevTools 协议(protocol) session )。

来自 Chrome DevTools Protocol Viewer ，我们可以看到 Page.printToPDF包含参数 headerTemplate和 footerTemplate :

Page.printToPDF

Print page as PDF.

PARAMETERS

headerTemplate string (optional)

HTML template for the print header. Should be valid HTML markup with following classes used to inject printing values into them:

date: formatted print date

title: document title

url: document location

pageNumber: current page number

totalPages: total pages in the document

For example, <span class=title></span> would generate span containing the title.

footerTemplate string (optional)

HTML template for the print footer. Should use the same format as the headerTemplate.

RETURN OBJECT

data string

Base64-encoded pdf data.

Chromium source code for Page.printToPDF 向我们展示:

Page.printToPDF参数传递给 sendDevToolsMessage 函数，它发出 DevTools 协议(protocol)命令并返回结果的 promise 。

经过进一步挖掘，我们可以看到 Chromium 有一个具体的 implementation of a class called SkDocument 创建 PDF 文件。

SkDocument 来自 Skia Graphics Library , 其中 Chromium uses for PDF generation .

Skia PDF Theory of Operation , 在 PDF Objects and Document Structure部分，指出:

Background: The PDF file format has a header, a set of objects and then a footer that contains a table of contents for all of the objects in the document (the cross-reference table). The table of contents lists the specific byte position for each object. The objects may have references to other objects and the ASCII size of those references is dependent on the object number assigned to the referenced object; therefore we can’t calculate the table of contents until the size of objects is known, which requires assignment of object numbers. The document uses SkWStream::bytesWritten() to query the offsets of each object and build the cross-reference table.

该文件进一步解释:

The PDF backend requires all indirect objects used in a PDF to be added to the SkPDFObjNumMap of the SkPDFDocument. The catalog is responsible for assigning object numbers and generating the table of contents required at the end of PDF files. In some sense, generating a PDF is a three step process. In the first step all the objects and references among them are created (mostly done by SkPDFDevice). In the second step, SkPDFObjNumMap assigns and remembers object numbers. Finally, in the third step, the header is printed, each object is printed, and then the table of contents and trailer are printed. SkPDFDocument takes care of collecting all the objects from the various SkPDFDevice instances, adding them to an SkPDFObjNumMap, iterating through the objects once to set their file positions, and iterating again to generate the final PDF.

关于puppeteer - Puppeter 的 page.pdf API 中的页眉和页脚打印是如何工作的？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/51458286/

39

4

0

文章推荐： puppeteer - 使用 Puppeteer 时如何获取 ElementHandle 的类名？

文章推荐： ionic-framework - Ionic 4 滚动时隐藏工具栏

文章推荐： terminal - 在 VS 代码中运行终端命令的快捷方式

api - Azure API 管理 - API 端点域与实际 API URL
我已经设置了 Azure API 管理服务，并在自定义域上配置了它。在 Azure 门户中 API 管理服务的配置部分下，我设置了以下内容: 因为这是一个客户端系统，我必须屏蔽细节，但以下是基础知识:
api - 使用 API key 获取 API(Twitter API)
我是一名习惯 React Native 的新程序员。我最近开始学习 Fetch API 及其工作原理。我的问题是，我找不到人们使用 API key 在他们的获取语句中访问信息的示例(我很难清楚地表达有
api - 插件 API 与类库 API
这里有很多关于 API 是什么的东西，但是我找不到我需要的关于插件 API 和类库 API 之间的区别。反正我不明白。在 Documenting APIs 一书中，我读到:插件 API 和类库 AP
api - 谷歌博客搜索 API 的替代 API
关闭。这个问题不满足Stack Overflow guidelines .它目前不接受答案。想改善这个问题吗？更新问题，使其成为 on-topic对于堆栈溢出。 7年前关闭。 Improve thi
api - 在现有 API 中使用多个第三方 API 的最佳实践
我正在尝试找出设计以下场景的最佳方法。假设我已经有了一个 REST API 实现，它将从不同的供应商那里获取书籍并将它们返回给我自己的客户端。每个供应商都提供单独的 API 来向其消费者提供图书。
api - REST API 和 API key
请有人向我解释如何使用 api key 以及它有什么用处。我对此进行了很多搜索，但得到了不同且相互矛盾的答案。有人说 API key 是保密的，它从不作为通信的一部分发送，而其他人则将它发送给客户端
api - Flickr api 与 Picasa api
关闭。这个问题是opinion-based .它目前不接受答案。想改进这个问题？更新问题，以便 editing this post 可以用事实和引用来回答它. 4年前关闭。 Improve this
api - WSO2 API Manager API 认证失败
谁能告诉我为什么 WSo2 API 管理器不进行身份验证？我已经设置了两个 WSo2 API Manager 1.8.0 实例并创建了一个 api。它作为原型(prototype) api 工作正常。
api - Fluent API 与其他 API 有何不同？
我在学习 DSL 的过程中遇到了 Fluent API。我在流利的 API 上搜索了很多……我可以得出的基本结论是，流利的 API 使用方法链来使代码流利。但我无法理解——在面向对象的语言中，我们
api - WSO2 API 管理器是否支持 API 联合？
基本上，我感兴趣的是在多个区域设置 WSO2 API 管理器；例如亚洲、美国和欧洲。一些 API 将部署在每个区域的数据中心内，而其他 API 将仅部署在特定区域内。理想情况下，我想要的是一个单一的
api - 使用 API key 保护我的 API
我正在构建自己的 API，供以下用户使用: 1) 安卓应用 2) 桌面应用我的网址之一是:http://api.chatapp.info/order_api/files/getbeers.php我的
api - 如何通过 API Key 授权谷歌分析 API
我需要向所有用户显示我的站点的分析，但使用 OAuth 它显示为登录用户配置的站点的分析。如何使用嵌入 API 实现仪表板但仅显示我的网站分析？我能想到的最好的可能性是使用 API key 而不是客
api - 提供 API 的公司是否在其 API 之前使用填充程序或代理？
我正在研究大公司如何管理其公共(public) API。我想到的是拥有成熟 API 的公司，例如 Google、Facebook、Twitter 和 Amazon。这些公司向公众公开了许多不同的 A
api - 显式 API 方法与广义的基于参数的 API 方法
在定义客户可访问的 API 时，以下是首选的行业惯例: a) 定义一组显式 API 方法，每个方法都有非常狭窄和特定的目的，例如: SetUserName SetUserAge Se
api - GAE API 资源管理器不显示 API，似乎卡在加载中
这在本地 deserver 和部署时都会发生。我成功地能够通过留言簿教程使用 API 资源管理器，但现在我已经创建了自己的项目并尝试访问我编写的第一个 API，它从未出现过。搜索栏旁边的黄色“正在加载
api - 尝试查询 API，但 api 响应为空
我正在尝试使用 http://ip-api.com/ api通过我的ip地址获取经度和纬度。当我访问 http://ip-api.com/json从我的浏览器或使用 curl，它以 json 格式返回
api - 流式 API 与 Rest API？
这里的典型示例是 Twitter 的 API。我从概念上理解 REST API 的工作原理，本质上它只是针对您的特定请求向他们的服务器查询，然后您会在其中收到响应(JSON、XML 等)，很棒。但是
api - 如何让其他 API 与您的 API 对话，而您的 API 又与 Twitter 对话？
我能想到的最好的标题，但要澄清的是，情况是这样的: 我正在开发一种类似短 url 的服务，该服务允许用户使用他们的 Twitter 帐户“登录”并发布内容。现在这项服务可以包含在 Tweetdeck
api - 平面与嵌套 API
我正在设计用于管理评论和讨论线程的 API 方案。我想有一个点 /discussions/:discussionId 当您GET 时，它会返回一组评论和一些元数据。评论也许可以单独访问 /discus
api - 后端和 API 是一样的吗？什么是后端 Web API？
关闭。这个问题需要更多focused .它目前不接受答案。想改进这个问题吗？更新问题，使其只关注一个问题 editing this post . 关闭去年。 Improve this quest

首页

博学

6Ren·AI

商城

puppeteer - Puppeter 的 page.pdf API 中的页眉和页脚打印是如何工作的？

page.pdf(options)

Page.printToPDF