python - 使用 nbconvert 执行包含内联 Markdown 的 Jupyter 笔记本-6ren

python - 使用 nbconvert 执行包含内联 Markdown 的 Jupyter 笔记本

转载作者：行者123 更新时间：2023-12-01 00:44:11

我有一个 Jupyter 笔记本，它在 Markdown 单元格中包含 python 变量，如下所示:

代码单元格:

x = 10

Markdown 单元格:

The value of x is {{x}}.

IPython-notebook-extension Python Markdown如果我在笔记本中使用 shift-enter 执行 Markdown 单元格，则允许我动态显示这些变量。

Markdown 单元格:

The value of x is 10.

我想以编程方式执行笔记本中的所有单元格，并使用以下内容将它们保存到新笔记本中:

import nbformat
from nbconvert.preprocessors import ExecutePreprocessor

with open('report.ipynb') as f:
    nb = nbformat.read(f, as_version=4)
        ep = ExecutePreprocessor(timeout=600, kernel_name='python3')
        ep.preprocess(nb, {})
with open('report_executed.ipynb', 'wt') as f:
    nbformat.write(nb, f)

这将执行代码单元而不是 Markdown 单元。它们仍然是这样的:

The value of x is {{x}}.

我认为问题在于笔记本不受信任。有没有办法告诉 ExecutePreprocessor 信任笔记本？是否有另一种方法以编程方式执行笔记本，包括 Markdown 单元格中的 python 变量？

最佳答案

执行预处理器 only looks at code cells ，因此您的 Markdown 单元格完全没有受到影响。如您所说，要进行 Markdown 处理，您需要 Python Markdown 预处理器。

不幸的是，Python Markdown 预处理器系统只在实时笔记本中执行代码，它是由 modifying the javascript involved with rendering cells 完成的。 .修改将执行代码片段的结果存储在单元元数据中。
PyMarkdownPreprocessor class(在 pre_pymarkdown.py 中)旨在与 nbconvert 在笔记本电脑上运行一起使用，这些笔记本电脑首先在实时笔记本设置中呈现。它处理 Markdown 单元格，替换 {{}}具有存储在元数据中的值的模式。

但是，在您的情况下，您没有实时笔记本元数据。我有一个类似的问题，我通过编写自己的执行预处理器来解决它，该预处理器还包括处理 Markdown 单元的逻辑:

from nbconvert.preprocessors import ExecutePreprocessor, Preprocessor
import nbformat, nbconvert
from textwrap import dedent

class ExecuteCodeMarkdownPreprocessor(ExecutePreprocessor):

    def __init__(self, **kw):
        self.sections = {'default': True} # maps section ID to true or false
        self.EmptyCell = nbformat.v4.nbbase.new_raw_cell("")

        return super().__init__(**kw)

    def preprocess_cell(self, cell, resources, cell_index):
        """
        Executes a single code cell. See base.py for details.
        To execute all cells see :meth:`preprocess`.
        """

        if cell.cell_type not in ['code','markdown']:
            return cell, resources

        if cell.cell_type == 'code':
            # Do code stuff
            return self.preprocess_code_cell(cell, resources, cell_index)

        elif cell.cell_type == 'markdown':
            # Do markdown stuff
            return self.preprocess_markdown_cell(cell, resources, cell_index)
        else:
            # Don't do anything
            return cell, resources

    def preprocess_code_cell(self, cell, resources, cell_index):
        ''' Process code cell.
        '''
        outputs = self.run_cell(cell)
        cell.outputs = outputs

        if not self.allow_errors:
            for out in outputs:
                if out.output_type == 'error':
                    pattern = u"""\
                        An error occurred while executing the following cell:
                        ------------------
                        {cell.source}
                        ------------------
                        {out.ename}: {out.evalue}
                        """
                    msg = dedent(pattern).format(out=out, cell=cell)
                    raise nbconvert.preprocessors.execute.CellExecutionError(msg)

        return cell, resources

    def preprocess_markdown_cell(self, cell, resources, cell_index):
        # Find and execute snippets of code
        cell['metadata']['variables'] = {}
        for m in re.finditer("{{(.*?)}}", cell.source):
            # Execute code
            fakecell = nbformat.v4.nbbase.new_code_cell(m.group(1))
            fakecell, resources = self.preprocess_code_cell(fakecell, resources, cell_index)

            # Output found in cell.outputs
            # Put output in cell['metadata']['variables']
            for output in fakecell.outputs:
                html = self.convert_output_to_html(output)
                if html is not None:
                    cell['metadata']['variables'][fakecell.source] = html
                    break
        return cell, resources

    def convert_output_to_html(self, output):
        '''Convert IOpub output to HTML

        See https://github.com/ipython-contrib/IPython-notebook-extensions/blob/master/nbextensions/usability/python-markdown/main.js
        '''
        if output['output_type'] == 'error':
            text = '**' + output.ename + '**: ' + output.evalue; 
            return text
        elif output.output_type == 'execute_result' or output.output_type == 'display_data':
            data = output.data
            if 'text/latex' in data:
                html = data['text/latex']
                return html
            elif 'image/svg+xml' in data:
                # Not supported
                #var svg = ul['image/svg+xml'];
                #/* embed SVG in an <img> tag, still get eaten by sanitizer... */
                #svg = btoa(svg);
                #html = '<img src="data:image/svg+xml;base64,' + svg + '"/>';
                return None
            elif 'image/jpeg' in data:
                jpeg = data['image/jpeg']
                html = '<img src="data:image/jpeg;base64,' + jpeg + '"/>'
                return html
            elif 'image/png' in data:
                png = data['image/png']
                html = '<img src="data:image/png;base64,' + png + '"/>'
                return html
            elif 'text/markdown' in data:
                text = data['text/markdown']
                return text
            elif 'text/html' in data:
                html = data['text/html']
                return html
            elif 'text/plain' in data:
                text = data['text/plain']
                # Strip <p> and </p> tags
                # Strip quotes
                # html.match(/<p>([\s\S]*?)<\/p>/)[1]
                text = re.sub(r'<p>([\s\S]*?)<\/p>', r'\1', text)
                text = re.sub(r"'([\s\S]*?)'",r'\1', text)
                return text
            else:
            # Some tag we don't support
                return None
        else:
            return None

然后，您可以使用类似于您发布的代码的逻辑处理您的笔记本:

import nbformat
from nbconvert.preprocessors import ExecutePreprocessor
import ExecuteCodeMarkdownPreprocessor # from wherever you put it
import PyMarkdownPreprocessor # from pre_pymarkdown.py

with open('report.ipynb') as f:
    nb = nbformat.read(f, as_version=4)
    ep = ExecuteCodeMarkdownPreprocessor(timeout=600, kernel_name='python3')
    ep.preprocess(nb, {})
    pymk = PyMarkdownPreprocessor()
    pymk.preprocess(nb, {})

with open('report_executed.ipynb', 'wt') as f:
    nbformat.write(nb, f)

请注意，通过包含 Python Markdown 预处理，您生成的笔记本文件将不再具有 {{}} Markdown 单元格中的语法 - Markdown 将具有静态内容。如果结果笔记本的接收者更改了代码并再次执行，则 Markdown 将不会更新。但是，如果您要导出为不同的格式(例如 HTML)，那么您确实需要替换 {{}}具有静态内容的语法。

关于python - 使用 nbconvert 执行包含内联 Markdown 的 Jupyter 笔记本，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/35805121/

文章推荐： jquery:单选按钮输入检查属性未定义

文章推荐： python - 每 1000 的倍数重置 Pandas Cumsum

文章推荐： jquery - 如何添加和删除CSS类

jupyter-notebook - pip install jupyter 后找不到 jupyter
在尝试安装 jupyter 的许多不同方法之后，它似乎没有正确安装。根据我最近遇到的 MacOS 系统 python 问题，可能与 MacOS 相关 pip install jupyter --us
jupyter - Jupyter 笔记本的更好命名约定
命名 Jupyter Notebook 时，如果使用空格，即 This is my notebook.ipynb 然后，当使用网络浏览器打开时，它的渲染效果非常好。然而，空格在命令行环境中是有害的。但
jupyter-notebook - `jupyter notebook` 和 `jupyter server` 有什么区别？
运行 jupyter notebook和 jupyter server给我非常相似的结果，描述也非常相似。 ❯ jupyter notebook -h The Jupyter HTML Noteboo
jupyter-notebook - Jupyter Lab 中的 Jupyter Notebook 扩展
是否可以在 Jupyter Lab 中使用笔记本扩展 (nbextensions)？我认为我的扩展已启用...我调用 jupyter nbextension enable事先在他们身上。但我没有在
python - Jupyter 命令仅适用于破折号(例如 jupyter-kernelspec 而不是 jupyter kernelspec)
我将 Jupyter 与 Anaconda3 结合使用。我的 Anaconda3\ 和 Anaconda3\Scripts\ 文件夹已添加到 %PATH% 变量中。尽管 jupyter.exe 位
jupyter - 在页面刷新时运行 Jupyter 单元格
我将 Jupyter 用于我公司的分析。我想制作显示一些漂亮图表的实时页面。我将在大厅的大显示器上显示此页面，我希望它自动刷新。有什么方法可以通过刷新浏览器页面来触发“运行所有单元格”吗？或者，是否
jupyter - %stored Jupyter 文件位于何处？
%store 魔术功能可以保存大型 python 对象，供您在不同 session 之间使用，但我想找出文件的实际位置，以便我也可以在不同 session 之间传输它们不同的电脑。我正在使用 Wind
jupyter-notebook - Jupyter 笔记本上的文本从右到左对齐
我在 Windows 10 的本地 Ubunto 机器上运行 Jupyter notebook。问题是所有文本都与屏幕右侧对齐，包括菜单 - 例如"file"选项卡位于最右侧。以这种方式阅读文本非常困
jupyter-notebook - Jupyter Notebook取消注释的快捷方式
Closed. This question is not reproducible or was caused by typos。它当前不接受答案。想要改善这个问题吗？更新问题，以便将其作为on-t
jupyter-notebook - Jupyter 链接到部分
我尝试使用以下内容创建指向 jupyter 笔记本中标题的内部链接。 SO上的各种答案，例如here在我的笔记本中似乎没有按预期工作。下面的代码创建了一个链接，但在尝试访问该链接时没有任何 react
jupyter - 如何共享 Jupyter 笔记本
这个问题在这里已经有了答案: How can I share Jupyter notebooks with non-programmers? [closed] (6 个回答) 5年前关闭。我安装了一
jupyter-notebook - Jupyter 笔记本自动完成显示太多
我试图在 jupyter notebook 中自动完成路径。按“tab”后，它显示的不仅仅是文件夹或文件。我认为这些是python的内置功能。有没有办法在自动完成路径时只显示路径和文件？谢谢! 最佳
jupyter - 从控制台执行 Jupyter notebook
我在 Jupyter notebook 中组合了一些数据分析步骤。随着数据的变化，我希望能够重新运行所有单元格(以考虑新数据) 转换为html以供查看我知道我可以通过 jupyter nbcon
jupyter-notebook - jupyter 实验室中的自动单元格执行计时
在 jupyter 中笔记本 , 我可以用 nbextensions 配置自动单元计时，结果是这样的: 我怎样才能在 jupyter 中做到这一点实验室 ?我没有找到任何做类似事情的扩展。观察:
jupyter-notebook - 无法在受限数据处理设置中安装 JUPYTER
我正在寻找一个带有 Jupter 可选组件的数据处理集群。 gcloud beta dataproc clusters create cluster-1ea3 --enable-component-g
jupyter-notebook - Jupyter 笔记本自动完成显示重复选项
Jupyter Notebook 的自动完成功能似乎有效，但不知何故它会显示该方法的重复选项。例如下面: 对于每个可能的选项，下拉菜单将显示 2 个相同的选项。为什么会发生这种情况以及如何解决？最佳
jupyter-notebook - Jupyter 实验室不断开启简单模式
每当我将 jupyter 实验室窗口推到一边时，“简单模式”就会被激活。这导致只显示一个选项卡，而其他选项卡被隐藏。这非常烦人，我不敢相信这是一个标准功能(只是再次安装了 jupyter 实验室)。
jupyter-notebook - Jupyter Notebook新电池类型默认
当我使用jupyter notebook时，我希望新单元格的类型为“markdown”。默认情况下，新单元格的类型为“代码”。我应该修改哪个配置文件以及应该更改哪个变量？最佳答案转到您的pyt
jupyter - Jupyter 下的 Haskell
奇怪的是，互联网上没有关于如何在 Jupyter 环境下运行 Haskell 的说明。任何引导您实现此目的的文档都需要您从源代码编译，但是当尝试这样做时，会发生各种构建错误。奇怪的是，一个肯定会给
jupyter-notebook - Jupyter 中的非阻塞单元执行
在带有 ipython 内核的 Jupyter 中，是否有以非阻塞方式执行单元格的规范方法？理想情况下，我希望能够运行一个单元格 %%background time.sleep(10) print(

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 使用 nbconvert 执行包含内联 Markdown 的 Jupyter 笔记本