php - 使用 PHP 的 DomDocument appendChild 时保留换行符-6ren

php - 使用 PHP 的 DomDocument appendChild 时保留换行符

转载作者：行者123 更新时间：2023-12-04 06:10:31

26

4

我正在尝试使用 PHP 中的 DOMDocument 在 HTML 文档中添加/解析内容。从我所读到的，将 formOutput 设置为 true 并将 preserveWhiteSpace 设置为 false 应该使制表符和换行符保持有序，但它似乎不适用于新创建或附加的节点。

这是代码:

$dom = new \DOMDocument;
$dom->formatOutput = true;
$dom->preserveWhiteSpace = false;
$dom->loadHTMLFile($htmlsource);
$tables = $dom->getElementsByTagName('table');
foreach($tables as $table)
{
    $table->setAttribute('class', 'tborder');
    $div = $dom->createElement('div');
    $div->setAttribute('class', 'm2x');
    $table->parentNode->insertBefore($div, $table);
    $div->appendChild($table);
}
$dom->saveHTMLFile($html)

这是 HTML 的样子:

<table>
    <tr>
        <td></td>
    </tr>
</table>

这是我想要的:

<div class="m2x">
    <table class="tborder">
        <tr>
            <td></td>
        </tr>
    </table>
</div>

这是我得到的:

<div class="m2x"><table class="tborder"><tr>
<td></td>
        </tr></table></div>

有什么我做错了吗？我已经尝试在谷歌上搜索尽可能多的不同方式，但没有运气。

最佳答案

不幸的是，您可能需要编写一个函数来按照您希望的方式缩进输出。我做了一个你可能会觉得有用的小功能。

function indentContent($content, $tab="\t")
{               

        // add marker linefeeds to aid the pretty-tokeniser (adds a linefeed between all tag-end boundaries)
        $content = preg_replace('/(>)(<)(\/*)/', "$1\n$2$3", $content);

        // now indent the tags
        $token = strtok($content, "\n");
        $result = ''; // holds formatted version as it is built
        $pad = 0; // initial indent
        $matches = array(); // returns from preg_matches()

        // scan each line and adjust indent based on opening/closing tags
        while ($token !== false) 
        {
                $token = trim($token);
                // test for the various tag states

                // 1. open and closing tags on same line - no change
                if (preg_match('/.+<\/\w[^>]*>$/', $token, $matches)) $indent=0;
                // 2. closing tag - outdent now
                elseif (preg_match('/^<\/\w/', $token, $matches))
                {
                        $pad--;
                        if($indent>0) $indent=0;
                }
                // 3. opening tag - don't pad this one, only subsequent tags
                elseif (preg_match('/^<\w[^>]*[^\/]>.*$/', $token, $matches)) $indent=1;
                // 4. no indentation needed
                else $indent = 0;

                // pad the line with the required number of leading spaces
                $line = str_pad($token, strlen($token)+$pad, $tab, STR_PAD_LEFT);
                $result .= $line."\n"; // add to the cumulative result, with linefeed
                $token = strtok("\n"); // get the next token
                $pad += $indent; // update the pad size for subsequent lines    
        }       

        return $result;
}

indentContent($dom->saveHTML())将返回:

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd">
<html>
    <body>
        <div class="m2x">
            <table class="tborder">
                <tr>
                    <td>
                    </td>
                </tr>
            </table>
        </div>
    </body>
</html>

我从 this one 开始创建了这个函数.

关于php - 使用 PHP 的 DomDocument appendChild 时保留换行符，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/7838929/

26

4

0

文章推荐： ruby-on-rails - 从 Webrat 迁移到 Capybara...失败

文章推荐： ruby-on-rails - Rails 应用程序中的 "Tab order"

文章推荐： c# - 底层连接已关闭 : An unexpected error occurred on a send on

php - 将 DOMDocument 根元素附加到另一个 DOMDocument
我有 2 个“DOMDocument”对象 - $original 和 $additional。我想要的是从 $additional DOMDocument 中获取所有子级并将其附加到 $origin
php - 将 DOMDocument 根元素附加到另一个 DOMDocument
我有 2 个“DOMDocument”对象 - $original 和 $additional。我想要的是从 $additional DOMDocument 中获取所有子级并将其附加到 $origin
php - DOMDocument::save[domdocument.save]:无法打开流:权限被拒绝
我有一个代码可以将 XML 文件保存到我的目录中。它在我的本地主机和我的共享主机中实际上就像一个魅力，但它在我的 Linux VPS 中不起作用。我总是遇到这个错误: 警告:DOMDocument:
PHP DOMDocument::loadHTML() [domdocument.loadhtml]: htmlParseEntityRef: 实体中没有名称
我试图从某些网页中获取“链接”元素。我无法弄清楚我做错了什么。我收到以下错误: Severity: Warning Message: DOMDocument::loadHTML() [domdocum
domdocument - Msxml2.DOMDocument 和 Msxml2.XMLHTTP 之间的区别
有什么区别: Msxml2.DOMDocument Msxml2.XMLHTTP ？当然，另一个问题是哪一个最适合我的目的，如下所述？上下文是这样的 - 我有代码可以多次调用来检索网页。我正在寻找执
domdocument - Windows Server 2016 - MSXML DOMDocument 版本
安装后 Windows Server 2012 和 Windows Server 2016 原生支持哪些版本的 MSXML 和 DOMDocument？最佳答案 Modern versions of
domdocument - Windows Server 2016 - MSXML DOMDocument 版本
安装后 Windows Server 2012 和 Windows Server 2016 原生支持哪些版本的 MSXML 和 DOMDocument？最佳答案 Modern versions of
domdocument - PHP DomDocument 在 CLI 和 Web 浏览器中的行为不同
我正在使用以下代码: $doc = new DOMDocument(); $doc->loadHTML("From: fsong | #001I hate you DomDocument :(.you
php - 警告:DOMDocument::loadXML() [function.DOMDocument-loadXML]:实体 'laquo' 未在实体中定义
我使用 xml、xsl 截取服务器的响应并提取所需的片段，以根据客户端请求从服务器响应中提取 html 片段。例如，假设 $content 在我们处理它之前有服务器响应。 $dom = new
c++ - 将 Xerces-C DOMDocument 中的 Xerces-C DOMElement 附加到另一个 Xerces-C DOMDocument
我之前在 RapidXml 中询问过一个类似的问题，我现在想知道，相同但使用 Xerces-C。我正在开发一个需要解析 xml 的 C++ 应用程序。考虑以下几点: xml文件:file1.xml
php DOMDocument 如何将节点值转换为字符串
这个问题在这里已经有了答案: 关闭 11 年前。 Possible Duplicate: How can I get an element's serialised HTML with PHP's
php - DOMDocument 删除脚本标签中的结束标签
我有以下 test.php文件，当我运行它时，关闭标签被删除。 loadHTML(' console.log("hello");
php DOMDocument - 操作和编码
$dom = new DOMDocument('1.0', 'UTF-8'); $dom->loadHTML($content); $divs = $dom->getElementsByTagName
dom - DOMDocument xpath查询图像扩展名不等于特定文本吗？
获得除png扩展名以外的所有图像？ $xpath = new DOMXPath( $htmlget); $nodelist = $xpath->query("//img[!ends-wi
PHP DOMDocument 未删除所有元素
我想删除所有 script 元素以及此处的代码 aaa EOF; $dom = new DOMDocument(); $dom->loadHTML($pageFile); foreach (
php - DOMDocument 添加属性到根标签
我想制作一个函数，向给定 html 的根标签添加一些属性。我正在这样做: $dom = new \DOMDocument(); $dom->loadHTML($content);
php - DOMDocument 添加属性到根标签
我想制作向给定 html 的根标记添加一些属性的函数。我这样做: $dom = new \DOMDocument(); $dom->loadHTML($content); $
javascript - DOMDocument - 从正文中获取脚本文本
我想做的是从 body 标签中获取脚本，但只有包含文本而不是脚本链接的脚本例如。 console.log("for a test run"); 不是具有文件 src 的脚本。我想将这些脚本放在页尾
PHP DOMDocument 如何得到这个标签的内容？
我正在使用 domDocument 来解析这个小的 html 代码。我正在寻找具有特定 id 的特定 span 标签。 Hello world 我的代码: $dom = new domDocument
PHP DomDocument 类不存在
关闭。这个问题是not reproducible or was caused by typos .它目前不接受答案。这个问题是由于错别字或无法再重现的问题引起的。虽然类似的问题可能是on-topi

首页

博学

6Ren·AI

商城

php - 使用 PHP 的 DomDocument appendChild 时保留换行符