xml - XSLT - 重命名节点和删除空格等等-6ren

xml - XSLT - 重命名节点和删除空格等等

转载作者：行者123 更新时间：2023-12-04 05:19:37

这是一个非常复杂的问题，超出了我对 XSLT 的了解 - 我仍在学习，无论我阅读了多少 O'Reilly 的 XSLT 书籍，我都处于头脑中。

我有一个多方面的问题，我已经为它生成了一个输入 XML 文件，之后我也将尝试解释需求。

输入

<roottag>
<body>
    <header>
        <r>
            <c>
                <d>Header Tag</d><!-- This can include spaces-->
                <e>System generated trash</e>
            </c>
        </r>
        <r>
            <c>
                <d>Sub Header Tag A</d>
                <e>System generated trash</e>
            </c>
            <c>
                <d>Sub Header Value A</d>
                <e>System generated trash</e>
            </c>
        </r>
        <r>
            <c>
                <d>Sub Header Tag B</d>
                <e>System generated trash</e>
            </c>
            <c>
                <d>Sub Header Value B</d>
                <e>System generated trash</e>
            </c>
        </r>
        <r>
            <c>
                <d>Sub Header Tag C</d>
                <e>System generated trash</e>
            </c>
            <c>
                <d>Sub Header Value C</d>
                <e>System generated trash</e>
            </c>
        </r>
    </header>
    <information>
        <r>Body of document</r>
        <r>Appears here but have an XSLT that deals with this</r>
    </informtaion>
    <footer>
        <r>
            <c>
                <d>Footer Tag</d><!-- This can include spaces-->
                <e>System generated trash</e>
            </c>
        </r>
        <r>
            <c>
                <d>Sub Footer Tag A</d>
                <e>System generated trash</e>
            </c>
            <c>
                <d>Sub Footer Value A</d>
                <e>System generated trash</e>
            </c>
        </r>
        <r>
            <c>
                <d>Sub Footer Tag B</d>
                <e>System generated trash</e>
            </c>
            <c>
                <d>Sub Footer Value B</d>
                <e>System generated trash</e>
            </c>
        </r>
        <r>
            <c>
                <d>Sub Footer Tag C</d>
                <e>System generated trash</e>
            </c>
            <c>
                <d>Sub Footer Value C</d>
                <e>System generated trash</e>
            </c>
        </r>
    </footer>
</body>
</roottag>

输出

<?xml version="1.0" encoding="utf-8"?>
<roottag>
  <body>
    <header>
      <HeaderTag>
        <!-- without spaces -->
        <HeaderName>Header Tag</HeaderName>
        <!-- This needs to preserve spaces-->
      </HeaderTag>
      <SubHeaderTagA>
        <!-- without spaces -->
        <HeaderName>Sub Header Tag A</HeaderName>
        <!-- This needs to preserve spaces-->
        <HeaderValue>Sub Header Value A</HeaderValue>
      </SubHeaderTagA>
      <SubHeaderTagB>
        <HeaderName>Sub Header Tag B</HeaderName>
        <HeaderValue>Sub Header Value B</HeaderValue>
      </SubHeaderTagB>
      <SubHeaderTagC>
        <HeaderName>Sub Header Tag C</HeaderName>
        <HeaderValue>Sub Header Value C</HeaderValue>
      </SubHeaderTagC>
    </header>
    <information>
      <r>Body of document</r>
      <r>Appears here but have an XSLT that deals with this</r>
      </information>
      <footer>
        <FooterTag>
          <FooterName>Footer Tag</FooterName>
        </FooterTag>
        <SubFooterTagA>
          <FooterName>Sub Footer Tag A</FooterName>
          <FooterValue>Sub Footer Value A</FooterValue>
        </SubFooterTagA>
        <SubFooterTagB>
          <FooterName>Sub Footer Tag B</FooterName>
          <FooterValue>Sub Footer Value B</FooterValue>
        </SubFooterTagB>
        <SubFooterTagC>
          <FooterName>Sub Footer Tag C</FooterName>
          <FooterValue>Sub Footer Value C</FooterValue>
        </SubFooterTagC>
      </footer>
    </body>
</roottag>

所以要解释我所看到的问题，以及我所面临的问题。

删除空格:
roottag/body/header/r/c/d 中保存的值可以并且通常包含空格，因此我需要一种方法来删除它，这是我从网站 [addLink] 上的问题中找到的，但是这也会替换该值，因此当我在稍后的过程中使用数据，它没有输出所需的空格。

仅用第一个值替换 R:
我不知道如何做到这一点，我尝试和研究的一切似乎都使用第二个值作为值。我在我的 wiks 结束了。

页眉或页脚名称/值:
同样，我的知识是有限的，如果这甚至是可能的，或者每个标签都需要单独匹配？

页眉和页脚标签的移动:
我没有将它包含在我需要的输出中，但我想我可能需要这样做 - 是否可以将页眉标签和页脚标签移到正文标签之外？所以 XML 将是:roottag-header-body-information-/body-footer/-/rt

如果您需要更多说明，请告诉我。

最佳答案

从源中的元素内容派生元素名称通常是一个坏主意 - 尽管您可以删除空格，但始终存在其他特殊字符的可能性，即使您将它们全部删除，最终也可能会出现意外的重复项。包含 1 Tag 的两个元素和 2 Tag例如，两者都需要精简为 Tag .

但是，这样的事情应该可以完成这项工作:

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
  <xsl:output method="xml" indent="yes"/>
  <xsl:strip-space elements="*" />

  <xsl:variable name="allowed">ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvxyz_</xsl:variable>

  <xsl:template match="r[c/d]">
    <xsl:variable name="elemName" select="translate(c/d,translate(c/d,$allowed,''),'')" />
    <xsl:element name="{$elemName}">
      <xsl:apply-templates />
    </xsl:element>
  </xsl:template>

  <xsl:template match="c">
    <xsl:apply-templates select="@* | node()"/>
  </xsl:template>

  <xsl:template match="c[1]/d">
    <HeaderName>
      <xsl:apply-templates />
    </HeaderName>
  </xsl:template>

  <xsl:template match="c[2]/d">
    <HeaderValue>
      <xsl:apply-templates />
    </HeaderValue>
  </xsl:template>

  <xsl:template match="e" />

  <xsl:template match="@* | node()">
    <xsl:copy>
      <xsl:apply-templates select="@* | node()"/>
    </xsl:copy>
  </xsl:template>
</xsl:stylesheet>

这使用“双重翻译”方法从字符串中去除所有不需要的字符。 translate函数可用于从列表中删除所有字符，方法是指定一个空字符串将这些字符转换为。您可以使用它从字符串中删除所有 VALID 字符，留下仅包含 INVALID 字符的字符串。然后您再次使用 translate 从原始字符串中删除所有这些 INVALID 字符。

如果您确实需要正文之外的页眉/页脚，请添加以下模板:

<xsl:template match="roottag">
  <xsl:copy>
    <xsl:apply-templates select="body/header" />
    <xsl:apply-templates select="body" />
    <xsl:apply-templates select="body/footer" />
  </xsl:copy>
</xsl:template>

<xsl:template match="body">
  <xsl:copy>
    <xsl:apply-templates select="information" />
  </xsl:copy>
</xsl:template>

忽略 r节点所在 c\d包含一个等号，在匹配“ r[c/d]”的下面添加这个模板:

<xsl:template match="r[contains(c/d,'=')]" />

关于xml - XSLT - 重命名节点和删除空格等等，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/13799148/

文章推荐： ajax - JSF 2.0 中的文本框渲染

文章推荐： scala - 如何使用 CanBuildFrom 为 Traversable 定义隐式类？

java - 正则表达式 30 个数字 + 空格 + 连字符 + 空格 1 个数字
我有这个代码来查找这个模式:201409250200131738007947036000 - 1，在文本内 final String patternStr = "(\\d{
正则表达式删除方括号/空格
我正在尝试使用正则表达式清除一些用户输入，以删除 [ 和 ] 并删除任何大于 1 个空格的空格。但我似乎无法实现我想要的效果。这是我第一次使用正则表达式，所以我对如何写出来有点困惑。 (preg_re
Java正则表达式匹配单词+空格
我正在尝试构建这个简单的正则表达式来匹配 Java 中的单词+空格，但我在尝试解决它时感到困惑。该网站上有很多类似的示例，但答案大多给出了正则表达式本身，而没有解释它是如何构造的。我正在寻找的是形成
Python删除行之间的输入/空格
好吧，我已经阅读了很多建议如何消除多余空间的帖子，但无论出于何种原因，我似乎无法将这些建议应用到我的系统中，所以我在这里寻求您的帮助。这些是我代码的最后几行: for line in rli
javascript - 如何删除某些空的新行/空格
所以我正在我的测试存储上学习网页抓取，但我不确定如何正确地从“sizes”数组中删除空的新行。 const $ = cheerio.load(body) $('div.lis
javascript - 输入表单中不允许有空白字符/空格
这个问题已经有答案了: How to prevent invalid characters from being typed into input fields (8 个回答) 已关闭 9 年前。是
java - 忽略空格、空格
有人知道如何让扫描仪忽略空间吗？我想输入名字和第二个名字，但扫描仪不让我输入，我想保存全名 String name; System.out.print("Enter name: "); name =
VIM:空格/制表符缩进
这个问题在这里已经有了答案: Make Vim show ALL white spaces as a character (23 个回答) 关闭 8 年前。 VIM(使用 Solarized Dar
java - 流标记器、空格
我想使用 StreamTokenizer 从 java 文件中提取名称。我已将空格设置为逗号 inputTokenizer.whitespaceChars(',', ','); 但是，
Java:读取txt文件并将其保存在字符串数组中但不带反斜杠(空格)？
我正在使用此代码逐行读取 txt 文件。 // Open the file that is the first command line parameter FileInputStream fstre
Java 正则表达式 - 空格
我似乎无法弄清楚我需要的正则表达式。这就是我想要实现的目标: {ANY CHAR} + @javax.persistence.Column(name = "{ANY 30 CHARS}") + {AN
StyleCop 和 = 空格
我正在运行 StyleCop(顺便说一句，如果你想提供高质量的代码，我完全推荐它)... 我有这条线 [System.Xml.Serialization.XmlRootAttribute(Namesp
PhpStorm 在每次保存时删除制表符/空格
我刚刚更新到 PhpStorm 2016，我突然注意到，每次我按 Ctrl + S 保存文件时，它都会删除我在测试这段代码后按下以继续编写的空格/制表符。请帮忙，这对我来说很烦人，因为我在每一行代码
c - 输入名称(空格)
关闭。此题需要details or clarity 。目前不接受答案。想要改进这个问题吗？通过 editing this post 添加详细信息并澄清问题. 已关闭 7 年前。 Improve th
c - 删除c程序中的制表符/空格
已关闭。此问题不符合Stack Overflow guidelines 。目前不接受答案。要求提供代码的问题必须表现出对所解决问题的最低限度的了解。包括尝试的解决方案、为什么它们不起作用以及预期结果
路径中的 C# 空格
我已经看过几十个关于这个主题的问题和答案，但我仍然无法解决我的问题。我在我的代码中使用了一个外部 ffmpeg 转换器，我将文件路径作为参数传递，如下所示: OutputPackage oo = c
c - 空格、特殊字符和转义序列
谁能详细解释一下它们是什么以及它们之间的区别。提前致谢。最佳答案转义序列是代表其他内容的字符序列。例如(“\n” = 新行，“\?” = 问号等)。有关更详细的列表，请检查:https://en.
javascript - 从数组中删除换行符/空格
我无法从我的 javascript 文本中删除换行符。这是我正在处理的数据示例: 0: "Christian Pulisic" 1: "↵" 2: "From Wikipedia, the free
java - 从字符串Java的开头和结尾删除新行/空格
我有一个问题 - 我似乎无法从字符串的开头/结尾删除新行/空格。我在正则表达式的开头和结尾使用 \s ，甚至在获取字符串后使用 .trim() ，但无济于事。 public void extractI
用于超链接的变量中的 PHP 空格
我是 php 的新手，我正在尝试将一系列变量添加到 html 超链接中。但是，任何返回空格的变量都会弄乱超链接。 Grants Test

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

xml - XSLT - 重命名节点和删除空格等等