xslt - 根据属性序列比较2个节点集-6ren

xslt - 根据属性序列比较2个节点集

转载作者：行者123 更新时间：2023-12-02 02:24:14

31

4

我正在尝试构建一种 XML 库，比较各种节点并将它们组合起来以供以后重用。逻辑应该很简单，如果给定语言的 tag_XX 属性值序列等于另一种语言的 tag_YY 属性值序列，则节点可以合并。请参阅下面的 XML 示例

<Book>
<Section>
    <GB>
        <Para tag_GB="L1">
            <Content_GB>string_1</Content_GB>
        </Para>
        <Para tag_GB="Illanc">
            <Content_GB>string_2</Content_GB>
        </Para>
        <Para tag_GB="|PLB">
            <Content_GB>string_3</Content_GB>
        </Para>
        <Para tag_GB="L1">
            <Content_GB>string_4</Content_GB>
        </Para>
        <Para tag_GB="Sub">
            <Content_GB>string_5</Content_GB>
        </Para>
        <Para tag_GB="L3">
            <Content_GB>string_6</Content_GB>
        </Para>
        <Para tag_GB="Subbull">
            <Content_GB>string_7</Content_GB>
        </Para>
    </GB>
    <!-- German translations - OK because same attribute sequence -->
    <DE>
        <Para tag_DE="L1">
            <Content_DE>German_translation of_string_1</Content_DE>
        </Para>
        <Para tag_DE="Illanc">
            <Content_DE>German_translation of_string_2</Content_DE>
        </Para>
        <Para tag_DE="|PLB">
            <Content_DE>German_translation of_string_3</Content_DE>
        </Para>
        <Para tag_DE="L1">
            <Content_DE>German_translation of_string_4</Content_DE>
        </Para>
        <Para tag_DE="Sub">
            <Content_DE>German_translation of_string_5</Content_DE>
        </Para>
        <Para tag_DE="L3">
            <Content_DE>German_translation of_string_6</Content_DE>
        </Para>
        <Para tag_DE="Subbull">
            <Content_DE>German_translation of_string_7</Content_DE>
        </Para>
    </DE>
    <!-- Danish translations - NG because not same attribute sequence -->
    <DK>
        <Para tag_DK="L1">
            <Content_DK>Partial_Danish_translation_of_string_1</Content_DK>
        </Para>
        <Para tag_DK="L1_sub">
            <Content_DK>Partial_Danish_translation_of_string_1</Content_DK>
        </Para>
        <Para tag_DK="Illanc">
            <Content_DK>Danish_translation_of_string_2</Content_DK>
        </Para>
        <Para tag_DK="L1">
            <Content_DK>Danish_translation_of_string_4</Content_DK>
        </Para>
        <Para tag_DK="|PLB">
            <Content_DK>Danish_translation_of_string_3</Content_DK>
        </Para>
        <Para tag_DK="L3">
            <Content_DK>Danish_translation_of_string_6</Content_DK>
        </Para>
        <Para tag_DK="Sub">
            <Content_DK>Danish_translation_of_string_5</Content_DK>
        </Para>
        <Para tag_DK="Subbull">
            <Content_DK>Danish_translation_of_string_7</Content_DK>
        </Para>
    </DK>
</Section>
</Book>

所以

GB tag_GB value sequence = L1 -> Illanc -> ... -> SubBul

DE tag_DE value sequence = L1 -> Illanc -> ... -> SubBul(与 GB 相同所以没问题)

DK tag_DK value sequence = L1 -> L1.sub -> 糟糕，预期的 Illanc 意味着这个序列与 GB 不同，locale 可以忽略

由于德语和英语节点集具有相同的属性序列，我喜欢将它们组合如下:

<Book>
<Dictionary>
    <Para tag="L1">
        <Content_GB>string_1</Content_GB>
        <Content_DE>German_translation of_string_1</Content_DE>
    </Para>
    <Para tag="Illanc">
        <Content_GB>string_2</Content_GB>
        <Content_DE>German_translation of_string_2</Content_DE>
    </Para>
    <Para tag="|PLB">
        <Content_GB>string_3</Content_GB>
        <Content_DE>German_translation of_string_3</Content_DE>
    </Para>
    <Para tag="L1">
        <Content_GB>string_4</Content_GB>
        <Content_DE>German_translation of_string_4</Content_DE>
    </Para>
    <Para tag="Sub">
        <Content_GB>string_5</Content_GB>
        <Content_DE>German_translation of_string_5</Content_DE>
    </Para>
    <Para tag="L3">
        <Content_GB>string_6</Content_GB>
        <Content_DE>German_translation of_string_6</Content_DE>
    </Para>
    <Para tag="Subbull">
        <Content_GB>string_7</Content_GB>
        <Content_DE>German_translation of_string_7</Content_DE>
    </Para>
</Dictionary>
</Book>

我使用的样式表如下:

<xsl:stylesheet version="2.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:output method="xml" version="1.0" xmlns="http://www.w3.org/1999/xhtml" encoding="UTF-8" indent="yes"/>
<xsl:output omit-xml-declaration="yes" indent="yes"/>
<xsl:template match="/">
    <xsl:copy>
        <xsl:apply-templates select="@* | node()"/>
    </xsl:copy>
</xsl:template>
<xsl:template match="@* | node()">
    <xsl:copy>
        <xsl:apply-templates select="@* | node()"/>
    </xsl:copy>
</xsl:template>
<xsl:template match="text()">
    <xsl:value-of select="normalize-space(.)"/>
</xsl:template>
<xsl:template match="Section">
    <!-- store reference tag list -->
    <xsl:variable name="Ref_tagList" select="GB/Para/attribute()[1]"/>
    <Dictionary>
        <xsl:for-each select="GB/Para">
            <xsl:variable name="pos" select="position()"/>
            <Para tag="{@tag_GB}">
                <!-- Copy English Master -->
                <xsl:apply-templates select="element()[1]"/>
                <xsl:for-each select="//Book/Section/element()[not(self::GB)]">
                    <!-- store current locale tag list -->
                    <xsl:variable name="Curr_tagList" select="Para/attribute()[1]"/>
                    <xsl:if test="$Ref_tagList = $Curr_tagList">
                        <!-- Copy current locale is current tag list equals reference tag list -->
                        <xsl:apply-templates select="Para[position()=$pos]/element()[1]"/>
                    </xsl:if>
                </xsl:for-each>
            </Para>
        </xsl:for-each>
    </Dictionary>
</xsl:template>
</xsl:stylesheet>

除了可能不是执行此操作的最有效方法(我是 xslt 游戏的新手...)，它也不起作用。我想到的逻辑是采用英语母版的属性集，如果任何其他语言环境的属性集相等，我就复制，否则我忽略。但出于某种原因，具有不同属性序列的节点集也会被愉快地复制(如下所示)。有人能告诉我我的逻辑在哪里与现实冲突吗？提前致谢!

当前输出包括应该被忽略的丹麦语......

<Book>
<Dictionary>
    <Para tag="L1">
        <Content_GB>string_1</Content_GB>
        <Content_DE>German_translation of_string_1</Content_DE>
        <Content_DK>Partial_Danish_translation_of_string_1</Content_DK>
    </Para>
    <Para tag="Illanc">
        <Content_GB>string_2</Content_GB>
        <Content_DE>German_translation of_string_2</Content_DE>
        <Content_DK>Partial_Danish_translation_of_string_1</Content_DK>
    </Para>
    <Para tag="|PLB">
        <Content_GB>string_3</Content_GB>
        <Content_DE>German_translation of_string_3</Content_DE>
        <Content_DK>Danish_translation_of_string_2</Content_DK>
    </Para>
    <Para tag="L1">
        <Content_GB>string_4</Content_GB>
        <Content_DE>German_translation of_string_4</Content_DE>
        <Content_DK>Danish_translation_of_string_4</Content_DK>
    </Para>
    <Para tag="Sub">
        <Content_GB>string_5</Content_GB>
        <Content_DE>German_translation of_string_5</Content_DE>
        <Content_DK>Danish_translation_of_string_3</Content_DK>
    </Para>
    <Para tag="L3">
        <Content_GB>string_6</Content_GB>
        <Content_DE>German_translation of_string_6</Content_DE>
        <Content_DK>Danish_translation_of_string_6</Content_DK>
    </Para>
    <Para tag="Subbull">
        <Content_GB>string_7</Content_GB>
        <Content_DE>German_translation of_string_7</Content_DE>
        <Content_DK>Danish_translation_of_string_5</Content_DK>
    </Para>
</Dictionary>
</Book>

最佳答案

这可能不是最佳解决方案。我使用了以下 XSLT 2.0 功能:

我使用 string-join() 比较了属性序列。
我已经利用了使用 RTF 变量的可能性

可能有更多 XSLT 2.0 工具可以解决您的问题。但我认为这里的大问题是您的输入文档。

很抱歉没有查看您当前的转换。刚刚从头开始实现。希望对您有所帮助:

<xsl:stylesheet version="2.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
    <xsl:output indent="yes"/>
    <xsl:strip-space elements="*"/>

    <xsl:template match="GB">
        <Book>
            <Dictionary>

                <xsl:variable name="matches">
                    <xsl:for-each select="following-sibling::*
                        [string-join(Para/@*,'-')
                        = string-join(current()/Para/@*,'-')]">
                        <match><xsl:copy-of select="Para/*"/></match>
                    </xsl:for-each>
                </xsl:variable>

                <xsl:apply-templates select="Para">
                    <xsl:with-param name="matches" select="$matches"/>
                </xsl:apply-templates>

            </Dictionary>
        </Book>
    </xsl:template>

    <xsl:template match="Para[parent::GB]">
        <xsl:param name="matches"/>
        <xsl:variable name="pos" select="position()"/>
        <Para tag="{@tag_GB}">
            <xsl:copy-of select="Content_GB"/>
            <xsl:copy-of select="$matches/match/*[position()=$pos]"/>
        </Para>
    </xsl:template>

    <xsl:template match="text()"/>

</xsl:stylesheet>

当应用于问题中提供的输入文档时，会产生以下输出:

<Book>
   <Dictionary>
      <Para tag="L1">
         <Content_GB>string_1</Content_GB>
         <Content_DE>German_translation of_string_1</Content_DE>
      </Para>
      <Para tag="Illanc">
         <Content_GB>string_2</Content_GB>
         <Content_DE>German_translation of_string_2</Content_DE>
      </Para>
      <Para tag="|PLB">
         <Content_GB>string_3</Content_GB>
         <Content_DE>German_translation of_string_3</Content_DE>
      </Para>
      <Para tag="L1">
         <Content_GB>string_4</Content_GB>
         <Content_DE>German_translation of_string_4</Content_DE>
      </Para>
      <Para tag="Sub">
         <Content_GB>string_5</Content_GB>
         <Content_DE>German_translation of_string_5</Content_DE>
      </Para>
      <Para tag="L3">
         <Content_GB>string_6</Content_GB>
         <Content_DE>German_translation of_string_6</Content_DE>
      </Para>
      <Para tag="Subbull">
         <Content_GB>string_7</Content_GB>
         <Content_DE>German_translation of_string_7</Content_DE>
      </Para>
   </Dictionary>
</Book>

关于xslt - 根据属性序列比较2个节点集，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/6694308/

31

4

0

文章推荐： perl - 系统::信息 - 问题

文章推荐： asp.net-mvc-3 - 使用来自服务器的默认数据初始化 View 模型

文章推荐： network-protocols - protobuf-net v2 类型元

xslt - 使用 XSLT 从 XSLT 样式表中删除命名空间声明
我有一个 XSLT 样式表，如下所示: 我想使用第二个 XSLT 样式表来转换此样式表，以删除与 XQHead
xslt - 一个大的 xslt 优于更小、更细粒度的 xslt
我们有一个大型 xslt，可以呈现整个商店区域，包括产品、制造商，并根据价格和类别进行过滤。我使用 sitecore 作为 CMS，但遇到缓存问题。我有大约 9000 个项目，有些页面需要长达 20
xslt - XSLT:是否应用带有条件参数的模板？
我想根据条件的结果应用具有不同参数的模板。像这样： Attribute no. 1
xslt - 循环 XSLT
我有一些看起来像这样的 XML Foo Details Bar Details Baz Details Foo Blah Bar BlahBlah Baz BlahBlahBl
xslt - XSLT 中的矩阵转置
我试图从这种输入出发: a b c d e f g ... 使用 XSLT 的 HTML 输出: one two a e b f
xslt - xslt 中的第一个子节点名称
我想知道如何在 xslt 中找到特定节点的第一个子节点名称。我有一个 xml: some text 我可以使用 body/
xslt - XSLT 中上个月的最后一天
是否可以在 XSLT 中获取上个月的最后一天？我找到了这个函数:http://www.xsltfunctions.com/xsl/functx_last-day-of-month.html但我不确定如
xslt - xslt 中匹配命名空间的问题
具有特定节点的匹配元素存在问题。 xml: description of profile PhoneKeyPad S
xslt - XSLT 中的动态变量
我将一堆键值对作为参数传递给 XSL(日期 ->“1 月 20 日”，作者 ->“Dominic Rodger”，...)。我正在解析的一些 XML 中引用了这些 - XML 如下所示: 目前，除
xslt - xslt 中最后一个字符后的子字符串
我找不到这个问题的确切答案，所以我希望有人能在这里帮助我。我有一个字符串，我想在最后一个 '.' 之后获取子字符串。我正在使用 xslt 1.0。这是怎么做的？这是我的代码。
xslt - XSLT 中的变量范围
我在尝试找出 xslt 上的 var 范围时遇到问题。我实际上想要做的是忽略具有重复“旅游代码”的“旅行”标签。示例 XML: X1 Budapest X1 Budapest X
xslt - XSLT 中的动态排序？
我有一些数据在 xslt 的 for-each 循环中输出。我对列表进行了分页，但没有对排序选择器进行分页。用户应该能够对 2 个值(创建的数据和每个项目的数字字段)进行排序。默认的排序方法是创建日
xslt - XSLT 的奇怪排序要求
我有一个奇怪的要求。我在 xslt 中有一个包含月份的变量，带有它们的 id (1-12) 问题是我需要全部显示它们，但从一月(1)以外的月份开始。目前我有以下 JAN
xslt - 模块化 xslt？
如何在 xslt 转换中模块化一组重复的输出？例如，我有如下内容(伪代码)。并
xslt - XSLT 中的位置字符串拆分
我得到一个像这样的字符串。 13091711111100222222003333330044444400 字符串的模式是这样的 13 - 09 - 17 - 11111 - 100 - 22222 -
xslt - XSLT 中的设计和编码模式
我是 XSLT 的新手，有一个一般性问题。为了区分具有不同属性的两个元素，最好(也是为了性能)使用和而不是在一个模板中。据我所知，这就是 XSLT 中应该“思考”的方式。但在我看来，这有一个缺点
xslt - 如何从字符串中删除连字符 +xslt
如何从“19650512-0065”到“196505120065”这样的字符串中删除连字符使用这个模板:传递 theID =
xslt - XSLT 中的填充零
是否有任何功能可以在左侧填充零？我正在尝试做的要求是: 我们不知道即将到来的输入字符串长度。如果小于 20，我们必须在左侧填充零。如果输入字符串长度为 10，那么我们必须在左侧填充 10 个零。
xslt - XSLT 应用模板的默认选择是什么？
身份模板如下所示: 是否选择多于，或者身份模板可能是这样的？当我执行以下操作时，究竟选择了什么？最佳答案
xslt - XSLT 模板中的超链接
我正在尝试使用 XML 信息和 XSLT 模板创建超链接。这是 XML 源代码。 Among individual stocks, the top percentage gainers in the

首页

博学

6Ren·AI

商城

xslt - 根据属性序列比较2个节点集