gpt4 book ai didi

xml - XSLT:将同级文本节点移动到选定的节点中以进行XLIFF修复

转载 作者:行者123 更新时间:2023-12-03 16:19:17 24 4
gpt4 key购买 nike

在对XSLT进行了几个小时的研究之后,我承认失败了!我需要修复因未命名的翻译工具而退回给我们的大量.xlf XLIFF翻译文件。理想情况下,我将使用批处理工具将XSL转换应用于它们。

以下是其中一个XLIFF文件的摘要:

<body>
<trans-unit id="1" phase-name="pretrans" restype="x-h3">
<source>Adding, Deleting or Modifying Notes in the Call Description</source>
<seg-source>Adding, Deleting or Modifying Notes in the Call Description</seg-source>
<target state="final">Добавление, удаление и изменение примечаний в описании звонка</target>
</trans-unit>
<trans-unit id="2" phase-name="pretrans" restype="x-p">
<source>Description of Fields on RHS</source>
<seg-source>Description of Fields on RHS</seg-source>
<target state="final">Поле описания в правой части</target>
</trans-unit>
<trans-unit id="3" phase-name="pretrans" restype="x-p">
<source>You can add descriptive text notes to a call recording, if you have the appropriate privileges to do so. These notes are visible to all users who have access to the call recording. It is recommended that each user add their initials to the notes to avoid potential confusion.</source>
<seg-source>
<mrk mtype="seg" mid="1">You can add descriptive text notes to a call recording, if you have the appropriate privileges to do so.</mrk>
<mrk mtype="seg" mid="2">These notes are visible to all users who have access to the call recording.</mrk>
<mrk mtype="seg" mid="3">It is recommended that each user add their initials to the notes to avoid potential confusion.</mrk>
</seg-source>
<target state="final">
<mrk mtype="seg" mid="1" /><ph ctype="" id="1">&lt;MadCap:variable name="zoom_userdocs_variables.var_product_name" xmlns:MadCap="http://www.madcapsoftware.com/Schemas/MadCap.xsd" /&gt;</ph> позволяет находить телефонные взаимодействия, содержащие или не содержащие определенные фразы.
<mrk mtype="seg" mid="2" />Каждая речевая метка содержит одну или несколько таких фраз.
<mrk mtype="seg" mid="3" />Ядро <ph ctype="" id="3">&lt;MadCap:variable name="zoom_userdocs_variables.var_product_name" xmlns:MadCap="http://www.madcapsoftware.com/Schemas/MadCap.xsd" /&gt;</ph> индексирует медиафайлы и помечает места вхождения фразы (добавляет к ним метки).
<mrk mtype="seg" mid="4" />Затем нужные медиафайлы можно искать по связанным с ними меткам.
</target>
</trans-unit>
<trans-unit id="4" phase-name="pretrans" restype="x-p">
<source>To add, delete, or modify text in the description field, click inside the description field.</source>
<seg-source>To add, delete, or modify text in the description field, click inside the description field.</seg-source>
<target state="final">Чтобы добавить, удалить или изменить текст в поле описания, щелкните это поле.</target>
</trans-unit>
</body>


注意第三个 target节点中的 trans-unit标记。 mrk标记应包含现在已经成为同级的文本节点(与早期的 seg-source标记相比,它仍然是正确的),从而弄乱了结构。

因此,我试图识别不包含文本节点的所有 mrk标记,并将以下文本节点移回它们中。

这是期望的结果:

<body>
<trans-unit id="1" phase-name="pretrans" restype="x-h3">
<source>Adding, Deleting or Modifying Notes in the Call Description</source>
<seg-source>Adding, Deleting or Modifying Notes in the Call Description</seg-source>
<target state="final">Добавление, удаление и изменение примечаний в описании звонка</target>
</trans-unit>
<trans-unit id="2" phase-name="pretrans" restype="x-p">
<source>Description of Fields on RHS</source>
<seg-source>Description of Fields on RHS</seg-source>
<target state="final">Поле описания в правой части</target>
</trans-unit>
<trans-unit id="3" phase-name="pretrans" restype="x-p">
<source>You can add descriptive text notes to a call recording, if you have the appropriate privileges to do so. These notes are visible to all users who have access to the call recording. It is recommended that each user add their initials to the notes to avoid potential confusion.</source>
<seg-source>
<mrk mtype="seg" mid="1">You can add descriptive text notes to a call recording, if you have the appropriate privileges to do so.</mrk>
<mrk mtype="seg" mid="2">These notes are visible to all users who have access to the call recording.</mrk>
<mrk mtype="seg" mid="3">It is recommended that each user add their initials to the notes to avoid potential confusion.</mrk>
</seg-source>
<target state="final">
<mrk mtype="seg" mid="1"><ph ctype="" id="1">&lt;MadCap:variable name="zoom_userdocs_variables.var_product_name" xmlns:MadCap="http://www.madcapsoftware.com/Schemas/MadCap.xsd" /&gt;</ph> позволяет находить телефонные взаимодействия, содержащие или не содержащие определенные фразы.</mrk>
<mrk mtype="seg" mid="2">Каждая речевая метка содержит одну или несколько таких фраз.</mrk>
<mrk mtype="seg" mid="3">Ядро <ph ctype="" id="3">&lt;MadCap:variable name="zoom_userdocs_variables.var_product_name" xmlns:MadCap="http://www.madcapsoftware.com/Schemas/MadCap.xsd" /&gt;</ph> индексирует медиафайлы и помечает места вхождения фразы (добавляет к ним метки).</mrk>
<mrk mtype="seg" mid="4">Затем нужные медиафайлы можно искать по связанным с ними меткам.</mrk>
</target>
</trans-unit>
<trans-unit id="4" phase-name="pretrans" restype="x-p">
<source>To add, delete, or modify text in the description field, click inside the description field.</source>
<seg-source>To add, delete, or modify text in the description field, click inside the description field.</seg-source>
<target state="final">Чтобы добавить, удалить или изменить текст в поле описания, щелкните это поле.</target>
</trans-unit>
</body>


我通常会在Perl中使用LibXML或类似工具来执行此操作,但是我确信这对于XSLT是简单的任务。我一直在寻找类似的解决方案,但找不到我可以做的任何事情。

需要注意的另一点-尽管这里的“漂亮印刷”最终的 body节点定义全部在一行上。

谢谢!我期待学习新的东西!

编辑:上面更新的源以显示必须保留的 <target>元素内的其他子标记。
编辑2:添加所需的结果。

最佳答案

试试这个XSLT:

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:output method="xml" indent="yes"/>

<xsl:template match="@* | node()">
<xsl:copy>
<xsl:apply-templates select="@* | node()"/>
</xsl:copy>
</xsl:template>

<xsl:template match="trans-unit/target/mrk[following-sibling::text()]">
<xsl:copy>
<xsl:apply-templates select="@* | node()"/>
<xsl:value-of select="following-sibling::text()"/>
</xsl:copy>
</xsl:template>

<xsl:template match="trans-unit/target/text()"/>

</xsl:stylesheet>


可能会产生预期的结果:

<body>
<trans-unit id="1" phase-name="pretrans" restype="x-h3">
<source>Adding, Deleting or Modifying Notes in the Call Description</source>
<seg-source>Adding, Deleting or Modifying Notes in the Call Description</seg-source>
<target state="final" />
</trans-unit>
<trans-unit id="2" phase-name="pretrans" restype="x-p">
<source>Description of Fields on RHS</source>
<seg-source>Description of Fields on RHS</seg-source>
<target state="final" />
</trans-unit>
<trans-unit id="3" phase-name="pretrans" restype="x-p">
<source>You can add descriptive text notes to a call recording, if you have the appropriate privileges to do so. These notes are visible to all users who have access to the call recording. It is recommended that each user add their initials to the notes to avoid potential confusion.</source>
<seg-source>
<mrk mtype="seg" mid="1">You can add descriptive text notes to a call recording, if you have the appropriate privileges to do so.</mrk>
<mrk mtype="seg" mid="2">These notes are visible to all users who have access to the call recording.</mrk>
<mrk mtype="seg" mid="3">It is recommended that each user add their initials to the notes to avoid potential confusion.</mrk>
</seg-source>
<target state="final"><mrk mtype="seg" mid="1">При наличии соответствующих прав можно добавить описательные текстовые примечания к записи звонка.
</mrk><mrk mtype="seg" mid="2">Эти примечания видны для всех пользователей, которые имеют доступ к записи звонка.
</mrk><mrk mtype="seg" mid="3">Во избежание возможной путаницы каждому пользователю рекомендуется к примечаниям добавлять свои инициалы.
</mrk></target>
</trans-unit>
<trans-unit id="4" phase-name="pretrans" restype="x-p">
<source>To add, delete, or modify text in the description field, click inside the description field.</source>
<seg-source>To add, delete, or modify text in the description field, click inside the description field.</seg-source>
<target state="final" />
</trans-unit>
</body>

关于xml - XSLT:将同级文本节点移动到选定的节点中以进行XLIFF修复,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/14226315/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com