git - 由 'recursive' 策略进行的 merge-6ren

git - 由 'recursive' 策略进行的 merge

转载作者：行者123 更新时间：2023-12-02 23:48:27

我知道当有多个共同祖先时 git merge recursive 实际上会发生，它会创建一个虚拟提交来 merge 这些共同祖先，然后再继续 merge 最近的提交(对不起，我不确定是否应该有一个术语这)。

但我一直在尝试找到更多关于 git merge 递归策略如何实际工作的详细信息，但找不到太多信息。

任何人都可以详细解释 git merge recursive 的实际执行情况，并通过示例和可能的流程图来帮助更好地可视化吗？

最佳答案

您可以找到 description here (另见 part 2):

When is merge recursive needed?

(Git 2.30，2020 年第一季度，将有 new merge-ort strategy )

What if we find "two common ancestors"? The branch explorer view below shows an alternative in which there are two possible "common ancestors".

Please note: the example is a little bit forced since there's not a good reason – initially – for the developer merging from changeset 11 into 16 instead of merging from changeset 15 (the latest from the branch main at the point of the merge).
But let's assume it has to be done for a reason, let's say, changeset 11 was stable and 13 and 15 weren't at the time, for instance.

The point is: between 15 and 16 there's not a single unique ancestor, but rather, two ancestors at the same "distance": 12 and 11.

虽然这不会经常发生，但它确实很可能发生在长期存在的分支或复杂的分支拓扑中。 (上面描述的案例是解决“多祖先”问题的最短案例，但它也可能发生在“交叉” merge 之间的多个变更集和分支中)。
一种解决方案是“选择”其中一个祖先作为 merge 的有效祖先(这是 Mercurial 采用的选项)，但它有许多缺点。

How merge recursive works?

When more than one valid ancestor is found, the recursive-merge strategy will create a new unique "virtual ancestor" merging the ones initially found.

The following image depicts the algorithm:

A new ancestor 2 will be used as "ancestor" to merge the "src" and "dst".

The "merge recursive strategy" is able to find a better solution than just "selecting one of the two" as I'll describe below.

注意: merge 递归策略最初是 merge “fredrik”策略(参见 commit e4cf17c，2005 年 9 月，Git v0.99.7a)，在 之后。弗雷德里克·库维宁 .
这是一个 python script , 起始于 commit 720d150 ，它说明了原始算法。
有关详细信息，请参阅第 17 页的“ Current Concepts in Version Control Systems from Petr Baudiˇs 2009-09-11”。
|B| = 1 : b(B) = B0
|B| = 2 : b(B) = M(LCA(B0, B1), B0, B1)
M(B, x, y) = ∆−1
(b(B), x ∪ y)
m(x, y) = M(LCA(x, y), x, y)
(是的，我也不知道怎么读)

In case of conflict, the main idea of the algorithm is to simply leave the conflict markers in place when using the result as a base for further merges.
This means that earlier conflicts are properly propagated as well as conflicting changes in newer revisions.

这是指 revctrl.org/CrissCrossMerge ，它描述了 中递归 merge 的上下文交叉 merge .
A criss-cross merge is an ancestry graph in which minimal common ancestors are not unique.
The simplest example with scalars is something like:
  a
 / \
b1  c1
|\ /|
| X |
|/ \|
b2  c2
The story one can tell here is that Bob and Claire made some change independently, then each merged the changes together.
They conflicted, and Bob (of course) decided his change was better, while Claire (typically) picked her version.
Now, we need to merge again. This should be a conflict.

Note that this can happen equally well with a textual merger -- they have each edited the same place in the file, and when resolving the conflict they each choose to make the resulting text identical to their original version (i.e., they don't munge the two edits together somehow, they just pick one to win).
所以:

Another possible solution is to first merge 'b1' and 'c1' to a temporary node (basically, imagine that the 'X' in the diagram is actually a revision, not just edges crossing) and then use that as a base for merging 'b2' and 'c2'.

The interesting part is when merging 'b1' and 'c1' results in conflicts - the trick is that in that case, 'X' is included with the conflicts recorded inside (e.g. using the classical conflict markers).

Since both 'b2' and 'c2' had to resolve the same conflict, in the case they resolved it the same way they both remove the conflicts from 'X' in the same way and a clean merge results; if they resolved it in different ways, the conflicts from 'X' get propagated to the final merge result.

这就是 torek "git merge: how did I get a conflict in BASE file?" 中描述作为“不对称结果”:

"These asymmetric results were harmless, except for the time bomb itself plus the fact that you later ran a recursive merge.
You get to see the conflict. It's up to you to resolve it — again — but this time there's no easy ours/theirs trick, if that worked for persons C and D."

从 revctrl.org/CrissCrossMerge 恢复:

If a merge would result in more than two bases ('b1', 'c1, 'd1'), they are merged consecutively - first 'b1' with 'c1' and then the result with 'd1'.

This is what "Git"'s "recursive merge" strategy does.

使用 Git 2.29(2020 年第四季度)，为准备新的 merge 策略后端，确实提供了对冲突和递归 merge 策略角色的良好描述:
(同样，Git 2.30，2020 年第一季度，将有一个 new merge-ort strategy )
见 commit 1f3c9ba , commit e8eb99d , commit 2a7c16c , commit 1cb5887 , commit 6c74948 , commit a1d8b01 , commit a0601b2 , commit 3df4e3b , commit 3b6eb15 , commit bc29dff , commit 919df31 (2020 年 8 月 10 日)作者: Elijah Newren ( newren ) .
(由 Junio C Hamano -- gitster -- 在 commit 36d225c 中 merge ，2020 年 8 月 19 日)

t6425: be more flexible with rename/delete conflict messages

^{Signed-off-by: Elijah Newren}

First, there's a basic conflict type known as modify/delete, which is a content conflict.
It occurs when one side deletes a file, but the other modifies it.

There is also a path conflict known as a rename/delete.
This occurs when one side deletes a path, and the other renames it.
This is not a content conflict, it is a path conflict.
It will often occur in combination with a content conflict, though, namely a modify/delete.
As such, these two were often combined.

Another type of conflict that can exist is a directory/file conflict.For example, one side adds a new file at some path, and the other side of history adds a directory at the same path.
The path that was "added" could have been put there by a rename, though.
Thus, we have the possibility of a single path being affected by a modify/delete, a rename/delete, and a directory/file conflict.

In part, this was a natural by-product of merge-recursive's design.
Since it was doing a four way merge with the contents of the working tree being the fourth factor it had to consider, it had working tree handling spread all over the code.
It also had directory/file conflict handling spread everywhere through all the other types of conflicts.

A natural outgrowth of this kind of structure is conflict messages that combine all the different types that the current codepath is considering.

However, if we want to make the different conflict types orthogonal and avoid repeating ourselves and getting very brittle code, then we need to split the messages from these different conflict types apart.
Besides, trying to determine all possible permutations is a royal mess.
The code to handle the rename/delete/directory/file conflict output is already somewhat hard to parse, and is somewhat brittle.
But if we really wanted to go that route, then we'd have to have special handling for the following types of combinations:

rename/add/delete: on side of history that didn't rename the given file, remove the file instead and place an unrelated file in the way of the rename

rename/rename(2to1)/mode conflict/delete/delete: two different files, one executable and the other not, are renamed to the same location, each side deletes the source file that the other side renames

rename/rename(1to2)/add/add: file renamed differently on each side of history, with each side placing an unrelated file in the way of the other

rename/rename(1to2)/content conflict/file location/(D/F)/(D/F)/: both sides modify a file in conflicting way, both rename that file but to different paths, one side renames the directory which the other side had renamed that file into causing it to possibly need a transitive rename, and each side puts a directory in the way of the other's path.

Let's back away from this path of insanity, and allow the different types of conflicts to be handled by separate pieces of non-repeated code by allowing the conflict messages to be split into their separate types. (If multiple conflict types affect a single path, the conflict messages can be printed sequentially.) Start this path with a simple change: modify this test to be more flexible and accept the output either merge backend (recursive or the new ort) will produce.

请注意，Git 2.22(2019 年第二季度)将改进递归 merge 策略，因为 git merge-recursive"后端最近(Git 2.18)学习了一种新的启发式方法
根据同一目录中其他文件的方式推断文件移动
感动。
由于这本质上不如基于文件本身的内容相似性(而不是基于其邻居正在做什么)的启发式方法，它有时会给出最终用户意想不到的结果。这已被调低以将重命名的路径留在索引中较高/冲突的阶段，因此
用户可以检查并确认结果。
见 commit 8c8e5bd , commit e62d112 , commit 6d169fd , commit e0612a1 , commit 8daec1d , commit e2d563d , commit c336ab8 , commit 3f9c92e , commit e9cd1b5 , commit 967d6be , commit 043622b , commit 93a02c5 , commit e3de888 , commit 259ccb6 , commit 5ec1e72 (2019 年 4 月 5 日)作者: Elijah Newren ( newren ) .
(由 Junio C Hamano -- gitster -- 在 commit 96379f0 中 merge ，2019 年 5 月 8 日)

merge-recursive: switch directory rename detection default

When all of x/a, x/b, and x/c have moved to z/a, z/b, and z/c on onebranch, there is a question about whether x/d added on a different branch should remain at x/d or appear at z/d when the two branches are merged.
There are different possible viewpoints here:

A) The file was placed at x/d; it's unrelated to the other files in x/ so it doesn't matter that all the files from x/ moved to z/ on one branch; x/d should still remain at x/d.

B) x/d is related to the other files in x/, and x/ was renamed to z/; therefore x/d should be moved to z/d.

由于之前无法检测目录重命名
Git 2.18，用户体验 (A)不管上下文。
选择 (B)在 Git 2.18 中实现，没有返回 (A) 的选项，并从那时起一直在使用。
但是，一位用户报告说 merge 结果不符合他们的预期，这使得更改默认值存在问题，特别是因为目录重命名检测移动文件时没有打印通知。
请注意，这里还有第三种可能性:

C) There are different answers depending on the context and content that cannot be determined by Git, so this is a conflict.
Use a higher stage in the index to record the conflict and notify the user of the potential issue instead of silently selecting a resolution for them.

为用户添加一个选项来指定他们是否使用的偏好
目录重命名检测，默认为 (C) .
即使打开了目录重命名检测，也会添加有关移入新目录的文件的通知消息。

在 Git 2.31(2021 年第一季度)中，“ORT” merge 策略(即我 presented here)会影响旧的递归策略。
见 commit c5a6f65 , commit e2e9dc0 , commit 04af187 , commit 43c1dcc , commit 1c7873c , commit 101bc5b , commit 6784574 (2020 年 12 月 3 日)作者 Elijah Newren ( newren ) .
(由 Junio C Hamano -- gitster -- 在 commit 85cf82f 中 merge ，2021 年 1 月 6 日)

merge-ort: add modify/delete handling and delayed output processing

^{Signed-off-by: Elijah Newren}

The focus here is on adding a path_msg() which will queue up warning/conflict/notice messages about the merge for later processing, storing these in a pathname -> strbuf map.
It might seem like a big change, but it really just is:

declaration of necessary map with some comments

initialization and recording of data

a bunch of code to iterate over the map at print/free time

at least one caller in order to avoid an error about having an unused function (which we provide in the form of implementing modify/delete conflict handling).

At this stage, it is probably not clear why I am opting for delayed output processing.
There are multiple reasons:

Merges are supposed to abort if they would overwrite dirty changesin the working tree.
We cannot correctly determine whether changes would be overwritten until both rename detection has occurred and full processing of entries with the renames has finalized.
Warning/conflict/notice messages come up at intermediate codepaths along the way, so unless we want spurious conflict/warning messages being printed when the merge will be aborted anyway, we need tosave these messages and only print them when relevant.

There can be multiple messages for a single path, and we want all messages for a give path to appear together instead of having them grouped by conflict/warning type.
This was a problem already with merge-recursive.c but became even more important due to the splitting apart of conflict types as discussed in the commit message for 1f3c9ba707 ("t6425: be more flexible with rename/delete conflict messages", 2020-08-10, Git 2.29)

Some callers might want to avoid showing the output in certain cases, such as if the end result is a clean merge.
Rebases have typically done this.

Some callers might not want the output to go to stdout or even stderr, but might want to do something else with it entirely.
For example, a --remerge-diff option to git show or git log -p that remerges on the fly and diffs merge commits against the remerged version would benefit from stdout/stderr not being written to in the standard form.

关于git - 由 'recursive' 策略进行的 merge ，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/55998614/

文章推荐： azure - 在 azure DocumentDb 中向前搜索文档字段

文章推荐： Azure - VM 泛化

三种缓存策略：Cache Aside 策略、Read/Write Through 策略、Write Back 策略
作者：小林coding 计算机八股文网站：https://xiaolincoding.com 大家好，我是小林。今天跟大家聊聊，常见的缓存更新策略。 Cache Aside（旁路缓存）策略； Rea
git - Mercurial merge 策略 vs Git merge 策略
我使用 git 多年，最近为了一个项目改用 mercurial。在过去的 6 个月里，我已经学会了如何通过命令行很好地使用 Mercurial。这可能是我的想象，但在我看来，mercurial 在
oauth-2.0 - Passport & JWT & Google/Facebook 策略 - 我如何结合 JWT 和 Google/Facebook 策略？
这个问题适合任何熟悉的人 Node.js express Passport 带有 Passport 的 JWT 身份验证(JSON Web token ) Facebook OAuth2.0 或谷歌
记录平等的 Coq 策略？
在 Coq 中，当试图证明记录的相等性时，是否有一种策略可以将其分解为所有字段的相等性？例如， Record R := {x:nat;y:nat}. Variables a b c d : nat.
javascript Bootstrap 策略
我正在处理的项目目前只有一个 Bootstrap 文件，用于初始化应用程序中的所有 javascript 对象。类似于下面的代码 if(document.getElementById('nav'))
ubuntu - OpenLDAP 策略
我正在考虑使用 OpenLDAP 在首次登录时添加密码到期和强制更改密码。似乎使用 ppolicy 覆盖来实现这一点。当我在 ppolicy.schema 中看到这个时，我开始使用 ppolicy
避免显卡敏感性的 WPF 策略？
这基本上是我昨天问的一个问题的重新陈述，因为我得到的一个答案似乎没有理解我的问题，所以我一定是不清楚。我的错。因为 WPF 依赖于 DirectX，所以它对卡和驱动程序的内部非常敏感。我有一个案例，
SAML NameId 策略
我是单点登录(SSO)概念的新手。我开始知道 SAML 请求和响应是实现 SSO 流程的最佳方式。然后我开始阅读有关 SAML2.0 的信息。我来了一个术语 NameIdPolicy 在 saml1.
新雪豹用户的 Perl 策略？
关闭。这个问题需要更多 focused .它目前不接受答案。想改进这个问题？更新问题，使其仅关注一个问题 editing this post . 5年前关闭。 Improve this questi
.NET 混淆工具/策略
关闭。这个问题是opinion-based 。目前不接受答案。想要改进这个问题吗？更新问题，以便 editing this post 可以用事实和引文来回答它。 . 已关闭 9 年前。 Improv
用于限制数据库扩展的 Azure 策略
在 Azure 上创建新的 SQL 数据库时，它将“计算+存储”选项设置为“2 vCore + 32GB 数据最大大小”作为默认配置，但我不想使用 vCore，我可以更改它。但问题是，是否可以通过策略
Azure 策略 - 防止在未经身份验证的情况下创建应用程序服务
我希望创建一项策略，防止在未启用身份验证的情况下创建应用服务(仅审核它们是不够的)。以下策略可以正确识别未启用身份验证的现有资源: { "mode": "All", "policyRule"
Azure 策略创建一个deployifnotexists 策略
我正在尝试从现有 AuditIfNotExists 策略创建 DeployIfNotExists 策略。部署时不会出错，但会错误提示“没有相关资源与策略定义中的效果详细信息匹配”。当评估政策时。当我将
Azure 策略创建一个deployifnotexists 策略
我正在尝试从现有 AuditIfNotExists 策略创建 DeployIfNotExists 策略。部署时不会出错，但会错误提示“没有相关资源与策略定义中的效果详细信息匹配”。当评估政策时。当我将
Django json 策略？
我正在使用 wunderground 的 json api 来查询我网站上的天气状况。 api 为我提供了一个包含所有必要数据的漂亮 json 对象，但我每天只能进行多次调用。存储这些数据的首选方式是
Java OOP 策略
我有一个名为可视化数据结构的项目。我有这样的 OOP 设计。 Class VisualDataStructures extends JFrame Class ControlPanel extends
javascript "include"策略
这个问题在这里已经有了答案: 关闭 14 年前。副本: Use javascript to inject script references as needed? Javascript 没有任何指
Android StrictMode 策略
Android 应用程序遇到了一些 ANR 问题，因此我实现了 StrictMode 策略。以前从未使用过这个，所以希望有人可以帮助解释以下内容: 为什么日志显示 2 个看似相似的违规行为，除了前 4
algorithm - 寻找酒店谜语的算法/策略
我目前正在尝试解决一个问题。假设我们在路上行驶，我们知道路上有 10 家酒店。每家酒店都有 0 到 6 星。我的问题是:找到选择星级酒店的最佳解决方案。唯一的问题是:您不能回头去参观您已经决定不去的酒
安卓 MVP 策略
我正在将我的应用程序迁移到 MVP。从这个 konmik 中获得了有关静态演示者模式的提示这是我的简要 MVP 策略。为简洁起见，删除了大部分样板和 MVP 监听器。这个策略帮助我改变了方向，证明了

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

git - 由 'recursive' 策略进行的 merge

When is merge recursive needed?

How merge recursive works?

`t6425`: be more flexible with rename/delete conflict messages

`merge-recursive`: switch directory rename detection default

`merge-ort`: add modify/delete handling and delayed output processing

首页

博学

6Ren·AI

商城

git - 由 'recursive' 策略进行的 merge

When is merge recursive needed?

How merge recursive works?

t6425: be more flexible with rename/delete conflict messages

merge-recursive: switch directory rename detection default

merge-ort: add modify/delete handling and delayed output processing

`t6425`: be more flexible with rename/delete conflict messages

`merge-recursive`: switch directory rename detection default

`merge-ort`: add modify/delete handling and delayed output processing