- html - 出于某种原因,IE8 对我的 Sass 文件中继承的 html5 CSS 不友好?
- JMeter 在响应断言中使用 span 标签的问题
- html - 在 :hover and :active? 上具有不同效果的 CSS 动画
- html - 相对于居中的 html 内容固定的 CSS 重复背景?
我正在尝试 git blame
以下文件(在我的本地机器上运行)因为它太慢而无法生成 GitHub 的错误:
https://github.com/Homebrew/homebrew-core/blob/master/Formula/sqlite.rb
但是在本地运行也很慢,在我的机器上运行一分钟以上
time git --no-pager blame Formula/sqlite.rb > /dev/null
git blame
命令?
最佳答案
在 Git 2.27(2020 年第二季度)中,“git blame
”学会利用“changed-paths
” Bloom filter 存储在提交图文件中,和 introduced with git log
.
见 commit 1b4c57f , commit 24b7d1e , commit fe88f9f (2020 年 4 月 23 日)来自 Jeff King ( peff
) .
见 commit 0906ac2 , commit b23ea97 , commit 8918e37 (2020 年 4 月 16 日) 来自 Derrick Stolee ( derrickstolee
) .
(由 Junio C Hamano -- gitster
-- merge 在 commit 6d56d4c 中,2020 年 5 月 1 日)
blame
: usechanged-path
Bloom filtersSigned-off-by: Derrick Stolee
The
changed-path
Bloom filters help reduce the amount of tree parsing required during history queries.Before calculating a diff, we can ask the filter if a path changed between a commit and its first parent.
- If the filter says "no" then we can move on without parsing trees.
- If the filter says "maybe" then we parse trees to discover if the answer is actually "yes" or "no".
When computing a blame, there is a section in
find_origin()
that computes a diff between a commit and one of its parents.
When this is the first parent, we can check the Bloom filters before callingdiff_tree_oid()
.In order to make this work with the blame machinery, we need to initialize a struct
bloom_key
with the initial path. But also, we need to add more keys to a list if a rename is detected. We then check to see if any of these keys answer "maybe" in the diff.If a user requests copy detection using "
git blame -C
", then there are more places where the set of "important" files can expand. I do not know enough about how this happens in the blame machinery.
Thus, the Bloom filter integration is explicitly disabled in this mode.
A later change could expand thebloom_key
data with an appropriate call (or calls) toadd_bloom_key()
.Generally, this is a performance enhancement and should not change the behavior of '
git blame
' in any way.
If a repo has a commit-graph file with computed changed-path Bloom filters, then they should notice improved performance for their 'git blame
' commands.Here are some example timings that I found by blaming some paths in the Linux kernel repository:
git blame arch/x86/kernel/topology.c
>/dev/null`Before: 0.83s
After: 0.24s
git blame kernel/time/time.c >/dev/null
Before: 0.72s
After: 0.24s
git blame tools/perf/ui/stdio/hist.c >/dev/null
Before: 0.27s
After: 0.11sI specifically looked for "deep" paths that were also edited many times.
As a counterpoint, theMAINTAINERS
file was edited many times but is located in the root tree.
This means that the cost of computing a diff relative to the pathspec is very small. Here are the timings for that command:
git blame MAINTAINERS >/dev/null
Before: 20.1s
After: 18.0sThese timings are the best of five.
The worst-case runs were on the order of 2.5 minutes for both cases.
Note that theMAINTAINERS
file has 18,740 lines across 17,000+ commits. This happens to be one of the cases where this change provides the least improvement.The lack of improvement for the
MAINTAINERS
file and the relatively modest improvement for the other examples can be easily explained.
The blame machinery needs to compute line-level diffs to determine which lines were changed by each commit. That makes up a large proportion of the computation time, and this change does not attempt to improve on that section of the algorithm.
TheMAINTAINERS
file is large and changed often, so it takes time to determine which lines were updated by which commit. In contrast, the code files are much smaller, and it takes longer to compute the line-by-line diff for a single patch on the Linux mailing lists.Outside of the "
-C
" integration, I believe there is little more to gain from the changed-path Bloom filters for 'git blame
' after this patch.
eantoranz
) .
gitster
-- merge 于
commit e1dd499 ,2020 年 9 月 18 日)
blame.c
: replace instance of!oidcmp
foroideq
Signed-off-by: Edmundo Carmona Antoranz
0906ac2b ("
blame
: use changed-path Bloom filters", 2020-04-16, Git v2.27.0-rc0 -- merge listed in batch #6) introduced a call to oidcmp() that should have beenoideq()
, which was introduced in 14438c44 ("introducehasheq()
andoideq()
", 2018-08-28, Git v2.20.0-rc0 -- merge listed in batch #1).
git commit-graph
(
man)写”学会了限制从头开始计算的布隆过滤器的数量
--max-new-filters
选项。
git blame
.
ttaylorr
) .
derrickstolee
) .
gitster
-- merge 于
commit 288ed98 ,2020 年 9 月 29 日)
builtin/commit-graph.c
: introduce '--max-new-filters='Helped-by: Junio C Hamano
Signed-off-by: Taylor Blau
Introduce a command-line flag to specify the maximum number of new Bloom filters that a '
git commit-graph write
'(man) is willing to compute from scratch.Prior to this patch, a commit-graph write with '
--changed-paths
' would compute Bloom filters for all selected commits which haven't already been computed (i.e., by a previous commit-graph write with '--split
' such that a roll-up or replacement is performed).This behavior can cause prohibitively-long commit-graph writes for a variety of reasons:
- There may be lots of filters whose diffs take a long time to generate (for example, they have close to the maximum number of changes, diffing itself takes a long time, etc).
- Old-style commit-graphs (which encode filters with too many entries as not having been computed at all) cause us to waste time recomputing filters that appear to have not been computed only to discover that they are too-large.
This can make the upper-bound of the time it takes for '
git commit-graph write --changed-paths
'(man) to be rather unpredictable.To make this command behave more predictably, introduce '
--max-new-filters=<n>
' to allow computing at most '<n>
' Bloom filters from scratch.
This lets "computing" already-known filters proceed quickly, while bounding the number of slow tasks that Git is willing to do.
git commit-graph
现在包含在其
man page 中:
With the
--max-new-filters=<n>
option, generate at mostn
new Bloomfilters (if--changed-paths
is specified).
Ifn
is-1
, no limit is enforced.
Only commits present in the new layer count against this limit.
To retroactively compute Bloom filters over earlier layers, it is advised to use--split=replace
.
git blame
”(
man)
raffs
) .
gitster
-- merge 于
commit 18decfd ,2021 年 2 月 25 日)
blame
: remove unnecessary use ofget_commit_info()
Signed-off-by: Rafael Silva
Reviewed-by: Taylor Blau
When
git blame
(man)--color-by-age
, thedetermine_line_heat()
is called to select how to color the output based on the commit's author date.
It uses theget_commit_info()
to parse the information into acommit_info
structure, however, this is actually unnecessary because thedetermine_line_heat()
caller also does the same.Instead, let's change the
determine_line_heat()
to take acommit_info
structure and remove the internal call toget_commit_info()
thus cleaning up and optimizing the code path.Enabling Git's trace2 API in order to record the execution time for every call to
determine_line_heat()
function:+ trace2_region_enter("blame", "determine_line_heat", the_repository);
determine_line_heat(ent, &default_color);
+ trace2_region_enter("blame", "determine_line_heat", the_repository);Then, running
git blame
for "kernel/fork.c
" in linux.git and summing all the execution time for every call (around 1.3k calls) resulted in 2.6x faster execution (best out 3):git built from 328c109303 (The eighth batch, 2021-02-12) = 42ms
git built from 328c109303 + this change = 16ms
关于git - 加速 `git blame` 在有很多提交的存储库上,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/57837986/
我时不时地输入“git”,然后想到别的东西,然后输入例如“git checkout master”。当然,这给我留下了 $ git git checkout master git: 'git' is
我做到了 git 克隆 git://foo.git 光盘富 ...编辑文件.. 现在我想重新开始。我不在乎我已经做出的任何改变,但我不想再次克隆整个巨型 foo.git,只是丢失我所有的更改。我怎
我在我的电脑上开发代码,我的计算节点很少。 为了让我的程序保持同步,我决定使用 git。以前,我以一种单向模式使用它来“下推”从 PC 到计算节点的更改。但是时不时遇到计算节点特有的小bug,现场修复
虽然它似乎什么也没做,但它没有给出任何警告或错误消息。有什么想法吗? 最佳答案 来自 Git 源的注释: /* * Read a directory tree. We currently ignor
我知道如何为这样的 HTTPS 请求提供用户名和密码: git clone https://username:password@remote 但我想知道如何像这样向 Remote 提供用户名和密码:
Git GUI、Git Bash 和 Git CMD 之间有什么区别?我是初学者,为了进行安装,我发现自己通常同时使用 git bash 和 git CMD 最佳答案 Git CMD 就像使用 git
有人能告诉我git中文件索引被删除是什么意思吗?这些文件在我的 VS Code 中标记为红色,但我仍然可以修改文件并将更改推送到将反射(reflect)这些更改的远程存储库。我认为这一切都是在我使用命
我通过 git 子树将 GLFV 库添加到项目中,但出现此警告“看起来您的 git 安装或您的 git-subtree 安装已损坏”。还描述了几个原因,为什么这可能是: 如 git --exec-pa
我有需要外部 git 项目的 repo,但我不想使用子模块,因为我想在 github 上存档所有文件,所以我认为我只是将具有 git repo 的整个目录添加到 git 但它不t 添加里面的 .git
我有需要外部 git 项目的 repo,但我不想使用子模块,因为我想在 github 上存档所有文件,所以我认为我只是将具有 git repo 的整个目录添加到 git 但它不t 添加里面的 .git
我一直在阅读一篇文章,作者在其中指示:在现有存储库中创建一个新存储库,并想知道这是否是他忽略的错误。稍后我会与他核实。 这些是我要检查的条件: 将现有目录制作成仓库的条件,并且已经 checkin 主
我确实在不同的计算机上处理相同的项目,我想知道是否有一种方法可以跟踪该 .git 文件夹,这样我就不必在所有本地文件中重新配置配置文件。 我将所有工作推送到 bitbucket。 最佳答案 不,没
这个问题在这里已经有了答案: How does git store files? (3 个答案) 关闭 9 年前。 我为我的许多项目创建了一个远程存储库,所以它是我的push 的目的地。与 git
应该如何在 git 中查看文件内容的完整历史记录? 一个文件在 git 中的历史很短,存储库通过 git-svn 同步,但在 svn 中的历史很长。 git 中的历史记录到达文件移动的位置。要查看历史
我是confused here ... 如何对修改后的文件进行git commit,以及如何对新文件进行git commit? 还有,你如何在git中单独提交文件? 最佳答案 git 提交过程分为两个
正在搜索 throw SO 来寻找答案。遇到这个似乎没有给出任何答案的旧线程。重新触发此线程,希望有人知道! 有人能告诉我 git subtree 和 git filter-branch 的区别吗?为
我想知道是否有一种方法可以避免在每个 Git 命令的开头键入单词 git。 如果有一种方法可以在打开命令提示符进入 “Git 模式” 后只使用一次 git 命令就好了。 例如: git> 之后,我们键
当您修改工作目录中的文件时,git 会告诉您使用“git add”暂存。 当您向工作目录添加新文件时,git 会告诉您使用“git add”开始跟踪。 我对这两个概念有点困惑,因为我假设跟踪文件的更改
为什么 git://有效 $ git clone git://github.com/schacon/grit.git Cloning into 'grit'... ... Checking conne
我在以下沙箱中练习 git:https://learngitbranching.js.org/?NODEMO 我在两个单独的 session 中运行了两组命令。第一组命令顺序如下: git clone
我是一名优秀的程序员,十分优秀!