algorithm - 数组保持不变的概率是多少？-6ren

algorithm - 数组保持不变的概率是多少？

转载作者：行者123 更新时间：2023-12-01 16:36:51

25

4

这个问题在微软面试中被问到了。非常想知道为什么这些人会问这么奇怪的概率问题？

给定一个 rand(N)，一个随机生成器，它生成从 0 到 N-1 的随机数。

int A[N]; // An array of size N
for(i = 0; i < N; i++)
{
    int m = rand(N);
    int n = rand(N);
    swap(A[m],A[n]);
}

编辑:请注意，种子不是固定的。

数组 A 保持不变的概率是多少？
假设数组包含唯一元素。

最佳答案

好吧，我对这个有点乐趣。当我第一次阅读这个问题时，我想到的第一件事是群论(特别是对称群 Sn)。 for 循环通过在每次迭代中组合转置(即交换)来简单地在 Sn 中构建置换 σ。我的数学不是那么出色，而且我有点生疏，所以如果我的符号不合适，请忍受我。

概述

让 A是我们的数组在排列后不变的事件。我们最终被要求找到事件的概率 A , Pr(A) .

我的解决方案尝试遵循以下过程:

考虑所有可能的排列(即我们数组的重新排序)

根据它们包含的所谓身份转换的数量，将这些排列划分为不相交的集合。这有助于将问题减少到仅偶数排列。

给定排列是偶数(并且具有特定长度)，确定获得身份排列的概率。

将这些概率相加以获得数组不变的总体概率。

1) 可能的结果

请注意，for 循环的每次迭代都会创建一个交换(或转置)，导致以下两种情况之一(但不会同时出现):

交换了两个元素。

元素与自身交换。就我们的意图和目的而言，数组没有改变。

我们标记第二种情况。让我们定义一个身份转换如下:

An identity transposition occurs when a number is swapped with itself. That is, when n == m in the above for loop.

对于所列代码的任何给定运行，我们编写 N换位。可以有 0, 1, 2, ... , N出现在这个“链”中的身份转换。

例如，考虑一个 N = 3案例:

Given our input [0, 1, 2].
Swap (0 1) and get [1, 0, 2].
Swap (1 1) and get [1, 0, 2]. ** Here is an identity **
Swap (2 2) and get [1, 0, 2]. ** And another **

请注意，有奇数个非同一性转置 (1) 且数组已更改。

2)基于身份换位次数的分区

让 K_i是事件 i身份换位出现在给定的排列中。请注意，这形成了所有可能结果的详尽划分:

任何排列都不能同时具有两个不同数量的身份转换，并且

所有可能的排列必须介于 0 之间和 N身份转换。

因此我们可以应用 Law of Total Probability :

Now we can finally take advantage of the the partition. Note that when the number of non-identity transpositions is odd, there is no way the array can go unchanged*. Thus:

*_{From group theory, a permutation is even or odd but never both. Therefore an odd permutation cannot be the identity permutation (since the identity permutation is even).}

3) Determining Probabilities

So we now must determine two probabilities for N-i even:

The First Term

The first term, Pr(K_i) , represents the probability of obtaining a permutation with i identity transpositions. This turns out to be binomial since for each iteration of the for loop:

The outcome is independent of the results before it, and
The probability of creating an identity transposition is the same, namely 1/N.

Thus for N trials, the probability of obtaining i identity transpositions is:

The Second Term

So if you've made it this far, we have reduced the problem to finding Pr(A|K_i) for N - i even. This represents the probability of obtaining an identity permutation given i of the transpositions are identities. I use a naive counting approach to determine the number of ways of achieving the identity permutation over the number of possible permutations.

First consider the permutations (n, m) and (m, n) equivalent. Then, let M be the number of non-identity permutations possible. We will use this quantity frequently.

The goal here is to determine the number of ways a collections of transpositions can be combined to form the identity permutation. I will try to construct the general solution along side an example of N = 4.

Let's consider the N = 4 case with all identity transpositions (i.e. i = N = 4). Let X represent an identity transposition. For each X, there are N possibilities (they are: n = m = 0, 1, 2, ... , N - 1). Thus there are N^i = 4^4 possibilities for achieving the identity permutation. For completeness, we add the binomial coefficient, C(N, i), to consider ordering of the identity transpositions (here it just equals 1). I've tried to depict this below with the physical layout of elements above and the number of possibilities below:

I  =  _X_   _X_   _X_   _X_
       N  *  N  *  N  *  N  * C(4, 4) => N^N * C(N, N) possibilities

现在没有明确替换 N = 4和 i = 4 ，我们可以看看一般情况。结合上面的分母，我们发现:

This is intuitive. In fact, any other value other than 1 should probably alarm you. Think about it: we are given the situation in which all N transpositions are said to be identities. What's the probably that the array is unchanged in this situation? Clearly, 1.

Now, again for N = 4, let's consider 2 identity transpositions (i.e. i = N - 2 = 2). As a convention, we will place the two identities at the end (and account for ordering later). We know now that we need to pick two transpositions which, when composed, will become the identity permutation. Let's place any element in the first location, call it t1. As stated above, there are M possibilities supposing t1 is not an identity (it can't be as we have already placed two).

I  =  _t1_   ___   _X_   _X_
       M   *  ?  *  N  *  N

唯一可能排在第二位的元素是 t1 的倒数。，实际上是 t1 (这是唯一的逆唯一性)。我们再次包含二项式系数:在这种情况下，我们有 4 个开放位置，我们希望放置 2 个身份排列。我们有多少种方法可以做到这一点？ 4 选择 2。

I  =  _t1_   _t1_   _X_   _X_ 
       M   *  1   *  N  *  N  * C(4, 2) => C(N, N-2) * M * N^(N-2) possibilities

再看看一般情况，这一切都对应于:

Finally we do the N = 4 case with no identity transpositions (i.e. i = N - 4 = 0). Since there are a lot of possibilities, it starts to get tricky and we must be careful not to double count. We start similarly by placing a single element in the first spot and working out possible combinations. Take the easiest first: the same transposition 4 times.

I  =  _t1_   _t1_   _t1_   _t1_ 
       M   *  1   *  1   *  1   => M possibilities

现在让我们考虑两个独特的元素 t1和 t2 .有 M t1 的可能性并且只有 M-1 t2 的可能性(因为 t2 不能等于 t1 )。如果我们穷尽所有的安排，我们会留下以下模式:

I  =  _t1_   _t1_   _t2_   _t2_ 
       M   *  1   *  M-1 *  1   => M * (M - 1) possibilities   (1)st

   =  _t1_   _t2_   _t1_   _t2_
       M   *  M-1 *  1   *  1   => M * (M - 1) possibilities   (2)nd

   =  _t1_   _t2_   _t2_   _t1_
       M   *  M-1 *  1   *  1   => M * (M - 1) possibilities   (3)rd

现在让我们考虑三个独特的元素， t1 , t2 , t3 .让我们放置 t1先然后 t2 .像往常一样，我们有:

I  =  _t1_   _t2_   ___   ___ 
       M   *  ?   *  ?  *  ?

我们还不能说有多少可能 t2 s 可能还有，我们将在一分钟内看到原因。

我们现在放置 t1在第三个位置。通知， t1必须去那里，因为如果要去最后一个地方，我们只会重新创建 (3)rd上面的安排。重复计算是不好的!这留下了第三个唯一元素 t3到最终位置。

I  =  _t1_   _t2_   _t1_   _t3_ 
       M   *  ?   *  1  *   ?

那么为什么我们要花一点时间来考虑 t2的数量？更接近？换位 t1和 t2 不能是不相交的排列(即它们必须共享其 n 或 m 中的一个(并且只有一个，因为它们也不能相等)。这样做的原因是因为如果它们不相交，我们可以交换排列顺序。这意味着我们将重复计算 (1)st安排。

说 t1 = (n, m) . t2必须是形式 (n, x)或 (y, m)一些 x和 y为了不相交。请注意 x可能不是 n或 m和 y许多不是 n或 m .因此，可能的排列数 t2可能实际上是 2 * (N - 2) .

所以，回到我们的布局:

I  =  _t1_    _t2_    _t1_   _t3_ 
       M   * 2(N-2) *  1   *  ?

现在 t3必须是 t1 t2 t1 的组合的倒数.让我们手动完成:

(n, m)(n, x)(n, m) = (m, x)

因此 t3必须是 (m, x) .注意这是不是与 t1 不相交并且不等于 t1或 t2所以在这种情况下没有重复计算。

I  =  _t1_    _t2_    _t1_   _t3_ 
       M   * 2(N-2) *  1  *   1    => M * 2(N - 2) possibilities

最后，将所有这些放在一起:

4) 把它们放在一起

所以就是这样。向后工作，将我们发现的结果代入步骤 2 中给出的原始总和。我计算了 N = 4 的答案。下面的情况。它与另一个答案中的经验数字非常匹配!

N = 4
M = 6 _________ _________ _________
| Pr(K_i) | Pr(A | K_i) |产品 |
_________|_________|_____________|_________|
| | | | |
|我 = 0 | 0.316 | 120/1296 | 0.029 |
|_________|_________|_____________|_________|
| | | | |
|我 = 2 | 0.211 | 6/36 | 0.035 |
|_________|_________|_____________|_________|
| | | | |
|我 = 4 | 0.004 | 1/1 | 0.004 |
|_________|_________|_____________|_________|
| | |
|总和: | 0.068 |
|_________|_________|

正确性

如果群论中有一个结果可以应用在这里会很酷——也许有!它肯定会有助于使所有这些繁琐的计数完全消失(并将问题缩短为更优雅的问题)。我在 N = 4 停止工作.对于 N > 5 ，给出的只是一个近似值(有多好，我不确定)。如果你仔细想想，很清楚为什么会这样:例如，给定 N = 8换位，显然有四种方法可以用上面没有说明的四个独特元素来创建身份。随着排列变长(据我所知......)，方式的数量似乎变得更加难以计算。

反正我绝对不能在采访范围内做这种事。如果幸运的话，我会走到分母一步。除此之外，它似乎很讨厌。

关于algorithm - 数组保持不变的概率是多少？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/11872190/

25

4

0

文章推荐： ios - 有什么方法可以经常使用 3G/4G/LTE 从后台上传位置？

文章推荐： java - isEven 在另一种方法中

文章推荐： ios - iTunes Connect没有退回任何产品

文章推荐： java抛出检查异常？

Python，概率
接下来是我的代码: with open("test.txt") as f_in: for line in f_in: for char in line:
python 概率
我们有一个六面骰子，面编号为 1 到 6。随着 n 的增加，在第 n 卷中第一次看到 1 的概率降低。我想找到最小的卷数，使得这个概率小于某个给定的限制。 def probTest(limit):
python - Numpy 概率
我只是想知道为什么运行下面的代码时出现错误。我正在尝试使用 numpy 为基于文本的游戏计算概率。下面的代码不是游戏本身的代码。这仅用于测试目的和学习。感谢您提前的答复，请对我宽容一点。 from n
sockets - UDP丢包模拟&概率
我目前正在创建一个与多个arduino板通信的服务器软件。由于硬件原因，我使用UDP协议(protocol)。我有一个非常简单的机制，在大多数情况下，当包裹丢失时，它会重新发送包裹。我现在有两个问题:
Android onfling 概率
我想在 LinearLayout 上添加一个 fling Action 。为此，我使用了以下代码。 public class NewsActivity extends Activity { .
Facebook 拼图(概率)
下面是其中一个 facebook 谜题:我无法理解如何进行此操作。你有 C 个容器、B 个黑球和无限数量的白球。您希望以一种方式在容器之间分配球，即每个容器至少包含一个球，并且选择白球的概率大于或等
c# - 概率。关于希伯来语编码
我有一个希伯来语文本，就像 "×گض¸×¨ض´×™×،ض°×ک×•ض¹×ں"，我想将它转换为可读的 unicode 希伯来语字符。我试过这段代码: const string Str = "×گض¸×
Java Random.nextDouble() 概率
我正在尝试使用 Random.nextDouble() 获取 1.0 和 10.0 之间的随机双数: double number = 1.0 + (10.0-1.0) * Random.nextDou
python - 概率 SVM、回归
我目前已经为二进制类实现了概率(至少我这么认为)。现在我想扩展这种回归方法，并尝试将其用于波士顿数据集。不幸的是，我的算法似乎被卡住了，我当前运行的代码如下所示: from sklearn impor
statistics - K 最近邻分类的“概率”
我在 2D 空间中有一小组数据点(大约 10 个)，每个数据点都有一个类别标签。我希望根据现有数据点标签对新数据点进行分类，并关联属于任何特定标签类别的“概率”。基于最近邻的标签来标记新点是否合适(
python - 如何计算给定输入和预期输出的 ctc 概率？
我正在做我的第一个 tensorflow 项目。我需要获得给定输入和预期序列的 ctc 概率(不是 ctc 损失)。在 python 或 c++ 中是否有任何 api 或方法可以做到这一点？我更
python - 如何向量化多维矩阵的 Softmax 概率
我正在尝试通过 assignment 1斯坦福 cs244n 类(class)。问题 1b 强烈建议对 Softmax 函数进行优化。我设法得到了N维向量的Softmax。我还得到了 MxN 维矩阵的
需要算法帮助! [概率、分布、序列分析。]
我有一个预测算法的想法，该算法可以根据所选项目先前出现的顺序准确预测随机值，并分析模式以提高准确性。基本上是一种接受两个参数的算法，一个是一组可能的选择；另一个是这些数字的历史，分析该模式并预测序列
java - 为什么此代码适用于此 TopCoder 概率？
自 HOURS 以来，我一直在努力思考这个 TopCoder 问题，但无法找到一个完美的解决方案，并找到了下面给出的一个使用得非常漂亮的解决方案! 我想弄清楚这个解决方案如何适用于给定的问题？而我当初
c# - 生成随机 boolean 概率
我只知道如何生成随机 boolean 值(真/假)。默认概率为 50:50 但是我怎样才能用我自己的概率生成真假值呢？假设它以 40:60 或 20:80 等的概率返回 true... 最佳答案一种
julia - 使用 z 分数计算百分位数/概率
对于以下示例，我如何计算 julia 中的百分位数/概率值/尾部区域 Example : N(1100, 200) #Normally distributed with mean 1100 & st
machine-learning - 概率 kNN 和朴素贝叶斯之间的区别
我正在尝试修改标准 kNN 算法来获取属于某个类别的概率，而不仅仅是通常的分类。我还没有找到太多关于概率 kNN 的信息，但据我了解，它的工作原理与 kNN 类似，不同之处在于它计算给定半径内每个类的
PostgreSQL 概率 : EXPLAIN on CREATE INDEX
我正在使用 PostgreSQL 为我所有数据中的变量对计算经验概率密度函数。我试图确定在计算 PDF 之前索引是否/何时更有效。我像这样运行 EXPLAIN CREATE INDEX， EXPLAI
mysql - 概率。使用 tquery.requeSTLive
有谁知道当查询有偏移时如何在 MySql 中请求“实时结果集”(例如:select * from table limit 10 offset 20;)。它正在经历类似的错误 'invalid use
c - 我试图获得 2 个数字的组合(概率)
unsigned long long int first( int b , int c){ int h=b; //int k; for(int k=b-1;k>c;k--){ b=b*k;

首页

博学

6Ren·AI

商城