Fórmula para tirar dados (fuerza no bruta)

En primer lugar, no estoy seguro de dónde se debe publicar esta pregunta. Estoy preguntando si un problema de estadística es NP-Complete y si no es para resolverlo mediante programación. Lo estoy publicando aquí porque el problema de las estadísticas es el punto central.

Estoy tratando de encontrar una mejor fórmula para resolver un problema. El problema es: si tengo 4d6 (4 dados comunes de 6 lados) y los lanzo todos a la vez, elimino un dado con el número más bajo (llamado "caída"), luego sumo los 3 restantes, ¿cuál es la probabilidad de cada posible resultado? ? Sé que la respuesta es esta:

Sum (Frequency): Probability
3   (1):         0.0007716049
4   (4):         0.0030864198
5   (10):        0.0077160494
6   (21):        0.0162037037
7   (38):        0.0293209877
8   (62):        0.0478395062
9   (91):        0.0702160494
10  (122):       0.0941358025
11  (148):       0.1141975309
12  (167):       0.1288580247
13  (172):       0.1327160494
14  (160):       0.1234567901
15  (131):       0.1010802469
16  (94):        0.0725308642
17  (54):        0.0416666667
18  (21):        0.0162037037

El promedio es 12.24 y la desviación estándar es 2.847.

Encontré la respuesta anterior por fuerza bruta y no sé cómo o si hay una fórmula para ello. Sospecho que este problema es NP-Complete y, por lo tanto, solo se puede resolver con la fuerza bruta. Es posible obtener todas las probabilidades de 3d6 (3 dados normales de 6 lados) y luego sesgar cada uno de ellos hacia arriba. Esto sería más rápido que la fuerza bruta porque tengo una fórmula rápida cuando se guardan todos los dados.

Programé la fórmula para mantener todos los dados en la universidad. Le pregunté a mi profesor de estadística al respecto y encontró esta página , que luego me explicó. Hay una gran diferencia de rendimiento entre esta fórmula y la fuerza bruta: 50d6 tardó 20 segundos pero 8d6 dejó caer los bloqueos más bajos después de 40 segundos (el cromo se queda sin memoria).

¿Es este problema NP-Complete? En caso afirmativo, proporcione una prueba, en caso negativo, proporcione una fórmula de fuerza no bruta para resolverlo.

Tenga en cuenta que no sé mucho sobre NP-Complete, por lo que podría estar pensando en NP, NP-Hard u otra cosa. La prueba de NP-Completeness es inútil para mí, la única razón por la que la pido es para evitar que la gente adivine. Y por favor, acépteme, ya que ha pasado mucho tiempo desde que trabajé en esto: no recuerdo las estadísticas tan bien como podría necesitar resolver esto.

Idealmente, estoy buscando una fórmula más genérica para el número X de dados con lados Y cuando N de ellos se caen, pero estoy comenzando con algo mucho más simple.

Editar:

También preferiría la fórmula a las frecuencias de salida, pero es aceptable solo generar probabilidades.

Para aquellos interesados, he programado la respuesta de whuber en JavaScript en mi GitHub (en este commit solo las pruebas realmente usan las funciones definidas).

dice np

— SkySpiral7
fuente

Esta es una pregunta interesante. Creo que debería estar en el tema aquí. Gracias por su consideración.

— gung - Restablece a Monica

Aunque la configuración es interesante, aún no ha formulado una pregunta que responda: la idea de la completitud NP depende de tener una clase de problemas, mientras que ha descrito solo uno. Exactamente, ¿cómo quieres que se generalice? Si bien insinúa que el número de dados podría variar, son posibles varias opciones adicionales y pueden dar diferentes respuestas: puede cambiar el número de caras, los valores en las caras, el número de dados y el número de dados que se cayeron, todo de varias maneras con varias relaciones entre ellos.

— whuber

@whuber Ella no conoce ninguna teoría de la complejidad, pero creo que está claro que pregunta por la familia de problemas generados al cambiar el número de dados. También creo que tengo un algoritmo eficiente para ello.

— Andy Jones

@Andy veo que al final está pidiendo "una fórmula más genérica para X número de dados con lados Y cuando N de ellos se caen".

— whuber

@whuber Hah! Aparentemente no es tan claro como pensaba entonces. Perdón, es mi culpa.

— Andy Jones

Respuestas:

Solución

Sea dados cada uno, lo que da las mismas oportunidades a los resultados . Deje que sea el mínimo de los valores cuando todos los dados son lanzados de manera independiente. $n=4$ $1, 2, \ldots, d=6$ $K$ $n$

Tenga en cuenta la distribución de la suma de todos los valores condicionales en . Deje ser esta suma. La función generadora para la cantidad de formas de formar cualquier valor dado de , dado que el mínimo es al menos , es $n$ $K$ $X$ $X$ $k$

\begin{matrix} (1) & f_{(n, d, k)} (x) = x^{k} + x^{k + 1} + \dots + x^{d} = x^{k} \frac{1 - x^{d - k + 1}}{1 - x} . \end{matrix}

$f_{(n,d,k)}(x) = x^k+x^{k+1} + \cdots + x^d = x^k\frac{1-x^{d-k+1}}{1-x}.\tag{1}$

Dado que los dados son independientes, la función generadora para la cantidad de formas de formar valores de donde todos los dados muestran valores de o mayores es $X$ $n$ $k$

\begin{matrix} (2) & f_{(n, d, k)} (x)^{n} = x^{k n} {(\frac{1 - x^{d - k + 1}}{1 - x})}^{n} . \end{matrix}

$f_{(n,d,k)}(x)^n = x^{kn}\left(\frac{1-x^{d-k+1}}{1-x}\right)^n.\tag{2}$

Esta función de generación incluye términos para los eventos donde excede , por lo que debemos restarlos. Por lo tanto, la función generadora para la cantidad de formas de formar valores de , dado , es $K$ $k$ $X$ $K=k$

\begin{matrix} (3) & f_{(n, d, k)} (x)^{n} - f_{(n, d, k + 1)} (x)^{n} . \end{matrix}

$f_{(n,d,k)}(x)^n - f_{(n,d,k+1)}(x)^n.\tag{3}$

Tomando nota de que la suma de los valores más altos es la suma de todos los valores menos la más pequeña, igual a . Por tanto, la función de generación necesita ser dividido por . Se convierte en una función generadora de probabilidad al multiplicar por la posibilidad común de cualquier combinación de dados, : $n-1$ $X-K$ $k$ $(1/d)^n$

\begin{matrix} (4) & d^{- n} \sum_{k = 1}^{d} x^{- k} (f_{(n, d, k)} (x)^{n} - f_{(n, d, k + 1)} (x)^{n}) . \end{matrix}

$d^{-n}\sum_{k=1}^dx^{-k}\left(f_{(n,d,k)}(x)^n - f_{(n,d,k+1)}(x)^n\right).\tag{4}$

Dado que todos los productos y potencias polinomiales pueden calcularse en operaciones (son convoluciones y, por lo tanto, pueden llevarse a cabo con la Transformada rápida de Fourier discreta), el esfuerzo computacional total es $O(n\log n)$ . En particular,es un algoritmo de tiempo polinómico. $O(k\,n\log n)$

Ejemplo

El trabajo de Let través del ejemplo en la pregunta con y . $n=4$ $d=6$

La fórmula para el PGF de condicional en da $(1)$ $X$ $K\ge k$

\begin{aligned} f_{(4, 6, 1)} (x) & = x + x^{2} + x^{3} + x^{4} + x^{5} + x^{6} \\ f_{(4, 6, 2)} (x) & = x^{2} + x^{3} + x^{4} + x^{5} + x^{6} \\ \dots \\ f_{(4, 6, 5)} (x) & = x^{5} + x^{6} \\ f_{(4, 6, 6)} (x) & = x^{6} \\ f_{(4, 6, 7)} (x) & = 0. \end{aligned}

$\eqalign{ f_{(4,6,1)}(x) &= x+x^2+x^3+x^4+x^5+x^6 \\ f_{(4,6,2)}(x) &= x^2+x^3+x^4+x^5+x^6 \\ \ldots \\ f_{(4,6,5)}(x) &= x^5+x^6 \\ f_{(4,6,6)}(x) &= x^6 \\ f_{(4,6,7)}(x) &= 0. }$

Elevándolos a la potencia como en la fórmula produce $n=4$ $(2)$

\begin{aligned} f_{(4, 6, 1)} (x)^{4} & = x^{4} + 4 x^{5} + 10 x^{6} + \dots + 4 x^{23} + x^{24} \\ f_{(4, 6, 2)} (x)^{4} & = x^{8} + 4 x^{9} + 10 x^{10} + \dots + 4 x^{23} + x^{24} \\ \dots \\ f_{(4, 6, 5)} (x)^{4} & = x^{20} + 4 x^{21} + 6 x^{22} + 4 x^{23} + x^{24} \\ f_{(4, 6, 6)} (x)^{4} & = x^{24} \\ f_{(4, 6, 7)} (x)^{4} & = 0 \end{aligned}

$\eqalign{ f_{(4,6,1)}(x)^4 &= x^4 + 4x^5 + 10 x^6 + \cdots + 4x^{23} + x^{24} \\ f_{(4,6,2)}(x)^4 &= x^8 + 4x^9 + 10x^{10}+ \cdots + 4x^{23} + x^{24} \\ \ldots \\ f_{(4,6,5)}(x)^4 &=x^{20} + 4 x^{21} + 6 x^{22} + 4x^{23} +x^{24}\\ f_{(4,6,6)}(x)^4 &= x^{24}\\ f_{(4,6,7)}(x)^4 &= 0 }$

Sus sucesivas diferencias en la fórmula son $(3)$

\begin{aligned} f_{(4, 6, 1)} (x)^{4} - f_{(4, 6, 2)} (x)^{4} & = x^{4} + 4 x^{5} + 10 x^{6} + \dots + 12 x^{18} + 4 x^{19} \\ f_{(4, 6, 2)} (x)^{4} - f_{(4, 6, 3)} (x)^{4} & = x^{8} + 4 x^{9} + 10 x^{10} + \dots + 4 x^{20} \\ \dots \\ f_{(4, 6, 5)} (x)^{4} - f_{(4, 6, 6)} (x)^{4} & = x^{20} + 4 x^{21} + 6 x^{22} + 4 x^{23} \\ f_{(4, 6, 6)} (x)^{4} - f_{(4, 6, 7)} (x)^{4} & = x^{24} . \end{aligned}

$\eqalign{ f_{(4,6,1)}(x)^4 - f_{(4,6,2)}(x)^4 &= x^4 + 4x^5 + 10 x^6 + \cdots + 12 x^{18} + 4x^{19} \\ f_{(4,6,2)}(x)^4 - f_{(4,6,3)}(x)^4 &= x^8 + 4x^9 + 10x^{10} + \cdots + 4 x^{20} \\ \ldots \\ f_{(4,6,5)}(x)^4 - f_{(4,6,6)}(x)^4 &=x^{20} + 4 x^{21} + 6 x^{22} + 4x^{23} \\ f_{(4,6,6)}(x)^4 - f_{(4,6,7)}(x)^4 &= x^{24}. }$

La suma resultante en la fórmula es $(4)$

6^{- 4} (x^{3} + 4 x^{4} + 10 x^{5} + 21 x^{6} + 38 x^{7} + 62 x^{8} + 91 x^{9} + 122 x^{10} + 148 x^{11} + 167 x^{12} + 172 x^{13} + 160 x^{14} + 131 x^{15} + 94 x^{16} + 54 x^{17} + 21 x^{18}) .

$6^{-4}\left(x^3 + 4x^4 + 10x^5 + 21x^6 + 38x^7 + 62x^8 + 91x^9 + 122x^{10} + 148x^{11} + \\167x^{12} + 172x^{13} + 160x^{14} + 131x^{15} + 94x^{16} + 54x^{17} + 21x^{18}\right).$

Por ejemplo, la probabilidad de que los tres dados superiores sumen es el coeficiente de , igual a $14$ $x^{14}$

6^{- 4} \times 160 = 10 / 81 = 0.123 456 790 123 456 \dots .

$6^{-4}\times 160 = 10/81 = 0.123\,456\,790\,123\,456\,\ldots.$

Está en perfecto acuerdo con las probabilidades citadas en la pregunta.

Por cierto, la media (calculada a partir de este resultado) es y la desviación estándar es $15869/1296 \approx 12.244598765\ldots$ . $\sqrt{13\,612\,487/1\,679\,616}\approx 2.8468444$

Un cálculo similar (no optimizado) para dados en lugar de tomó menos de medio segundo, lo que respalda la afirmación de que este no es un algoritmo computacionalmente exigente. Aquí hay una trama de la parte principal de la distribución: $n=400$ $n=4$

Dado que el mínimo es muy probable que ser igual a y la suma será extremadamente cerca de tener una normal de distribución (cuya media es y la desviación estándar es de aproximadamente ), el la media debe ser extremadamente cercana a y la desviación estándar extremadamente cercana a . Esto describe muy bien la trama, indicando que es probable que sea correcta. De hecho, el cálculo exacto da una media de alrededor $K$ $1$ $X$ $(400\times 7/2, 400\times 35/12)$ $1400$ $34.1565$ $1400-1=1399$ $34.16$ $2.13\times 10^{-32}$ greater than $1399$ and a standard deviation around $1.24\times 10^{-31}$ less than $\sqrt{400\times 35/12}$ .

— whuber
fuente

Su respuesta es rápida y correcta, así que la marqué como la respuesta. También en una edición dije que también sería bueno tener frecuencias si es posible. Para eso no necesita editar su respuesta ya que puedo ver que el 6^-4multiplicador se usa para convertir de frecuencia a probabilidad.

— SkySpiral7

Editar: @SkySpiral ha tenido problemas para que funcione la siguiente fórmula. Actualmente no tengo tiempo para resolver cuál es el problema, así que si estás leyendo esto, es mejor proceder bajo el supuesto de que es incorrecto.

No estoy seguro sobre el problema general con diferentes números de dados, lados y caídas, pero creo que puedo ver un algoritmo eficiente para el caso de la caída de 1. El calificador es que no estoy completamente seguro de que sea correcto, pero en este momento no puedo ver ningún defecto.

$X_n$ represents the $n$ th die, and suppose $Y_n$ represents the sum of $n$ dice. Then

p (Y_{n} = a) = \sum_{k} p (Y_{n - 1} = a - k) p (X_{n} = k)

$p(Y_n = a) = \sum_k p(Y_{n-1} = a - k)p(X_n=k)$

Now suppose $Z_n$ is the sum of $n$ dice when one die is dropped. Then

p (Z_{n} = a) = p (n th die is the smallest) p (Y_{n - 1} = a) + p (n th die is not the smallest) \sum_{k} p (Z_{n - 1} = a - k) p (X_{n} = k)

$p(Z_n = a) = p(\text{$n$th die is the smallest})p(Y_{n-1} = a) + \\ p(\text{$n$th die is not the smallest})\sum_k p(Z_{n-1} = a - k)p(X_n=k)$

If we define $M_n$ to be distribution of the minimum of $n$ dies, then

p (Z_{n} = a) = p (X_{n} \leq M_{n - 1}) p (Y_{n - 1} = a | X_{n} \leq M_{n - 1}) + p (X_{n} > M_{n - 1}) \sum_{k} p (Z_{n - 1} = a - k) p (X_{n} = k | X_{n} > M_{n - 1})

$p(Z_n = a) = p(X_n \leq M_{n-1})p(Y_{n-1} = a | X_n \leq M_{n-1}) + \\ p(X_n > M_{n-1})\sum_k p(Z_{n-1} = a - k)p(X_n=k | X_n > M_{n-1})$

and we can calculate $M_n$ using

p (M_{n} = a) = p (X_{n} \leq M_{n - 1}) p (X_{n} = a | X_{n} \leq M_{n - 1}) + p (X_{n} > M_{n - 1}) p (M_{n - 1} = a | X_{n} > M_{n - 1})

$p(M_n = a) = p(X_n \leq M_{n-1})p(X_n = a |X_n \leq M_{n-1}) + p(X_n > M_{n-1})p(M_{n-1} = a|X_n > M_{n-1})$

Anyway, together this all suggests a dynamic programming algorithm based on $Y_n, Z_n$ and $M_n$ . Should be quadratic in $n$ .

edit: A comment has been raised on how to calculate $p(X_n \leq M_{n-1})$ . Since $X_n, M_{n-1}$ can each only take on one of six values, we can just sum over all possibilities:

p (X_{n} \leq M_{n - 1}) = \sum_{a, b} p (X_{n} = a, M_{n - 1} = b, a \leq b)

$p(X_n \leq M_{n-1}) = \sum_{a,b} p(X_n = a, M_{n-1} = b, a \leq b)$

Similarly, $p(X_n = k | X_n > M_{n-1})$ can be calculated by applying Bayes rule then summing over the possible values of $X_n, M_{n-1}$ .

— Andy Jones
fuente

+1 This looks correct and you said that's it's quadratic. But it's been a few years since I took statistics (I'm primarily a programmer). So I'd like to fully understand this before marking it as the answer. Also I see you have p(nth is the smallest die) does this include if nth is tied with the smallest? Such as rolling all 3s.

— SkySpiral7

Good catch. If the

n

$n$ th die rolled is the same as the current minimum, we can regard that die as the one to be dropped. In which case the distribution is

Y_{n - 1}

$Y_{n-1}$ . I've swapped some

(<)

$(<)$ s for

(\leq)

$(\leq)$ s to reflect this.

— Andy Jones

Thank you. If I understand this correctly I think your formulas are the answer. However I don't know how to calculate p(X(n) > M(n-1)) (or the negation of it) or p(X(n)=k|X(n) > M(n-1)) so I can't use this answer yet. I'll mark this as the answer but I'd like more information. Can you edit your answer to explain these or should I post it as another question?

— SkySpiral7

Edited my answer.

— Andy Jones

Sorry I know it's been a year and a half but I've finally gotten around to implementing this formula into code. However the p(Z(n)=a) formula appears incorrect. Suppose 2 dice with 2 sides (drop lowest), what are the chances of the result being 1? The chance of X(n) being the smallest or tied is 3/4 and p(Y(n-1)=1) is 1/2 so that Z(n) returns at least 3/8 even though the correct answer is 1/4. The Z formula looks correct to me and I don't know how to fix it. So if it's not too much to ask: what do you think?

— SkySpiral7

I have a reasonably efficient algorithm for this that, on testing, seems to match results of pure brute force while relying less heavily on enumerating all possibilities. It's actually more generalized than the above problem of 4d6, drop 1.

Some notation first: Let $X_NdY$ indicate that you are rolling $X$ dice with $Y$ faces (integer values $1$ to $Y$ ), and considering only the highest $N$ dice rolled. The output is a sequence of dice values, e.g. $4_3d6$ yields $3, 4, 5$ if you rolled $1, 3, 4, 5$ on the four dice. (Note that I'm calling it a "sequence," but the order is not important here, particularly since all we care about in the end is the sum of the sequence.)

The probability $P(X_NdY = S)$ (or more specifically, $P(4_3d6 = S)$ ) is a simplified version of the original problem, where we are only considering a specific set of dice, and not all possible sets that add up to a given sum.

Suppose $S$ has $k$ distinct values, $s_0, s_1, ..., s_k$ , such that $s_i > s_{i+1}$ , and each $s_i$ has a count of $c_i$ . For example, if $S = 3, 4, 4, 5$ , then $(s_0,c_0) = (5,1)$ , $(s_1,c_1) = (4,2)$ , and $(s_2,c_2) = (3,1)$ .

You can calculate $P(X_NdY = S)$ in the following way:

P (X_{N} d Y = S) = \frac{(\prod_{i = 0}^{k - 1} (\binom{X - \sum_{h = 0}^{i - 1} c_{h}}{c_{i}})) (\sum_{j = 0}^{X - N} (\binom{c_{k} + X - N}{c_{k} + X - N - j}) (s_{k} - 1)^{j})}{Y^{X}}

$P(X_NdY = S) = \frac{ \left( \prod_{i=0}^{k-1} {X - \sum_{h=0}^{i-1} c_h \choose c_i} \right) \left( \sum_{j=0}^{X-N} { c_k+X-N \choose c_k+X-N-j} (s_k-1)^j \right)}{ Y^X }$

That's pretty messy, I know.

The product expression $\prod_{i=0}^{k-1}$ is iterating through all but the lowest of the values in $S$ , and calculating all the ways those values may be distributed among the dice. For $s_0$ , that's just $X \choose c_i$ , but for $s_1$ , we have to remove the $c_0$ dice that have already been set aside for $s_0$ , and likewise for $s_i$ you must remove $\sum_{h=0}^{i-1}c_h$ .

The sum expression $\sum_{j=0}^{X-N}$ is iterating through all the possibilities of how many of the dropped dice were equal to $s_k$ , since that affects the possible combinations for the un-dropped dice with $s_k$ as their value.

By example, let's consider $P[4_3d6=(5,4,4)]$ :

(s_{1}, c_{1}) = (5, 1)

$(s_1, c_1) = (5, 1)$

(s_{2}, c_{2}) = (4, 2)

$(s_2, c_2) = (4, 2)$

So using the formula above:

P [4_{3} d 6 = (5, 4, 4)] = \frac{(\binom{4}{1}) ((\binom{3}{3}) \cdot 3^{0} + (\binom{3}{2}) \cdot 3^{1})}{6^{4}} = \frac{5}{162} = 0.0 \bar{308641975}

$P[4_3d6=(5,4,4)] \\ = \frac{ {4 \choose 1} \left( {3 \choose 3} \cdot 3^0 + {3 \choose 2} \cdot 3^1 \right) }{ 6^4 } \\ = \frac{5}{162} = 0.0\overline{308641975}$

The formula breaks down on a domain issue when $s_k=1$ and $j=0$ in the summation, leading to a first term of $0^0$ , which is indeterminate and needs to be treated as $1$ . In such a case, a summation is not actually necessary at all, and can be omitted, since all the dropped dice will also have a value of $s_k = 1$ .

Now here's where I do need to rely on some brute force. The original problem was to calculate the probability of the sum being some value, and $X_NdY$ represents the individual dice left after dropping. This means you must add up the probabilities for all possible sequences $S$ (ignoring ordering) whose sum is the given value. Perhaps there is a formula to calculate this across all such values of $S$ at once, but I haven't even tried broaching that yet.

I've implemented this in Python first, and the above is an attempt to express it mathematically. My Python algorithm is accurate and reasonably efficient. There are some optimizations that could be made for the case of calculating the entire distribution of $\sum X_NdY$ , and maybe I'll do that later.

— Riley John Gibbs
fuente

As a programmer it might be easier for me to understand your Python code (although I've never used Python so it might be the same). Posting the code here is off topic but you could post a link to github etc.

— SkySpiral7

Your answer may be correct and it seems to reduce the complexity from O(Y^X) to O((Y+X-1)!/(X!*(Y-1)!)) but it still isn't as efficient as whuber's answer of O(c*X*log(X)). Thanks for your answer though +1.

— SkySpiral7