Encontrar testigos en la suma de enteros minkowski

Sean $A$ y $B$ subconjuntos de $\{0,\ldots,n\}$ . Estamos interesados en encontrar la suma de Minkowski $A+B=\{a+b~|~a\in A,b\in B\}$ .

$\chi_X:\{0,\ldots,2n\}\to \{0,1\}$ es una función característica de $X$ si

χ X (x) = {1 if x \in X 0 otherwise

$\chi_X(x) = \begin{cases} 1 \text{ if } x\in X\\ 0 \text{ otherwise}\end{cases}$

Sea $f$ la convolución discreta de $\chi_A$ y $\chi_B$ , luego $x\in A+B$ si y solo si $f(x)> 0$ . Por lo tanto, $A+B$ se puede calcular en tiempo $O(n\log n)$ por convolución discreta a través de FFT.

A veces es importante averiguar el par real $a\in A$ y $b\in B$ que suma a $x$ . $a\in A$ se llama testigo de $x$ , si existe $b\in B$ tal que $a+b=x$ . Una función $w:A+B\to A$ se llama función testigo si $w(x)$ es testigo de $x$ .

¿Es posible calcular una función testigo en el tiempo ? $O(n\log n)$

convolution fft

— Chao Xu
fuente

no es especialmente difícil. O(npolylogn) $O( n polylog n )$

— Sariel Har-Peled

Puedes usar la búsqueda binaria. por ejemplo, dividir

en dos conjuntos de tamaños aproximadamente iguales

y calcular

; compruebe en cuál de esos

está; y recurse. Esto te dará algo como

. A $A$

AL,AR $A_L,A_R$

AL+B $A_L+B$

AR+B $A_R+B$

x $x$

O(nlg2n) $O(n \lg^2 n)$

— DW

@DW Esto sólo se puede encontrar un testigo para un solo

, pero queremos un testigo para cada elemento de

. (mi redacción parece no estar clara, así que acabo de actualizar la pregunta)x $x$

A+B $A+B$

— Chao Xu

Pero, ¿está interesado en la solución O (n polylog n)?

— Sariel Har-Peled

@ SarielHar-Peled sí, también estoy interesado en el algoritmo determinista

. O(npolylogn) $O(n polylog n)$

— Chao Xu

Respuestas:

Aquí estoy explicando cómo obtener el tiempo de ejecución aleatorio . Necesitamos una secuencia de observaciones: $O(n *\mathrm{polylog} n)$

Un testigo de un valor es un par de números tal que . Deje que y se definan de manera análoga. Observe que el coeficiente de en $v$ $(a,b) \in A \times B$ $a+b=v$ $P_A(x) = \sum_{i \in A} x^i$ $P_B(x)$ $x^v$ es el número de testigos que hay para el valor . $P_A(x) * P_B(x)$ $v$
Assume $v$ has a single witness $(a,b) \in A \times B$ , and consider the the polynomial $Q_A(x) = \sum_{i \in A} i*x^i$ . Clearly, the coefficient of $x^v$ in $Q_A(x)*P_B(x)$ is $a$ , and as such we now know the pair $(a,v-a)$ and we are done.
Entonces, hemos terminado con el caso de que hay un solo testigo. Así que considere el caso de que tiene testigos . Sea $v$ $k$ $(a_1, b_1),\ldots, (a_k,b_k)$ . Observe que $i(k) = \lceil{\lg \sqrt{k}}\rceil$ . Luego, dejemos que, para, parasean muestras aleatorias, de modo que cada elemento dese elija en con probabilidad . La probabilidad de que $2^{i(k)-1} \leq \sqrt{k} \leq 2^{i(k)}$ $R_j = (A_j, B_j)$ $j=1,\ldots, m$ $m=O(\log n)$ $A$ $A_i$ $p = 1/2^{i(k)}$ $v$ has a single witness in $R_j$ is $\alpha = \binom{k}{1}p^2 (1-p^2)^{k-1}$ , since the witness are disjoint pairs of numbers (since the sum of each pair is $v$ ). It is easy to verify that $\alpha$ is a constant in $(0,1)$ independent of the value of $k$ . As such, it must be, with high probability, that $v$ has a single witness in one of the samples $R_1, \ldots, R_{m}$ . As such, by computing the two polynomials associated with with such sample, as described above, in $O(n \log n)$ time (per sample), using FFT, we can decide this in constant time.
We are almost done. Compute the above random samples for resolutions $i=1,\ldots, \lceil\lg n\rceil$ . For each such resolution compute the random samples and associated polynomials. Also, compute the associated polynomial for $A$ and $B$ . This preprocessing naively takes $O(n \log^3 n)$ , but I suspect that being slightly more careful a $\log n$ factor should be removable.
The algorithm: For every value $v$ , compute how many witness, say k, it has in constant time, by consulting the polynomial $Q_A(x)*P_B(x)$ . Next, go to the relevant data-structure for $i(k)$ . Then, it finds the random sample that has it as a single witness, and it extract the pair that is this witness in constant time.
Strangely enough, the preprocessing time is $O(n \log^3 n)$ , but the expected time to find the witness themselves take only $O(n)$ time, since one can stop the search as soon as one find a witness. This suggests that this algorithm should be improveable. In particular, for $i(k) \ll \lg n$ , the polynomials generated are very sparse, and one should be able to do much faster FFT.

— Sariel Har-Peled
fuente

Ok, I've been holding off since really Sariel should get credit for an answer, but I'm tired of waiting, so here is my cut at a near-linear randomized algorithm.

By choosing samples of $n(1-\epsilon)^i$ points, $i=0,1,\dots$ , you can get a logarithmic number of subproblems such that each sum from the original problem has constant probability of being represented uniquely in one of the subproblems (the one where the sampling cuts down the expected number of representations to near 1).
By repeating the sampling process a logarithmic number of times you can get all sums to have unique representations with high probability.
If you have a partition of $A$ and $B$ into two subsets, then by multiplying the numbers by four, adding 2 to the numbers in one of the subsets in $A$ , and adding 1 to the numbers in one of the subsets in $B$ , you can read off from the mod-4 values of the achievable sums which of the two subsets their summands come from.
By repeating the partition process a logarithmic number of times, using each bit position of the binary representations of the values or indices in the subproblems to select the partitions in each step, you can uniquely identify the summands of every uniquely-represented sum.

This blows up the running time by three logarithmic factors; probably that can be reduced.

— David Eppstein
fuente

Ha ha ;). I was in the middle of writing it, and then went to lunch...

— Sariel Har-Peled

This answer gives a determinstic $O(n~\mathrm{polylog} n)$ algorithm.

It appears that Sariel and David's algorithm can be derandomized through an approach similar to this paper. [2] While going through the process I found there is a more general problem that implies this result.

The $k$ -reconstruction problem

There are hidden sets $S_1,\ldots,S_n \subset \{1,\ldots,m\}$ , we have two oracles $Size$ and $Sum$ that take a query set $Q$ .

$Size(Q)$ returns $(|S_1\cap Q|,|S_2\cap Q|,\ldots,|S_n\cap Q|)$ , the size of each intersection.

$Sum(Q)$ returns $(\sum_{s\in S_1\cap Q} s,\sum_{s\in S_2\cap Q} s,\ldots,\sum_{s\in S_n\cap Q} s)$ , the sum of elements in each intersection.

The $k$ -reconstruction problem asks one to find $n$ subsets $S_1',\ldots,S_n'$ such that $S_i'\subset S_i$ and $|S_i'|=\min(k,|S_i|)$ for all $i$ .

Let $f$ be the running time of calling the oracles, and assume $f=\Omega(m+n)$ , then one can find the sets in deterministic $O(f k \log n~\mathrm{polylog}(m))$ time. [1]

Now we can reduce the finding witness problem to $1$ -reconstruction problem. Here $S_1,\ldots,S_{2n}\subset \{1,\ldots,2n\}$ where $S_i = \{a|a+b = i, a\in A, b\in B\}$ .

Define the polynomials $\chi_Q(x) = \sum_{i \in Q} x^i$ , $I_Q(x) = \sum_{i \in Q} i x^i$

The coefficient for $x^i$ in $\chi_Q\chi_B(x)$ is $|S_i\cap Q|$ and in $I_Q\chi_B(x)$ is $\sum_{s\in S_i\cap Q} s$ . Hence the oracles take $O(n\log n)$ time per call.

This gives us an $O(n~\mathrm{polylog}(n))$ time deterministic algorithm.

[1] Yonatan Aumann, Moshe Lewenstein, Noa Lewenstein, Dekel Tsur: Finding witnesses by peeling. ACM Transactions on Algorithms 7(2): 24 (2011)

[2] Noga Alon, Moni Naor: Derandomization, witnesses for Boolean matrix multiplication and construction of perfect hash functions. Algorithmica 16(4-5) (1996)

— Chao Xu
fuente