Encontrar la secuencia óptima de preguntas para minimizar el tiempo total del estudiante

Supongamos que hay una sesión de tutoría en una universidad. Tenemos un conjunto de $k$ preguntas $Q = \{ q_1 \ldots q_k \}$ y un conjunto de $n$ estudiantes $S = \{ s_1 \ldots s_n \}$ . Cada estudiante tiene una duda en un determinado subconjunto de preguntas, es decir, para cada estudiante $s_j$ , y mucho $Q_j \subseteq Q$ el conjunto de preguntas que un estudiante tiene una duda. Suponga que $\forall 1 \leq j \leq n: Q_j \neq \phi$ y $\bigcup_{1\leq j\leq n}Q_j = Q$ .

Todos los estudiantes ingresan a la sesión de tutoría al principio (en $t = 0$ ). Ahora, un estudiante deja la sesión de tutoría tan pronto como se hayan discutido todas las preguntas en las que tiene dudas. Suponga que el tiempo necesario para discutir cada pregunta es igual, digamos 1 unidad . Deje que sea el tiempo empleado por en la sesión de tutoría. Queremos encontrar una permutación óptima en la que se discutan las preguntas como la cantidad $^*$ $t_j$ $s_j$ $\sigma$ $(q_{\sigma(1)} \ldots q_{\sigma(n)})$ se minimiza. $T_\sigma = \Sigma_{1\leq j \leq n}t_j$

No he podido diseñar un algoritmo de tiempo polinómico ni probar la dureza $\mathsf{NP}$

Podemos definir una versión decisión del problema

T U T = {⟨ k, n, F Q, C ⟩ ∣ \exists σ : T σ \leq C}

$\mathsf{TUT} = \{\langle k, n, \mathcal{F}_Q, C \rangle \mid \exists \sigma : T_{\sigma} \leq C\}$

donde es el conjunto de 's. $\mathcal{F}_Q$ $Q_j$

Entonces podemos averiguar el mínimo mediante la búsqueda binaria en y averiguar la óptima utilizando las cesiones parciales de en tiempo polinomial usando un oráculo para . Además, porque el óptimo se puede usar como un certificado que podemos verificar fácilmente en tiempo polinómico. $T_\sigma$ $C$ $\sigma$ $\sigma$ $\mathsf{TUT}$ $\mathsf{TUT} \in \mathsf{NP}$ $\sigma$

Mi pregunta: ¿ -completo o podemos diseñar un algoritmo de tiempo polinómico para ello? $\mathsf{TUT}$ $\mathsf{NP}$

Nota al margen: Por cierto, pensé en esta pregunta después de una sesión de tutoría real, en la que el TA discutió las preguntas en el orden normal debido a que muchos estudiantes tuvieron que esperar hasta el final. $q_1 \ldots q_n$

Ejemplo
Let y . y . Podemos ver que una óptima porque en ese caso, hojas después de y hojas después de $k=3$ $n=2$ $Q_1 = \{q_3\}$ $Q_2 = \{q_1, q_2, q_3\}$ $\sigma = \langle 3, 1, 2 \rangle$ $s_1$ $t_1 = 1$ $s_2$ , por lo que suma es 4. Sin embargo, si se discuten las preguntas en el orden , entonces y ambos tienen que esperar hasta el final y , por lo suma es 6. $t_2 = 3$
$\langle 1, 2, 3\rangle$ $s_1$ $s_2$ $t_1 = t_2 = 3$

Eres libre de resolver el caso más general donde cada pregunta toma unidades para discutir! $^*$ $q_i$ $x_i$

— skankhunt42
fuente

Para ser claros: ¿todos los estudiantes ingresan al mismo tiempo, o ingresan desde el momento en que se hace su primera pregunta?

— Lagarto discreto

@Discretelizard Todos los estudiantes ingresan al mismo tiempo al principio (en t = 0).

— skankhunt42

En la definición actual, los conjuntos de preguntas son únicos, es decir, un conjunto de preguntas pertenece a un máximo de un alumno. Esto podría ser una simplificación razonable, pero dudo que sea realista (y dudo que esto haga mucho por la complejidad del problema)

— Lagarto discreto

Supongo que dos estudiantes podrían tener exactamente el mismo conjunto de preguntas, por lo que el tiempo de espera se multiplicaría por dos.

— gnasher729

Sospecho que el problema es NP-hard. Mostraré cómo transformar el problema de modo que esté fuertemente relacionado con un problema que es NP-hard. (Sí, todo esto es bastante vago. Básicamente creo que mi enfoque general es correcto, pero actualmente no puedo continuar). $\mathsf{TUT}$

Primero, tenga en cuenta que el problema puede reformularse de la siguiente manera: $\mathsf{TUT}$

Dado un conjunto de preguntas de tamaño , un conjunto de subconjuntos y un entero , que hace existe una secuencia tal que, para todo : $Q$ $k$ $n$ $\mathcal{F}_Q\subseteq \mathcal{P}(Q)$ $C$ $\Sigma : \langle S_1,\ldots, S_k\rangle$ $i \in\{1,\ldots,k\}$

y ; y $S_i\subseteq Q$ $|S_i|=i$
$S_i \subset S_j$ for all $j>i$ ; and
$\sum_{i=1}^k |\{q\in\mathcal{F}_Q\mid q\not\subseteq S_i\}| \leq C$ ?

Note that the set $S_i$ represent the first $i$ questions that will be explained. Conditions 1 and 2 ensure that the subsets are well formed according to this interpretation. Condition 3 counts the amount of students that haven't left at every moment in time, so it indeed sums up to the total waiting time among all students.

Now, we restrict the size of the subsets in $\mathcal{F}_Q$ to $2$ , so we can represent these subsets as edges on a graph where the vertices are the elements from $Q$ . (A hardness result for this special case is sufficient for hardness of the general problem)

Now, the problem of minimizing $|\{q\in\mathcal{F}_Q\mid q\not\subseteq S_i\}|$ for a single $i$ (this is essentially ignoring condition 2) is equivalent to the following problem, which I dub ' $\mathsf{\text{Double max $k$-vertex-cover}}$ ':

Given an undirected graph $G=(V,E)$ and integers $k$ and $t$ , does there exist a set of vertices $V'\subseteq V$ of size at most $k$ such that the set $\{(u,v)\in E\mid u\in V' \wedge v \in V' \}$ has a size of at least $t$ ?

This problem is NP-hard, since $k$ -clique is a special case of this problem, as this answer shows. However, this is not sufficient to prove $\mathsf{TUT}$ to be NP-hard, since we need to find the maximum for every $i$ , while respecting condition 2. This conditions are not satisfied by every sequence $\Sigma$ that satisfies only condition 1 and 3: consider the graph on $7$ vertices with two disjoint cycles, one of size $4$ , the other size $3$ . For $i=3$ , selecting all vertices in the $3$ -cycle gives the maximum, while selecting all vertices of the $4$ -cycle is optimal for $i=4$ .

It seems that condition 2 makes the problem even harder and most certainly not easier, which means $\mathsf{TUT}$ should be NP-hard, but I haven't seen a method to formally prove this.

So, to summarize, I have reduced the question to the following:

Is it possible to include condition 2 to complete the hardness proof for $\mathsf{TUT}$ ?

Side note: The formulation I gave makes it tempting to try an iterative algorithm which finds $|\{q\in\mathcal{F}_Q\mid q\not\subseteq S_i\}|$ under condition 2 from $i=1\ldots k$ , by finding all maximum 'extenstions' of all found maximum sets for $i-1$ . This does not lead to an efficient algorithm, as the amount of maximum sets at a single iteration may be exponential in $k$ . Additionally, I have not seen a method to determine whether a subset for some $i$ would eventually become the 'global' maximum to prevent checking an exponential amount of subsets.

— Discrete lizard
fuente