Improved error space for universal optimal predictor schemes

Vanessa Kosoy

We construct an error space which is smaller than $Δ_{a v g}^{2}$ but admits analogous existence theorems for optimal predictor schemes.

Results

Construction

Given $ϕ \in Φ$ we define $Δ_{ϕ}^{1}$ to be the set of functions $δ : N \to R^{\geq 0}$ s.t. $\exists ϵ > 0 : {lim}_{k \to \infty} ϕ (k)^{ϵ} δ (k) = 0$ . It is easily seen $Δ_{ϕ}^{1}$ is an error space.

Given $ϕ \in Φ$ , denote $t_{ϕ} (k) := ⌊ 2^{(log k)^{ϕ (k)}} ⌋$ . We define $Δ_{l l, ϕ}^{2}$ to be the set of bounded functions $δ : N^{2} \to R^{\geq 0}$ s.t. for any $ϕ^{'} \in Φ$ , if $ϕ^{'} \leq ϕ$ then

$\frac{t_{ϕ^{'}} (k) - 1 \sum j = 2 (log log (j + 1) - log log j) δ (k, j)}{log log t_{ϕ^{'}} (k)} \in Δ_{ϕ^{'}}^{1}$

We define $Δ_{l l}^{2} := ⋂_{ϕ \in Φ} Δ_{l l, ϕ}^{2}$ .

Proposition 1

$Δ_{l l, ϕ}^{2}$ is an error space for any $ϕ \in Φ$ . $Δ_{l l}^{2}$ is an error space.

Proposition 2

Consider a polynomial $q : N^{2} \to N$ . There is a function $λ_{q} : N^{3} \to [0, 1]$ s.t.

(i) $\forall k, j \in N : \sum i \in N λ_{q} (k, j, i) = 1$

(ii) For any function $ϵ : N^{2} \to [0, 1]$ we have

$ϵ (k, j) - \sum i \in N λ_{q} (k, j, i) ϵ (k, q (k, j) + i) \in Δ_{l l}^{2}$

The proofs of Propositions 1 and 2 are in the Appendix. The following are proved using exactly like the analogous statements for $Δ_{a v g}^{2}$ and we omit the proofs.

Lemma

Consider $(f, μ)$ a distributional estimation problem, $(P, r, a)$ , $(Q, s, b)$ $(p o l y, l o g)$ -predictor schemes. Suppose $p : N^{2} \to N$ a polynomial and $δ \in Δ_{l l}^{2}$ are s.t.

$\forall i, k, j \in N : E [(P^{k, p (k, j) + i} - f)^{2}] \leq E [(Q^{k j} - f)^{2}] + δ (k, j)$

Then $\exists δ^{'} \in Δ_{a v g}^{2}$ s.t.

$E [(P^{k j} - f)^{2}] \leq E [(Q^{k j} - f)^{2}] + δ^{'} (k, j)$

Theorem 1

Consider $(f, μ)$ a distributional estimation problem. Define $Υ : N^{2} \times {0, 1}^{*}^{3} a l g - \to [0, 1]$ by

$Υ^{k j} (x, y, Q) := β (e v^{j} (Q, x, y))$

Define $υ_{f, μ} : N^{2} \to {0, 1}^{*}$ by

$υ_{f, μ}^{k j} := a r g m i n | Q | \leq log j E_{μ^{k} \times U^{j}} [(Υ^{k j} (x, y, Q) - f (x))^{2}]$

Then, $(Υ, j, υ_{f, μ})$ is a $Δ_{l l}^{2} (p o l y, l o g)$ -optimal predictor scheme for $(f, μ)$ .

Theorem 2

There is an oracle machine $Λ$ that accepts an oracle of signature $S F : N \times {0, 1}^{*} \to {0, 1}^{*} \times [0, 1]$ and a polynomial $r : N \to N$ where the allowed oracle calls are $S F^{k} (x)$ for $| x | = r (k)$ and computes a function of signature $N^{2} \times {0, 1}^{*}^{2} \to [0, 1]$ s.t. for any $ϕ \in Φ$ , $(f, μ)$ a distributional estimation problem and $G := (S, F, r^{S}, a^{S})$ a corresponding $Δ_{ϕ}^{1} (l o g)$ -generator, $Λ [G]$ is a $Δ_{l l, ϕ}^{2} (p o l y, l o g)$ -optimal predictor scheme for $(f, μ)$ .

Appendix

Proof of Proposition 1

The only slightly non-obvious condition is (v). We have

$limsup k \to \infty ϕ (k)^{ϵ α} E_{κ_{ϕ}^{k}} [δ (k, j)^{α}] \leq limsup k \to \infty ϕ (k)^{ϵ α} E_{κ_{ϕ}^{k}} [δ (k, j)]^{α}$

$limsup k \to \infty ϕ (k)^{ϵ α} E_{κ_{ϕ}^{k}} [δ (k, j)^{α}] \leq (limsup k \to \infty ϕ (k)^{ϵ} E_{κ_{ϕ}^{k}} [δ (k, j)])^{α}$

$lim k \to \infty ϕ (k)^{ϵ α} E_{κ_{ϕ}^{k}} [δ (k, j)^{α}] = 0$

Proof of Proposition 2

Given functions $q_{1}, q_{2} : N^{2} \to N$ s.t. $q_{1} (k, j) \geq q_{2} (k, j)$ for $k, j ≫ 0$ , the proposition for $q_{1}$ implies the proposition for $q_{2}$ by setting

$λ_{q_{2}} (k, j, i) := {\begin{matrix} λ_{q_{1}} (k, j, i - q_{1} (k, j) + q_{2} (k, j)) & if i - q_{1} (k, j) + q_{2} (k, j) \geq 0 0 & if i - q_{1} (k, j) + q_{2} (k, j) < 0 \end{matrix}$

Therefore, it is enough to prove to proposition for functions of the form $q (k, j) = j^{m + n log k}$ for $m > 0$ .

Consider any $ϕ \in Φ$ . We have

$lim k \to \infty ϕ (k)^{- \frac{1}{2}} = 0$

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{log log k}{ϕ (k) log log k} = 0$

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{log (m + n log k)}{ϕ (k) log log k} = 0$

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{2^{m + n log k} \int x = 2 d (log log x)}{ϕ (k) log log k} = 0$

Since $ϵ$ takes values in $[0, 1]$

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{2^{m + n log k} \int x = 2 ϵ (k, ⌊ x ⌋) d (log log x)}{ϕ (k) log log k} = 0$

Similarly

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{t_{ϕ} (k)^{m + n log k} \int x = t_{ϕ} (k) ϵ (k, ⌊ x ⌋) d (log log x)}{ϕ (k) log log k} = 0$

The last two equations imply that

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{t_{ϕ} (k) \int x = 2 ϵ (k, ⌊ x ⌋) d (log log x) - t_{ϕ} (k)^{m + n log k} \int x = 2^{m + n log k} ϵ (k, ⌊ x ⌋) d (log log x)}{ϕ (k) log log k} = 0$

Raising $x$ to a power is equivalent to adding a constant to $log log x$ , therefore

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{t_{ϕ} (k) \int x = 2 ϵ (k, ⌊ x ⌋) d (log log x) - t_{ϕ} (k) \int x = 2 ϵ (k, ⌊ x^{m + n log k} ⌋) d (log log x)}{ϕ (k) log log k} = 0$

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{t_{ϕ} (k) \int x = 2 (ϵ (k, ⌊ x ⌋) - ϵ (k, ⌊ x^{m + n log k} ⌋)) d (log log x)}{ϕ (k) log log k} = 0$

Since $⌊ x^{m + n log k} ⌋ \geq ⌊ x ⌋^{m + n log k}$ we can choose $λ_{q}$ satisfying condition (i) so that

$j + 1 \int x = j ϵ (k, ⌊ x^{m + n log k} ⌋) d (log log x) = (log log (j + 1) - log log j) \sum i λ_{q} (k, j, i) ϵ (k, j^{m + n log k} + i)$

It follows that

$j + 1 \int x = j ϵ (k, ⌊ x^{m + n log k} ⌋) d (log log x) = j + 1 \int x = j \sum i λ_{q} (k, ⌊ x ⌋, i) ϵ (k, ⌊ x ⌋^{m + n log k} + i) d (log log x)$

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{t_{ϕ} (k) \int x = 2 (ϵ (k, ⌊ x ⌋) - \sum_{i} λ_{q} (k, ⌊ x ⌋, i) ϵ (k, ⌊ x ⌋^{m + n log k} + i)) d (log log x)}{ϕ (k) log log k} = 0$

$lim k \to \infty ϕ (k)^{\frac{1}{2}} \frac{\sum_{j = 2}^{t_{ϕ} (k) - 1} (log log (j + 1) - log log j) (ϵ (k, j) - \sum_{i} λ_{q} (k, j, i) ϵ (k, j^{m + n log k} + i))}{ϕ (k) log log k} = 0$

$ϵ (k, j) - \sum i \in N λ_{q} (k, j, i) ϵ (k, q (k, j) + i) \in Δ_{l l}^{2}$