Adaptive noise spreading

As discussed above, each atomic wiWw_i \in W has its distortion contribution at the spatial region kik_i on the rendered image. Since the rendering process contains multiple options to improve the quality of the output (noise reduction, anti-aliasing, etc.), it is severe if the region kik_i(s) fall into perceptually insignificant regions [7] of the image because the deliberate distortions raised by wiw_i(s) would be eliminated by the rendering enhancement. Hence, distortions are preferred to be placed in human perceptually significant regions. However, to keep the compromise between the fidelity of the rendered frames and the robustness of noise verification, the strength of the distortion of each wiw_i must be tuned so that its deviation from locally enclosed regions is within a predetermined bound.

We handle this problem using the adaptive noise spreading [16], [17], [18]. Research on the human visual perception agrees that the important information of images is located at high energy and low frequency spectral regions [10]. Then before embedding the object vector Ω\Omega, we render the scene GG at some low settings to get a sampling instance of the image. Next, we proceed both the spatial and spectral analysis on this instance to get perceptually significant spatial regions, called preferred regions. The atomic noises wiw_i(s) will be placed in these regions. Simultaneously, we tune the expectation value μ\mu used in generating atomic noises so that the energy (statistically given in proposition below) of high frequencies of the noises are sufficiently higher than the threshold used in the noise verification.

Proposition 2. (Convergence of energies)

Let {Xi,YiiN}\left\{ \mathcal{X}_i, \mathcal{Y}_i \mid i \in \mathbb{N} \right\} be a set of independent and identically distributed normal random variables XiYiN(μ,σ2)\mathcal{X}_i \sim \mathcal{Y}_i \sim \mathcal{N}\left(\mu, \sigma^2\right). Let [XiYi]t[X_i \quad Y_i]^t be a sample of the random vector [XiYi]t[\mathcal{X}_i \quad \mathcal{Y}_i]^t and wiw_i be the signal (x,y)Ae2iπ(xXi+yYi)\left(x,y\right) \mapsto Ae^{2i\pi\left(\frac{x}{X_i} + \frac{y}{Y_i}\right)} for any iNi \in \mathbb{N}. Then the average of discrete Fourier transforms Fn=1ni=1nFi\overline{F}_n = \frac{1}{n}\sum\limits_{i=1}^{n} F_i converges:

Fn(u,v)na.sAM×N(1e2iπMμ)(1e2iπNμ)(1e2iπ(1μ+uM))(1e2iπ(1μ+vN))\overline{F}_n \left(u,v\right) \xrightarrow[n \to \infty]{a.s} \frac{A}{M \times N} \frac{\left(1 - e^{2i\pi \frac{M}{\mu}}\right) \left(1 - e^{2i\pi \frac{N}{\mu}}\right)}{\left(1 - e^{2i\pi\left(\frac{1}{\mu} + \frac{u}{M}\right)}\right) \left(1 - e^{2i\pi\left(\frac{1}{\mu} + \frac{v}{N}\right)}\right)}

for all 0u<M, 0v<N0 \leq u < M, \ 0 \leq v < N.

Proof. Direct application of the continuous mapping theorem. For details, see Appendix A.

It is worth noting that the rendering work of GG generally contains multiple tasks, each requires to render multiple image frames. But the number of reference instances used for spectral analysis is much smaller, practically less than 1% the total number of rendered frames. While keeping the robustness of ANGV, we can adjust this ratio be even smaller by increasing the size nn of the noise vector WW.

Last updated