【PaperReading】RTNeRF & Instant3D

Before

Gatech 组的主要算法优化策略：

首先，选择当前的 SOTA 算法
对 SOTA 算法进行 Profiling，找到性能瓶颈
以增量式优化为主

RT-NeRF

Motivation

认为：目前 NeRF 效率低有两个主要原因：

The commonly used uniform point sampling method
- 朴素的采样方法）
The required dense accesses and computations for embeddings
- 密集的 Embedding 访问和计算

先验信息： Sparsities of pre-existing points

最终有效的采样点应该具有稀疏性

优化方法

Directly computing the geometry of pre-existing points based on the corresponding non-zero cubes of the occupancy grid
- 通过预计算已经存在于 Occupancy Grid 的几何元素，减少采样点数量
Leverages a coarse-grained view-dependent rendering ordering scheme to avoid processing invisible points
- 通过一个粗粒度的排序，减少对某些不可见点的运算
- Object Ordered 思想

Profiling

对 TensoRF 的 Rendering Pipeline 进行 Profiling。

Profiling

Locate the pre-existing points

All the candidate points are uniformly sampled along rays and then the existence of pre-existing points are identified via a query process based on the occupancy grid.

首先在光线上进行一次预采样，通过 Occupancy Grid 来查询点的存在与否

两个 Inefficiency:

The sparsity of the occupancy grid is not leveraged
- 没有利用 Occupancy Grid 的稀疏性先验
The DRAM accesses to the occupancy grid are irregular because the emitted rays can come from any direction, and thus the order of their accesses to the occupancy grid can not be predicted in advance.
- 由于 Ray 的方向并不能预知，Occupancy 的 DRAM-Access 很随机，Locality 差

Proposed Solution？

Directly computes the coordinates of pre-existing points by looping over the non-zero cubes of the occupancy grid.

按照 固定的顺序 访问 Occupancy Grid（也即所谓的“Cube”）

Efficient Rendering Pipeline

将 Occupancy Grid 中的每个 Non-zero cube 近似为一个球，以方便后续步骤的计算；
将上述的球投射到要渲染的图像上，成为一个椭圆（Oval）；
根据待渲染的图像中的 regular arrangement of points ，即一个点对应一个像素，确定椭圆内的点；
使用 Line-Sphere intersections 的解析解来计算出沿着光线射线并且在球内的点的 Geometries。

只有 Pre-exist points 会被包含在循环中。解决了：

Occupancy Grid 的 Sparsity 没有被充分利用
在 SOTA Rendering Pipeline 中， DRAM Access 的不规则性

Early Termination: Volume Rendering

在图形学中，Volume Rendering Integral 的离散化计算主要有两种：

Front-to-back composition: 从前向后积分
- $\begin{cases}\hat{C_i}=\hat{C}{i+1} + \hat{T{i+1}}C_i\\hat{T_i}=\hat{T}_{i+1}(1-\alpha_i)\end{cases}$
Back-to-front composition：从后向前积分
- $\begin{cases}\hat{C_i}=\hat{C_{i-1}}(1-\alpha_i)\\hat{T}i=\hat{T}{i-1}(1-\alpha_i)\end{cases}$