Hestia
1 Hestia: Hierarchical Continuous NBV Bridge
Primary source. Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction [1].
Local source. main.tex and section sources under tex-src/arXiv-Hestia/sec/.
Related ARIA-NBV pages. GenNBV, RL planning, and roadmap.
1.1 Core contribution
Hestia is the most relevant continuous-policy bridge in the current corpus. It remains coverage-driven, but it improves the continuous NBV control structure by decomposing action selection into a look-at proposal and a feasible camera-position policy, and by tracking directional voxel-face visibility [1].
For ARIA-NBV, the transferable idea is not the coverage reward. It is the factorization:
\[ \text{target/look-at proposal} \;\longrightarrow\; \text{feasible camera pose}. \]
1.2 Verified paper signals
| signal | source-backed detail | ARIA-NBV relevance |
|---|---|---|
| Face-aware state | Hestia tracks directional visibility of voxel faces rather than only occupancy. | Inspires directional observation histograms for semi-dense points, EVL voxels, or target crops. |
| Hierarchical action | The controller predicts a look-at point and a 3-DoF position; yaw/pitch are derived from the look-at relation. | Future bridge from finite candidates to target-then-pose continuous control. |
| Feasibility projection | Predicted positions are adjusted to collision-free locations. | Mirrors ARIA-NBV’s need for hard validity masks and feasible candidate transitions. |
| Close-greedy reward | The paper uses immediate face-coverage gains with a small discount factor. | Supports short-horizon evidence gates before long-horizon RL claims. |
| Intermediate supervision | The look-at proposal is supervised by a target derived from uncaptured surface geometry. | Suggests supervised target/missing-surface proposal heads after target-RRI labels are trusted. |
Directional face visibility can be summarized as:
\[ F_t = f_t \lor F_{t-1}, \qquad f_t(v_i,j) = \mathbb{1}\!\left( \frac{a'_t-p_{v_i}}{\lVert a'_t-p_{v_i}\rVert} \cdot n_{i,j} > 0 \right). \]
1.3 ARIA-NBV adoption
- Stretch/bridge: use Hestia to design a later continuous target-then-pose actor, not as the first quantitative thesis baseline.
- Proposal/diagnostic: add directional observation features or incidence histograms to candidate reports.
- Validity: keep feasibility projection and invalid-action handling as hard constraints rather than reward-only penalties.
1.4 Do not adopt
- Do not replace RRI or target RRI with face coverage.
- Do not make continuous actor-critic training mandatory for the thesis result.
- Do not transfer object-centric Objaverse coverage results as evidence for ASE/EFM target-conditioned NBV.
1.5 Open risks / caveats
- Directional face coverage is still a proxy and may disagree with Chamfer-style quality.
- Continuous policies need simulator fidelity, collision handling, and reward throughput that ARIA-NBV does not yet have.
- Hestia is most useful after finite-candidate oracle rollouts and Q_H learning establish the quality-driven baseline.