|
NBV - Next-Best-View
|
Problem of selecting the next sensor viewpoint to improve an active reconstruction or inspection objective under a limited acquisition budget.
|
\(\mathcal{M}_{\mathrm{NBV}}\)rl.mdp_nbv
|
\(\mathcal{M}_{\mathrm{NBV}}=(\mathcal{S},\mathcal{A},T,r_e,\gamma,H)\)rl.nbv_mdp
|
|
RRI - Relative Reconstruction Improvement
|
Metric quantifying the relative reconstruction-quality improvement obtained by adding a candidate observation to the current reconstruction.
|
\(\mathrm{RRI}\)oracle.rri
|
\(\mathrm{RRI}(q)=\frac{D(\mathcal{P}_t,\mathcal{M}^{\mathrm{GT}})-D(\mathcal{P}_t\cup\mathcal{P}_q,\mathcal{M}^{\mathrm{GT}})}{D(\mathcal{P}_t,\mathcal{M}^{\mathrm{GT}})+\varepsilon}\)rri.rri
|
|
CD - Chamfer Distance
|
Historical bidirectional distance family used to compare reconstructed points against reference geometry.
|
\(D\)rri.cd_value \(\mathcal{P}\)oracle.points \(\mathcal{M}^{\mathrm{GT}}\)ase.mesh
|
\(D(\mathcal{P},\mathcal{M}^{\mathrm{GT}})=D_{P\to M}(\mathcal{P},\mathcal{M}^{\mathrm{GT}})+D_{M\to P}(\mathcal{P},\mathcal{M}^{\mathrm{GT}})\)rri.cd
|
|
target RRI - Target-Specific RRI
|
RRI computed only on the ground-truth and reconstructed geometry associated with a selected target of interest.
|
\(\mathrm{RRI}_e\)entity.rri_e \(\mathcal{M}_e^{\mathrm{GT}}\)ase.mesh_target
|
\(\mathrm{RRI}_e(q)=\frac{D(\mathcal{P}_t^e,\mathcal{M}_e^{\mathrm{GT}})-D(\mathcal{P}_t^e\cup\mathcal{P}_q^e,\mathcal{M}_e^{\mathrm{GT}})}{D(\mathcal{P}_t^e,\mathcal{M}_e^{\mathrm{GT}})+\varepsilon}\)rri.target_rri
|
|
target - Target of Interest
|
Selected entity, object crop, point, region, or surface-deficit hypothesis whose reconstruction quality should be improved.
|
\(e_t\)rl.target \(\mathcal{M}_e^{\mathrm{GT}}\)ase.mesh_target
|
-
|
|
PC - Point Cloud
|
Set of 3D points representing observed scene geometry.
|
\(\mathcal{P}\)oracle.points \(\mathcal{P}_q\)oracle.points_q
|
-
|
|
candidate - Candidate View
|
Proposed camera pose whose expected reconstruction utility is evaluated before selecting the next observation.
|
\(\mathcal{Q}_t\)oracle.candidates_t \(q_{t,i}\)oracle.candidate_qti
|
-
|
|
oracle RRI - Oracle RRI
|
RRI label computed with privileged ground-truth geometry, used for supervised training and evaluation.
|
\(\mathrm{RRI}\)oracle.rri \(\mathcal{M}^{\mathrm{GT}}\)ase.mesh
|
\(\mathrm{RRI}(q)=\frac{D(\mathcal{P}_t,\mathcal{M}^{\mathrm{GT}})-D(\mathcal{P}_t\cup\mathcal{P}_q,\mathcal{M}^{\mathrm{GT}})}{D(\mathcal{P}_t,\mathcal{M}^{\mathrm{GT}})+\varepsilon}\)rri.rri
|
|
target-conditioned scorer - Target-Conditioned Scorer
|
VIN-style candidate scorer that receives scene state, a candidate view, and an encoding of the target of interest.
|
\(\hat{r}\)vin.rri_hat
|
\(\rho=\operatorname{corr}(\operatorname{rank}(\hat{r}_i),\operatorname{rank}(r_i))\)metrics.spearman \(\mathrm{TopKAcc}(k)=\frac{1}{N}\sum_i\mathbb{1}[y_i\in\mathrm{TopK}(\boldsymbol{\pi}_i,k)]\)metrics.topk_acc
|
|
OBS-SEL - Observed Target Selection
|
Main thesis protocol component requiring target selection to use only actor-visible observed or predicted target evidence.
|
-
|
-
|
|
PRED-Q - Predicted-Target Q
|
Main thesis protocol component requiring scorer or Q_H inputs to use predicted or observed target descriptors.
|
\(Q_H\)rl.qh
|
\(Q_H(s_t^{\mathrm{cf0}},a_t)=\mathbb{E}\left[G_t^{(H)}\mid s_t=s_t^{\mathrm{cf0}},a_t\right]\)rl.q_h
|
|
GT-EVAL - Ground-Truth Target Evaluation
|
Main thesis protocol component using ground-truth OBBs and target mesh crops only for labels and evaluation.
|
\(\mathcal{M}_e^{\mathrm{GT}}\)ase.mesh_target
|
-
|
|
cost - Acquisition Cost
|
Budget consumed to acquire observations, measured by view count, path length, elapsed time, invalid-action rate, or a weighted combination.
|
\(C(\tau)\)rl.acquisition_cost
|
-
|
|
NBV MDP - Target-Conditioned NBV MDP
|
Finite-horizon MDP contract for target-conditioned ARIA-NBV rollouts and fitted Q_H training.
|
\(\mathcal{M}_{\mathrm{NBV}}\)rl.mdp_nbv \(s\)rl.s \(\mathcal{A}(s_t)\)rl.action_set \(T\)rl.transition \(r_t^e\)rl.reward_target \(\gamma\)rl.gamma \(H\)rl.H
|
\(\mathcal{M}_{\mathrm{NBV}}=(\mathcal{S},\mathcal{A},T,r_e,\gamma,H)\)rl.nbv_mdp
|
|
state - Rollout State
|
Rollout state family separating actor-visible state from oracle-only supervision.
|
\(s\)rl.s \(s_t^{\mathrm{hist}}\)rl.s_hist \(s_t^{\mathrm{off}}\)rl.s_off \(s_t^{\mathrm{cf0}}\)rl.s_cf0 \(s_t^{\mathrm{cf+}}\)rl.s_cf_geom \(s_t^{\mathrm{oracle}}\)rl.s_oracle \(\mathcal{P}\)oracle.points \(\mathcal{Q}_t\)oracle.candidates_t \(m_{t,i}\)rl.validity_mask \(\rho_{t,i}\)rl.invalid_reason \(e_t\)rl.target \(b_t\)rl.budget
|
\(s_t^{\mathrm{hist}}=(I_{1:t},T_{1:t},P_{1:t}^{\mathrm{semi}},V^{\mathrm{root}},e_t,b_t)\)rl.s_hist \(s_t^{\mathrm{off}}=(\mathrm{VinSnippetView},\mathcal{Q}_t,N_t,m_{t,i},\ell_{t,i})\)rl.s_off \(s_t^{\mathrm{cf0}}=(V^{\mathrm{root}},\mathcal{P}_t,\mathcal{Q}_t,m_{t,i},\rho_{t,i},e_t,b_t)\)rl.s_cf0 \(s_t^{\mathrm{cf+}}=(s_t^{\mathrm{cf0}},D_{1:t}^{\mathrm{sel}},P_{1:t}^{\mathrm{sel}},N_{1:t}^{\mathrm{sel}})\)rl.s_cf_geom \(s_t^{\mathrm{oracle}}=(s_t^{\mathrm{cf+}},\mathcal{M}^{\mathrm{GT}},\mathcal{M}_e^{\mathrm{GT}},\{D_{t,i}^{\mathrm{GT}},\mathcal{P}_{t,i}^{\mathrm{GT}},\mathrm{RRI}_{t,i}\}_{i=1}^{N_t})\)rl.s_oracle
|
|
historic state - Historic Snippet State
|
Raw actor-visible state from the logged ASE/Project Aria snippet trajectory.
|
\(s_t^{\mathrm{hist}}\)rl.s_hist
|
\(s_t^{\mathrm{hist}}=(I_{1:t},T_{1:t},P_{1:t}^{\mathrm{semi}},V^{\mathrm{root}},e_t,b_t)\)rl.s_hist
|
|
offline state - Persisted Offline Sample State
|
Compact persisted state used by VIN training and offline diagnostics.
|
\(s_t^{\mathrm{off}}\)rl.s_off
|
\(s_t^{\mathrm{off}}=(\mathrm{VinSnippetView},\mathcal{Q}_t,N_t,m_{t,i},\ell_{t,i})\)rl.s_off
|
|
CF0 state - Minimal Counterfactual Actor State
|
Main Q_H actor state for mesh-supervised counterfactual rollouts.
|
\(s_t^{\mathrm{cf0}}\)rl.s_cf0
|
\(s_t^{\mathrm{cf0}}=(V^{\mathrm{root}},\mathcal{P}_t,\mathcal{Q}_t,m_{t,i},\rho_{t,i},e_t,b_t)\)rl.s_cf0
|
|
CF+ state - Geometry-Rich Counterfactual State
|
Counterfactual ablation state with selected synthetic geometry observations.
|
\(s_t^{\mathrm{cf+}}\)rl.s_cf_geom
|
\(s_t^{\mathrm{cf+}}=(s_t^{\mathrm{cf0}},D_{1:t}^{\mathrm{sel}},P_{1:t}^{\mathrm{sel}},N_{1:t}^{\mathrm{sel}})\)rl.s_cf_geom
|
|
oracle state - Oracle Rollout State
|
Privileged rollout state for labels, upper bounds, and evaluation.
|
\(s_t^{\mathrm{oracle}}\)rl.s_oracle
|
\(s_t^{\mathrm{oracle}}=(s_t^{\mathrm{cf+}},\mathcal{M}^{\mathrm{GT}},\mathcal{M}_e^{\mathrm{GT}},\{D_{t,i}^{\mathrm{GT}},\mathcal{P}_{t,i}^{\mathrm{GT}},\mathrm{RRI}_{t,i}\}_{i=1}^{N_t})\)rl.s_oracle
|
|
action set - Finite Candidate Action Set
|
Masked finite action-index set over sampled candidate views.
|
\(\mathcal{A}(s_t)\)rl.action_set \(q_{t,i}\)oracle.candidate_qti \(\mathcal{Q}_t\)oracle.candidates_t \(m_{t,i}\)rl.validity_mask
|
\(\mathcal{Q}_t=\{q_{t,i}\}_{i=1}^{N_t},\quad \mathcal{A}(s_t)=\{i\in\{1,\ldots,N_t\}:m_{t,i}=1\},\quad q_t=q_{t,a_t}\)rl.finite_action_set
|
|
transition - Counterfactual Transition
|
Replayable state update after selecting a candidate index.
|
\(T\)rl.transition \(\mathcal{P}\)oracle.points \(\mathcal{P}_q\)oracle.points_q
|
\(\mathcal{P}_{t+1}=\mathcal{P}_t\cup\mathcal{P}_{q_t}\)rl.counterfactual_transition
|
|
reward - Target-RRI Reward
|
Quality-only immediate reward equal to root-normalized target gain for the selected candidate.
|
\(r_t^e\)rl.reward_target \(\mathrm{RRI}_e\)entity.rri_e \(\mathcal{P}\)oracle.points \(\mathcal{M}_e^{\mathrm{GT}}\)ase.mesh_target
|
\(r_t^e=\mathrm{RRI}_e(q_t\mid \mathcal{P}_t,\mathcal{M}_e^{\mathrm{GT}})\)rl.target_rri_reward
|
|
return - Finite-Horizon Return
|
H-step discounted return over root-normalized target-gain rewards.
|
\(G_t^{(H)}\)rl.return_h \(r_t^e\)rl.reward_target \(\gamma\)rl.gamma \(H\)rl.H
|
\(G_t^{(H)}=\sum_{k=0}^{H-1}\gamma^k r_{t+k}^e\)rl.finite_horizon_return
|
|
Q_H - Finite-Horizon Q Function
|
Finite-horizon candidate-value function for target-conditioned ARIA-NBV.
|
\(Q_H\)rl.qh \(G_t^{(H)}\)rl.return_h \(s_t^{\mathrm{cf0}}\)rl.s_cf0 \(a\)rl.a
|
\(Q_H(s_t^{\mathrm{cf0}},a_t)=\mathbb{E}\left[G_t^{(H)}\mid s_t=s_t^{\mathrm{cf0}},a_t\right]\)rl.q_h \(y_t^Q=r_t+\gamma V(s_{t+1})\)rl.q_backup
|
|
mask - Validity Mask
|
Hard mask that separates feasible candidate actions from invalid candidates.
|
\(m_{t,i}\)rl.validity_mask \(\rho_{t,i}\)rl.invalid_reason \(m\)vin.cand_valid
|
\(m_i=\mathbb{1}[\mathrm{finite}]\mathbb{1}[v_i>0]\mathbb{1}[v_i^{\mathrm{sem}}>0]\)metrics.candidate_validity \(\mathcal{Q}_t=\{q_{t,i}\}_{i=1}^{N_t},\quad \mathcal{A}(s_t)=\{i\in\{1,\ldots,N_t\}:m_{t,i}=1\},\quad q_t=q_{t,a_t}\)rl.finite_action_set
|
|
Project Aria
|
Egocentric research-device and tooling ecosystem for calibrated, time-aligned multimodal sensing.
|
-
|
-
|