Are More Data-Efficient and Robust to New Viewpoints and Environments
We reformulate robot actions as a per-pixel regression problem, enabling dramatically improved data efficiency and generalization compared to global coordinate regression, thanks to dense supervision and shift equivarianc
Same Viewpoint and Environment; Ours vs ACT.
97% Task Completion Rate
9% Task Completion Rate
—% Task Completion Rate
—% Task Completion Rate
97% Task Completion Rate
11% Task Completion Rate
—% Task Completion Rate
—% Task Completion Rate
95% Task Completion Rate
0% Task Completion Rate
—% Task Completion Rate
—% Task Completion Rate
52% Task Completion Rate
0% Task Completion Rate
87% Task Completion Rate
4% Task Completion Rate
94% Task Completion Rate
0% Task Completion Rate
—% Task Completion Rate
—% Task Completion Rate
—% Task Completion Rate
—% Task Completion Rate
—% Task Completion Rate
—% Task Completion Rate
PARA (Ours) — % Completion Rate
ACT — % Completion Rate
PARA (Ours) — % Completion Rate
ACT — % Completion Rate
PARA (Ours) — % Completion Rate
ACT — % Completion Rate
Hand Demonstrations Curtained with PARA (Ours) vs ACT.
50% Task Completion Rate
80% Task Completion Rate
80% Task Completion Rate
100% Task Completion Rate
5% Task Completion Rate
10% Task Completion Rate
15% Task Completion Rate
20% Task Completion Rate
95% Task Completion Rate
87% Task Completion Rate
92% Task Completion Rate
94% Task Completion Rate