PLATO

Mastering Scene Rearrangement with Expert-assisted Curriculum Learning and Adaptive Trade-Off Tree-Search

IROS 2024

Zan Wang^1✶, Hanqing Wang^1✶, Wei Liang¹

^✶ indicates equal contribution
¹Beijing Institute of Technology

Abstract

Scene Rearrangement Planning (SRP) has recently emerged as a crucial interior scene task; however, current approaches still face two primary issues. First, prior works define the action space of SRP using handcrafted coarse-grained actions, which are inflexible for scene arrangement transition and impractical for real-world deployment. Secondly, the scarcity of realistic indoor scene rearrangement data hinders popular data-hungry learning approaches and quantitative evaluation. To tackle these issues, we propose a fine-grained action space definition and curate a large-scale scene rearrangement dataset to facilitate the training of learning approaches and comprehensive benchmarking. Building upon this dataset, we introduce a novel framework, PLATO, designed for efficient agent training and inference. Our approach features an exPert-assisted curriculum Learning (PL) paradigm that possesses a Behavior Cloning (BC) and an offline Reinforcement Learning (RL) curriculum for agent training, along with an advanced tree-search-based planner enhanced by an Adaptive Trade-Off (ATO) strategy to improve expert agent performance further. We demonstrate the superior performance of our method over baseline agents through extensive experiments and provide a detailed analysis to elucidate its rationale.

Scene Rearrangement Planning

SRP aims to generate a feasible movement plan to transit from an initial scene layout to a target scene layout by moving furniture.

Success Cases

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Failure Cases

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Initial layout

Action Sequence

Target layout

Citation

@inproceedings{wang2024mastering,
  title={Mastering Scene Rearrangement with Expert-assisted Curriculum Learning and Adaptive Trade-Off Tree-Search},
  author={Wang, Zan and Wang, Hanqing and Liang, Wei},
  booktitle={International Conference on Intelligent Robots and Systems (IROS)},
  year={2024}
}

Mastering Scene Rearrangement with Expert-assisted Curriculum Learning and Adaptive Trade-Off Tree-Search

Abstract

Scene Rearrangement Planning

SRP aims to generate a feasible movement plan to transit from an initial scene layout to a target scene layout by moving furniture.

Success Cases

Failure Cases

Dataset

We curate a new large-scale dataset containing 13k layout pairs for benchmarking SRP.

Citation