HDFlow

Hierarchical Diffusion-Flow Planning for Long-horizon Tasks

★ ICML 2026  ·  Spotlight  ·  Top 2%

Nandiraju Gireesh*1,2   Yuanliang Ju*3   Chaoyi Xu2   Weiheng Liu2   Yuxuan Wan1,2   He Wang1,2

Peking University 1 Peking University
Galbot 2 Galbot
University of Toronto 3 University of Toronto

* Equal Contribution

Abstract

Recent advances in generative models have shown promise in generating behavior plans for long-horizon, sparse reward tasks. While these approaches have achieved promising results, they often lack a principled framework for hierarchical decomposition and struggle with the computational demands of real-time execution, due to their iterative denoising process. In this work, we introduce Hierarchical Diffusion-Flow (HDFlow), a novel hierarchical planning framework that optimally leverages the strengths of diffusion and rectified flow models to overcome the limitations of single-paradigm generative planners. HDFlow employs a high-level diffusion planner to generate sequences of strategic subgoals in a learned latent space, capitalizing on diffusion's powerful exploratory capabilities. These subgoals then guide a low-level rectified flow planner that generates smooth and dense trajectories, exploiting the speed and efficiency of ordinary differential equation (ODE)-based trajectory generation. We evaluate HDFlow on four challenging furniture assembly tasks in both simulation and real-world, where it significantly outperforms state-of-the-art methods. Furthermore, we also showcase our method's generality on two long-horizon benchmarks comprising diverse locomotion and manipulation tasks.

Key Insight

The iterative denoising process of diffusion models is computationally expensive, making them ill-suited for the fast, low-level control required for real-time robotic interaction. Applying diffusion models naively at all levels of a hierarchy inherits this critical drawback, creating a bottleneck at the trajectory generation stage. This raises a fundamental question:

Is a single generative modeling paradigm optimal for all levels of a planning hierarchy?

We empirically show that the answer is no. The requirements for high-level strategic planning are fundamentally different from those of low-level trajectory generation. High-level planning demands exploration and multi-modal diversity to discover viable sequences of subgoals. In contrast, low-level planning demands speed, precision, and deterministic execution to translate a chosen subgoal into a smooth, dense trajectory.

Method

We introduce Hierarchical Diffusion-Flow (HDFlow), a novel hierarchical planning framework that optimally leverages the strengths of diffusion and rectified flow models. Our framework consists of two main stages: World Model Learning, where observations are encoded into a structured latent space, and Hierarchical Planner Training. The latter involves a High-Level diffusion planner generating sparse strategic subgoals with EBM guidance, and a Low-Level rectified flow planner synthesizing dense trajectories between subgoals using an ODE solver.

HDFlow pipeline

Simulation Results

Furniture assembly tasks across four environments and three difficulty levels.

One Leg
Low
Med
High
Lamp
Low
Med
High
Round Table
Low
Med
High
Cabinet
Low
Med

Real-world Results

Physical robot experiments across three furniture assembly tasks.

One Leg
Low
Med
Lamp
Low
Med
Round Table
Low
Med

RLBench Results

18 diverse manipulation tasks from the RLBench benchmark.

Close Jar
Drag Stick
Insert Peg
Meat off Grill
Open Drawer
Place Cups
Place Wine
Push Buttons
Put in Cupboard
Put in Drawer
Put in Safe
Screw Bulb
Slide Block
Sort Shape
Stack Blocks
Stack Cups
Sweep to Dustpan
Turn Tap

OGBench Results

Long-horizon locomotion and manipulation across diverse environments.

Antmaze
Medium
Large
Giant
Humanoidmaze
Medium
Large
Giant
Cube Manipulation
Single
Double
Triple
Scene
Scene
Puzzle
3×3
4×4
4×5
4×6

Authors

Nandiraju Gireesh Nandiraju Gireesh*1,2
Yuanliang Ju Yuanliang Ju*3
Chaoyi Xu Chaoyi Xu2
Weiheng Liu Weiheng Liu2
Yuxuan Wan Yuxuan Wan1,2
He Wang He Wang1,2

1 Peking University  ·  2 Galbot  ·  3 University of Toronto

Citation

@inproceedings{gireesh2026hdflow,
  title     = {{HDFlow}: Hierarchical Diffusion-Flow Planning for Long-horizon Tasks},
  author    = {Nandiraju Gireesh and Yuanliang Ju and Chaoyi Xu
               and Weiheng Liu and Yuxuan Wan and He Wang},
  booktitle = {Proceedings of the 43rd International Conference on Machine Learning},
  year      = {2026},
}

For questions, contact Nandiraju Gireesh at 2401112103@stu.pku.edu.cn or Yuanliang Ju at yuanliang.ju@mail.utoronto.ca.