Jointly Learning Predicates and Actions Enables Zero-Shot Skill Composition

IROS 2026

Benedict Quartey, Sebastian Castro, Eric Rosen, Wil Thomason, George Konidaris, Stefanie Tellex

Brown University, Robotics and AI Institute (RAI)

“Learned robot skills that predict both actions and their symbolic effects, enabling skill composition via symbolic planning.”

Abstract

Learning from Demonstration (LfD) enables robots to learn complex behaviors from expert examples, yet existing approaches often fail to generalize to new compositions of known skills without retraining. Modern generative policies model distri- butions over action trajectories alone, thus are unable to reason about the symbolic outcomes required for robust composition. We propose that skills should jointly model action trajectories and the symbolic outcomes they induce. To address this gap, we introduce Predicate-Action Skills (PACTS), a class of closed- loop visuomotor policies that model skills as a joint generative process over action and predicate belief trajectories, producing coherent action–outcome rollouts within a single model. Jointly generating actions and predicates enables PACTS to learn in- ternal representations that improve both action generation and predicate classification. Furthermore, we demonstrate zero-shot composition of learned skills via planning by leveraging online predicate predictions from PACTS as a symbolic interface for sequencing and monitoring execution

Predicate-Action Skills. Conditioned on current observations $\mathbf{o}$, we model skills as a joint generative process over an action trajectory $\mathbf{x}$ and a predicate-belief trajectory $\mathbf{z}$ by learning the coupled distribution $p(\mathbf{x},\mathbf{z}\mid \mathbf{o})$. Starting from noise ($\mathbf{x}_T$,$\mathbf{z}_T$), our model iteratively refines both modalities to produce temporally coherent action–outcome rollouts $(\mathbf{x}_0,\mathbf{z}_0)$. The resulting predicate-belief trajectory $\mathbf{z}_0$ provides an online symbolic interface for monitoring skill execution and planning-based skill composition using off-the-shelf planners.

BibTeX


      @article{quartey2026jointly,
        title={Jointly Learning Predicates and Actions Enables Zero-Shot Skill Composition},
        author={Quartey, Benedict and Castro, Sebastian and Rosen, Eric and Thomason, Wil and Konidaris, George and Tellex, Stefanie},
        journal={arXiv preprint arXiv:2605.20648},
        year={2026}
      }

Jointly Learning Predicates and Actions Enables Zero-Shot Skill Composition

Abstract

PushBarrier

Kitchen

Coffee Preparation

1. Long-horizon Task Demonstration

2. Keyframe-level Predicate Annotation

3. Generated Skill-Centric Dataset

BibTeX