Skip to content

executor_train: train_qwen3_4b_adpo.sh fails with Could not find 'data/legacy_data' #26

@benj617

Description

@benj617

Hi, thanks for open-sourcing Agent0.

I followed the README and completed the previous steps, but when I run:

bash examples/train/math_tir/train_qwen3_4b_adpo.sh

it fails with:

In 'ppo_trainer': Could not find 'data/legacy_data'

It looks like executor_train/verl_tool/trainer/config/ppo_trainer.yaml references data@data: legacy_data, but the corresponding config/data/legacy_data.yaml file is missing.

Could you please clarify:

  1. Is the executor_train directory complete in the current public release?
  2. Is there any missing config or preprocessing file for executor training?
  3. Could you provide more detailed instructions for executor data preparation and the exact training script/config expected for the Qwen3-4B ADPO setup?

I would also appreciate a clearer description of how the data produced in the data curation step should be converted or placed for executor_train.

Thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions