Clarification about memory baseline configurations (Mem0, MemP, SimpleMem)

Hello, thank you for releasing this interesting work and the accompanying code.

I am trying to better understand the exact configurations used for the memory-based baselines in the paper, especially for reproducibility. I would greatly appreciate some clarification regarding the following details.

### Mem0

1. Which **embedding model** was used for memory retrieval?
2. Which model was used for **fact extraction / memory writing**?

### MemP

1. Which **LLM** was used for the procedural memory construction (e.g., trajectory distillation / reflection)?
2. Was the same model used for **build / update / reflection**, or were different models used for different stages?
3. Which **embedding model** was used for retrieval?

### SimpleMem

1. Which **LLM model** was used for memory construction / compression?
2. Which **embedding model** was used for retrieval?

### Trajectories used for memory construction

For the trajectories used to generate memories in these baselines:

* Were they generated by the **teacher model (o3)** or by the **policy model (Qwen2.5-7B-Instruct)**?
* Approximately **how many trajectories** were used to construct the memory entries?

Understanding these details would help reproduce the experimental setup more faithfully.

Thank you very much for your help!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification about memory baseline configurations (Mem0, MemP, SimpleMem) #13

Mem0

MemP

SimpleMem

Trajectories used for memory construction

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Clarification about memory baseline configurations (Mem0, MemP, SimpleMem) #13

Description

Mem0

MemP

SimpleMem

Trajectories used for memory construction

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions