-
Notifications
You must be signed in to change notification settings - Fork 30
Open
Description
Hello, thank you for releasing this interesting work and the accompanying code.
I am trying to better understand the exact configurations used for the memory-based baselines in the paper, especially for reproducibility. I would greatly appreciate some clarification regarding the following details.
Mem0
- Which embedding model was used for memory retrieval?
- Which model was used for fact extraction / memory writing?
MemP
- Which LLM was used for the procedural memory construction (e.g., trajectory distillation / reflection)?
- Was the same model used for build / update / reflection, or were different models used for different stages?
- Which embedding model was used for retrieval?
SimpleMem
- Which LLM model was used for memory construction / compression?
- Which embedding model was used for retrieval?
Trajectories used for memory construction
For the trajectories used to generate memories in these baselines:
- Were they generated by the teacher model (o3) or by the policy model (Qwen2.5-7B-Instruct)?
- Approximately how many trajectories were used to construct the memory entries?
Understanding these details would help reproduce the experimental setup more faithfully.
Thank you very much for your help!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels