Skip to content

[BFCL] Multi-turn evaluation problem. #914

@lucenzhong

Description

@lucenzhong

For multi_turn_base_186 at turn 3, the user's question is

There has been a problem with my booking and I previously reached out to support without any feedback yet. Kindly contact customer support on my behalf, emphasizing the smooth facilitation of my travel arrangements.

The model is supposed to call the customer_support_message function with a specific user message. I didn't find any definition of the standard message, as I understand it, the meesage parameter is a generated by the model. The evaluation process considerd the only correct message as the ground truth. I'm wondering if my gernerated message is also correct.

My result:

"contact_customer_support(booking_id='3426812',message='I am experiencing an issue with my booking and require immediate assistance to ensure the smooth facilitation of my travel arrangements. Please address this matter urgently.')"

groud_truth:

"contact_customer_support(booking_id='3426812', message='No feedback yet on my inquiry regarding my flight arrangements. Please expedite the process.')"

Metadata

Metadata

Assignees

No one assigned

    Labels

    BFCL-DatasetBFCL Dataset-Related Issue

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions