baseline score #84

Lorraine-Kwok · 2024-04-23T06:42:50Z

Having a reference point for the baseline model's scores would be incredibly beneficial for my team and me as we develop our approach and compare its performance. If it's possible, could you please provide us with the baseline model's scores on the validation dataset, or direct us to where we could find this information?

ChonghaoSima · 2024-04-23T07:58:37Z

Please see here

Lorraine-Kwok · 2024-04-23T08:47:52Z

Thank you

Lorraine-Kwok · 2024-04-24T07:08:59Z

Hi,

I've been looking at the language scores reported in your results and noticed that the baseline's language score is much lower compared to the GPT's high score. Additionally, the GPT scores for both sampled data and test data are almost identical, leading to very similar final scores.

Could you shed some light on the following?

Why is there such a big difference in language scores between the baseline and GPT?
How can the GPT scores be so close for sampled and test data?
It seems a bit odd, and I'm trying to make sense of it. Any clarification would be appreciated.

Thanks!

Lorraine-Kwok closed this as completed Apr 23, 2024

Lorraine-Kwok reopened this Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baseline score #84

baseline score #84

Lorraine-Kwok commented Apr 23, 2024

ChonghaoSima commented Apr 23, 2024

Lorraine-Kwok commented Apr 23, 2024

Lorraine-Kwok commented Apr 24, 2024

baseline score #84

baseline score #84

Comments

Lorraine-Kwok commented Apr 23, 2024

ChonghaoSima commented Apr 23, 2024

Lorraine-Kwok commented Apr 23, 2024

Lorraine-Kwok commented Apr 24, 2024