Why the performance is very different to other paper? #4

chiyuzhang94 · 2023-12-20T05:34:00Z

Hi Andreea,

I notice that the model performance reported in your paper is very different to the performance in original paper.
For example, MINER (Li et al. 2019) got AUC=69.61 on MIND-small dataset but your reported performance is only AUC=51.2.
Compared to other work reproduced MINER model, this performance is much lower than others. For example, this paper reported that their reproduced MINER model got AUC of 63.88.
In general, most GeneralRec models in your Table 1 got AUC < 52.00, which are largely different to the performance reported in other papers.
Could you give any comments on this?

andreeaiana · 2023-12-22T08:13:03Z

Hi,

The data splits used in the other papers are most likely different than the one used by us. Neither the MINER paper, nor the one referenced by you explicitly mention which split of the MIND dataset they use, so I assume they used the test portion, without the publicly available labels. In contrast, as explained in our paper (Section 2.5), we use the MINDdev portion of the dataset as our test split, and further split the MINDtrain dataset into training and validation portions, respectively.

chiyuzhang94 · 2023-12-22T18:01:55Z

Hi,

Yes, I understand the different data split can lead to some variances but 10+ AUC differences is too large. The Dev and Test are come from same dataset and should not have dramatically shifting.
Have you verify the performance by running the official codes from the original paper (e.g., MINER) on your data splits?

yanjinYoungOG · 2024-12-30T11:43:10Z

你好，

是的，我理解不同的数据分割会导致一些差异，但 10+ AUC 差异太大了。开发和测试来自同一数据集，不应该有显著的变化。您是否通过在数据分割上运行原始论文中的官方代码（例如 MINER）来验证性能？

hello， I find the same problem, I wanna know that did you later validate the reliability of this newreclibrary through partitioning training and testing data or hyperparameter tuning, using hyperparameters from original models such as miners.

andreeaiana added the question Further information is requested label Dec 22, 2023

Poseidondon added a commit to Poseidondon/newsreclib-ru that referenced this issue Jun 11, 2024

Bugfix andreeaiana#4

3438623

Poseidondon added a commit to Poseidondon/newsreclib-ru that referenced this issue Jun 24, 2024

lstur andreeaiana#4

fcce573

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why the performance is very different to other paper? #4

Why the performance is very different to other paper? #4

chiyuzhang94 commented Dec 20, 2023

andreeaiana commented Dec 22, 2023 •

edited

Loading

chiyuzhang94 commented Dec 22, 2023

yanjinYoungOG commented Dec 30, 2024

Why the performance is very different to other paper? #4

Why the performance is very different to other paper? #4

Comments

chiyuzhang94 commented Dec 20, 2023

andreeaiana commented Dec 22, 2023 • edited Loading

chiyuzhang94 commented Dec 22, 2023

yanjinYoungOG commented Dec 30, 2024

andreeaiana commented Dec 22, 2023 •

edited

Loading