We release now the data collected for our paper: Transformers Go for the LOLs: Generating (Humourous) Titles from Scientific Abstracts End-to-End.
Abstract: We consider the end-to-end abstract-to-title generation problem, exploring seven recent transformer based models (including ChatGPT) fine-tuned on more than 30k abstract-title pairs from NLP and machine learning (ML) venues. As an extension, we also consider the harder problem of generating humorous paper titles. For the latter, we compile the first large-scale humor annotated dataset for scientific papers in the NLP/ML domains, comprising ∼2.6k titles. We evaluate all models using human and automatic metrics. Our human evaluation suggests that our best end-to-end system performs similarly to human authors (but arguably slightly worse). Generating funny titles is more difficult, however, and our automatic systems clearly underperform relative to humans and often learn dataset artefacts of humor. Finally, ChatGPT, without any fine-tuning, performs on the level of our best fine-tuned system.
The data for abstract-to-title generation is located in the eval_anno_data/
folder.
2638_humor_annotated_no_squibs.csv
: human annotations regarding humor levels of titles.32952_with_humor_label_no_squibs.csv
: all abstract-title pairs with human/automatic annotations for humor.train_no_squibs.csv
: train setdev_no_squibs.csv
: dev settest_no_squibs.csv
: test set
dataset_230samples.json
: human evaluation for title generation.eval_1_processed.csv
: human evaluation for humorous title generation (BARTXsum vs. BARTXsum-psuedo).eval_2_processed.csv
: human evaluation for humorous title generation (BARTXsum vs. ChatGPT).
The data for abstract+x-to-title generation is located in the long_input/
folder.
data/
: Titles paired with abstract, introduction and conclusion:train.json
: train setdev.json
: dev settest.json
: test setanno/
: human evaluation
models/
: training args for each model with predictions on the test set.
If you use the data from this work, please cite us!
@inproceedings{chen-eger-2023-transformers,
title = "Transformers Go for the {LOL}s: Generating (Humourous) Titles from Scientific Abstracts End-to-End",
author = "Chen, Yanran and
Eger, Steffen",
editor = {Deutsch, Daniel and
Dror, Rotem and
Eger, Steffen and
Gao, Yang and
Leiter, Christoph and
Opitz, Juri and
R{\"u}ckl{\'e}, Andreas},
booktitle = "Proceedings of the 4th Workshop on Evaluation and Comparison of NLP Systems",
month = nov,
year = "2023",
address = "Bali, Indonesia",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2023.eval4nlp-1.6",
pages = "62--84",
abstract = "We consider the end-to-end abstract-to-title generation problem, exploring seven recent transformer based models (including ChatGPT) fine-tuned on more than 30k abstract-title pairs from NLP and machine learning (ML) venues. As an extension, we also consider the harder problem of generating humorous paper titles. For the latter, we compile the first large-scale humor annotated dataset for scientific papers in the NLP/ML domains, comprising 2.6k titles. We evaluate all models using human and automatic metrics. Our human evaluation suggests that our best end-to-end system per-forms similarly to human authors (but arguably slightly worse). Generating funny titles is more difficult, however, and our automatic systems clearly underperform relative to humans and often learn dataset artefacts of humor. Finally, ChatGPT, without any fine-tuning, performs on the level of our best fine-tuned system.",
}
If you're interested in the data that hasn't been uploaded, you could also drop me a line: [email protected].