Skip to content

Conversation

@weissenh
Copy link
Contributor

@weissenh weissenh commented Dec 2, 2025

(Please replace this text with a description of the changes effected by this pull request.
Include a link to the corresponding Github Issue, if there is one.
Details on how to do this (can be found here).)

Step towards #6589 : fix effect of reordering bug in bulk metadata corrections from 2025-11-14

Each change annotated with comment referencing error-introducing commit to compare, paper page, initial metadata correction issue etc.

Process

Went through the "Files changed" tab https://github.com/acl-org/acl-anthology/pull/6469/files
and noted all instances, where a reordering affected associations of affiliation and orcid to authors.

Changes:

2024.acl-long.796

in data/xml/2024.acl.xml
Issue of metadata correction: #6328
2024.acl-long.796
Paper page: https://aclanthology.org/2024.acl-long.796.pdf
Commit showing the bug effect: f88a83e

just affiliations needed to be changed
note: affiliations don't always match with what is shown on PDF

2025.arabicnlp-main.26

in data/xml/2025.arabicnlp.xml
Issue of metadata correction: #6401
2025.arabicnlp-main.26
Paper page: https://aclanthology.org/2025.arabicnlp-main.26/
Commit showing the bug effect: db094a2

just affiliations needed to be changed
saied and norah (same last name) swapped

2025.arabicnlp-sharedtasks.133

in data/xml/2025.arabicnlp.xml
Issue of metadata correction: #6335
2025.arabicnlp-sharedtasks.133
Paper page: https://aclanthology.org/2025.arabicnlp-sharedtasks.133/
Commit showing the bug effect: c8dfe7c

just affiliation (meaningless "NA")

noted another name inconsistency: last author has name Ensaf Hussein, but metadata says Lastname Mohamed. In issue only updated authors_new but not authors list itself. This author should probably also have name variants recorded, currently 3 author pages (Ensaf Hussein Mohamed, Ensaf Mohamed, Ensaf H. Mohamed) that could probably merge and several metadata PDF inconsistencies.

2025.starsem-1.18

in data/xml/2025.starsem.xml
Issue of metadata correction: #6394
2025.starsem-1.18
Paper page: https://aclanthology.org/2025.starsem-1.18/
Commit showing the bug effect: ec5e183

Bug affected orcid and affiliation
confirmed ORCID now correct Haw-Shiuan Chang : https://orcid.org/0000-0003-4607-936X

2025.wmt-1.85

in data/xml/2025.wmt.xml
Issue of metadata correction: #6448
2025.wmt-1.85
Paper page: https://aclanthology.org/2025.wmt-1.85/
Commit showing the bug effect: 53420bc

Bug only affected affiliation

2025.emnlp-main.1435

in data/xml/2025.emnlp.xml
Issue of metadata correction: #6422
2025.emnlp-main.1435
Paper page: https://aclanthology.org/2025.emnlp-main.1435/
Commit showing the bug effect: 7ba600b

Bug affected affiliation and orcid
check orcid belongs to correct person: https://orcid.org/0000-0003-0701-0204 Ranathunga

Introduced by metadata corrections 2025-11-14
Note that the affiliations in XML don't always match with what is found on the PDF.
Introduced by metadata corrections 2025-11-14
Introduced by metadata corrections 2025-11-14
Pretty meaningless affiliation 'Institute'', but nonetheless.
Noticed last author name inconsistent between PDF and metadata: fixed too
Introduced by metadata corrections 2025-11-14
Affiliation and ORCID put back to correct author
@weissenh weissenh self-assigned this Dec 2, 2025
@github-actions
Copy link

github-actions bot commented Dec 2, 2025

Introduced by metadata corrections 2025-11-14
Affiliations put back to correct author
Introduced by metadata corrections 2025-11-14
Affiliations and orcid put back to correct author
<author><first>Salsabil Maulana</first><last>Akbar</last><affiliation>Universitas Telkom</affiliation></author>
<author><first>Nuur</first><last>Shadieq</last><affiliation>Universitas Telkom</affiliation></author>
<author><first>Wawan</first><last>Cenggoro</last><affiliation>Binus University</affiliation></author>
<author><first>Salsabil Maulana</first><last>Akbar</last><affiliation>Institut Teknologi Bandung</affiliation></author>
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Issue of metadata correction: #6328
2024.acl-long.796
Paper page: https://aclanthology.org/2024.acl-long.796.pdf
Commit showing the bug effect: f88a83e

just affiliations needed to be changed
note: affiliations don't always match with what is shown on PDF

<author><first>Norah</first><last>Alshahrani</last><affiliation>University of Bisha</affiliation></author>
<author><first>Saied</first><last>Alshahrani</last><affiliation>ASAS AI</affiliation></author>
<author><first>Norah</first><last>Alshahrani</last><affiliation>ASAS AI</affiliation></author>
<author><first>Saied</first><last>Alshahrani</last><affiliation>University of Bisha</affiliation></author>
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Issue of metadata correction: #6401
2025.arabicnlp-main.26
Paper page: https://aclanthology.org/2025.arabicnlp-main.26/
Commit showing the bug effect: db094a2

just affiliations needed to be changed
saied and norah (same last name) swapped

<author><first>Mohamed</first><last>Samy</last><affiliation>Institute</affiliation></author>
<author><first>Mayar</first><last>Boghdady</last></author>
<author><first>Mohamed</first><last>Samy</last></author>
<author><first>Mayar</first><last>Boghdady</last><affiliation>Institute</affiliation></author>
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Issue of metadata correction: #6335
2025.arabicnlp-sharedtasks.133
Paper page: https://aclanthology.org/2025.arabicnlp-sharedtasks.133/
Commit showing the bug effect: c8dfe7c

just affiliation (meaningless "NA")

<author><first>Marwan</first><last>El Adawi</last></author>
<author><first>Mohamed</first><last>Nassar</last></author>
<author><first>Ensaf Hussein</first><last>Mohamed</last></author>
<author><first>Ensaf</first><last>Hussein</last></author>
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Noted another name inconsistency: last author has name Ensaf Hussein 8according to PDF](https://aclanthology.org/2025.arabicnlp-sharedtasks.133.pdf), but metadata says last name Mohamed. In metadata correction issue only updated authors_new but not authors list itself. This author should probably also have name variants recorded, currently 3 author pages (Ensaf Hussein Mohamed, Ensaf Mohamed, Ensaf H. Mohamed) that could probably merge and several metadata PDF inconsistencies. I haven't seen an author page request for this author yet.

<author orcid="0000-0003-0701-0204"><first>Charitha</first><last>Rathnayake</last><affiliation>Massey University</affiliation></author>
<author><first>Surangika</first><last>Ranathunga</last><affiliation>University of Moratuwa</affiliation></author>
<author><first>Charitha</first><last>Rathnayake</last><affiliation>University of Moratuwa</affiliation></author>
<author orcid="0000-0003-0701-0204"><first>Surangika</first><last>Ranathunga</last><affiliation>Massey University</affiliation></author>
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Issue of metadata correction: #6422
2025.emnlp-main.1435
Paper page: https://aclanthology.org/2025.emnlp-main.1435/
Commit showing the bug effect: 7ba600b

Bug affected affiliation and orcid
check orcid belongs to correct person: https://orcid.org/0000-0003-0701-0204 Ranathunga

<author><first>Chaitali</first><last>Agarwal</last></author>
<author><first>Sudharshan</first><last>Govindan</last></author>
<author><first>Haw-Shiuan</first><last>Chang</last></author>
<author orcid="0000-0003-4607-936X"><first>Haw-Shiuan</first><last>Chang</last><affiliation>Department of Computer Science, University of Massachusetts at Amherst</affiliation></author>
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Issue of metadata correction: #6394
2025.starsem-1.18
Paper page: https://aclanthology.org/2025.starsem-1.18/
Commit showing the bug effect: ec5e183

Bug affected orcid and affiliation
confirmed ORCID now correct Haw-Shiuan Chang : https://orcid.org/0000-0003-4607-936X

<author><first>Sina</first><last>Ahmadi</last><affiliation>University of Zurich</affiliation></author>
<author><first>Anthony</first><last>Munthali</last></author>
<author><first>Jonathan Mingfei</first><last>Liu</last><affiliation>Google</affiliation></author>
<author><first>Jonathan</first><last>Eng</last><affiliation>Google</affiliation></author>
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Issue of metadata correction: #6448
2025.wmt-1.85
Paper page: https://aclanthology.org/2025.wmt-1.85/
Commit showing the bug effect: 53420bc

Bug only affected affiliation

  • affiliations of two consecutive authors were swapped
  • affiliations were off by one
  • last two authors need to get back their affiliations from newly introduced authors before them

@weissenh weissenh marked this pull request as ready for review December 3, 2025 12:33
@weissenh weissenh changed the title Fix bug reordering bulk corrections from 2025 11 14 Fix effects of reordering bug introduced in bulk metadata corrections from 2025-11-14 Dec 3, 2025
@weissenh weissenh requested a review from mbollmann December 3, 2025 12:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants