Skip to content

Conversation

@anthology-assist
Copy link
Contributor

@anthology-assist anthology-assist commented Sep 15, 2025

(Please replace this text with a description of the changes effected by this pull request.
Include a link to the corresponding Github Issue, if there is one.
Details on how to do this (can be found here).)

  1. In the Github sidebar, add the PR to the current milestone
  2. In the Github sidebar, add the PR to the "Anthology Work Items" project
  3. In the Github sidebar, under "Development", link to the corresponding ingestion issue (if applicable)
  4. Make sure the branch is merged with the latest master branch
  5. Ensure that there are editors listed in the <meta> block
  6. For workshops, add a <venue>ws</venue> tag to its meta block
  7. For workshops, add a backlink from the main event's <event> block
  8. Add events to their relevant SIGs
  9. Look at the venue listing for prior years, and ensure that the new volume titles are consistent. You can do this by clicking on the venue name from a paper page, which will take you to the vendor listing.
  10. Navigate to the event page preview (e.g., https://preview.aclanthology.org/icnlsp-ingestion/events/icnlsp-2021/), and page through, to see if there are any glaring mistakes
  11. Skim through the complete listing, looking for mis-parsed author names.
  12. Download the frontmatter and verify that the table of contents matches at least three randomly-selected papers
  13. Download 3–5 PDFs (including the first and last one) and make sure they are correct (title, authors, page numbers).

After the PR is closed, for all events:

  • Archive the ingestion materials in format YYYY-MM-DD-{event}

After the PR is closed, for ACL events:

  • Generate the DOIs for all volumes (generate_crossref_doi_metadata.py)
  • Upload to Crossref
  • Add the DOIs to the XML in a separate PR (add_dois.py)

@anthology-assist anthology-assist added this to the 2025Q3 milestone Sep 15, 2025
@anthology-assist anthology-assist linked an issue Sep 15, 2025 that may be closed by this pull request
2 tasks
@github-actions
Copy link

github-actions bot commented Sep 15, 2025

Build successful. Some useful links:

This preview will be removed when the branch is merged.

Copy link
Member

@mjpost mjpost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Papers are not lined up, e.g., https://preview.aclanthology.org/gwc-25-ingestion/2025.gwc-1.3.pdf

(You checked "Download 3–5 papers", did you not see this? I usually click over to the preview, then command-click three to five papers to open them in new tabs, then click to each's PDF to do a quick sanity check)

@EricaBiagetti
Copy link

Hi, thank you for taking care of this! I wanted to ask if we could wait a while before publishing the website. The board of the Global WordNet Association mentioned the possibility of obtaining a DOI for the proceedings, and I would like to check with them if they want to talk with you about this.

@mjpost
Copy link
Member

mjpost commented Sep 19, 2025

Yes, we can wait—the proceedings are not correct, anyway.

@EricaBiagetti
Copy link

Thank you! What shall we correct? - Sorry for this, it's the first time my colleagues and I have prepared proceedings for ACL.

@mjpost
Copy link
Member

mjpost commented Sep 19, 2025

@mcandito The papers are not lined up. See this paper, for example, from the preview: https://preview.aclanthology.org/gwc-25-ingestion/2025.gwc-1.3/

@mcandito
Copy link
Contributor

Hi @mjpost , I think the comment is not for me??

@anthology-assist
Copy link
Contributor Author

@EricaBiagetti I double checked the material that's been submitted and the mismatch comes from the submitted material.

In pdf dir, GWC2025_2025_paper_3.pdf shows this paper: "Analysis of Anachronistic Lemmas and Semantic Fields in Ancient Greek WordNet" but in the bib dir , 2025.GWC2025-proceedings.3.bib shows this paper: "An Abstract Multilingual WordNet".

There's another problem with the submitted material. Could you follow instructions here to make sure the file names in pdf dir matches the file names in the bib dir (currently the pdfs are named like: GWC2025_2025_paper_3.pdf and the bibs are named like: 2025.GWC2025-proceedings.3.bib, however they should have the same prefix).

Let me know once the material is updated and I can re-ingest.

@EricaBiagetti
Copy link

Thank you @mjpost! Sure, I'll let you know once the material is updated

@EricaBiagetti
Copy link

@EricaBiagetti
Copy link

Hello, we noticed some errors in the material, which my colleague Luca is fixing, so I will have to update the material again. Sorry for this!
Also, @mjpost, did you happen to see my email? The GWA board confirmed that they would like to obtain DOIs for papers in the proceedings, and we editors were wondering if there is something we should do to get them. Thanks!

@anthology-assist
Copy link
Contributor Author

@EricaBiagetti Let us know once the material is updated.

@mjpost
Copy link
Member

mjpost commented Oct 8, 2025

@anthology-assist Once this is ingested, please assign DOIs to all GWC volumes here and in the past.

@EricaBiagetti
Copy link

Hi, here is the updated (and hopefully final) version of the material: https://drive.google.com/file/d/1APIn71S8IyPvzHtxybMQT2JwXLKkH6J-/view?usp=sharing
Thank you!

@fcbond
Copy link

fcbond commented Nov 3, 2025

Hi,

could I check if there is anything else that needs to be done here? I think Erica has fixed the papers and metadata, and we have SIGLEX and SIGSEM endorsement, so we should get DOIs.

Francis

@mjpost mjpost modified the milestones: 2025Q3, 2025Q4 Nov 3, 2025
@mjpost
Copy link
Member

mjpost commented Nov 4, 2025

Hi, please see the preview: https://preview.aclanthology.org/gwc-25-ingestion/volumes/2025.gwc-1/

The PDFs are still all out of sync with the metadata.

@bavagliladri
Copy link

In the latest zip folder we sent with the proceedings, the PDFs are correctly aligned with the corresponding BibTeX files (each 2025.GWC-2025-proceedings.X.pdf matches the correct 2025.GWC-2025-proceedings.X.bib).
Although, in the current preview, the first PDF shown does not correspond to 2025.GWC-2025-proceedings.1.pdf. Should we do something on our side to fix this?

@mjpost
Copy link
Member

mjpost commented Nov 4, 2025

Okay, this was a combination of mistakes and oversights. The volume was set as proceedings, so the new files were generated as 2025.gwc-proceedings.*, and I didn't notice this. I've corrected it now and all should be well.

I will merge this in a few hours in a gap between meetings, and then will generate DOIs soon afterward.

@EricaBiagetti
Copy link

Hi all! Some colleagues are asking if we know exactly when the proceedings will be out, since they are preparing an application and would like to add a reference to their paper. Do we know already, or is it better if I stay vague?

@mjpost mjpost merged commit 9756319 into master Nov 4, 2025
2 checks passed
@mjpost mjpost deleted the gwc-25-ingestion branch November 4, 2025 14:33
@mjpost
Copy link
Member

mjpost commented Nov 4, 2025

I just merged it now. I should be live in ~30 minutes.

@fcbond
Copy link

fcbond commented Nov 4, 2025

Thank you Matt and Erica!

I can see the correctly aligned proceedings at https://aclanthology.org/events/gwc-2025/

However, they still not appear to have any DOIs, ...

@mjpost
Copy link
Member

mjpost commented Nov 4, 2025

I will merge this in a few hours in a gap between meetings, and then will generate DOIs soon afterward.

Yes, that is the next step.

@mjpost mjpost mentioned this pull request Nov 4, 2025
@mjpost
Copy link
Member

mjpost commented Nov 5, 2025

I have added DOIs to all GWC conferences in the Anthology.

@EricaBiagetti
Copy link

Fantastic, thank you Matt!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Ingestion Request: {GWC2025}

7 participants