Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create scrape of terms from Vocabulary #45

Open
3 of 6 tasks
alliyya opened this issue Jul 13, 2022 · 0 comments
Open
3 of 6 tasks

Create scrape of terms from Vocabulary #45

alliyya opened this issue Jul 13, 2022 · 0 comments
Labels
Conversion: LINCS This is related to the conversion process using CIDOC-CRM and the CWRC vocabularies. (Main Branch) priority:routine project:bibliography extraction related to extraction of bibliography entries project:biography extraction related to extraction of biography entries project:writing extraction related to extraction of writing entries

Comments

@alliyya
Copy link
Member

alliyya commented Jul 13, 2022

Update scrape scripts to extract from https://gitlab.com/calincs/infrastructure/vocabularies

CSVs to update (In order of priority):

Scripted

  • Occupations
  • Culturalforms
  • Genre (There are two files in the data folder, eliminate the irrelevant one)
  • Education (Looks like only degrees is scraped, may need to scrape school types)

Manual (Possibly script these, however Family isn't likely to change)

  • Family
  • Context

This is currently blocked by current work on the vocabulary, however, scripts could be tested with current version and rerun when vocab is ready.

@alliyya alliyya added priority:routine project:biography extraction related to extraction of biography entries project:writing extraction related to extraction of writing entries project:bibliography extraction related to extraction of bibliography entries Conversion: LINCS This is related to the conversion process using CIDOC-CRM and the CWRC vocabularies. (Main Branch) labels Jul 13, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Conversion: LINCS This is related to the conversion process using CIDOC-CRM and the CWRC vocabularies. (Main Branch) priority:routine project:bibliography extraction related to extraction of bibliography entries project:biography extraction related to extraction of biography entries project:writing extraction related to extraction of writing entries
Projects
None yet
Development

No branches or pull requests

1 participant