Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .zenodo.json
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@
}
],
"upload_type": "dataset",
"description": "<p>Cite the source of the dataset as:</p>\n\n<blockquote>\n<p>Dunn, M. and Tresolid, T. (2021): IELex Data and Trees.</p>\n</blockquote>",
"description": "<p>Cite the source of the dataset as:</p>\n\n<blockquote>\n<p>Dunn, M. and Tresoldi, T. (2021): IELex Data and Trees. Zenodo: 10.5281/zenodo.5556801.</p>\n</blockquote>",
"license": {
"id": "CC-BY-4.0"
}
Expand Down
1 change: 0 additions & 1 deletion FORMS.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,6 @@ The value-to-form processing is divided into two steps, implemented as methods:
- `FormSpec.clean`: Normalizes a form chunk.

These methods use the attributes of a `FormSpec` instance to configure their behaviour.

- `brackets`: `{'(': ')'}`
Pairs of strings that should be recognized as brackets, specified as `dict` mapping opening string to closing string
- `separators`: `(';', '/', ',')`
Expand Down
16 changes: 8 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@

If you use these data please cite
- the original source
> Dunn, M. and Tresolid, T. (2021): IELex Data and Trees.
> Dunn, M. and Tresoldi, T. (2021): IELex Data and Trees. Zenodo: 10.5281/zenodo.5556801.
- the derived dataset using the DOI of the [particular released version](../../releases/) you were using

## Description
Expand All @@ -27,20 +27,20 @@ This dataset does not contain the actual word forms, but rather only the cognate
![Concepticon: 100%](https://img.shields.io/badge/Concepticon-100%25-brightgreen.svg "Concepticon: 100%")
![Source: 100%](https://img.shields.io/badge/Source-100%25-brightgreen.svg "Source: 100%")

- **Varieties:** 94
- **Concepts:** 207
- **Lexemes:** 21,483
- **Varieties:** 148 (linked to 123 different Glottocodes)
- **Concepts:** 207 (linked to 207 different Concepticon concept sets)
- **Lexemes:** 33,922
- **Sources:** 1
- **Synonymy:** 1.15
- **Cognacy:** 21,483 cognates in 3,730 cognate sets (1,128 singletons)
- **Cognate Diversity:** 0.17
- **Synonymy:** 1.22
- **Cognacy:** 33,922 cognates in 3,908 cognate sets (603 singletons)
- **Cognate Diversity:** 0.11

# Contributors

Name | GitHub user | Descriptin |Role
--- | --- | --- | ---
Johann-Mattis List | @LinguList | cldf conversion | Other
Dunn, Michael | | data collection | Author
Dunn, Michael | @evoling | data collection | Author
Tiago Tresoldi | @tresoldi | data preparation | Author


Expand Down
16 changes: 8 additions & 8 deletions cldf/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,13 +8,13 @@

property | value
--- | ---
[dc:bibliographicCitation](http://purl.org/dc/terms/bibliographicCitation) | Dunn, M. and Tresolid, T. (2021): IELex Data and Trees.
[dc:bibliographicCitation](http://purl.org/dc/terms/bibliographicCitation) | Dunn, M. and Tresoldi, T. (2021): IELex Data and Trees. Zenodo: 10.5281/zenodo.5556801.
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF Wordlist](http://cldf.clld.org/v1.0/terms.rdf#Wordlist)
[dc:identifier](http://purl.org/dc/terms/identifier) | https://doi.org/10.5281/zenodo.5556801
[dc:license](http://purl.org/dc/terms/license) | https://creativecommons.org/licenses/by/4.0/
[dcat:accessURL](http://www.w3.org/ns/dcat#accessURL) | https://github.com/lexibank/ielexfinal
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/lexibank/ielexfinal/tree/8cf2304">lexibank/ielexfinal 8cf2304</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v4.4">Glottolog v4.4</a></li><li><a href="https://github.com/concepticon/concepticon-data/tree/v2.5.0">Concepticon v2.5.0</a></li><li><a href="https://github.com/cldf-clts/clts//tree/b12a7df">CLTS v2.1.0-26-gb12a7df</a></li></ol>
[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) | <ol><li><strong>lingpy-rcParams</strong>: <a href="./lingpy-rcParams.json">lingpy-rcParams.json</a></li><li><strong>python</strong>: 3.9.6</li><li><strong>python-packages</strong>: <a href="./requirements.txt">requirements.txt</a></li></ol>
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/lexibank/ielexfinal/tree/v0.1">lexibank/ielexfinal v0.1</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v5.1">Glottolog v5.1</a></li><li><a href="https://github.com/concepticon/concepticon-data/tree/v3.4.0">Concepticon v3.4.0</a></li><li><a href="https://github.com/cldf-clts/clts/tree/824176a">CLTS v2.3.0-4-g824176a</a></li></ol>
[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) | <ol><li><strong>lingpy-rcParams</strong>: <a href="./lingpy-rcParams.json">lingpy-rcParams.json</a></li><li><strong>python</strong>: 3.14.0</li><li><strong>python-packages</strong>: <a href="./requirements.txt">requirements.txt</a></li></ol>
[rdf:ID](http://www.w3.org/1999/02/22-rdf-syntax-ns#ID) | ielexfinal
[rdf:type](http://www.w3.org/1999/02/22-rdf-syntax-ns#type) | http://www.w3.org/ns/dcat#Distribution

Expand All @@ -33,7 +33,7 @@ This is the basis for creating rows in CLDF representations of the data by
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF FormTable](http://cldf.clld.org/v1.0/terms.rdf#FormTable)
[dc:extent](http://purl.org/dc/terms/extent) | 21483
[dc:extent](http://purl.org/dc/terms/extent) | 33922


### Columns
Expand All @@ -59,7 +59,7 @@ Name/Property | Datatype | Description
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF LanguageTable](http://cldf.clld.org/v1.0/terms.rdf#LanguageTable)
[dc:extent](http://purl.org/dc/terms/extent) | 94
[dc:extent](http://purl.org/dc/terms/extent) | 149


### Columns
Expand All @@ -72,8 +72,8 @@ Name/Property | Datatype | Description
`Glottolog_Name` | `string` |
[ISO639P3code](http://cldf.clld.org/v1.0/terms.rdf#iso639P3code) | `string` |
[Macroarea](http://cldf.clld.org/v1.0/terms.rdf#macroarea) | `string` |
[Latitude](http://cldf.clld.org/v1.0/terms.rdf#latitude) | `decimal` |
[Longitude](http://cldf.clld.org/v1.0/terms.rdf#longitude) | `decimal` |
[Latitude](http://cldf.clld.org/v1.0/terms.rdf#latitude) | `decimal`<br>&ge; -90<br>&le; 90 |
[Longitude](http://cldf.clld.org/v1.0/terms.rdf#longitude) | `decimal`<br>&ge; -180<br>&le; 180 |
`Family` | `string` |
`SubGroup` | `string` |

Expand All @@ -99,7 +99,7 @@ Name/Property | Datatype | Description
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF CognateTable](http://cldf.clld.org/v1.0/terms.rdf#CognateTable)
[dc:extent](http://purl.org/dc/terms/extent) | 21483
[dc:extent](http://purl.org/dc/terms/extent) | 33922


### Columns
Expand Down
27 changes: 12 additions & 15 deletions cldf/cldf-metadata.json
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
{
"@context": "http://www.w3.org/ns/csvw",
"aboutUrl": null,
"dc:bibliographicCitation": "Dunn, M. and Tresolid, T. (2021): IELex Data and Trees.",
"dc:bibliographicCitation": "Dunn, M. and Tresoldi, T. (2021): IELex Data and Trees. Zenodo: 10.5281/zenodo.5556801.",
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#Wordlist",
"dc:identifier": "https://doi.org/10.5281/zenodo.5556801",
"dc:isVersionOf": null,
Expand All @@ -14,25 +14,25 @@
{
"rdf:about": "https://github.com/lexibank/ielexfinal",
"rdf:type": "prov:Entity",
"dc:created": "8cf2304",
"dc:created": "v0.1",
"dc:title": "Repository"
},
{
"rdf:about": "https://github.com/glottolog/glottolog",
"rdf:type": "prov:Entity",
"dc:created": "v4.4",
"dc:created": "v5.1",
"dc:title": "Glottolog"
},
{
"rdf:about": "https://github.com/concepticon/concepticon-data",
"rdf:type": "prov:Entity",
"dc:created": "v2.5.0",
"dc:created": "v3.4.0",
"dc:title": "Concepticon"
},
{
"rdf:about": "https://github.com/cldf-clts/clts/",
"rdf:about": "https://github.com/cldf-clts/clts",
"rdf:type": "prov:Entity",
"dc:created": "v2.1.0-26-gb12a7df",
"dc:created": "v2.3.0-4-g824176a",
"dc:title": "CLTS"
}
],
Expand All @@ -43,7 +43,7 @@
},
{
"dc:title": "python",
"dc:description": "3.9.6"
"dc:description": "3.14.0"
},
{
"dc:title": "python-packages",
Expand All @@ -52,14 +52,11 @@
],
"rdf:ID": "ielexfinal",
"rdf:type": "http://www.w3.org/ns/dcat#Distribution",
"dialect": {
"commentPrefix": null
},
"tables": [
{
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#FormTable",
"dc:description": "\nRaw lexical data item as it can be pulled out of the original datasets.\n\nThis is the basis for creating rows in CLDF representations of the data by\n- splitting the lexical item into forms\n- cleaning the forms\n- potentially tokenizing the form\n",
"dc:extent": 21483,
"dc:extent": 33922,
"tableSchema": {
"columns": [
{
Expand Down Expand Up @@ -162,7 +159,7 @@
},
{
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#LanguageTable",
"dc:extent": 94,
"dc:extent": 149,
"tableSchema": {
"columns": [
{
Expand All @@ -178,7 +175,7 @@
{
"datatype": "string",
"propertyUrl": "http://cldf.clld.org/v1.0/terms.rdf#glottocode",
"valueUrl": "http://glottolog.org/resource/languoid/id/{glottolog_id}",
"valueUrl": "http://glottolog.org/resource/languoid/id/{Glottocode}",
"name": "Glottocode"
},
{
Expand Down Expand Up @@ -248,7 +245,7 @@
{
"datatype": "string",
"propertyUrl": "http://cldf.clld.org/v1.0/terms.rdf#concepticonReference",
"valueUrl": "http://concepticon.clld.org/parameters/{concepticon_id}",
"valueUrl": "http://concepticon.clld.org/parameters/{Concepticon_ID}",
"name": "Concepticon_ID"
},
{
Expand All @@ -264,7 +261,7 @@
},
{
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#CognateTable",
"dc:extent": 21483,
"dc:extent": 33922,
"tableSchema": {
"columns": [
{
Expand Down
Loading