This repository is borne out of a need for a singular database of languages mapped to their ISO 639-1 and ISO 639-3 codes with their native name ("autonym") listed against their tags.
While the data has initially been generated from standard sources, contributions are welcomed for any overly-anglicised names (those not using the native character set of the language) or those languages with name variants that have not been included.
tag3
and tag1
columns represent the ISO 639-3 and -1 tags respectively. The name
field is the "most recognisable" form of the language name, typically in English, to be used as a fallback where an autonym is not available.
The autonym
field is the name of the language in that language. If this field is blank, it means that there is no confirmed autonym for this language in this database and you may use the name
field as a fallback.
An autonym
being blank does not necessarily indicate that the name
field represents the autonym. In the case where they are the same, they should be listed in both columns.
Here are the currently utilised sources by this repository:
cldr
- Unicode Common Locale Data Repositoryethnologue
- Ethnologue, only autonyms where specified.iso639-3
- ISO 639-3, reference language names only.github
- This repository
Databases in many countries do not attract intellectual property rights, and where they do it, they very rarely attract copyright due to the raw and inexpressive nature of the data. However, to alleviate doubt, this data is being published by a resident of Sweden where sui generis database rights do not apply to non-EU datasets. CLDR and Ethnologue are both datasets published in the US, where database rights also do not apply.
However, for those who have annoying and immovable legal teams and want to use this in a product, the dataset is licensed under the Creative Commons Zero 1.0 license.
Any contributions to this repository are on the condition that the contributor relinquishes all database rights, copyrights and all other intellectual property rights or claims to the contribution.
Mark the source as github
in the data.