This software incorporates materials licensed by third parties.
- Wiktionary. Category: English countable nouns. Available at https://en.wiktionary.org/wiki/Category:English_countable_nouns [2024-06-03] Wikimedia Foundation. 2022.
- Wiktionary. Category: English uncomparable adjectives. Available at https://en.wiktionary.org/wiki/Category:English_uncomparable_adjectives [2024-07-25] Wikimedia Foundation. 2022.
- Wiktionary. Category: English uncomparable adverbs. Available at https://en.wiktionary.org/wiki/Category:English_uncomparable_adverbs [2024-07-25] Wikimedia Foundation. 2022.
- Wiktionary. Category: English uncountable nouns. Available at https://en.wiktionary.org/wiki/Category:English_uncountable_nouns [2024-06-03] Wikimedia Foundation. 2022.
Creative Commons Attribution-ShareAlike License
- Page titles in the "English countable nouns" and "English uncountable nouns" categories were used to compile
res/noun.unc
. - Page titles in the "English uncomparable adjectives" category were used to compile
res/adj.ncmp
. - Page titles in the "English uncomparable adverbs" category were used to compile
res/adv.ncmp
.
WordNet: A Lexical Database for English v3.1. Available at https://wordnet.princeton.edu/download/current-version [2024-04-15] Princeton University. 2011.
WordNet Release 3.1
This software and database is being provided to you, the LICENSEE, by
Princeton University under the following license. By obtaining, using
and/or copying this software and database, you agree that you have
read, understood, and will comply with these terms and conditions.:
Permission to use, copy, modify and distribute this software and
database and its documentation for any purpose and without fee or
royalty is hereby granted, provided that you agree to comply with
the following copyright notice and statements, including the disclaimer,
and that the same appear on ALL copies of the software, database and
documentation, including modifications that you make for internal
use or for distribution.
WordNet 3.1 Copyright 2011 by Princeton University. All rights reserved.
THIS SOFTWARE AND DATABASE IS PROVIDED "AS IS" AND PRINCETON
UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR
IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PRINCETON
UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES OF MERCHANT-
ABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE
OF THE LICENSED SOFTWARE, DATABASE OR DOCUMENTATION WILL NOT
INFRINGE ANY THIRD PARTY PATENTS, COPYRIGHTS, TRADEMARKS OR
OTHER RIGHTS.
The name of Princeton University or Princeton may not be used in
advertising or publicity pertaining to distribution of the software
and/or database. Title to copyright in this software, database and
any associated documentation shall at all times remain with
Princeton University and LICENSEE agrees to preserve same.
Project file | Source file |
---|---|
res/adj → embed/adj |
data.adj |
res/adv → embed/adv |
data.adv |
res/noun → embed/noun |
data.noun |
res/verb → embed/verb |
data.verb |
res/adj.irr , res/adj.suf |
adj.exc , adv.exc |
adj.irr
and adj.suf
were manually constructed using their respective source files. Automated formatting via scripts was applied to the remaining source files. Below is the summary of the procedure:
- scripts/res
- Remove license text from the beginning of the file.
- Extract words from the surrounding WordNet metadata.
- Remove single-letter words.
- Remove words containing apostrophes.
- Remove compound words.
- Remove words containing numbers.
- Remove entries consisting of multiple words.
- Remove proper nouns and adjectives derived from them.
- Remove parenthesized line content.
- If any irregular verb is missing from the main list, append it.
- Remove duplicate words.
- Remove unsuitable words. The lists of excluded words are available in res/filters directory.
- Change spelling of the selected words. To review the modifications, refer to res/misc/replacements.json.
- Sort word lists alphabetically.
- scripts/embed