Skip to content

Commit

Permalink
refactor(data): corpus improvements
Browse files Browse the repository at this point in the history
  • Loading branch information
Xunnamius committed Aug 26, 2021
1 parent 9d73a7e commit f3c7409
Show file tree
Hide file tree
Showing 2 changed files with 10 additions and 7 deletions.
2 changes: 1 addition & 1 deletion data/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ manually.
Latest corpus stats:

- 7318 total lines
- 5331 generated usernames
- 5331 unique usernames
- 100 unique memes

Movie titles used:
Expand Down
15 changes: 9 additions & 6 deletions data/generate-corpus.js
Original file line number Diff line number Diff line change
Expand Up @@ -132,9 +132,12 @@ fs.writeFileSync('corpus.json', JSON.stringify({ dialogs, usernames }));

console.log('> corpus generated successfully');
console.log(`
> latest corpus stats:
> ${numLines} total lines
> ${usernames.length} generated usernames
>
> movie titles used:
> ${filteredMovies.map((movie) => movie.title).join('\n> ')}`);
Latest corpus stats:
- ${numLines} total lines
- ${usernames.length} unique usernames
- 100 unique memes
Movie titles used:
- ${filteredMovies.map((movie) => movie.title).join('\n- ')}`);

0 comments on commit f3c7409

Please sign in to comment.