Skip to content

AI NLP model to correct one of the most common error in German sentences: "das" vs. "dass".

License

Notifications You must be signed in to change notification settings

alexanderbluhm/dasordass

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dasordass

AI NLP model to correct the usage of one of the most common errors in German sentences: "das" vs "dass".
The model architecture consists of a BERT model, pretrained exclusively on German language (HugginfaceModel), and a simple classifier layer on top. Docker image generated using cog and deployed to fly.io for inference (CPU, 2GB RAM).
Achieving 99.9% accuracy on the validation set.

Training

Trained for 10min on an RTX 2070 Super on 20k sentences from the alexanderbluhm/wiki_sentences_de_2k dataset. The dataset consists of sentences containing "das" or "dass" from the first 2,000 de wikipedia documents, split with spaCy.

Hyperparameters:

  • Steps: 1,500 (24,000 sentences, 2 epochs, 32 batch size)
  • Loss Function: Mean Squared Error
  • Optimizer: AdamW (default settings)
  • Learning Rate: 3e-5 with linear learning rate schedule
  • Warmup Steps: 100

Deployment

Adapted from: https://til.simonwillison.net/fly/fly-docker-registry

  • Create an empty fly.io application using flyctl launch
  • Build image using cog and tag it like registry.fly.io/your-app-name:unique-image-tag: cog build -t registry.fly.io/your-app-name:unique-tag-for-your-image
  • Run: flyctl auth docker
  • Push to the registry: docker push registry.fly.io/your-app-name:unique-image-tag
  • Deploy: flyctl deploy --image registry.fly.io/your-app-name:unique-image-tag
  • Adjust RAM amount in case of a memory error (2GB required for this model)

About

AI NLP model to correct one of the most common error in German sentences: "das" vs. "dass".

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages