Skip to content
This repository has been archived by the owner on Nov 22, 2022. It is now read-only.

PyText v0.2.0

Compare
Choose a tag to compare
@Titousensei Titousensei released this 15 Jun 00:08

Note: This release makes the new data handler API the default and deprecates Task and Model classes using the old data handler API. We recommend that you migrate your models to the new API as soon as possible. More details here: https://www.facebook.com/groups/pytext/permalink/1038962512978256/

New Stuff

  • most tasks and models deprecated, replaced with better versions using the new data handler API
  • performance improvements in metric reporter
  • Add Multilingual TSV Data Source
  • LabelSmoothedCrossEntropyLoss
  • Support for pretrained word embedding in TokenTensorizer
  • option to use pretrained embedding
  • TorchScript export for document classification
  • Improve log in trainer
  • performance measurement: reporting tokens_per_second and updates_per_second
  • Implement DocumentReader from DrQA in PyText (StackedBidirectionalRNN)
  • improved and updated documentation
  • Implement SWA(SGD|ADAM) and Adagrad Optimizers
  • cache numerized data in memory
  • TorchScript BPE tokenization
  • CLI command to update configs
  • Visualize gradients with tensorboard

Many bug fixes and code clean-ups