You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
⚠️ Disclaimer: Checkpoints are based on training with publicly available
datasets. Some datasets contain limitations, including non-commercial use
limitations. Please review the terms and conditions made available by third parties
before using the datasets provided. Checkpoints are licensed under
Apache 2.0.
⚠️ Disclaimer: Datasets hyperlinked from this page are not owned or distributed
by Google. Such datasets are made available by third parties. Please review the
terms and conditions made available by the third parties before using the data.
TF-Vision modeling library for computer vision provides a collection of
baselines and checkpoints for image classification, object detection, and
segmentation.
The COCO Consortium does not own the copyright of the images
corresponding to the annotations. The images are
made available by Flickr under
various Creative Commons licenses, and users of the images accept full
responsibility for the use of the dataset.
Training details:
Models finetuned from ImageNet pretrained
checkpoints adopt the 12 or 36 epochs schedule. Models trained from
scratch adopt the 350 epochs schedule.
The default training data augmentation implements horizontal flipping
and scale jittering with a random scale between [0.5, 2.0].
Unless noted, all models are trained with l2 weight regularization and
ReLU activation.
We use batch size 256 and stepwise learning rate that decays at the last
30 and 10 epoch.
We use square image as input by resizing the long side of an image to
the target size then padding the short side with zeros.