Status: Pending
Author: Jordan Hoffmann, Laurent Sifre, Oriol Vinyals, Sebastian Borgeaud
Topic: Large-Language-Models, Transformers
Category: Architecture, Optimization-No. of params, Pre-Training, Tips & Tricks
Conference: arXiv
Year: 2022
Link: https://arxiv.org/abs/2203.15556