- For a long time, I have heard of this paper and always wondered to know it properly.
- This repo is one way for me for that redemption.
- Read the paper, understand the maths.
- Correlate the knowledge with LSTMs and RNNs.
- Implement the vanilla architecture using pytorch.
- Train the model on the WMT2014 English to German Translation Task.
- Rinse and repeat on some more models that came after that.
- Start Date : 02/January/2022
- Will Complete by : 02/February/2022
- Paper : https://arxiv.org/pdf/1706.03762.pdf
- Data : https://huggingface.co/datasets/wmt14
- How to write Pytorch Code : https://nn.labml.ai/transformers/mha.html