TSAI Group Assignment

Group Members:

Arjun Gupta
Himanshu
Aeshna Singh
Palash Baranwal

SESSION 10 - Transformers Review

ASSIGNMENT

Train the same code, but on different data. If you have n-classes, your accuracy MUST be more than 4 * 100 / n.
Submit the Github link, that includes your notebook with training logs, and proper readme file.

DATASET USED

MANYTHINGS ORG DUTCH - ENGLISH DATASET

Links:

DIAGRAMS

Transformer

Encoder

Attention

Decoder

SCREENSHOTS

TRAINING LOGS

EVALUATION OUTPUT

TRANSLATION OUTPUT

REFERENCES

Attention is All You Need: https://github.com/ammesatyajit/pytorch-seq2seq/blob/master/6%20-%20Attention%20is%20All%20You%20Need.ipynb
Paper: Attention is All You Need https://arxiv.org/pdf/1706.03762.pdf
The Illustrated Transformer https://jalammar.github.io/illustrated-transformer/
What Do Position Embeddings Learn? https://arxiv.org/pdf/2010.04903.pdf
https://github.com/bentrevett/pytorch-seq2seq/blob/master/6%20-%20Attention%20is%20All%20You%20Need.ipynb