Transformers_from_scratch

Implementation of Vanilla Transformers from scratch (Attention is all you need)

Embeddings: representing text in lower dimensional representation. converting text into numerical : learnable parameter.
Positional Embedding: Convey positional information to the model Sin function: Even position of embedding cos function: odd position of embedding stays the same : nonlearnable parameter.
Attention Mechanism:

Resources

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
__pycache__		__pycache__
notebooks		notebooks
src		src
transformer_architecture		transformer_architecture
README.md		README.md