PyTorch

Note:

This version of the project works well if you expect around a thousand viewers per week. But it is not designed to scale beyond that.

If you want a highly scalable version, contact me at ansimran@protonmail.com.

The main goal here is to showcase Machine Learning skills, not full-stack AI development skills.

For an example of my work in full-stack AI development with scalability in mind, you can check this project: 👉 Production-ready self-corrective RAG.

tags: PyTorch, numpy, pandas, Data Processing, Tokenization, Padding, Generator, Positional-Encoding, Padding-Mask, Look-Ahead-Mask, Encoder, Decoder, MultiHead-Self-Attention, Residual-Connectioncs, Batch Normalization, Feed-Forward-Neural-Networks, Embedding-Layer, Dropout-Layer, Masked-MultiHead-Self-Attention-(Causal-Attention), Linear-Layer, Log-Softmax, Training-Loop, Epochs, Learning Rate, Batch-Size, Pad-Index, Loss-Function, Optimizer, Predictions, Gradients & Updating Weights.

PyTorch

Building a Transformer from Scratch for Text Summarization

Based on the Natural Language Processing Specialization by DeepLearning.ai

Course 4 – Week 2

📘 Full NLP Specialization GitHub Repo Here: Natural Language Processing from Scratch

Results

Encoder Layer
- MultiHead Self-Attention
- Residual Connection & Batch Normalization
- Feed Forward Neural Network
- Residual Connection & Batch Normalization
Full Encoder
- Embedding Layer
- Positional Encoding
- Dropout Layer
- Encoder LayerS
Decoder Layer
- Masked MultiHead Self-Attention (Causal Attention)
- Residual Connection & Batch Normalization
- MultiHead Attention
- Residual Connection & Batch Normalization
- Feed Forward Neural Network
- Residual Connection & Batch Normalization
Full Decoder
- Embedding Layer
- Positional Encoding
- Dropout Layer
- Decoder layerS
Full TRANSFORMER
- Encoder + Decoder + Linear Layer
- Log Softmax

4. Training Loop

Epochs, Learning Rate, Batch Size and Pad-Index
Loss Function
Optimizer
Computing Loss
Predictions
Clearing Gradients
Updating Weights

5. Inference

Next Word Prediction Function
Summarization Function

6. Conclusion

Some Remarks on Results

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
data		data
images		images
README.md		README.md
Transformer_from_Scratch_for_Text_Summarization_(PyTorch_Implementation).ipynb		Transformer_from_Scratch_for_Text_Summarization_(PyTorch_Implementation).ipynb
model.pt		model.pt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch

Building a Transformer from Scratch for Text Summarization

Based on the Natural Language Processing Specialization by DeepLearning.ai

Course 4 – Week 2

Results

Table of Contents

1. Data Processing

2. Useful Functions

3. Transformer (Encoder-Decoder)

4. Training Loop

5. Inference

6. Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PyTorch

Building a Transformer from Scratch for Text Summarization

Based on the Natural Language Processing Specialization by DeepLearning.ai

Course 4 – Week 2

Results

Table of Contents

1. Data Processing

2. Useful Functions

3. Transformer (Encoder-Decoder)

4. Training Loop

5. Inference

6. Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages