GitHub - Himank-J/GPT2-CustomTrained

Training GPT-2 from Scratch using Tiny Shakespeare dataset

In this repo you can find the Tiny Shakespeare dataset (input.txt) and 17 files each representing a step in training GPT-2 model from scratch. In files train_get2-1 through train_get2-8 we setup our code for training. From train_get2-9-speedup-1 thorugh speedup-9 we implement different techniques to speed up our training process.

Objective

Target - loss less than 0.099

Training Params -

Epochs - 6000
Batch size = 8
Number of tokens = 1024

Results

As seen in below image, after runnning for 6000 epohcs with batch size 8 and tokens 1024, we are able to get our loss to 0.06

App

Link to app - https://huggingface.co/spaces/HimankJ/GPT2_CustomTrained

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
ERAV2-S21-Himank.py		ERAV2-S21-Himank.py
README.md		README.md
input.txt		input.txt
test.py		test.py
train_get2-1.py		train_get2-1.py
train_get2-2.py		train_get2-2.py
train_get2-3.py		train_get2-3.py
train_get2-4.py		train_get2-4.py
train_get2-5.py		train_get2-5.py
train_get2-6.py		train_get2-6.py
train_get2-7.py		train_get2-7.py
train_get2-8-init.py		train_get2-8-init.py
train_get2-9-speedup1.py		train_get2-9-speedup1.py
train_get2-9-speedup2.py		train_get2-9-speedup2.py
train_get2-9-speedup3.py		train_get2-9-speedup3.py
train_get2-9-speedup4.py		train_get2-9-speedup4.py
train_get2-9-speedup5.py		train_get2-9-speedup5.py
train_get2-9-speedup6.py		train_get2-9-speedup6.py
train_get2-9-speedup7.py		train_get2-9-speedup7.py
train_get2-9-speedup8.py		train_get2-9-speedup8.py
train_get2-9-speedup9.py		train_get2-9-speedup9.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Training GPT-2 from Scratch using Tiny Shakespeare dataset

Objective

Training Params -

Results

App

About

Releases

Packages

Languages

Himank-J/GPT2-CustomTrained

Folders and files

Latest commit

History

Repository files navigation

Training GPT-2 from Scratch using Tiny Shakespeare dataset

Objective

Training Params -

Results

App

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages