LeNet-5 implementation

this project implements LeCunn's 1998 paper "gradient based learning applied to document recognition". the model performed a 0.96 on the f1 macro metric with minimal training epochs. code and models are available.

features:

custom implementations of:
- a convolution layer class which supports sparse connections between channels and shared weights making it more flexible than pytorch's nn.conv2d!
- average pooling, loss function, and optimizer using pytorch tensors operations
  - maximum a posteriori loss and sdlm optimizer
follows original paper's specifications strictly for architecture, hyperparameters, initialization, and even the stylized 10x8x12 bitmap the 0-9 digits used as weights in the rbf layer!!! https://ibb.co/d6ktzc0
trained with 60,000 samples and validated with 10,000 on mnist dataset

training loss over time:

before/after predictions:

implementation details:

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
classes		classes
data		data
models		models
src		src
README.md		README.md
notes.txt		notes.txt
paper.dvi - Lecun98.pdf		paper.dvi - Lecun98.pdf