Deformable DETR - CSE deep learning project

Seth Vanderwilt, Zach Wilson, Zack Barnes, Richard Park

Example prediction output from our model, filtered by confidence threshold of 0.25

`ProjectSite.md` has more information than this stale README - please go there instead

Old stuff below this line

Paper here: https://arxiv.org/abs/2010.04159

What is this?

We want to try out a new object detector called Deformable DETR on the COCO dataset (or some subset)
Once we can successfully instantiate & train a model, we will modify the internals, hyperparameters, etc. and run some experiments

Installation

We have Deformable-DETR in here as a regular folder because we're making modifications
Just follow the Deformable DETR instructions here to set up a conda environment
Can pip install -r requirements.txt if we add anything else
[detectron2 - TBD] if the Deformable DETR training script works for us, we may just want to use that and forget about detectron2, or just pull in some of its useful functions.

Dataset

Whoever is able to do a full training run should download COCO 2017 train/val images and annotations, should be about 20GB total. Start by just downloading the train2017 annotations https://cocodataset.org/#download
Use these instructions to get the right directory structure within Deformable-DETR/data/coco
Run the make_coco_subset.py script to shrink the instances_train2017.json file down to a few images & only "person" boxes. You might have to rename to instances_train2017_orig.json first, wanted to keep the original dataset around but need that filename for the training script.
Run visualize_dataset.py to check if the boxes are showing up correctly. Probably should improve this script!

Training

See https://github.com/fundamentalvision/Deformable-DETR#training, but we are just running main.py directly e.g.

python -u main.py --wb_name "batchsize20_resnet18_3encs_3decs" \
--wb_notes "Try batch size of 20 for frozen resnet18 with 3 layers of transformer encoders + 3 layers of decoders" \
--lr_backbone 0 --backbone resnet18 --enc_layers 3 --dec_layers 3 \
--batch_size 20 \
--num_feature_levels 1 --output_dir output_r18

or something like that

We are using this Weights & Biases project and have edited the code to log losses, metrics, etc. there. Not sure that we want to upload all the checkpoints, there may be some COCO evaluation detection boxes kind of stuff for precision + recall in eval folder

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
Deformable-DETR		Deformable-DETR
checkpoints		checkpoints
.gitignore		.gitignore
ProjectSite.md		ProjectSite.md
README.md		README.md
coco_category_ids.txt		coco_category_ids.txt
coco_detection_example.png		coco_detection_example.png
instances_train2017_5classes_areagt100.json		instances_train2017_5classes_areagt100.json
instances_val2017_5classes_areagt100.json		instances_val2017_5classes_areagt100.json
make_coco_subset.py		make_coco_subset.py
mobilenet_v2_r50_feature_maps.py		mobilenet_v2_r50_feature_maps.py
project-video.mp4		project-video.mp4
requirements.txt		requirements.txt
visualize_dataset.py		visualize_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deformable DETR - CSE deep learning project

`ProjectSite.md` has more information than this stale README - please go there instead

Old stuff below this line

What is this?

Installation

Dataset

Training

About

Releases

Packages

Languages

sethv/dl-dales-project

Folders and files

Latest commit

History

Repository files navigation

Deformable DETR - CSE deep learning project

ProjectSite.md has more information than this stale README - please go there instead

Old stuff below this line

What is this?

Installation

Dataset

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`ProjectSite.md` has more information than this stale README - please go there instead

Packages