2D Object Detection Benchmark Overview (from KITTI)

Read this first: https://medium.com/@sshleifer/how-to-finetune-tensorflows-object-detection-models-on-kitti-self-driving-dataset-c8fcfe3258e9

THIS PROJECT IS NO LONGER BEING MANTAINED. The Official Object Detection repo now supports kitti! Goal: Glue between tensorflow objection detection models and kitti-2d object detection data Status: probably won't work out of the box but will save you time vs googling all over

scripts to fetch convert the kitti 2D objection detection data to TFRecords
all my code is in the object_detection/ directory
more on how stuff works can be found at

Incomplete List of Dependencies:

- Download pretrained faster-rcnn https://medium.com/r/?url=http%3A%2F%2Fdownload.tensorflow.org%2Fmodels%2Fobject_detection%2Ffaster_rcnn_inception_resnet_v2_atrous_coco_11_06_2017.tar.gz
- tensorflow
- a GPU
- have not tested on anything besides Ubuntu 16.05

Instructions after cloning

cd object_detection
./fetch_kitti.sh  # uncomment python create_dataset.py, or run separately if you get into trouble
./train_rcnn.sh
# open a separate shell and run
./eval.sh rcnn_logs samples/configs/faster_rcnn_inception_resnet_v2_atrous_kitti.config# open yet a third shell and run
tensorboard --logdir rcnn_logs
# go to sleep...in the morning,
./freeze.sh samples/configs/faster_rcnn_inception_resnet_v2_atrous_kitti.config faster_rcnn_logs/model.ckpt-431399  faster_rcnn_frozen
jupyter notebook
# find kitti_inference.ipynb and try to figure out what is going on

References

object_detection/vod_converter is shamelessly stolen from github.com/nghiattran/vod-converter with a few modifications
tensorflow object detection: https://github.com/tensorflow/models/tree/master/object_detection

2D Object Detection Benchmark Overview (from KITTI)

The goal in the 2D object detection task is to train object detectors for the classes 'Car', 'Pedestrian', and 'Cyclist'. The object detectors must provide as output the 2D 0-based bounding box in the image using the format specified above, as well as a detection score, indicating the confidence in the detection. All other values must be set to their default values (=invalid), see above. One text file per image must be provided in a zip archive, where each file can contain many detections, depending on the number of objects per image. In our evaluation we only evaluate detections/ objects larger than 25 pixel (height) in the image and do not count 'Van' as false positives for 'Car' or 'Sitting Person' as false positive for 'Pedestrian' due to their similarity in appearance. As evaluation criterion we follow PASCAL and require the intersection-over-union of bounding boxes to be larger than 50% for an object to be detected correctly.


Validation Results (794 valid images, 6900 train images)

Faster-RCNN
========================================================
Category          [email protected]
car               0.959948
cyclist           0.846211
dontcare          0.339320
misc              0.844625
pedestrian        0.792805
person_sitting    0.670089
tram              0.940657
truck             0.943405
van               0.936856
Total             0.808213


SSD Mobilenet
===========================
Category          [email protected]
car               0.723661
cyclist           0.390498
dontcare          0.073786
misc              0.493499
pedestrian        0.257245
person_sitting    0.573592
tram              0.800318
truck             0.641025
van               0.579114
Total             0.503638


Final Total Loss
================

rcnn    0.474066  43 hours   2.90 steps per second
ssd     2.778544  127 hours  1.15 steps per second

Name		Name	Last commit message	Last commit date
Latest commit History 1,295 Commits
adv_imagenet_models		adv_imagenet_models
adversarial_crypto		adversarial_crypto
adversarial_text		adversarial_text
attention_ocr		attention_ocr
audioset		audioset
autoencoder		autoencoder
cognitive_mapping_and_planning		cognitive_mapping_and_planning
compression		compression
differential_privacy		differential_privacy
domain_adaptation		domain_adaptation
im2txt		im2txt
inception		inception
learned_optimizer		learned_optimizer
learning_to_remember_rare_events		learning_to_remember_rare_events
lfads		lfads
lm_1b		lm_1b
namignizer		namignizer
neural_gpu		neural_gpu
neural_programmer		neural_programmer
next_frame_prediction		next_frame_prediction
object_detection		object_detection
pcl_rl		pcl_rl
ptn		ptn
qa_kg		qa_kg
real_nvp		real_nvp
rebar		rebar
resnet		resnet
skip_thoughts		skip_thoughts
slim		slim
street		street
swivel		swivel
syntaxnet		syntaxnet
textsum		textsum
transformer		transformer
tutorials		tutorials
video_prediction		video_prediction
.gitignore		.gitignore
.gitmodules		.gitmodules
AUTHORS		AUTHORS
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
ISSUE_TEMPLATE.md		ISSUE_TEMPLATE.md
LICENSE		LICENSE
README.md		README.md
WORKSPACE		WORKSPACE
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Incomplete List of Dependencies:

Instructions after cloning

References

2D Object Detection Benchmark Overview (from KITTI)

About

Releases

Packages

Contributors 245

Languages

License

sshleifer/object_detection_kitti

Folders and files

Latest commit

History

Repository files navigation

Incomplete List of Dependencies:

Instructions after cloning

References

2D Object Detection Benchmark Overview (from KITTI)

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 245

Languages

Packages