Skip to content

Toy problem to get started with object detection and bounding box regression using MNIST

Notifications You must be signed in to change notification settings

adityassrana/object-detection-mnist

Repository files navigation

object-detection-mnist

In these notebooks we will learn how to implement a convolutional neural network (CNN) regressor to localize digits of the MNIST dataset. We will use the PyTorch library for training our model.

  1. The input to our model will be a 64 x 64 image with a MNIST digit at any location, and the output of the model are four real numbers that define a bounding box (x, y, width, and height) around the digit. Open In Collab

  1. Then we will modify our model by adding a classification output, so that it can jointly predict the bounding box of the digit and also its class label. Open In Collab

References

The above notebooks are adapted versions of an assignment given by Lluis Gomez i Bigorda for my M5:Visual Recognition class

About

Toy problem to get started with object detection and bounding box regression using MNIST

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published