existing → bounding boxes then image classifier

too slow

Image (448x448) → CNN on Resized image (get bounding boxes and probabilities) → non-maximal suppression (suppresses lower confidence score)

Untitled

  1. SxS grid
  2. Predict B bounding boxes (x, y, w, h, confidence) along with confidence scores (IOU)
  3. Predict C conditional class Probabilities