ROI Pooling

Region of interest pooling explained (deepsense.ai)

It's a NN layer taking two inputs:

  • A fixed-size feature map from a CNN

  • A N x 5 matrix of RoIs. With N = #RoI and 5 = #bbox coordinates + the image index

How does it work?

For every RoI, it takes the corresponding section of the feature map and scales it to a pre-defined size (eg 5x5). For it,

Note: RoI size does not need to be multiple of the grid size (eg a 7x5 region can be divided into a 2x2)

Last updated