[76] Long-tail Detection with Effective Class-Margins

TL;DR

task : long-tail object detection
problem : COCO data is annotated with long-tail and trained accordingly, but the evaluation metric, mAP, is AUC, so there is a gap.
idea : Optimize this by replacing mAP probabilistically and bounding it by a weighted version of the pairwise ranking error under class-margin bounds in detection (=measuring the frequency with which negative sample x’ ranks higher than positive x).
architecture : Mask R-CNN, Cascade Mask R-CNN
objective : ECM loss
baseline : CE Loss, Federated Loss, Seesaw Loss, LOCE loss
data : LVIS v1, Open Images
result : SOTA
contribution : no hyper-parameter for long-tail problem
Limitations or things I don’t understand : I don’t understand all the formulas. It says there is no penalty effect for duplicate object. Doesn’t it work with DETR?

Approaches that implicitly/explicitly re-weight losses, as most of the literature does.
Equalization loss: how to remove negative gradients for rare classes
Assumption that rare classes are discouraged by negative gradients of other classes
Balanced Group Softmax (BaGS): divides groups by frequency in the training set and gets softmax + cross-entropy from there
federated loss: computes only the negative gradient of the class from the image
Equalization Loss V2: Trying to match the cumulative ratio of positive/negative by class
SeeSaw loss: reduces weight for negative gradients in rare classes with high frequency