Improving object detection performance on hard to detect instances in DOTA

Authors

Said Harb
Tim Schmidt

Dataset

The dataset used in our experiments is the Dota Dataset. In particular we use the version 1.5 of this dataset. It consists of 2.806 Aerial images with different sizes. The image sizes vary from 800x800 to 20.000 x 20.000 pixels. On these images there are 16 classes labeled which are:

large vehicle
small vehicle
helicopter
plane
ship
swimmingpool
container crane
storage tank
bridge
harbor
roundabout
baseball-diamond
basketball court
ground track field
tennis court
soccerball field

Model(s)

The model employed is the YOLOv5n.

The newest version and information about the model can be found here: Link

Results

Both employed approaches improved the detection performance on hard to detect instances considerably. However the oversampling apporach prooved to be slightly better, as the metrics are the best for this experiment.

The following images show the mAP overall and the AP per class for the best experiment:

Results 1

Results 2