Training Tricks

MMSegmentation support following training tricks out of box.

Different Learning Rate(LR) for Backbone and Heads

In semantic segmentation, some methods make the LR of heads larger than backbone to achieve better performance or faster convergence.

In MMSegmentation, you may add following lines to config to make the LR of heads 10 times of backbone.

optim_wrapper=dict(
    paramwise_cfg = dict(
        custom_keys={
            'head': dict(lr_mult=10.)}))

With this modification, the LR of any parameter group with 'head' in name will be multiplied by 10. You may refer to MMEngine documentation for further details.

Online Hard Example Mining (OHEM)

We implement pixel sampler for training sampling, like OHEM (Online Hard Example Mining), which is used for remove the "easy" examples for model training. Here is an example config of training PSPNet with OHEM enabled.

_base_ = './pspnet_r50-d8_4xb2-40k_cityscapes-512x1024.py'
model=dict(
    decode_head=dict(
        sampler=dict(type='OHEMPixelSampler', thresh=0.7, min_kept=100000)) )

In this way, only pixels with confidence score under 0.7 are used to train. And we keep at least 100000 pixels during training. If thresh is not specified, pixels of top min_kept loss will be selected.

Class Balanced Loss

For dataset that is not balanced in classes distribution, you may change the loss weight of each class. Here is an example for cityscapes dataset.

_base_ = './pspnet_r50-d8_4xb2-40k_cityscapes-512x1024.py'
model=dict(
    decode_head=dict(
        loss_decode=dict(
            type='CrossEntropyLoss', use_sigmoid=False, loss_weight=1.0,
            # DeepLab used this class weight for cityscapes
            class_weight=[0.8373, 0.9180, 0.8660, 1.0345, 1.0166, 0.9969, 0.9754,
                        1.0489, 0.8786, 1.0023, 0.9539, 0.9843, 1.1116, 0.9037,
                        1.0865, 1.0955, 1.0865, 1.1529, 1.0507])))

class_weight will be passed into CrossEntropyLoss as weight argument. Please refer to PyTorch Doc for details.

Multiple Losses

For loss calculation, we support multiple losses training concurrently. Here is an example config of training unet on DRIVE dataset, whose loss function is 1:3 weighted sum of CrossEntropyLoss and DiceLoss:

_base_ = './fcn_unet_s5-d16_64x64_40k_drive.py'
model = dict(
    decode_head=dict(loss_decode=[
        dict(type='CrossEntropyLoss', loss_name='loss_ce', loss_weight=1.0),
        dict(type='DiceLoss', loss_name='loss_dice', loss_weight=3.0)
    ]),
    auxiliary_head=dict(loss_decode=[
        dict(type='CrossEntropyLoss', loss_name='loss_ce', loss_weight=1.0),
        dict(type='DiceLoss', loss_name='loss_dice', loss_weight=3.0)
    ]),
)

In this way, loss_weight and loss_name will be weight and name in training log of corresponding loss, respectively.

Note: If you want this loss item to be included into the backward graph, loss_ must be the prefix of the name.

monkey_cici / mmsegmentation

Training Tricks

Different Learning Rate(LR) for Backbone and Heads

Online Hard Example Mining (OHEM)

Class Balanced Loss

Multiple Losses

简介

发行版

贡献者

近期动态

monkey_cici / mmsegmentation .gitee-modal { width: 500px !important; }

Training Tricks

Different Learning Rate(LR) for Backbone and Heads

Online Hard Example Mining (OHEM)

Class Balanced Loss

Multiple Losses

简介

发行版

开源评估指数源自 OSS-Compass 评估体系，评估体系围绕以下三个维度对项目展开评估：

贡献者

近期动态

搜索帮助

monkey_cici / mmsegmentation