TypeError: binary_cross_entropy() got an unexpected keyword argument 'ignore_index' #162

fengyouliang · 2020-09-26T14:17:37Z

i changed use_sigmoid=False to use_simoid=True, encounter this problem:
TypeError: binary_cross_entropy() got an unexpected keyword argument 'ignore_index'

collect env

sys.platform: linux
Python: 3.7.7 (default, Mar 23 2020, 22:36:06) [GCC 7.3.0]
CUDA available: True
CUDA_HOME: /usr/local/cuda
NVCC: Cuda compilation tools, release 10.1, V10.1.243
GPU 0,1,2,3: GeForce RTX 2080 Ti
GCC: gcc (Ubuntu 7.4.0-1ubuntu1~18.04.1) 7.4.0
PyTorch: 1.5.1
PyTorch compiling details: PyTorch built with:
  - GCC 7.3
  - C++ Version: 201402
  - Intel(R) Math Kernel Library Version 2020.0.0 Product Build 20191122 for Intel(R) 64 architecture applications
  - Intel(R) MKL-DNN v0.21.1 (Git Hash 7d2fd500bc78936d1d648ca713b901012f470dbc)
  - OpenMP 201511 (a.k.a. OpenMP 4.5)
  - NNPACK is enabled
  - CPU capability usage: AVX2
  - CUDA Runtime 10.1
  - NVCC architecture flags: -gencode;arch=compute_37,code=sm_37;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_61,code=sm_61;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75;-gencode;arch=compute_37,code=compute_37
  - CuDNN 7.6.3
  - Magma 2.5.2
  - Build settings: BLAS=MKL, BUILD_TYPE=Release, CXX_FLAGS= -Wno-deprecated -fvisibility-inlines-hidden -fopenmp -DNDEBUG -DUSE_FBGEMM -DUSE_QNNPACK -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DUSE_INTERNAL_THREADPOOL_IMPL -O2 -fPIC -Wno-narrowing -Wall -Wextra -Werror=return-type -Wno-missing-field-initializers -Wno-type-limits -Wno-array-bounds -Wno-unknown-pragmas -Wno-sign-compare -Wno-unused-parameter -Wno-unused-variable -Wno-unused-function -Wno-unused-result -Wno-strict-overflow -Wno-strict-aliasing -Wno-error=deprecated-declarations -Wno-stringop-overflow -Wno-error=pedantic -Wno-error=redundant-decls -Wno-error=old-style-cast -fdiagnostics-color=always -faligned-new -Wno-unused-but-set-variable -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, PERF_WITH_AVX512=1, USE_CUDA=ON, USE_EXCEPTION_PTR=1, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_STATIC_DISPATCH=OFF, 

TorchVision: 0.6.0a0+35d732a
OpenCV: 4.2.0
MMCV: 1.1.0
MMSegmentation: 0.5.1+2981245
MMCV Compiler: GCC 7.4
MMCV CUDA Compiler: 10.1

config:

# model settings
norm_cfg = dict(type='BN', requires_grad=True)
model = dict(
    type='EncoderDecoder',
    pretrained=None,
    backbone=dict(
        type='ResNetV1c',
        depth=50,
        num_stages=4,
        out_indices=(0, 1, 2, 3),
        dilations=(1, 1, 2, 4),
        strides=(1, 2, 1, 1),
        norm_cfg=norm_cfg,
        norm_eval=False,
        style='pytorch',
        contract_dilation=True),
    decode_head=dict(
        type='PSPHead',
        in_channels=2048,
        in_index=3,
        channels=512,
        pool_scales=(1, 2, 3, 6),
        dropout_ratio=0.1,
        num_classes=8,
        norm_cfg=norm_cfg,
        align_corners=False,
        loss_decode=dict(
            type='CrossEntropyLoss', use_sigmoid=True, loss_weight=1.0)),
    auxiliary_head=dict(
        type='FCNHead',
        in_channels=1024,
        in_index=2,
        channels=256,
        num_convs=1,
        concat_input=False,
        dropout_ratio=0.1,
        num_classes=8,
        norm_cfg=norm_cfg,
        align_corners=False,
        loss_decode=dict(
            type='CrossEntropyLoss', use_sigmoid=True, loss_weight=0.4)))
# model training and testing settings
train_cfg = dict()
test_cfg = dict(mode='whole')


# dataset settings
dataset_type = 'Dataset'
data_root = '/datasets/'
img_norm_cfg = dict(
    mean=[123.675, 116.28, 103.53], std=[58.395, 57.12, 57.375], to_rgb=True)
image_size = (256, 256)
train_pipeline = [
    dict(type='LoadImageFromFile'),
    dict(type='LoadAnnotations', ),  # imdecode_backend='cv2'
    dict(type='Resize', img_scale=image_size, multiscale_mode="value"),
    dict(type='RandomFlip', flip_ratio=0.5),
    dict(type='PhotoMetricDistortion'),
    dict(type='Normalize', **img_norm_cfg),
    dict(type='Pad', size=image_size, pad_val=0, seg_pad_val=255),
    dict(type='DefaultFormatBundle'),
    dict(type='Collect', keys=['img', 'gt_semantic_seg']),
]
test_pipeline = [
    dict(type='LoadImageFromFile'),
    dict(
        type='MultiScaleFlipAug',
        img_scale=image_size,
        # img_ratios=[0.5, 0.75, 1.0, 1.25, 1.5, 1.75],
        flip=False,
        transforms=[
            dict(type='Resize', keep_ratio=True),
            dict(type='RandomFlip'),
            dict(type='Normalize', **img_norm_cfg),
            dict(type='ImageToTensor', keys=['img']),
            dict(type='Collect', keys=['img']),
        ])
]
data = dict(
    samples_per_gpu=16,
    workers_per_gpu=0,
    train=dict(
        type=dataset_type,
        data_root=data_root,
        img_dir='train/image',
        ann_dir='train/label_cvt',
        split='train/split/train.txt',
        pipeline=train_pipeline),
    val=dict(
        type=dataset_type,
        data_root=data_root,
        img_dir='train/image',
        ann_dir='train/label_cvt',
        split='train/split/val.txt',
        pipeline=test_pipeline),
    test=dict(
        type=dataset_type,
        data_root=data_root,
        img_dir='test',
        split='train/split/test.txt',
        pipeline=test_pipeline))

# optimizer
optimizer = dict(type='SGD', lr=0.01, momentum=0.9, weight_decay=0.0005)
optimizer_config = dict()
# learning policy
lr_config = dict(policy='poly', power=0.9, min_lr=1e-4, by_epoch=False)
# runtime settings
total_iters = 20000
checkpoint_config = dict(by_epoch=False, interval=2000)
evaluation = dict(interval=2000, metric='mIoU')

# yapf:disable
log_config = dict(
    interval=100,
    hooks=[
        dict(type='TextLoggerHook', by_epoch=False),
        dict(type='TensorboardLoggerHook')
    ])
# yapf:enable
dist_params = dict(backend='nccl')
log_level = 'INFO'
load_from = '/pth/psp/pspnet_r50-d8_512x512_160k_ade20k_20200615_184358-1890b0bd.pth'
resume_from = None
workflow = [('train', 1)]
cudnn_benchmark = True

The text was updated successfully, but these errors were encountered:

xvjiarui · 2020-09-26T17:46:33Z

Hi @fengyouliang
It seems to be a bug. Would you mind creating a PR to fix it?

fengyouliang · 2020-09-28T14:44:58Z

Hi @fengyouliang
It seems to be a bug. Would you mind creating a PR to fix it?

have no idea how to fix it

lmunan · 2020-10-05T09:08:26Z

I also encountered this problem and don’t know how to solve it @xvjiarui

make text model generic

* Add the hmr head and discriminator for SMPL parameters. Add test codes and test data. * Modify the codes according to review.

Re-direct the link of CSN and TIN in the ModelZoo section to their README files, following the common practice of all other models.

xvjiarui mentioned this issue Oct 25, 2020

[Enhancement] Support ignore_index for sigmoid BCE #210

Merged

hellock closed this as completed in #210 Nov 17, 2020

aravind-h-v pushed a commit to aravind-h-v/mmsegmentation that referenced this issue Mar 27, 2023

[LDMTextToImagePipeline] make text model generic (open-mmlab#162)

543ee1e

make text model generic

wjkim81 pushed a commit to wjkim81/mmsegmentation that referenced this issue Dec 3, 2023

Add hmr head for human mesh estimation task. (open-mmlab#162)

273367d

* Add the hmr head and discriminator for SMPL parameters. Add test codes and test data. * Modify the codes according to review.

sibozhang pushed a commit to sibozhang/mmsegmentation that referenced this issue Mar 22, 2024

Update README.md (open-mmlab#162)

2d2e9a7

Re-direct the link of CSN and TIN in the ModelZoo section to their README files, following the common practice of all other models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TypeError: binary_cross_entropy() got an unexpected keyword argument 'ignore_index' #162

TypeError: binary_cross_entropy() got an unexpected keyword argument 'ignore_index' #162

fengyouliang commented Sep 26, 2020

xvjiarui commented Sep 26, 2020

fengyouliang commented Sep 28, 2020

lmunan commented Oct 5, 2020

TypeError: binary_cross_entropy() got an unexpected keyword argument 'ignore_index' #162

TypeError: binary_cross_entropy() got an unexpected keyword argument 'ignore_index' #162

Comments

fengyouliang commented Sep 26, 2020

xvjiarui commented Sep 26, 2020

fengyouliang commented Sep 28, 2020

lmunan commented Oct 5, 2020