Mixconv_pytorch

This repo is the pytorch implementation of the paper from Google: MixConv: Mixed Depthwise Convolutional Kernels

This code mimics the implementation from the offical repo in Tensorflow (https://github.com/tensorflow/tpu/tree/master/models/official/mnasnet/mixnet)

Dependencies

Python 3.5+
PyTorch v1.0.0+

How to use

python train_cifar.py --lr 0.016 --batch-size 256 -a mixnet-s --dtype cifar100 --optim adam --scheduler exp --epochs 650

Reproduce and Results

CIFAR 100

Network	Top 1	#Params	#Flops
Mixnet-S	in progress	2.7M (this code)	3.2M (this code)
Mixnet-M	in progress	3.6M (this code)	4.4M (this code)
Mixnet-L	in progress	5.8M (this code)	Bug issue (solved soon)

ImageNet

Network	Top 1	#Params	#Flops
Mixnet-S	in progress	4.1M (this code)	259M (this code)
Mixnet-M	in progress	5.0M (this code)	360M (this code)
Mixnet-L	in progress	7.3M (this code)	580M (this code)

Discussion

Currently, the accuracy is very low compare with the numbers reported in the paper. So, welcome scientific, rigorous ,and helpful feedbacks to train MixConv proper in Pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
models		models
README.md		README.md
count_params_flops.py		count_params_flops.py
main.py		main.py
model.txt		model.txt
scripts.sh		scripts.sh
train_cifar.py		train_cifar.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mixconv_pytorch

Dependencies

How to use

Reproduce and Results

CIFAR 100

ImageNet

Discussion

About

Releases

Packages

Languages

haithanhp/mixconv_pytorch

Folders and files

Latest commit

History

Repository files navigation

Mixconv_pytorch

Dependencies

How to use

Reproduce and Results

CIFAR 100

ImageNet

Discussion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages