wide-resnet-cifar100-kaggle 联合开发网

Pudn.com > 下载中心 > 人工智能/神经网络/深度学习 > wide-resnet-cifar100-kaggle

wide-resnet-cifar100-kaggle

lua torch ensemble wide-residual-networks state-of-the-art

所属分类：人工智能/神经网络/深度学习
开发工具：Lua
文件大小：659KB
下载次数：0
上传日期：2018-02-22 23:41:24
上传者：sh-1993

说明：用于CIFAR100 kaggle提交的宽Resnet集成实现：在kaggle私有排行榜上超过83.97%

文件列表:

IMAGES (0, 2018-02-23)
IMAGES\cifar100_image.png (238365, 2018-02-23)
IMAGES\cifar10_image.png (231057, 2018-02-23)
IMAGES\img_356.lua (336, 2018-02-23)
IMAGES\svhn_image.png (180204, 2018-02-23)
checkpoints.lua (3392, 2018-02-23)
dataloader.lua (3500, 2018-02-23)
datasets (0, 2018-02-23)
datasets\cifar100-gen.lua (2060, 2018-02-23)
datasets\cifar100.lua (1630, 2018-02-23)
datasets\init.lua (1351, 2018-02-23)
datasets\transforms.lua (4917, 2018-02-23)
ensemble.lua (2987, 2018-02-23)
main.lua (3561, 2018-02-23)
networks (0, 2018-02-23)
networks\init.lua (4637, 2018-02-23)
networks\wide-resnet.lua (6794, 2018-02-23)
opts.lua (6137, 2018-02-23)
scripts (0, 2018-02-23)
scripts\ensemble_test.sh (494, 2018-02-23)
scripts\ensemble_train.sh (2546, 2018-02-23)
scripts\test.sh (580, 2018-02-23)
scripts\train.sh (476, 2018-02-23)
test.lua (4449, 2018-02-23)
train.lua (7601, 2018-02-23)

# Wide Residual Networks Using Ensemble for CIFAR100 Kaggle submission (.csv is generated) This is taken from https://github.com/meliketoy/wide-residual-network/ and modified to create solution for STAT946 kaggle challenge - Deep Learning and gives 0.83971 on the private leaderboard: https://www.kaggle.com/c/fall2017stat946/leaderboard NOTE: Before submitting the solution, make sure to remove "true_labels" column from the csv file generated. All my experiments were run on GeForce GTX 1080. Wide-residual network implementations for cifar100 Torch Implementation of Sergey Zagoruyko's [Wide Residual Networks](https://arxiv.org/pdf/1605.07146v2.pdf). In order to figure out what 'width' & 'height' does on wide-residual networks, several experiments were conducted on different settings of weights and heights. It turns out that **increasing the number of filters(increasing width)** gave more positive influence to the model than making the model deeper. Last but not least, simply averaging a few models with different parameter settings showed a significant increase in both top1 and top5 accuracy. The CIFAR dataset test results approached to **84.19%** for CIFAR-100 with only **meanstd** normalization. ## Requirements See the [installation instruction](INSTALL.md) for a step-by-step installation guide. See the [server instruction](SERVER.md) for server setup. - Install [Torch](http://torch.ch/docs/getting-started.html) - Install [cuda-8.0](https://developer.nvidia.com/cuda-downloads) - Install [cudnn v5.1](https://developer.nvidia.com/cudnn) - Install luarocks packages ```bash $ luarocks install cutorch $ luarocks install xlua $ luarocks install optnet ``` ## Directions and datasets - modelState : The best model will be saved in this directory - datasets : Data preparation & preprocessing directory - networks : Wide-residual network model structure file directory - gen : Generated t7 file for each dataset will be saved in this directory - scripts : Directory where the run file scripts are contained ## Best Results Adapting weight adjustments for each model will promise a more improved accuracy. You can see that the ensemble network improves the results of single WRNs. Test error (%, random flip, **meanstd** normaliztion, median of 5 runs) on CIFAR: | Dataset | network | Top1 Err(%) | |:-----------:|:------------:|:------------:| | CIFAR-100 | WRN-28x10 | 18.85 | | CIFAR-100 | Ensemble-WRN | **15.81** | ## How to run You can train each dataset of cifar100 by editing and running the script below. ```bash $ ./scripts/train.sh ``` You can test your own trained model of cifar100 by editing and running the script below. This will print test results and save the .csv file for submission to kaggle ```bash $ ./scripts/test.sh ``` To train an ensemble follow the steps below. ```bash $ vi ensemble.lua # Press :32 in vi, which will move your cursor to line 32 ens_depth = torch.Tensor({28, 28, 28, 28, 40, 40}) ens_widen_factor = torch.Tensor({10, 20, 20, 20, 10, 14}) ens_nExperiment = torch.Tensor({ 1, 1, 2, 3, 1, 1}) ``` then edit and run ```bash $ ./scripts/ensemble_train.sh ``` This will create .csv file for kaggle submission If you want to just create the ensemble of already trained models, edit ensemble.lua as mentioned above and then edit and run scripts/ensemble_test.sh ```bash $ ./scripts/ensemble_test.sh ``` ## Implementation Details * CIFAR-100 | epoch | learning rate | weight decay | Optimizer | Momentum | Nesterov | |:---------:|:-------------:|:-------------:|:---------:|:--------:|:--------:| | 0 ~ 60 | 0.1 | 0.0005 | Momentum | 0.9 | true | | 61 ~ 120 | 0.02 | 0.0005 | Momentum | 0.9 | true | | 121 ~ 160 | 0.004 | 0.0005 | Momentum | 0.9 | true | | 161 ~ 200 | 0.0008 | 0.0005 | Momentum | 0.9 | true | ## CIFAR-100 Results ![alt tag](IMAGES/cifar100_image.png) Below is the result of the test set accuracy for **CIFAR-100 dataset** training. NOTE: 28x20 configuration consumes the highest GPU memory ~11 GB while running on a single GPU, if you get error, it is possibly due to GPU memory insufficiency. Also note, while running script/ensemble_test.sh, you might hit into memory issues, in which case, either reduce number of models in the ensemble by modifying, ensemble.lua or edit test.lua to load models differently. **Accuracy is the average of 5 runs** | network | dropout | preprocess | GPU:0 | GPU:1 | per epoch | Top1 acc(%)| Top5 acc(%) | |:-----------------:|:-------:|:-----------:|:-----:|:-----:|:------------:|:----------:|:-----------:| | pre-ResNet-1001 | 0 | meanstd | - | - | 3 min 25 sec | 77.29 | 93.44 | | wide-resnet 28x10 | 0 | ZCA | 5.90G | - | 2 min 03 sec | 80.03 | 95.01 | | wide-resnet 28x10 | 0 | meanstd | 5.90G | - | 2 min 03 sec | 81.01 | 95.44 | | wide-resnet 28x10 | 0.3 | meanstd | 5.90G | - | 2 min 03 sec | 81.47 | 95.53 | | wide-resnet 28x20 | 0.3 | meanstd | 8.13G | 6.93G | 4 min 05 sec | **82.43** | **96.02** | | wide-resnet 40x10 | 0.3 | meanstd | 8.93G | - | 3 min 06 sec | 81.47 | 95.65 | | wide-resnet 40x14 | 0.3 | meanstd | 7.39G | ***6G | 3 min 23 sec | 81.83 | 95.50 |

近期下载者：

相关文件：

评论：[我要评论] [举报此文件]

收藏者：