You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+24-5Lines changed: 24 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -30,12 +30,15 @@ The model was trained on the standard [MNIST](http://yann.lecun.com/exdb/mnist/)
30
30
31
31
*Note: you don't have to manually download, preprocess, and load the MNIST dataset as [TorchVision](https://github.com/pytorch/vision) will take care of this step for you.*
32
32
33
+
I have tried using other datasets. See the [Other Datasets](#other-datasets) section below for more details.
34
+
33
35
## Requirements
34
36
- Python 3
35
37
- Tested with version 3.6.4
36
38
-[PyTorch](http://pytorch.org/)
37
-
- Tested with version 0.2.0.post4 and 0.3.0.post4
39
+
- Tested with version 0.3.0.post4
38
40
- Code will not run with version 0.1.2 due to `keepdim` not available in this version.
41
+
- Code will not run with version 0.2.0 due to `softmax` function doesn't takes a dimension.
| Input image width to the convolution | 28 | --input-width 28 |
114
+
| Input image height to the convolution | 28 | --input-height 28 |
109
115
110
116
## Results
111
117
@@ -160,7 +166,8 @@ Test loss. Lowest test error: 0.2002%
160
166
161
167
### Training Speed
162
168
163
-
Around `3.25s / batch` or `25min / epoch` on a single Testla K80 GPU.
169
+
- Around `5.97s / batch` or `8min / epoch` on a single Tesla K80 GPU with batch size of 704.
170
+
- Around `3.25s / batch` or `25min / epoch` on a single Tesla K80 GPUwith batch size of 128.
164
171
165
172

166
173
@@ -251,7 +258,7 @@ decoder.fc2.bias: [1024]
251
258
decoder.fc3.weight: [784, 1024]
252
259
decoder.fc3.bias: [784]
253
260
254
-
Total number of parameters (with reconstruction network): 8227088 (8 million)
261
+
Total number of parameters on (with reconstruction network): 8227088 (8 million)
255
262
```
256
263
257
264
### TensorBoard
@@ -271,15 +278,27 @@ $ tensorboard --logdir runs
271
278
```
272
279
5. Open TensorBoard dashboard in your web browser using this URL: http://localhost:6006
273
280
281
+
### Other Datasets
282
+
283
+
#### CIFAR10
284
+
285
+
In the spirit of experiment, I have tried using other datasets. I have updated the implementation so that it supports and works with CIFAR10. Need to note that I have not tested throughly our capsule model on CIFAR10.
286
+
287
+
Here's how we can train and test the model on CIFAR10 by running the following commands.
0 commit comments