├── .gitignore
├── README.md
├── doc
    ├── chainer_basic.md
    ├── chainer_caffemodel.md
    ├── chainer_convolution.md
    ├── chainer_install.md
    ├── chainer_layer.md
    ├── chainer_optimizer.md
    ├── chainer_trainer.md
    ├── environment.md
    ├── exercise_1.md
    ├── mnist.md
    ├── numpy_1.md
    ├── numpy_2.md
    ├── numpy_3.md
    ├── numpy_4.md
    ├── numpy_5.md
    ├── reinforcement.md
    └── translation.md
├── image
    └── digit
    │   ├── digit_0.png
    │   ├── digit_1.png
    │   ├── digit_2.png
    │   ├── digit_3.png
    │   ├── digit_4.png
    │   ├── digit_5.png
    │   ├── digit_6.png
    │   ├── digit_7.png
    │   ├── digit_8.png
    │   └── digit_9.png
└── src
    ├── bouncing
        ├── make_animation.py
        └── train.py
    ├── caffemodel
        ├── labels.txt
        └── predict.py
    ├── ex1
        ├── data_test.txt
        ├── data_train.txt
        ├── example
        │   ├── image
        │   │   └── .gitkeep
        │   └── train.py
        ├── image
        │   └── .gitkeep
        └── train.py
    ├── ex1_2
        ├── data_test.txt
        ├── data_train.txt
        ├── example
        │   ├── image
        │   │   └── .gitkeep
        │   └── train.py
        ├── image
        │   └── .gitkeep
        └── train.py
    ├── ex2
        ├── zundoko.py
        └── zundoko_lstm.py
    ├── gan
        ├── image
        │   └── .gitkeep
        └── train.py
    ├── human_activity
        ├── make_animation.py
        └── train_lstm.py
    ├── mnist
        ├── net.py
        ├── predict.py
        └── train.py
    ├── open_ai_gym
        └── train.py
    ├── translation
        ├── dataset
        │   ├── dataset.json
        │   └── dev.json
        ├── make_dataset.py
        ├── model
        │   └── .gitkeep
        └── train.py
    └── vae
        ├── image
            └── .gitkeep
        ├── net.py
        ├── train.py
        └── train_m2.py


/.gitignore:
--------------------------------------------------------------------------------
1 | *.pyc
2 | *.model
3 | *.caffemodel
4 | src/**/*.png
5 | 


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
 1 | # Deep Learning
 2 | 
 3 | ## 実行環境
 4 | 
 5 | * [Chainer](http://chainer.org/) 1.15
 6 | * [Pillow](http://pillow.readthedocs.io/en/3.0.x/index.html)
 7 | 
 8 | ## 目次
 9 | 
10 | ### 準備
11 | 
12 | * [Deep Learning 開発に必要な環境](/doc/environment.md)
13 | * [Chainerのインストール方法](/doc/chainer_install.md)
14 | 
15 | ### Numpyの基本
16 | 
17 | * [Numpyについて、配列の生成](/doc/numpy_1.md)
18 | * [要素へのアクセス](/doc/numpy_2.md)
19 | * [配列の変形と連結](/doc/numpy_3.md)
20 | * [配列の演算](/doc/numpy_4.md)
21 | * [統計処理と線形代数](/doc/numpy_5.md)
22 | 
23 | ### Chainerを使ったDeep Lerningの実践
24 | 
25 | * [手書き数字の認識](/doc/mnist.md)
26 | * [Chainerの基本的な使い方](/doc/chainer_basic.md)
27 | * [Convolutional Neural Network](/doc/chainer_convolution.md)
28 | * [Layerについて](/doc/chainer_layer.md)
29 | * [最適化アルゴリズム](/doc/chainer_optimizer.md)
30 | * [Caffe modelの使用](/doc/chainer_caffemodel.md)
31 | * [翻訳](/doc/translation.md)
32 | * [強化学習](/doc/reinforcement.md)
33 | 
34 | ### 演習
35 | 
36 | * [演習1 領域の分割](/doc/exercise_1.md)
37 | 
38 | ### その他
39 | 
40 | * [ChainerのTrainerについて](/doc/chainer_trainer.md)
41 | 
42 | ### 参考資料
43 | 
44 | * [Neural Networks and Deep Learning](http://neuralnetworksanddeeplearning.com/) ([日本語訳](http://nnadl-ja.github.io/nnadl_site_ja/index.html))
45 | * Convolutional Neural Network
46 |     * [How do Convolutional Neural Networks work?](http://brohrer.github.io/how_convolutional_neural_networks_work.html)
47 |     * [Convolution arithmetic](https://github.com/vdumoulin/conv_arithmetic)
48 |     * [Convolutional Neural Networks (CNNs): An Illustrated Explanation](http://xrds.acm.org/blog/2016/06/convolutional-neural-networks-cnns-illustrated-explanation/)
49 | * Optimizer
50 |     * [An overview of gradient descent optimization algorithms](http://sebastianruder.com/optimizing-gradient-descent/) [日本語訳](http://postd.cc/optimizing-gradient-descent/)
51 | * Recurrent Neural Network, LSTM
52 |     * [わかるLSTM ～ 最近の動向と共に](http://qiita.com/t_Signull/items/21b82be280b46f467d1b)
53 |     * [Understanding LSTM Networks](http://colah.github.io/posts/2015-08-Understanding-LSTMs/) [日本語訳](http://qiita.com/KojiOhki/items/89cd7b69a8a6239d67ca)
54 | 


--------------------------------------------------------------------------------
/doc/chainer_basic.md:
--------------------------------------------------------------------------------
  1 | # Chainerの基本的な使い方
  2 | 
  3 | Chainerに限らずニューラルネットワークのフレームワークは機能ごとにモジュール化されており、必要な機能を組み合わせて目的に沿ったニューラルネットワークを構築したり、学習を行ったりすることが容易になっている。
  4 | 
  5 | 注意: ここの説明はTrainerを使用していないためChainer公式のMNISTデータセットの学習とは異なる。Trainerを使用しない理由は[ChainerのTrainerについて](chainer_trainer.md)を参照すること。
  6 | 
  7 | ## 学習とは
  8 | 
  9 | ニューラルネットワークのパラメータを最適な値に調整することを指す。
 10 | 
 11 | ## ニューラルネットワークを使った教師あり学習の流れ
 12 | 
 13 | ### データセットを用意する
 14 | 
 15 | 通常以下の3つを用意する。それぞれ入力データと教師データとがある。
 16 | 
 17 | * 学習データ (train data)  
 18 | パラメータ学習に使用するデータ
 19 | * validationデータ (validation data)  
 20 | どのiteration/epochでのモデルが最適であるか比較したり、ハイパーパラメータのどの値が適切かを比較するのに使用するデータ
 21 | * テストデータ (test data)  
 22 | 学習後のモデルを評価するのに使用するデータ
 23 | 
 24 | ### ニューラルネットワークとoptimizerを構築する
 25 | 
 26 | * ニューラルネットワークを構築する  
 27 | ニューラルネットワークの各層はモジュール化されており、モジュールを選択して結合することで容易に構築できるようになっている。必要ならパラメータをファイルから読み込む
 28 | * optimizerを構築する  
 29 | どのoptimizerを使用するか決め、ニューラルネットワークと関連付ける
 30 | 
 31 | ### 学習
 32 | 
 33 | 以下の処理を繰り返して学習を行う。以下の処理1回を1 iteration と呼ぶ。これに対して学習データを一通り使用するまでiterationを繰り返すことを1 epochと呼ぶ
 34 | 
 35 | * 学習データをニューラルネットワークに入力する
 36 | * ニューラルネットワークの出力と正解データから損失を計算する
 37 | * 損失から各パラメータの勾配を求める
 38 | * 勾配を基にパラメータを更新する
 39 | 
 40 | ### 学習後のパラメータをファイルに保存する
 41 | 
 42 | 学習後のパラメータを保存しておいて予測時に使用する
 43 | 
 44 | ## 教師あり学習の詳細
 45 | 
 46 | ### データセットを用意する
 47 | 
 48 | データセットごとに異なる。以下にMNISTデータの場合について述べる。
 49 | 
 50 | #### 読み込み
 51 | 
 52 | Chainerでは`chainer.datasets.get_mnist`を使うことでMNISTデータセットを取得することができる。
 53 | 内部ではデータセットのダウンロードを行っているので初回呼び出し時にはインターネット環境に接続している必要がある。
 54 | 
 55 | 以下のようにするとtrain_dataに学習データが、test_dataにテストデータが格納される。
 56 | 
 57 | ```
 58 | > import chainer
 59 | > train_data, test_data = chainer.datasets.get_mnist()
 60 | ```
 61 | 
 62 | 学習データとvalidationデータを分離する場合には`chainer.datasets.split_dataset_random`を使う。
 63 | 
 64 | ```
 65 | > train_data, valid_data = chainer.datasets.split_dataset_random(train_data, len(train_data) - 5000)
 66 | ```
 67 | 
 68 | #### データの取得
 69 | 
 70 | `Iterator`を使うことで学習データ、テストデータを順に取得することができる。
 71 | `Iterator`については[Chainer Referance ManualのIterator examples](http://docs.chainer.org/en/stable/reference/iterators.html)を参照のこと。
 72 | 
 73 | ```
 74 | > batch_size = 100
 75 | > train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
 76 | > batch = train_iterator.next()
 77 | ```
 78 | 
 79 | `batch`はlistで、各要素は1枚の画像を表すnumpy配列と正解ラベルを表す整数とのtupleである。
 80 | これを複数画像のnumpy配列と正解ラベルのnumpy配列に変換するには`chainer.dataset.convert.concat_examples`を使う。
 81 | 
 82 | ```
 83 | > from chainer.dataset import convert
 84 | > x, t = convert.concat_examples(batch)
 85 | ```
 86 | 
 87 | ### ニューラルネットワークを構築する
 88 | 
 89 | 以下にFully Connected Layer 3層からならニューラルネットワークの例を挙げる。
 90 | 
 91 | ```
 92 | class MLP(chainer.Chain):
 93 | 
 94 |     def __init__(self, n_in, n_out, n_hidden):
 95 |         # Networkの持つLinkを定義する
 96 |         # superクラスの__init__を使う方法と
 97 |         # add_link()メソッドで追加する方法とがある
 98 |         # Chainer1.12からL.Linear()の第1引数はNoneでよく、
 99 |         # その場合入力サイズは実際のデータから自動的に決まる
100 |         super(MLP, self).__init__(
101 |             l1=L.Linear(n_in, n_hidden),
102 |             l2=L.Linear(n_hidden, n_hidden),
103 |             l3=L.Linear(n_hidden, n_out),
104 |         )
105 | 
106 |     def __call__(self, x, train=True):
107 |         # Linkはcallableとなっており、関数として呼び出すとLinkの処理
108 |         # L.LinearはWx + bを計算する(W, bはL.Linearが持つ重み行列とバイアス項)
109 |         h = self.l1(x)
110 |         # Dropoutを実行する
111 |         # Dropoutは学習時と予測時とで挙動が異なるので、trainでどちらであるかを指定する必要がある
112 |         h = F.dropout(h, 0.5, train=train)
113 |         # ReLUを実行する
114 |         h = F.relu(h)
115 |         h = self.l2(h)
116 |         h = F.dropout(h, 0.5, train=train)
117 |         h = F.relu(h)
118 |         h = self.l3(h)
119 |         return h
120 | ```
121 | 
122 | クラスの定義は以下のようになる。
123 | 
124 | * `chainer.Chain`を継承したクラスを定義する
125 | * `__init__`で必要な`link`を追加する。`link`はパラメータつきの層として機能し、ここで追加した`link`の持つパラメータが学習対象のパラメータとなる。
126 | * `__call__`で各層をどう結合するかを定義する。
127 | 
128 | `__init__`と`__call__`の引数は自由に決めることができる。例えば`__init__`の引数に中間層の大きさを渡したり、`__call__`が複数の入力データを受け付けるようにすることができる。
129 | 
130 | 以下のようにニューラルネットワークのインスタンスを生成する。
131 | 
132 | ```
133 | net = MLP(28 * 28, 10, 100)
134 | ```
135 | 
136 | ### optimizerを構築する
137 | 
138 | optimizerの構築は、optimizerのインスタンスを生成して`setup`メソッドでニューラルネットワークと関連付けるだけである。
139 | `chainer.optimizers`以下に複数の最適化アルゴリズムのモジュールがあり、使用したいものを選ぶことができる。
140 | 
141 | ```
142 | from chainer import optimizers
143 | 
144 | optimizer = optimizers.Adam()
145 | optimizer.setup(net)
146 | ```
147 | 
148 | ### 学習を行う
149 | 
150 | #### 学習データをニューラルネットワークに入力する
151 | 
152 | ニューラルネットワークに入力するには
153 | * 入力のnumpy配列(またはcupy配列)から`chainer.Variable`インスタンスを生成する。(省略可)
154 | * ニューラルネットワークインスタンスを関数として呼び出す。このとき`chainer.Variable`インスタンスをの引数とする。
155 | 
156 | ```
157 | y = net(chainer.Variable(x), train=True)
158 | ```
159 | 
160 | Chainer 1.10から以下のように`chainer.Variable`インスタンス生成を省略できるようになった。numpy配列を直接渡した場合には内部で`Variable`インスタンスを生成する。
161 | 
162 | ```
163 | y = net(x, train=True)
164 | ```
165 | 
166 | #### ニューラルネットワークの出力と正解データから損失を計算する
167 | 
168 | ここでは画像分類でよく使われる損失関数(loss function, cost function)を計算する。
169 | 
170 | ```
171 | from chainer import functions as F
172 | 
173 | loss = F.softmax_cross_entropy(y, chainer.Variable(t))
174 | ```
175 | 
176 | 以下のように`chainer.Variable`の省略が可能である。
177 | 
178 | ```
179 | loss = F.softmax_cross_entropy(y, t)
180 | ```
181 | 
182 | #### 損失から各パラメータの勾配を求める
183 | 
184 | 以下のように`cleargrads`メソッドでパラメータの勾配を初期化し、`backward`メソッドで勾配を計算する。
185 | 
186 | ```
187 | net.cleargrads()
188 | loss.backward()
189 | ```
190 | 
191 | Chainer 1.14までは勾配初期化に`zerograds`メソッドを使用していたが1.15からdeprecatedとなった。
192 | 
193 | #### 勾配を基にパラメータを更新する
194 | 
195 | optimizerの`update`メソッドを呼ぶだけである。
196 | 
197 | ```
198 | optimizer.update()
199 | ```
200 | 
201 | ### 学習後のパラメータをファイルに保存する
202 | 
203 | パラメータをファイルに保存するには`chainer.serializers`を使う。
204 | ここでは`save_npz`メソッドを使って保存する例を挙げる。
205 | 
206 | ```
207 | from chainer import serializers
208 | 
209 | serializers.save_npz('mnist.model', net)
210 | ```
211 | 
212 | `save_npz`メソッドで保存したファイルは`load_npz`メソッドで読み込むことができる。
213 | 
214 | ```
215 | serializers.load_npz('mnist.model', net)
216 | ```
217 | 
218 | `serializers`には他にも保存・読み込みを行うためのメソッドがある。
219 | 詳細は[Chainer Reference ManualのSerializersの項](http://docs.chainer.org/en/stable/reference/serializers.html)を参照すること。
220 | 
221 | ## ニューラルネットワークを使った予測
222 | 
223 | ### ニューラルネットワークを構築する
224 | 
225 | 学習時と同様にニューラルネットワークのインスタンスを生成した後、ファイルからパラメータを読み込む。
226 | 
227 | ```
228 | from chainer import serializers
229 | 
230 | net = MLP(28 * 28, 10, 100)
231 | serializers.load_npz('mnist.model', net)
232 | ```
233 | 
234 | ### 入力データを生成する
235 | 
236 | 画像を読み込み、numpy配列(またはcupy配列)に変換する。ここではPillowを使って画像を読み込んでいる。注意する点として、学習データの値の範囲が0～1で、背景が黒(0)なので、予測時の入力データも同様になるように変換する必要がある。
237 | 
238 | 複数の画像を一度に入力することも可能である。その場合入力となる配列のshapeは(画像枚数, 画像のピクセル数)となる
239 | 
240 | ```
241 | import numpy as np
242 | from PIL import Image
243 | 
244 | image = Image.open('sample.png').convert('L').reseize((28, 28), Image.BILINEAR)
245 | image = 1 - np.asarray(image).astype(np.float32) / 255
246 | image = image.reshape((1, -1))
247 | ```
248 | 
249 | ### ニューラルネットワークに入力する
250 | 
251 | ニューラルネットワークインスタンスに入力となる配列を渡す。今回使用する`MLP`クラスは、引数として学習時かどうかを判別する`train`をとるが、今は予測時なので`False`を渡す。
252 | 
253 | `chainer.Variable`の`volatile=True`は出力が呼び出された`function`への参照を持たないことを意味する。`volatile`を有効にすると`function`の呼び出しを逆順にたどれなくなるのでバックプロパゲーションができなくなるが、消費メモリ量は減る。
254 | 
255 | ```
256 | y = net(chainer.Variable(image, volatile=True), train=False)
257 | ```
258 | 
259 | 以下のように`chainer.Variable`を省略することが可能だが、無駄なメモリ消費を抑えるために`Variable`を使用して`volatile`を有効にしたほうが良い。
260 | 
261 | ```
262 | y = net(image, train=False)
263 | ```
264 | 
265 | ### Softmax値を求める
266 | 
267 | 出力結果としてどのラベルが選ばれたかを知るためにはニューラルネットワークのどの出力が最大であるかわかればよいが、出力の意味をわかりやすくするためにSoftmax値を求める。
268 | Softmax値を求めるには`chainer.functions.softmax`を使用する。
269 | 出力`Variable`で、`data`プロパティを参照することでnumpy配列(またはcupy配列)が得られる。
270 | 
271 | ```
272 | from chainer import functions as F
273 | 
274 | result = F.softmax(y).data
275 | ```
276 | 


--------------------------------------------------------------------------------
/doc/chainer_caffemodel.md:
--------------------------------------------------------------------------------
 1 | # Caffe modelの使用
 2 | 
 3 | ## Caffe modelとは
 4 | Deep Learningフレームワークである[Caffe](http://caffe.berkeleyvision.org/)を使って学習したモデルである。多数のモデルファイルが公開されており、研究用途で使われることが多い。画像分類に使われるモデルをそのまま使用したり、画像の特徴量を抽出して別の用途(例えば物体検出、キャプション生成など)に使用することができる。
 5 | 
 6 | ## Caffe modelの属性の調べ方
 7 | 
 8 | Caffe modelはprototextファイルに記述されている。
 9 | prototxt内のレイヤー定義については[Caffe公式ドキュメントのLayers](http://caffe.berkeleyvision.org/tutorial/layers.html)を参照すること。
10 | 
11 | 例えば[GoogleNet](https://github.com/BVLC/caffe/tree/master/models/bvlc_googlenet)の[train_val.prototxt](https://github.com/BVLC/caffe/blob/master/models/bvlc_googlenet/train_val.prototxt)から次のことがわかる
12 | 
13 | * 入力は`data`レイヤー
14 | * `transform_param`から入力の画像サイズは224、BGRの平均値がそれぞれ104, 117, 123である
15 | * 出力は`loss3/classifier`
16 | 
17 | ## Caffe modelの読み込み
18 | 
19 | caffemodelファイルへのパスを指定して`chainer.functions.caffe.CaffeFunction`のインスタンスを生成する。
20 | 
21 | 例:
22 | 
23 | ```
24 | from chainer.functions import caffe
25 | 
26 | model = caffe.CaffeFunction('bvlc_googlenet.caffemodel')
27 | ```
28 | 
29 | ## Caffe modelの実行
30 | 
31 | 生成したインスタンスを関数として呼び出す
32 | * 入力
33 |     * `inputs`: 入力データを`dict`形式で渡す。`dict`のkeyはモデルに依存する
34 |     * `outputs`: 出力レイヤー名の`list`
35 |     * `disable`: 使用しないレイヤー名の`list`を渡す。出力レイヤーに関与しないレイヤーがある場合に指定する
36 |     * `train`: Trueなら学習モード、それ以外はテストモードになる。back propagationが必要な場合はTrueを指定する
37 | * 出力
38 |     * 指定したレイヤーの出力の`tuple`
39 | 
40 | 例:
41 | 
42 | ```
43 | y, = model(
44 |     inputs={'data': chainer.Variable(x, volatile=True)},
45 |     outputs=['loss3/classifier'],
46 |     disable=['loss1/ave_pool', 'loss2/ave_pool'],
47 |     train=False)
48 | ```
49 | 
50 | * 入力はnumpyもしくはcupy配列である
51 | * 入力のshapeは(mini_batch, color, height, width)であることが多い。
52 | * 入力からは平均画像を引く必要がある。
53 | 平均画像はモデルによって異なる。
54 | * 色空間はRGBではなくBGRであることが多い。
55 | 
56 | ## サンプルプログラム
57 | 
58 | ### ソースディレクトリ
59 | 
60 | (root dir)/src/caffemodel
61 | 
62 | ### 実行方法
63 | 
64 | ```
65 | $ cd src/caffemodel
66 | $ python predict.py image_dir
67 | ```
68 | 
69 | オプション:
70 | * `-m <caffemodel path>` : GoogleNetモデルのファイルパス(default: bvlc_googlenet.caffemodel)
71 | * `-l <label file path>` : ラベルファイルのパス。ソースツリーに含まれるのでデフォルト値で問題ない(default: labels.txt)
72 | * `-g <GPU index>` : 使用するGPUを指定する。-1の場合はCPUを使用(default: -1)
73 | 
74 | ## よく使用されるCaffe model
75 | 
76 | * [Model Zoo](https://github.com/BVLC/caffe/wiki/Model-Zoo)  
77 | 様々なCaffe modelへのリンクがある。
78 | * [Residual Network](https://github.com/KaimingHe/deep-residual-networks)  
79 | ILSVRC 2015 で使用されたResidual Network。
80 | * [VGG 16-layers](https://gist.github.com/ksimonyan/211839e770f7b538e2d8)  
81 | ILSVRC 2014 で使用されたVGG 16-layersモデル。
82 | * [VGG 19-layers](https://gist.github.com/ksimonyan/3785162f95cd2d5fee77)
83 | ILSVRC 2014 で使用されたVGG 19-layersモデル。
84 | * [GoogleNet](https://github.com/BVLC/caffe/tree/master/models/bvlc_googlenet)
85 | ILSVRC 2014 で使用されたGoogleNetモデル。
86 | * [AlexNet](https://github.com/BVLC/caffe/tree/master/models/bvlc_alexnet)  
87 | LSVRC-2010 で使用されたAlexNetモデル。
88 | 


--------------------------------------------------------------------------------
/doc/chainer_convolution.md:
--------------------------------------------------------------------------------
 1 | # Convolutional Neural Network (CNN, 畳み込みネットワーク)
 2 | 
 3 | Convolutional Neural Networkは、Convolutional Layerを持ったNeural Networkを指す。
 4 | Convolutional LayerとReLU等のActivation Functionを組み合わせてNeural Networkを構築することが多い。
 5 | 以降は2次元のConvolutional Layerについて説明する。
 6 | 
 7 | Convolutional Layerの図つきの説明としては以下がわかりやすい。
 8 | * [How do Convolutional Neural Networks work?](http://brohrer.github.io/how_convolutional_neural_networks_work.html)
 9 | * [Convolution arithmetic](https://github.com/vdumoulin/conv_arithmetic)
10 | * [Convolutional Neural Networks (CNNs): An Illustrated Explanation](http://xrds.acm.org/blog/2016/06/convolutional-neural-networks-cnns-illustrated-explanation/)
11 | 
12 | ## Convolutional Layer (畳み込み層)
13 | 
14 | ### 処理の概要
15 | 
16 | * 入力データは`(入力チャンネル数, 高さ, 幅)`の3次元。(ミニバッチも考慮すると`(ミニバッチ数, チャンネル数, 高さ, 幅)`の4次元となる)
17 | * 入力データから`(入力チャンネル数, kh, kw)`の領域を取り出す。ここで`kh`と`kw`はカーネルと呼ばれる領域の高さと幅である。
18 | * 取り出したデータをベクトル化し(1列に並べ)、行列`W`を掛けてバイアス`b`を足す。行列のサイズは`(出力チャンネル数, 入力チャンネル数 * kh * kw)`となる。
19 | * 上記の操作を、取り出す位置を一定間隔ずつずらしながら行う。このとき使用する`W`と`b`は共通である。
20 | * 出力したベクトルを並べる。出力は`(出力チャンネル数, 出力高さ, 出力幅)`となる。出力の高さと幅については後述する。
21 | 
22 | ### Convolutional Layerの特徴
23 | 
24 | * 局所的な特徴を抽出することができる。
25 | * `W`と`b`が共通なので、Fully Connected Layerに比べるとLayerのパラメータサイズが小さくなる。
26 | 
27 | ### Convolutional Layerの属性
28 | 
29 | Convolutional Layerは以下の属性を持つ。
30 | 
31 | * 入力チャンネル数、出力チャンネル数  
32 | 入力と出力の特徴量の大きさを表す。例えばRGB値を持つカラー画像であればチャンネル数は3となる。
33 | * カーネルサイズ  
34 | 畳み込み演算を実行する領域(カーネル)の幅と高さを表す。
35 | * stride  
36 | カーネルを移動する間隔である。
37 | * パディング
38 | 入力データの高さと幅方向の上下に挿入するパディングである。通常はパディングした位置には0を挿入する。
39 | 
40 | 出力データの幅と高さは以下のようになる。
41 | 
42 | ```
43 | # in_width, in_height: 入力高さ
44 | # in_height, out_height: 入力高さ
45 | # pad_w, pad_h: パディング幅、高さ
46 | # kernel_w, kernel_h: カーネル幅、高さ
47 | # stride_w, stride_h: stride幅、高さ
48 | 
49 | out_height = (in_height + 2 * pad_h - kernel_h) / stride_h + 1
50 | out_width = (in_width + 2 * pad_w - kernel_w) / stride_w + 1
51 | ```
52 | 
53 | Convolutional LayerでForward propagationを行うコードは以下のようになる。(カーネルサイズ、stride、パディングは高さ方向と幅方向で同じにしてある。)
54 | 
55 | ```
56 | import numpy as np
57 | import six
58 | 
59 | def convolution_2d(x, w, ksize, stride=1, pad=0):
60 |     batch_size, in_channel, width, height = x.shape
61 |     out_channel = w.shape[0]
62 | 
63 |     if pad > 0:
64 |         h = np.pad(x, ((0, 0), (0, 0), (pad, pad), (pad, pad)), 'constant')
65 |     else:
66 |         h = x
67 | 
68 |     out_height = (height + 2 * pad - ksize) // stride + 1
69 |     out_width = (width + 2 * pad - ksize) // stride + 1
70 |     y = np.zeros((batch_size, out_channel, out_height, out_width), dtype=x.dtype)
71 | 
72 |     for i in six.moves.range(batch_size):
73 |         for j in six.moves.range(out_height):
74 |             for k in six.moves.range(out_width):
75 |                 feature = np.ravel(h[i, :, j * stride:j * stride + ksize, k * stride:k * stride + ksize])
76 |                 y[i, :, j, k] = np.dot(w, feature)
77 | 
78 |     return y
79 | ```
80 | 
81 | ## ChainerでのConvolution Layerの利用
82 | 
83 | `chainer.links.Convolution2D`または`chainer.links.ConvolutionND`を使う。
84 | `Convolution2D`の入力配列のサイズは`(ミニバッチ数, チャンネル数, 高さ, 幅)`の4次元にする必要がある。`ConvolutionND`の入力配列のサイズは`(ミニバッチ数, チャンネル数, 各次元のサイズ)`のN+2次元にする必要がある。
85 | 


--------------------------------------------------------------------------------
/doc/chainer_install.md:
--------------------------------------------------------------------------------
 1 | # Chainerのインストール方法
 2 | 
 3 | ## Linux, MacOS
 4 | 
 5 | [公式のInstall Guide](http://docs.chainer.org/en/stable/install.html) を参照してください
 6 | 
 7 | ## Windows7以降
 8 | 
 9 | 公式にはWindowsをサポートしていませんが動作することを確認しています。
10 | 少し古いですが以下の手順でインストールできると思います。
11 | 
12 | [Windows上にChainer v1.5+CUDA+cuDNNを一番簡単に入れれる方法](http://qiita.com/okuta/items/f985b9da6de33a016a75)
13 | 
14 | `pip install chainer` でインストールできない場合
15 | 
16 | 0. `pip uninstall chainer`で古いChainerを削除
17 | 0. `pip list` でパッケージ一覧を表示してChainerが表示されないことを確認
18 | 0. [Chainerソースコード](https://github.com/pfnet/chainer)のClone & `python setup.py install` でビルドする
19 | 0. 依存パッケージのnumpyのビルドに失敗する場合は`pip install numpy`でnumpyを個別にインストールした後、setup.pyの`    'numpy>=1.9.0',`の1行をコメントアウトして`python setup.py install`を行う
20 | 


--------------------------------------------------------------------------------
/doc/chainer_layer.md:
--------------------------------------------------------------------------------
  1 | # Layerについて
  2 | 
  3 | Deep Learningでは様々なLayerを組み合わせる必要がある。ここでは使用頻度の高いLayerの特徴を説明する。
  4 | 
  5 | 以下のスライドではLayerのパラメータと学習結果との関係について説明している。Layerについて一通り学習した後に読むと良い。
  6 | 
  7 | http://www.slideshare.net/takanoriogata1121/160924-deep-learning-tuningathon
  8 | 
  9 | ## chainer.functionsとchainer.linksの違い。
 10 | 
 11 | `chainer.functions`はパラメータを持たないLayerを集めたパッケージ、`chainer.links`はパラメータを持つLayerを集めたパッケージである。`chainer.links`にある機能のパラメータを学習対象とせずに自分で指定することもできる。その場合は`chainer.functions`にある同等の機能を使用する。例えば`chainer.links.Liner`のパラメータを指定したい場合には`chainer.functions.linear`を使う。
 12 | 
 13 | ## Connection
 14 | 
 15 | 線形変換を行うLayerについて説明する。
 16 | 
 17 | ### Fully Connected (links.Linear)
 18 | 
 19 | 入力と出力が全結合したLayerである。
 20 | パラメータとして行列`W`とバイアスベクトル`b`を持ち、入力`x`に対して`Wx + b`が出力となる。
 21 | 
 22 | ### 2 Dimension Convolution (links.Convolution2D)
 23 | 
 24 | 2次元Convolutional Layerである。
 25 | 
 26 | ### N Dimension Convolution (links.ConvolutionND)
 27 | 
 28 | N次元Convolutional Layerである。
 29 | 
 30 | ### Embed ID (links.EmbedID)
 31 | 
 32 | 整数値をベクトルに変換するのに使用する。例えば自然言語処理で単語IDをベクトルに変換する時に使用する。IDとベクトル値との対応が学習対象となる。
 33 | 
 34 | ## Activation Function
 35 | 
 36 | Activation Function (活性化関数)となるLayerについて説明する。
 37 | 
 38 | ### Rectified Linear Unit (ReLU) (functions.relu)
 39 | 
 40 | Rectified Linear Unit (ReLU)を実行する。`x`を入力として`max(0, x)`で定義される。
 41 | 
 42 | ### Tanh (functions.tanh)
 43 | 
 44 | tanhを実行する。形状はSigmoidと同じだが、出力の範囲が-1～1である点が異なる。使用頻度はReLUに比べると少ないが、出力の範囲を限定したい場合に使われる。
 45 | 
 46 | ### Sigmoid (functions.sigmoid)
 47 | 
 48 | Sigmoid関数を実行する。最近はあまり使われない。
 49 | 
 50 | ### Softmax (functions.softmax)
 51 | 
 52 | Softmax値を計算する。
 53 | 
 54 | ### Log Softmax (functions.log_softmax)
 55 | 
 56 | Softmax値のlogをとる。単純に`functions.softmax`の出力のlogをとるのでは、`softmax`の出力が0に丸められることがあるので支障がある。softmaxのlogをとる場合は`log_softmax`を使う方が安全である。
 57 | 
 58 | ### Exponential Linear Unit (ELU) (functions.elu)
 59 | 
 60 | ReLUは入力が負の時に勾配が消失するというデメリットがあり、それを改善するために導入された。出力は入力を`x`、`alpha`を定数として`x >= 0`のとき`x`、`x < 0`のとき`alpha * (exp(x) - 1)`となる。使用頻度は低い。
 61 | 
 62 | ### Leaky ReLU (functions.leaky_relu)
 63 | 
 64 | ELUと同じくReLUの勾配消失問題を改善するために導入された。`x`を入力、`alpha`を0～1の範囲にある定数として、出力は`max(alpha * x, x)`となる。使用頻度は低い。
 65 | 
 66 | ## Normalization
 67 | 
 68 | データにノイズを加えるLayerについて説明する。ノイズを加えることで過学習を防ぎ汎化性を高めることができる。
 69 | 
 70 | ### Dropout (functions.dropout)
 71 | 
 72 | 学習時に出力の一部を確率的に0にする。0にしない箇所の出力は確率に応じて大きくする。0にする確率を`p`とする場合、出力は`1 / (1 - p)`倍にする。予測時には出力の操作は行わない。
 73 | 使用頻度は高く、Fully Connected LayerまたはConvolutional Layerの直後で使われることが多い。
 74 | 
 75 | ## Normalization
 76 | 
 77 | Normalizationを行うことで学習速度や精度を向上させることができる。
 78 | 
 79 | ### Batch Normalization (links.BatchNormalization)
 80 | 
 81 | 出力をミニバッチ内で平均0, 分散1にすることでその後のLayerの学習を行いやすくする。
 82 | Convolutional Layerの直後に挿入することが多い。
 83 | 使用頻度は非常に高く、CNNを使う場合導入はほぼ必須といえる。
 84 | 
 85 | ## Spatial Pooling
 86 | 
 87 | 2 x 2, 3 x 3など、空間的に近い領域の出力の平均または最大値をとって出力の数を減らすことをPoolingと呼ぶ。
 88 | 
 89 | ### Max Pooling (functions.max_pooling_2d)
 90 | 
 91 | 局所領域の最大値を出力する。画像認識では、複数のCNNの後にMax Poolingを挿入するというのを何回か繰り返すことが多い。
 92 | 
 93 | ### Average Pooling (functions.average_pooling_2d)
 94 | 
 95 | 局所領域の平均値を出力する。画像認識では最終出力の数段前にAverage Poolingを挿入して、幅・高さ方向の全平均をとる手法がある。
 96 | 
 97 | ## 損失関数
 98 | 
 99 | 使用頻度の高い損失関数が用意されている。
100 | 
101 | ### Softmax Cross Entropy (functions.softmax_cross_entropy)
102 | 
103 | Softmax Cross Entropyを計算する。多値分類でよく使われる。出力は求めたSoftmax Cross Entropyの平均となる。
104 | 
105 | ### 平均二乗誤差 (functions.mean_squared_error)
106 | 
107 | 平均二乗誤差を計算する。回帰でよく使われる。
108 | 
109 | ## 配列操作
110 | 
111 | numpyに備わっている配列操作の一部を`chainer.functions`でも行うことができる。
112 | 
113 | ### 連結 (functions.concat)
114 | 
115 | 2つの配列を連結する。
116 | 
117 | ### reshape (functions.reshape)
118 | 
119 | 配列のreshapeを行う。
120 | 
121 | ### 行列積 (functions.matmul)
122 | 
123 | 2つの行列の積を求める。
124 | 
125 | ### バッチごとの行列積 (functions.matmul)
126 | 
127 | バッチごとに2つの行列の積を求める。配列`a`と配列`b`とがあるとき、`c = functions.batch_matmul(a, b)`とすると、i = 0, 1, 2, ...について`c[i]`は`a[i]`と`b[i]`の積となる。
128 | 
129 | ## 数値計算
130 | 
131 | ### 演算子
132 | 
133 | numpy配列の計算のように`a + b - c`、`a ** b`といった記述で演算子を使用できる。以下の演算子を使用できる。
134 | 
135 | * 正負の反転: `-a`
136 | * 加算: `a + b`
137 | * 減算: `a - b`
138 | * 乗算: `a * b`
139 | * 除算: `a / b`
140 | * べき乗: `a ** b`
141 | * 絶対値: `abs(a)`
142 | * 行列積(Python 3.5以降): `a @ b`  
143 | `chainer.functions.matmul`でも実行可能なのであえて互換性のない`@`を使用する必要はない。
144 | 
145 | ### 合計 (functions.sum)
146 | 
147 | 合計値を計算する。特定の軸に沿った合計値を出力することもできる。
148 | 


--------------------------------------------------------------------------------
/doc/chainer_optimizer.md:
--------------------------------------------------------------------------------
 1 | # 最適化アルゴリズム
 2 | 
 3 | ## 最急降下法と確率的勾配法
 4 | 
 5 | * [最急降下法(Steepest descent method)(Wikipedia)](https://ja.wikipedia.org/wiki/%E6%9C%80%E6%80%A5%E9%99%8D%E4%B8%8B%E6%B3%95)
 6 | * [確率的勾配法(Stochastic Gradient Descent, SGD)(Wikipedia)](https://ja.wikipedia.org/wiki/%E7%A2%BA%E7%8E%87%E7%9A%84%E5%8B%BE%E9%85%8D%E9%99%8D%E4%B8%8B%E6%B3%95)
 7 | 
 8 | 最急降下法と確率的勾配法の違い
 9 | * 最急降下法
10 |     * 学習データを全て使って勾配を求め、パラメータ更新を行う。
11 | * 確率的勾配法
12 |     * 学習データの一部をランダムに選択し、選択したデータを使って勾配を求める。選択した一部のデータをミニバッチと呼ぶ。
13 |     epoch毎に学習データをランダムに並び替えて一定数ずつ取り出す手法がよく使われる。
14 |     * 学習データの一部を使うので、最急降下法に比べiteration毎の計算量が少ない。
15 |     * 最急降下法は局所最適解に陥りやすいが、確率的勾配法はより適切な解を発見しやすい。
16 | 
17 | ## モーメントつきSGD
18 | 
19 | パラメータの変更量に前回のパラメータの変更量の一定割合を加える手法をモーメント付きSGD(Momentum SGD)と呼ぶ。モーメントつきにすることで収束を早めることができる。
20 | 
21 | ## 学習率の決め方
22 | 
23 | 学習率を決めるときは、最初に大きめの値を試み、損失関数が発散するようなら小さくしていく、という方法をとると良い。
24 | 
25 | ## 学習率のスケジューリング
26 | 
27 | SGDでは学習率を大きくすると収束が早くなるが、損失関数の変動が不安定になる。この問題を解決するために、損失関数がある程度収束したら学習率を小さくする手法がある。一定のepochまたはiteration毎に学習率を0.1倍したり、全体epoch数の50%と75%で学習率を0.1倍する手法などがある。
28 | 
29 | ## SGDの改良アルゴリズム
30 | 
31 | [An overview of gradient descent optimization algorithms](http://sebastianruder.com/optimizing-gradient-descent/) [日本語訳](http://postd.cc/optimizing-gradient-descent/)
32 | 
33 | ## どのアルゴリズムを使うのがよいか？
34 | 
35 | アルゴリズムがいくつもあって迷うが、最初はAdamを使ってみるのが良いと思う。
36 | 
37 | SGDの改良アルゴリズムがいくつか提案されているが、(モーメントつき)SGDも廃れているわけではなくよく使われている。
38 | 


--------------------------------------------------------------------------------
/doc/chainer_trainer.md:
--------------------------------------------------------------------------------
 1 | # ChainerのTrainerについて
 2 | 
 3 | Chainerは1.11.0からTrainerを使って学習ループを抽象化しているが、以下のような問題があるので、ここでは使用しない。
 4 | 
 5 | ## 機能が不足している
 6 | 
 7 | 以下のようにTrainerの機能が不足している。
 8 | TriggerやExtensionの実装を行えば対応可能だが、Trainerを使わずに自前でループを実装したほうが実装コストが低くなり、Trainerを使うメリットがなくなる。
 9 | 
10 | ### 学習率のスケジューリングが困難
11 | 
12 | ~~Deep Learningの学習ではよく「100epochごとに学習率を0.1倍する」などといった学習率のスケジューリングを行うのだが、
13 | [ExponentialShift](http://docs.chainer.org/en/stable/reference/extensions.html#exponentialshift)、[LinearShift](http://docs.chainer.org/en/stable/reference/extensions.html#linearshift)といったクラスだと1iterationごとにしか学習率を変更できないので不便である。より柔軟なスケジューリングを行うためにはこれらのクラスをTriggerで制御することが必要となる。~~
14 | 
15 | 2016/9/24追記: `trainer.extend`で`trigger`を指定できるのでExponentialShift, LinearShiftによる学習率変化はTriggerオブジェクトで制御可能である。
16 | 
17 | また「100, 150, 200epochで学習率を0.1倍する」、「しばらくvalidation dataの精度が上がらなかったら学習率を下げる」などのように複雑な条件もあるので新たなTriggerの実装も必要となる。
18 | 
19 | ### Extensionからmetricsを簡単に取得できない
20 | 
21 | Trainerでは1epochごとのloss、accuracyなどのmetricsの計算を[LogReport](http://docs.chainer.org/en/stable/reference/extensions.html#logreport)が行っているが、他のExtensionから参照できないようになっている。
22 | Extensionで1epochごとのaccuracyが必要になる場合には、Extension内でLogReportを持ったり、LogReportと同じ処理をする必要があり、LogReportが重複する構造になってしまう。
23 | 
24 | ### metricsのファイル保存、グラフ化
25 | 
26 | metricsをファイルに保存したり、グラフとして表示したりしたい。
27 | 
28 | ## Classifierという謎の役割を持ったクラスがある
29 | 
30 | Chainerのmnist、ptbのexampleを見ると`chainer.links.Classifier`というクラスを使っているが、以下のようにどのような役割を持っているのか不明である。
31 | 
32 | * Classifierの要件が不明。Classifierという名前から分類タスクで使用するものと思われるが、他のタスクを行うためにどのような実装をすればよいのかまとまった説明がない。
33 | * `chainer.links`の下にある低レイヤーモジュールなのに、比較的高レイヤーの`chainer.reporter`を参照している。
34 | 


--------------------------------------------------------------------------------
/doc/environment.md:
--------------------------------------------------------------------------------
 1 | # Deep Learning 開発に必要な環境
 2 | 
 3 | このリポジトリにあるコードを実行するときは、Chainerをインストールできる環境であればOS等の環境は自由に選んでかまいません。
 4 | 
 5 | 以下は本格的にDeep Learningを学びたい人向けの環境です。
 6 | 
 7 | ## OS
 8 | 
 9 | * Ubuntuがおすすめ。
10 | ほとんどのフレームワークはUbuntuをサポートしている。
11 | * Macはハードウェアに制限がかかるのでおすすめしない。
12 | * Windowsは対応していないフレームワークがあるので、様々なフレームワークを試したい人にはおすすめしない。  
13 | * Windowsで動作を(MNISTデータセットを学習できるレベルで)確認したフレームワーク:
14 |     * [Chainer](http://chainer.org/)
15 |     * [mxnet](http://mxnet.io/)
16 |     * [Theano](http://deeplearning.net/software/theano/index.html)
17 |     * [Keras(バックグラウンドとしてTheano使用)](https://keras.io/ja/)
18 | * Windowsで動作するはずのフレームワーク
19 |     * [CNTK](https://github.com/Microsoft/CNTK)
20 |     * [Caffe](http://caffe.berkeleyvision.org/)  
21 |     [Windows版Caffe](https://github.com/niuzhiheng/caffe)があるが古い
22 | 
23 | ## NVIDIA製チップセットを搭載したグラフィックボード(Highly recommended)
24 | 
25 | ニューラルネットワークはGPUを使うことで数倍から数十倍高速に動作する。
26 | GPUのチップセットメーカーは複数あるが、ほとんどのフレームワークはCUDAにしか対応していないのでNVIDIAしか選択肢がない状態である。
27 | 


--------------------------------------------------------------------------------
/doc/exercise_1.md:
--------------------------------------------------------------------------------
 1 | # 演習1
 2 | 
 3 | 学習データを元にして領域の分割を行う。
 4 | 学習データとして点の座標とその点が所属するカテゴリが複数与えられている。
 5 | 
 6 | ## 実行方法
 7 | 
 8 | 0. `/src/ex1/train.py`のTODO部分を実装する
 9 | 0. 以下のコマンドで`/src/ex1/train.py`を実行する
10 | ```
11 | cd src/ex1
12 | python train.py
13 | ```
14 | 
15 | `/src/ex1_2`ディレクトリにも同様のコードがある。こちらのほうが難易度が高い。
16 | 
17 | ## 実行結果
18 | 
19 | 正しく実装されていれば、epoch毎の学習データのloss/accuracy、テストデータのloss, accuracyを出力する。
20 | 
21 | また`/src/ex1/image`ディレクトリに`result_train.png`と`result_test.png`を生成する。
22 | `result_train.png`は学習データの点をプロットした画像で、`result_test.png`はテストデータの点をプロットした画像である。
23 | 
24 | 画像の点と領域は以下の意味を持つ。
25 | 
26 | * 赤い点: カテゴリ0に所属する点
27 | * 青い点: カテゴリ1に所属する点
28 | * 赤い領域: 学習済みのモデルがカテゴリ0に所属すると判断した領域
29 | * 青い領域: 学習済みのモデルがカテゴリ1に所属すると判断した領域
30 | 
31 | ## 学習データ
32 | 
33 | 学習データは`data_train.txt`、テストデータは`data_test.txt`に保存されている。
34 | 以下のように各行に"x座標"、"y座標"、"所属するカテゴリ"が記録されている。
35 | 
36 | ```
37 | 7.95, 2.18, 0
38 | 2.07, -5.19, 1
39 | -2.65, 6.56, 1
40 | 3.92, -4.12, 1
41 | ```
42 | 


--------------------------------------------------------------------------------
/doc/mnist.md:
--------------------------------------------------------------------------------
 1 | # 手書き数字の認識
 2 | 
 3 | ## MNISTデータセット
 4 | 
 5 | http://yann.lecun.com/exdb/mnist/
 6 | 
 7 | * 手書き数字の画像データセット
 8 | * 28 x 28px の白黒画像
 9 | * 学習データ60,000枚、テストデータ10,000枚
10 | 
11 | ## 実行方法
12 | 
13 | ### ソースディレクトリ
14 | 
15 | (root dir)/src/mnist
16 | 
17 | ### 学習
18 | 
19 | ```
20 | $ cd src/mnist
21 | $ python train.py
22 | ```
23 | 
24 | オプション:
25 | * `-m <model name>` : ニューラルネットワークのモデルを指定する(default: mlp)
26 |     * mlp: Fully Connected Layerのみ
27 |     * cnn: Convolutional Neural Networkを使用
28 | * `-g <GPU index>` : 使用するGPUを指定する。-1の場合はCPUを使用(default: -1)
29 | * `-e <epoch>` : epoch数を指定する(default: 100)
30 | * `-b <batch size>` : ミニバッチ数を指定する
31 | * `-p <prefix>` : モデルパラメータ保存時のファイル名のprefixを指定する
32 | 
33 | ### 予測
34 | 
35 | ```
36 | $ python predict.py mnist.model sample.png
37 | ```
38 | 
39 | パラメータ:
40 | * 学習済みモデルパラメータファイル
41 | * 画像ファイルパス。画像は28 x 28px にリサイズされる
42 | 
43 | オプション:
44 | * `-m <model name>` : ニューラルネットワークのモデルを指定する。学習時のモデルと一緒にすること(default: mlp)
45 | * `-g <GPU index>` : 使用するGPUを指定する。-1の場合はCPUを使用(default: -1)
46 | 


--------------------------------------------------------------------------------
/doc/numpy_1.md:
--------------------------------------------------------------------------------
  1 | # Numpy
  2 | 
  3 | [公式ドキュメント](http://www.numpy.org/)
  4 | 
  5 | ## Numpyとは
  6 | 
  7 | Pythonの数値計算用パッケージで、以下の機能をもつ。
  8 | 
  9 | * 多次元配列操作
 10 | * 線形代数
 11 | 
 12 | 内部はC/C++とFortranで実装されており、高速に動作する。また配列の中身とインデックスとを別々に管理しており、効率的なメモリ管理を行っている。
 13 | 
 14 | Numpyを効率よく使用する方法については以下が参考になる。
 15 | 
 16 | http://ipython-books.github.io/featured-01/
 17 | 
 18 | ## 使い方
 19 | 
 20 | まずnumpyをimportする。`np`をエイリアスとすることが多い。
 21 | 
 22 | ```
 23 | >>> import numpy as np
 24 | ```
 25 | 
 26 | ### numpy.xxxとnumpy.ndarray.xxxについて
 27 | 
 28 | Numpyのドキュメントを読むと`numpy.xxx`と`numpy.ndarray.xxx`とがあることがわかる。
 29 | 以降では`numpy.xxx`について説明するが、通常同じ機能を持つメソッドがnumpy配列のインスタンスメソッド`numpy.ndarray.xxx`として定義されている。
 30 | 
 31 | ```
 32 | >>> x = np.array([1,2,3], np.float32)
 33 | >>> x
 34 | array([ 1.,  2.,  3.], dtype=float32)
 35 | >>> np.sum(x)
 36 | 6.0
 37 | >>> x.sum()
 38 | 6.0
 39 | ```
 40 | 
 41 | ## 配列の生成
 42 | 
 43 | ### numpy.array
 44 | 
 45 | listまたはnumpy配列を元に配列を生成する。
 46 | 
 47 | ```
 48 | >>> x = np.array([[1,2],[3,4]], dtype=np.float32)
 49 | >>> x
 50 | array([[ 1.,  2.],
 51 |        [ 3.,  4.]], dtype=float32)
 52 | ```
 53 | 
 54 | `dtype`で要素の型を指定する。Chainerを使用する時には通常整数は`np.int32`、浮動小数は`np.float32`を指定する。
 55 | 
 56 | `dtype`を指定しない場合は、元のnumpy配列またはlistの型になる。
 57 | 
 58 | ### numpy.asarray
 59 | 
 60 | `numpy.asarray`とほとんど同じだが、生成する配列が入力した配列と同じ場合にはコピーを生成しない。
 61 | 
 62 | ```
 63 | >>> x = np.array([1,2,3], dtype=np.float32)
 64 | >>> y = np.asarray(x)
 65 | >>> x is y
 66 | True
 67 | ```
 68 | 
 69 | ### numpy.copy
 70 | 
 71 | 配列のコピーには`numpy.copy`を使う
 72 | 
 73 | ```
 74 | >>> x = np.array([1,2,3], dtype=np.float32)
 75 | >>> x
 76 | array([ 1.,  2.,  3.], dtype=float32)
 77 | >>> y = np.copy(x)
 78 | >>> y
 79 | array([ 1.,  2.,  3.], dtype=float32)
 80 | >>> x is y
 81 | False
 82 | ```
 83 | 
 84 | ### numpy.zeros, numpy.ones, numpy.full
 85 | 
 86 | `numpy.zeros`を使うと要素を0で埋めた配列を生成できる。
 87 | 
 88 | ```
 89 | >>> x = np.zeros((2,3), dtype=np.float32)
 90 | >>> x
 91 | array([[ 0.,  0.,  0.],
 92 |        [ 0.,  0.,  0.]], dtype=float32)
 93 | ```
 94 | 
 95 | `numpy.ones`を使うと要素を1で埋めた配列を生成できる。
 96 | 
 97 | ```
 98 | >>> x = np.ones((2,3), dtype=np.float32)
 99 | >>> x
100 | array([[ 1.,  1.,  1.],
101 |        [ 1.,  1.,  1.]], dtype=float32)
102 | ```
103 | 
104 | `numpy.full`を使うと要素を指定した値で埋めた配列を生成できる。
105 | 
106 | ```
107 | >>> x = np.full((2, 3), 10, dtype=np.float32)
108 | >>> x
109 | array([[ 10.,  10.,  10.],
110 |        [ 10.,  10.,  10.]], dtype=float32)
111 | ```
112 | 
113 | ### numpy.zeros_like, numpy.ones_like, numpy.full_like
114 | 
115 | 別の配列と同じ形状で中身を0, 1, または指定した値の配列を生成できる。
116 | 
117 | ```
118 | >>> a = np.asarray([[1,2],[3,4]], dtype=np.float32)
119 | >>> a
120 | array([[ 1.,  2.],
121 |        [ 3.,  4.]], dtype=float32)
122 | >>> x = np.zeros_like(a)
123 | >>> x
124 | array([[ 0.,  0.],
125 |        [ 0.,  0.]], dtype=float32)
126 | >>> x = np.ones_like(a)
127 | >>> x
128 | array([[ 1.,  1.],
129 |        [ 1.,  1.]], dtype=float32)
130 | >>> x = np.full_like(a, 10)
131 | >>> x
132 | array([[ 10.,  10.],
133 |        [ 10.,  10.]], dtype=float32)
134 | ```
135 | 
136 | ### numpy.arange
137 | 
138 | 等差数列を生成する。開始、終了、間隔を指定することができる。開始位置は省略すると0、間隔は省略すると1になる。
139 | 
140 | ```
141 | >>> x = np.arange(5, dtype=np.int32)
142 | >>> x
143 | array([0, 1, 2, 3, 4])
144 | >>>
145 | >>> x = np.arange(3, 9, 2, dtype=np.float32)
146 | >>> x
147 | array([ 3.,  5.,  7.], dtype=float32)
148 | ```
149 | 


--------------------------------------------------------------------------------
/doc/numpy_2.md:
--------------------------------------------------------------------------------
  1 | # Numpy
  2 | 
  3 | ## 要素へのアクセス
  4 | 
  5 | ### Indexing
  6 | 
  7 | 配列の要素には`[]`を使ってアクセスできる。
  8 | 
  9 | ```
 10 | >>> x = np.arange(0, 20, 2, dtype=np.int32)
 11 | >>> x
 12 | array([ 0,  2,  4,  6,  8, 10, 12, 14, 16, 18])
 13 | >>>
 14 | >>> x[3]
 15 | 6
 16 | ```
 17 | 
 18 | インデックスを負にすると(要素数 + 負のインデックス値)にアクセスする。
 19 | 
 20 | ```
 21 | >>> x[-1]
 22 | 18
 23 | >>> x[-3]
 24 | 14
 25 | ```
 26 | 
 27 | listを使って複数の要素にアクセスすることができる。
 28 | 
 29 | ```
 30 | >>> x[[5, 0, 2]]
 31 | array([10,  0,  4])
 32 | ```
 33 | 
 34 | 多次元配列の場合、`[i0, i1, ...], [j0, j1, ...]`を指定すると(i0, j0), (i1, j1), ...を座標とする要素を取得できる。
 35 | 
 36 | ```
 37 | >>> x = np.array([[1, 2, 3], [4, 5, 6]], dtype=np.float32)
 38 | >>> x
 39 | array([[ 1.,  2.,  3.],
 40 |        [ 4.,  5.,  6.]], dtype=float32)
 41 | >>> x[[0, 1], [1, 0]]
 42 | array([ 2.,  4.], dtype=float32)
 43 | ```
 44 | 
 45 | またインデックスのtupleを渡すこともできる。
 46 | 
 47 | ```
 48 | >>> x[[(0, 1), (1, 0)]]
 49 | array([ 2.,  4.], dtype=float32)
 50 | ```
 51 | 
 52 | ### Slicing
 53 | 
 54 | `[(開始位置):(終了位置):(間隔)]`で、指定した範囲の要素にアクセスできる。(間隔)は省略できる。(開始位置)を指定しない場合には0となり、(終了位置)を指定しない場合には末尾となる。
 55 | 
 56 | ```
 57 | >>> x = np.arange(10, dtype=np.int32)
 58 | >>> x
 59 | array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
 60 | >>>
 61 | >>> x[2:6]
 62 | array([2, 3, 4, 5])
 63 | >>> x[2:6:2]
 64 | array([2, 4])
 65 | >>> x[7:]
 66 | array([7, 8, 9])
 67 | >>> x[:3]
 68 | array([0, 1, 2])
 69 | >>> x[:]
 70 | array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
 71 | ```
 72 | 
 73 | (開始位置)、(終了位置)に負の値を指定すると(要素数 + 負のインデックス値)を意味する。
 74 | 
 75 | ```
 76 | >>> x[-2:]
 77 | array([8, 9])
 78 | >>> x[:-4]
 79 | array([0, 1, 2, 3, 4, 5])
 80 | ```
 81 | 
 82 | (間隔)を負の値にするとインデックの大きいほうから小さいほうへの順序になる。
 83 | 
 84 | ```
 85 | >>> x[5:2:-1]
 86 | array([5, 4, 3])
 87 | >>> x[::-1]
 88 | array([9, 8, 7, 6, 5, 4, 3, 2, 1, 0])
 89 | ```
 90 | 
 91 | 以下のように正の値と負の値を混在させることもできるがわかりにくくなるのでやめたほうが良い。
 92 | 
 93 | ```
 94 | >>> x[-7:8]
 95 | array([3, 4, 5, 6, 7])
 96 | ```
 97 | 
 98 | ### 条件を満たす要素へのアクセス
 99 | 
100 | 以下のように比較演算子を使うと条件を満たす位置が`True`、それ以外は`False`となる配列を得られる。
101 | 
102 | ```
103 | >>> x = np.arange(10, dtype=np.int32)
104 | >>> x
105 | array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
106 | >>> x > 5
107 | array([False, False, False, False, False, False,  True,  True,  True,  True], dtype=bool)
108 | ```
109 | 
110 | これを利用して条件を満たす要素にアクセスしたり、条件を満たす箇所だけ変更したりすることができる。
111 | 
112 | ```
113 | >>> x[x > 5]
114 | array([6, 7, 8, 9])
115 | >>> x[x > 5] = 0
116 | >>> x
117 | array([0, 1, 2, 3, 4, 5, 0, 0, 0, 0])
118 | ```
119 | 


--------------------------------------------------------------------------------
/doc/numpy_3.md:
--------------------------------------------------------------------------------
  1 | # Numpy
  2 | 
  3 | ## 配列の変形と連結
  4 | 
  5 | ### shapeとnumpy.reshape
  6 | 
  7 | 配列の形状はshapeプロパティを参照することで得られる
  8 | 
  9 | ```
 10 | >>> x = np.array([[1, 2, 3], [4, 5, 6]], dtype=np.float32)
 11 | >>> x
 12 | array([[ 1.,  2.,  3.],
 13 |        [ 4.,  5.,  6.]], dtype=float32)
 14 | >>> x.shape
 15 | (2L, 3L)
 16 | ```
 17 | 
 18 | `numpy.reshape`を使うとshapeを変更できる。変更後の総要素数が一致しない場合にはエラーとなる。
 19 | 
 20 | ```
 21 | >>> np.reshape(x, (3, 2))
 22 | array([[ 1.,  2.],
 23 |        [ 3.,  4.],
 24 |        [ 5.,  6.]], dtype=float32)
 25 | ```
 26 | 
 27 | shapeの1つに負の値を指定すると自動的に計算される。
 28 | 
 29 | ```
 30 | >>> np.reshape(x, (-1, 2))
 31 | array([[ 1.,  2.],
 32 |        [ 3.,  4.],
 33 |        [ 5.,  6.]], dtype=float32)
 34 | ```
 35 | 
 36 | ### numpy.transpose
 37 | 
 38 | `numpy.transpose`を使うと軸を入れ替えることができる。
 39 | 
 40 | ```
 41 | >>> x = np.arange(12, dtype=np.float32).reshape((2, 2, 3))
 42 | >>> x
 43 | array([[[  0.,   1.,   2.],
 44 |         [  3.,   4.,   5.]],
 45 | 
 46 |        [[  6.,   7.,   8.],
 47 |         [  9.,  10.,  11.]]], dtype=float32)
 48 | >>> x.shape
 49 | (2L, 2L, 3L)
 50 | >>> y = np.transpose(x, (2, 1, 0))
 51 | >>> y
 52 | array([[[  0.,   6.],
 53 |         [  3.,   9.]],
 54 | 
 55 |        [[  1.,   7.],
 56 |         [  4.,  10.]],
 57 | 
 58 |        [[  2.,   8.],
 59 |         [  5.,  11.]]], dtype=float32)
 60 | >>> y.shape
 61 | (3L, 2L, 2L)
 62 | ```
 63 | 
 64 | ### numpy.swapaxes
 65 | 
 66 | `numpy.swapaxes`を使うと2つの軸を入れ替えることができる。
 67 | 
 68 | ```
 69 | >>> x = np.arange(12, dtype=np.float32).reshape((2, 2, 3))
 70 | >>> x.shape
 71 | (2L, 2L, 3L)
 72 | >>> y = np.swapaxes(x, 1, 2)
 73 | >>> y
 74 | array([[[  0.,   3.],
 75 |         [  1.,   4.],
 76 |         [  2.,   5.]],
 77 | 
 78 |        [[  6.,   9.],
 79 |         [  7.,  10.],
 80 |         [  8.,  11.]]], dtype=float32)
 81 | >>> y.shape
 82 | (2L, 3L, 2L)
 83 | ```
 84 | 
 85 | ### numpy.concatenate
 86 | 
 87 | `numpy.concatenate`を使うと複数の配列を特定の軸に沿って連結することができる。
 88 | 
 89 | ```
 90 | >>> x = np.array([[1, 2], [3, 4]], dtype=np.float32)
 91 | >>> y = np.array([[5, 6], [7, 8]], dtype=np.float32)
 92 | >>>
 93 | >>> z = np.concatenate((x, y), axis=1)
 94 | >>>
 95 | >>> x
 96 | array([[ 1.,  2.],
 97 |        [ 3.,  4.]], dtype=float32)
 98 | >>> y
 99 | array([[ 5.,  6.],
100 |        [ 7.,  8.]], dtype=float32)
101 | >>>
102 | >>> z
103 | array([[ 1.,  2.,  5.,  6.],
104 |        [ 3.,  4.,  7.,  8.]], dtype=float32)
105 | ```
106 | 


--------------------------------------------------------------------------------
/doc/numpy_4.md:
--------------------------------------------------------------------------------
  1 | # Numpy
  2 | 
  3 | ## 配列の演算
  4 | 
  5 | ### 要素ごとの演算
  6 | 
  7 | `+`,`-`,`*`,`/`などの演算子を使うと配列とスカラ、または配列の要素同士の演算を行うことができる。
  8 | 
  9 | ```
 10 | 
 11 | >>> x = np.array([1, 2, 3, 4], dtype=np.float32)
 12 | >>> x
 13 | array([ 1.,  2.,  3.,  4.], dtype=float32)
 14 | >>>
 15 | >>> x + 2
 16 | array([ 3.,  4.,  5.,  6.], dtype=float32)
 17 | >>> x - 2
 18 | array([-1.,  0.,  1.,  2.], dtype=float32)
 19 | >>> x * 2
 20 | array([ 2.,  4.,  6.,  8.], dtype=float32)
 21 | >>> x / 2
 22 | array([ 0.5,  1. ,  1.5,  2. ], dtype=float32)
 23 | >>>
 24 | >>> y = np.array([2, 3, 4, 5], dtype=np.float32)
 25 | >>> y
 26 | array([ 2.,  3.,  4.,  5.], dtype=float32)
 27 | >>>
 28 | >>> x + y
 29 | array([ 3.,  5.,  7.,  9.], dtype=float32)
 30 | >>> x - y
 31 | array([-1., -1., -1., -1.], dtype=float32)
 32 | >>> x * y
 33 | array([  2.,   6.,  12.,  20.], dtype=float32)
 34 | >>> x / y
 35 | array([ 0.5       ,  0.66666669,  0.75      ,  0.80000001], dtype=float32)
 36 | ```
 37 | 
 38 | `**`, `numpy.exp`, `numpy.log`, `numpy.sin`なども使える。
 39 | 
 40 | ```
 41 | >>> x ** 2
 42 | array([  1.,   4.,   9.,  16.], dtype=float32)
 43 | >>> np.exp(x)
 44 | array([  2.71828175,   7.38905621,  20.08553696,  54.59814835], dtype=float32)
 45 | >>> np.log(x)
 46 | array([ 0.        ,  0.69314718,  1.09861231,  1.38629436], dtype=float32)
 47 | >>> np.sin(x)
 48 | array([ 0.84147096,  0.90929741,  0.14112   , -0.7568025 ], dtype=float32)
 49 | ```
 50 | 
 51 | ### Broadcasting
 52 | 
 53 | 2つの配列の次元、要素数が異なっている場合でも配列の演算を行うことができる。
 54 | 全ての次元について、以下が成立すればよい。
 55 | 
 56 | * 要素数の大きさが同じ
 57 | * どちらかの要素数が1
 58 | 
 59 | 要素数が少ない側は要素数を多い側と同じになるようにbroadcastする。
 60 | 
 61 | 詳しくは[Numpy ManualのBroadcastingの項](http://docs.scipy.org/doc/numpy/user/basics.broadcasting.html)を参照すること。
 62 | 
 63 | ```
 64 | >>> x = np.zeros((2,2,2), dtype=np.float32)
 65 | >>> y = np.array([[1, 2]], dtype=np.float32)
 66 | >>> x
 67 | array([[[ 0.,  0.],
 68 |         [ 0.,  0.]],
 69 | 
 70 |        [[ 0.,  0.],
 71 |         [ 0.,  0.]]], dtype=float32)
 72 | >>> y
 73 | array([[ 1.,  2.]], dtype=float32)
 74 | >>> y.shape
 75 | (1L, 2L)
 76 | >>> x + y
 77 | array([[[ 1.,  2.],
 78 |         [ 1.,  2.]],
 79 | 
 80 |        [[ 1.,  2.],
 81 |         [ 1.,  2.]]], dtype=float32)
 82 | ```
 83 | 
 84 | ### numpy.dot
 85 | 
 86 | `numpy.dot`を使うとベクトル同士の積、行列とベクトルの積、行列同士の積を計算できる。
 87 | 
 88 | 引数が両方とも1次元の場合は内積を計算する。
 89 | 
 90 | ```
 91 | >>> x = np.asarray([1, 2, 3], dtype=np.float32)
 92 | >>> y = np.asarray([4, 5, 6], dtype=np.float32)
 93 | >>> np.dot(x, y)
 94 | ```
 95 | 
 96 | 一方が行列の場合は行列の乗算となる。
 97 | 
 98 | ```
 99 | >>> x = np.array([[1, 2], [3, 4]], dtype=np.float32)
100 | >>> y = np.array([[2, 3], [4, 5]], dtype=np.float32)
101 | >>> z = np.asarray([10, 20], dtype=np.float32)
102 | >>>
103 | >>> np.dot(x, y)
104 | array([[ 10.,  13.],
105 |        [ 22.,  29.]], dtype=float32)
106 | >>> np.dot(x, z)
107 | array([  50.,  110.], dtype=float32)
108 | ```
109 | 
110 | 配列が3次元以上の場合は、1個目の配列の最後の軸の要素と、2個目の配列の最後から2番目の軸の要素をそれぞれ掛けた値の輪をとる。
111 | 例えば3次元同士の配列を使って`numpy.dot`を呼ぶと戻り値は4次元配列となるが、各要素は以下のようになる。
112 | 
113 | ```
114 | dot(a, b)[i,j,k,m] = sum(a[i,j,:] * b[k,:,m])
115 | ```
116 | 
117 | ```
118 | >>> x = np.arange(8, dtype=np.float32).reshape((2, 2, 2))
119 | >>> x
120 | array([[[ 0.,  1.],
121 |         [ 2.,  3.]],
122 | 
123 |        [[ 4.,  5.],
124 |         [ 6.,  7.]]], dtype=float32)
125 | >>> y = np.arange(12, dtype=np.float32).reshape((3, 2, 2))
126 | >>> y
127 | array([[[  0.,   1.],
128 |         [  2.,   3.]],
129 | 
130 |        [[  4.,   5.],
131 |         [  6.,   7.]],
132 | 
133 |        [[  8.,   9.],
134 |         [ 10.,  11.]]], dtype=float32)
135 | >>>
136 | >>> z = np.dot(x, y)
137 | >>> z
138 | array([[[[   2.,    3.],
139 |          [   6.,    7.],
140 |          [  10.,   11.]],
141 | 
142 |         [[   6.,   11.],
143 |          [  26.,   31.],
144 |          [  46.,   51.]]],
145 | 
146 | 
147 |        [[[  10.,   19.],
148 |          [  46.,   55.],
149 |          [  82.,   91.]],
150 | 
151 |         [[  14.,   27.],
152 |          [  66.,   79.],
153 |          [ 118.,  131.]]]], dtype=float32)
154 | >>> z.shape
155 | (2L, 2L, 3L, 2L)
156 | ```
157 | 


--------------------------------------------------------------------------------
/doc/numpy_5.md:
--------------------------------------------------------------------------------
 1 | # Numpy
 2 | 
 3 | ## 統計処理と線形代数
 4 | 
 5 | ### 統計処理
 6 | 
 7 | numpyを使って統計処理を行うことができる。例えば以下のメソッドを使える。
 8 | 
 9 | * `numpy.sum`: 合計
10 | * `numpy.mean`: 平均
11 | * `numpy.var`: 分散
12 | * `numpy.std`: 標準偏差
13 | 
14 | ```
15 | >>> x = np.arange(10, dtype=np.float32)
16 | >>>
17 | >>> x.sum()
18 | 45.0
19 | >>> np.sum(x)
20 | 45.0
21 | >>> np.mean(x)
22 | 4.5
23 | >>> np.var(x)
24 | 8.25
25 | >>> np.std(x)
26 | 2.8722813
27 | ```
28 | 
29 | `axis`を指定することで特定の次元に沿った計算を行うこともできる。
30 | 
31 | ```
32 | >>> x = np.arange(9, dtype=np.float32).reshape((3,3))
33 | >>> x
34 | array([[ 0.,  1.,  2.],
35 |        [ 3.,  4.,  5.],
36 |        [ 6.,  7.,  8.]], dtype=float32)
37 | >>> x.sum(axis=1)
38 | array([  3.,  12.,  21.], dtype=float32)
39 | ```
40 | 
41 | ### 線形代数
42 | 
43 | `numpy.linalg`パッケージを使うと線形代数の処理を行うことができる。詳しくは[Numpy ManualのLinear algebraの項](http://docs.scipy.org/doc/numpy/reference/routines.linalg.html)を参照すること。
44 | 
45 | `numpy.linalg.inv`を使って逆行列を求める。
46 | 
47 | ```
48 | >>> x = np.array([[1, 2], [3, 4]], dtype=np.float32)
49 | >>> x
50 | array([[ 1.,  2.],
51 |        [ 3.,  4.]], dtype=float32)
52 | >>> y = np.linalg.inv(x)
53 | >>> y
54 | array([[-2. ,  1. ],
55 |        [ 1.5, -0.5]], dtype=float32)
56 | >>> np.dot(x, y)
57 | array([[ 1.,  0.],
58 |        [ 0.,  1.]], dtype=float32)
59 | ```
60 | 
61 | `np.linalg.eig`を使って固有値、固有ベクトルを求める。
62 | 
63 | ```
64 | >>> e_value, e_vector = np.linalg.eig(x)
65 | >>> e_value
66 | array([-0.37228131,  5.37228155], dtype=float32)
67 | >>> e_vector
68 | array([[-0.82456481, -0.41597354],
69 |        [ 0.56576747, -0.90937668]], dtype=float32)
70 | >>>
71 | >>> np.dot(x, e_vector[:,0])
72 | array([ 0.30697012, -0.21062446], dtype=float32)
73 | >>> e_vector[:, 0] * e_value[0]
74 | array([ 0.30697006, -0.21062465], dtype=float32)
75 | ```
76 | 


--------------------------------------------------------------------------------
/doc/reinforcement.md:
--------------------------------------------------------------------------------
 1 | # 強化学習
 2 | 
 3 | OpenAI Gymが提供する環境で強化学習を行う。
 4 | 以下の機能をサポートしている。
 5 | 
 6 | * Q学習
 7 | * "CartPole-v0"と"MountainCar-v0"の実行
 8 | * Experience Replay
 9 | 
10 | ## 参考にした実装
11 | 
12 | https://github.com/jaara/ai_examples
13 | 
14 | ## 必要な環境
15 | 
16 | * [OpenAI Gym](https://gym.openai.com/docs)
17 | `pip install gym`でインストールできる。
18 | 
19 | ## 実行方法
20 | 
21 | ### ソースディレクトリ
22 | 
23 | (rood dir)/src/open_ai_gym
24 | 
25 | ### 学習
26 | 
27 | ```
28 | cd src/open_ai_gym
29 | python train.py
30 | ```
31 | 
32 | オプション:
33 | * `-e <environmant>` : 実行するEnvironment("cart_pole"または"mountain_car")(default: "cart_pole")
34 | * `-s <skip size>` : 描画をスキップするエピソード数。描画をスキップすることで高速化する(default: 0)
35 | * `-b <batch size>` : 学習時ミニバッチサイズ(default: 32)
36 | * `-p <pool size>` : Experience Replay用に保持するiteration数(default: 2000)
37 | * `-t <train iteration>` : 一度に学習する回数(default: 10)
38 | * `--episode <episode num>` : 実行するエピソード数(default 1000)
39 | * `--use-double-q` : 指定するとDoule Q-learningを使用する
40 | 


--------------------------------------------------------------------------------
/doc/translation.md:
--------------------------------------------------------------------------------
 1 | # 翻訳
 2 | 
 3 | ## 実行方法
 4 | 
 5 | ### ソースディレクトリ
 6 | 
 7 | (rood dir)/src/translation
 8 | 
 9 | ### 学習データ生成
10 | 
11 | 0. 以下のリポジトリをクローンする。  
12 | https://github.com/odashi/small_parallel_enja
13 | 0. 以下のコマンドを実行する。(dataset_repository_rootはクローンしたデータセットリポジトリのルートディレクトリ)。
14 | ```
15 | python make_dataset.py dataset/dataset.json dataset/dataset.pkl -d (dataset_repository_root)
16 | ```
17 | 
18 | ### 学習
19 | 
20 | ```
21 | python train.py [dataset file] [input language] [output languate] [output model path] [output result path]
22 | ```
23 | 
24 | 例:
25 | 
26 | ```
27 | $ cd src/translation
28 | $ python train.py dataset/dataset.pkl en ja model/en_ja.model model/en_ja_result.txt
29 | ```
30 | 
31 | パラメータ:
32 | * `dataset file` : データセットファイルのパス
33 | * `input language` : 入力言語
34 | * `output language` : 出力言語
35 | * `output model path` : 出力モデルファイルパス
36 | * `output result path` : テストデータの翻訳結果の出力ファイルパス
37 | 
38 | オプション:
39 | * `-e <epoch>` : 学習epoch数(default: 50)
40 | * `-b <batch size>` : ミニバッチサイズ(default: 100)
41 | * `--hidden-size <size>` : 隠れ層のサイズ(default: 256)
42 | * `--max-result-len <length>` : テストデータ翻訳結果の最大単語数
43 | 


--------------------------------------------------------------------------------
/image/digit/digit_0.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_0.png


--------------------------------------------------------------------------------
/image/digit/digit_1.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_1.png


--------------------------------------------------------------------------------
/image/digit/digit_2.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_2.png


--------------------------------------------------------------------------------
/image/digit/digit_3.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_3.png


--------------------------------------------------------------------------------
/image/digit/digit_4.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_4.png


--------------------------------------------------------------------------------
/image/digit/digit_5.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_5.png


--------------------------------------------------------------------------------
/image/digit/digit_6.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_6.png


--------------------------------------------------------------------------------
/image/digit/digit_7.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_7.png


--------------------------------------------------------------------------------
/image/digit/digit_8.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_8.png


--------------------------------------------------------------------------------
/image/digit/digit_9.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/image/digit/digit_9.png


--------------------------------------------------------------------------------
/src/bouncing/make_animation.py:
--------------------------------------------------------------------------------
 1 | import argparse
 2 | import importlib
 3 | import numpy as np
 4 | import matplotlib.pyplot as plt
 5 | import matplotlib.animation as animation
 6 | import six
 7 | 
 8 | from chainer import serializers
 9 | 
10 | def update(index, scat, items):
11 |     scat.set_offsets(items[:, index, :])
12 |     return scat
13 | 
14 | def plot_animation(data, interval=50):
15 |     fig = plt.figure()
16 |     ax = fig.add_subplot(1,1,1)
17 |     ax.set_xlim([0, 50])
18 |     ax.set_ylim([0, 5])
19 |     ax.set_xlabel('X')
20 |     ax.set_ylabel('Y')
21 |     ax.set_title('Bouncing Ball')
22 | 
23 |     scat = ax.scatter(data[:, 0, 0], data[:, 0, 1], c=['b', 'r'], s=50, alpha=1)
24 |     # Creating the Animation object
25 |     max_frame = data.shape[1]
26 |     anim = animation.FuncAnimation(
27 |         fig, update, max_frame, fargs=(scat, data),
28 |         interval=interval, blit=False
29 |     )
30 |     plt.show()
31 | 
32 | def make_data(size, length, vx=5.0, vy=5.0, fps=20):
33 |     g = 9.8
34 |     # elapsed time
35 |     t = np.arange(length).astype(np.float32) / fps
36 |     t = t.reshape((1, -1, 1)).repeat(size, axis=0)
37 |     # calculate x, y positions
38 |     x = vx * t
39 |     interval = vy * 2 / g
40 |     count = np.floor(t / interval)
41 |     z = t - interval * count - interval * 0.5
42 |     y = 0.5 * g * ((interval * 0.5) ** 2 - (z) ** 2)
43 |     # output dimensions
44 |     # (size, length, (x and y))
45 |     return np.concatenate((x, y), axis=2)
46 | 
47 | def predict(net, xs):
48 |     result = np.zeros_like(xs)
49 |     result[:, 0, :] = xs[:, 0, :]
50 |     net.reset()
51 |     for i in six.moves.range(3):
52 |         x = xs[:, i, :]
53 |         y = net(x, train=False)
54 |         result[:, i + 1, :] = y.data[:, :]
55 |     y = xs[:, 3, :]
56 |     for i in six.moves.range(3, xs.shape[1] - 1):
57 |         y = net(y, train=False)
58 |         result[:, i + 1, :] = y.data[:, :]
59 |     return result
60 | 
61 | def load_model(file_path, module_name='train'):
62 |     module = importlib.import_module(module_name)
63 |     net = module.Bouncing()
64 |     serializers.load_npz(file_path, net)
65 |     return net
66 | 
67 | if __name__ == '__main__':
68 |     parser = argparse.ArgumentParser(description='Physical action training')
69 |     parser.add_argument('model_path', type=str, help='Model file path')
70 |     parser.add_argument('-x', type=float, default=5.0, help='X velocity')
71 |     parser.add_argument('-y', type=float, default=5.0, help='Y velocity')
72 |     args = parser.parse_args()
73 | 
74 |     net = load_model(args.model_path)
75 |     data = make_data(1, 100, vx=args.x, vy=args.y)
76 |     predicted = predict(net, data)
77 |     data = np.concatenate((data, predicted), axis=0)
78 |     plot_animation(data, interval=50)
79 | 


--------------------------------------------------------------------------------
/src/bouncing/train.py:
--------------------------------------------------------------------------------
  1 | # -*- coding: utf-8 -*-
  2 | 
  3 | import argparse
  4 | import numpy as np
  5 | import six
  6 | import time
  7 | 
  8 | import chainer
  9 | from chainer import functions as F
 10 | from chainer import links as L
 11 | from chainer import serializers
 12 | 
 13 | class Bouncing(chainer.Chain):
 14 | 
 15 |     def __init__(self):
 16 |         super(Bouncing, self).__init__(
 17 |             enc=L.Linear(2, 8),
 18 |             lstm=L.LSTM(8, 8),
 19 |             dec=L.Linear(8, 2),
 20 |         )
 21 | 
 22 |     def __call__(self, x, train=True):
 23 |         h1 = self.enc(x)
 24 |         h2 = self.lstm(h1)
 25 |         dx = self.dec(h2)
 26 |         return x + dx
 27 | 
 28 |     def reset(self):
 29 |         self.lstm.reset_state()
 30 | 
 31 | def make_train_data(size, length, fps=20):
 32 |     g = 9.8
 33 |     # initial velocity
 34 |     vx = np.random.random((size, 1, 1)).astype(np.float32) * 5 + 2.5
 35 |     vy = np.random.random((size, 1, 1)).astype(np.float32) * 5 + 2.5
 36 |     # elapsed time
 37 |     t = np.arange(length).astype(np.float32) / fps
 38 |     t = t.reshape((1, -1, 1)).repeat(size, axis=0)
 39 |     # calculate x, y positions
 40 |     x = vx * t
 41 |     interval = vy * 2 / g
 42 |     count = np.floor(t / interval)
 43 |     z = t - interval * count - interval * 0.5
 44 |     y = 0.5 * g * ((interval * 0.5) ** 2 - (z) ** 2)
 45 |     # output dimensions
 46 |     # (size, length, (x and y))
 47 |     return np.concatenate((x, y), axis=2)
 48 | 
 49 | def train_one(net, optimizer, xs):
 50 |     loss = 0
 51 |     h = None
 52 |     ts = xs[1:]
 53 |     update_interval = 10
 54 |     sum_loss = 0
 55 |     net.reset()
 56 |     for i in six.moves.range(3):
 57 |         x = xs[:, i, :]
 58 |         y = net(x, train=True)
 59 |     for i in six.moves.range(3, xs.shape[1] - 1):
 60 |         x = xs[:, i, :]
 61 |         t = xs[:, i + 1, :]
 62 |         y = net(x, train=True)
 63 |         loss += F.mean_squared_error(y, t)
 64 |         if (i + 1) % update_interval == 0:
 65 |             net.cleargrads()
 66 |             loss.backward()
 67 |             optimizer.update()
 68 |             sum_loss += float(loss.data)
 69 |             loss.unchain_backward()
 70 |             loss = 0
 71 |     if float(loss.data) > 0:
 72 |         net.cleargrads()
 73 |         loss.backward()
 74 |         optimizer.update()
 75 |         sum_loss += float(loss.data)
 76 |     return float(loss.data)
 77 | 
 78 | def train(epoch_num, model_path):
 79 |     batch_size = 5
 80 |     max_data_length = 100
 81 |     it_per_epoch = 100
 82 |     net = Bouncing()
 83 |     optimizer = chainer.optimizers.Adam(alpha=0.001)
 84 |     optimizer.setup(net)
 85 |     last_clock = time.clock()
 86 |     for epoch in six.moves.range(epoch_num):
 87 |         loss = 0
 88 |         for it in six.moves.range(it_per_epoch):
 89 |             data_length = min((epoch // 10 + 1) * 10, max_data_length)
 90 |             x = make_train_data(batch_size, data_length)
 91 |             loss += train_one(net, optimizer, x)
 92 |         current_clock = time.clock()
 93 |         print('epoch {} done {}s elapsed'.format(epoch + 1, current_clock - last_clock))
 94 |         print('training loss: {}'.format(loss / it_per_epoch))
 95 |         last_clock = current_clock
 96 |         serializers.save_npz(model_path, net)
 97 |         if epoch + 1 == int(epoch_num * 0.5):
 98 |             optimizer.alpha *= 0.1
 99 | 
100 | if __name__ == '__main__':
101 |     parser = argparse.ArgumentParser("Bouncing ball training")
102 |     parser.add_argument('--epoch', '-e', type=int, default=100, help='Number of epochs')
103 |     parser.add_argument('--model_path', '-m', type=str, default='bouncing.model', help='Model file path')
104 |     args = parser.parse_args()
105 |     train(args.epoch, args.model_path)
106 | 


--------------------------------------------------------------------------------
/src/caffemodel/labels.txt:
--------------------------------------------------------------------------------
   1 | tench, Tinca tinca
   2 | goldfish, Carassius auratus
   3 | great white shark, white shark, man-eater, man-eating shark, Carcharodon carcharias
   4 | tiger shark, Galeocerdo cuvieri
   5 | hammerhead, hammerhead shark
   6 | electric ray, crampfish, numbfish, torpedo
   7 | stingray
   8 | cock
   9 | hen
  10 | ostrich, Struthio camelus
  11 | brambling, Fringilla montifringilla
  12 | goldfinch, Carduelis carduelis
  13 | house finch, linnet, Carpodacus mexicanus
  14 | junco, snowbird
  15 | indigo bunting, indigo finch, indigo bird, Passerina cyanea
  16 | robin, American robin, Turdus migratorius
  17 | bulbul
  18 | jay
  19 | magpie
  20 | chickadee
  21 | water ouzel, dipper
  22 | kite
  23 | bald eagle, American eagle, Haliaeetus leucocephalus
  24 | vulture
  25 | great grey owl, great gray owl, Strix nebulosa
  26 | European fire salamander, Salamandra salamandra
  27 | common newt, Triturus vulgaris
  28 | eft
  29 | spotted salamander, Ambystoma maculatum
  30 | axolotl, mud puppy, Ambystoma mexicanum
  31 | bullfrog, Rana catesbeiana
  32 | tree frog, tree-frog
  33 | tailed frog, bell toad, ribbed toad, tailed toad, Ascaphus trui
  34 | loggerhead, loggerhead turtle, Caretta caretta
  35 | leatherback turtle, leatherback, leathery turtle, Dermochelys coriacea
  36 | mud turtle
  37 | terrapin
  38 | box turtle, box tortoise
  39 | banded gecko
  40 | common iguana, iguana, Iguana iguana
  41 | American chameleon, anole, Anolis carolinensis
  42 | whiptail, whiptail lizard
  43 | agama
  44 | frilled lizard, Chlamydosaurus kingi
  45 | alligator lizard
  46 | Gila monster, Heloderma suspectum
  47 | green lizard, Lacerta viridis
  48 | African chameleon, Chamaeleo chamaeleon
  49 | Komodo dragon, Komodo lizard, dragon lizard, giant lizard, Varanus komodoensis
  50 | African crocodile, Nile crocodile, Crocodylus niloticus
  51 | American alligator, Alligator mississipiensis
  52 | triceratops
  53 | thunder snake, worm snake, Carphophis amoenus
  54 | ringneck snake, ring-necked snake, ring snake
  55 | hognose snake, puff adder, sand viper
  56 | green snake, grass snake
  57 | king snake, kingsnake
  58 | garter snake, grass snake
  59 | water snake
  60 | vine snake
  61 | night snake, Hypsiglena torquata
  62 | boa constrictor, Constrictor constrictor
  63 | rock python, rock snake, Python sebae
  64 | Indian cobra, Naja naja
  65 | green mamba
  66 | sea snake
  67 | horned viper, cerastes, sand viper, horned asp, Cerastes cornutus
  68 | diamondback, diamondback rattlesnake, Crotalus adamanteus
  69 | sidewinder, horned rattlesnake, Crotalus cerastes
  70 | trilobite
  71 | harvestman, daddy longlegs, Phalangium opilio
  72 | scorpion
  73 | black and gold garden spider, Argiope aurantia
  74 | barn spider, Araneus cavaticus
  75 | garden spider, Aranea diademata
  76 | black widow, Latrodectus mactans
  77 | tarantula
  78 | wolf spider, hunting spider
  79 | tick
  80 | centipede
  81 | black grouse
  82 | ptarmigan
  83 | ruffed grouse, partridge, Bonasa umbellus
  84 | prairie chicken, prairie grouse, prairie fowl
  85 | peacock
  86 | quail
  87 | partridge
  88 | African grey, African gray, Psittacus erithacus
  89 | macaw
  90 | sulphur-crested cockatoo, Kakatoe galerita, Cacatua galerita
  91 | lorikeet
  92 | coucal
  93 | bee eater
  94 | hornbill
  95 | hummingbird
  96 | jacamar
  97 | toucan
  98 | drake
  99 | red-breasted merganser, Mergus serrator
 100 | goose
 101 | black swan, Cygnus atratus
 102 | tusker
 103 | echidna, spiny anteater, anteater
 104 | platypus, duckbill, duckbilled platypus, duck-billed platypus, Ornithorhynchus anatinus
 105 | wallaby, brush kangaroo
 106 | koala, koala bear, kangaroo bear, native bear, Phascolarctos cinereus
 107 | wombat
 108 | jellyfish
 109 | sea anemone, anemone
 110 | brain coral
 111 | flatworm, platyhelminth
 112 | nematode, nematode worm, roundworm
 113 | conch
 114 | snail
 115 | slug
 116 | sea slug, nudibranch
 117 | chiton, coat-of-mail shell, sea cradle, polyplacophore
 118 | chambered nautilus, pearly nautilus, nautilus
 119 | Dungeness crab, Cancer magister
 120 | rock crab, Cancer irroratus
 121 | fiddler crab
 122 | king crab, Alaska crab, Alaskan king crab, Alaska king crab, Paralithodes camtschatica
 123 | American lobster, Northern lobster, Maine lobster, Homarus americanus
 124 | spiny lobster, langouste, rock lobster, crawfish, crayfish, sea crawfish
 125 | crayfish, crawfish, crawdad, crawdaddy
 126 | hermit crab
 127 | isopod
 128 | white stork, Ciconia ciconia
 129 | black stork, Ciconia nigra
 130 | spoonbill
 131 | flamingo
 132 | little blue heron, Egretta caerulea
 133 | American egret, great white heron, Egretta albus
 134 | bittern
 135 | crane (bird)
 136 | limpkin, Aramus pictus
 137 | European gallinule, Porphyrio porphyrio
 138 | American coot, marsh hen, mud hen, water hen, Fulica americana
 139 | bustard
 140 | ruddy turnstone, Arenaria interpres
 141 | red-backed sandpiper, dunlin, Erolia alpina
 142 | redshank, Tringa totanus
 143 | dowitcher
 144 | oystercatcher, oyster catcher
 145 | pelican
 146 | king penguin, Aptenodytes patagonica
 147 | albatross, mollymawk
 148 | grey whale, gray whale, devilfish, Eschrichtius gibbosus, Eschrichtius robustus
 149 | killer whale, killer, orca, grampus, sea wolf, Orcinus orca
 150 | dugong, Dugong dugon
 151 | sea lion
 152 | Chihuahua
 153 | Japanese spaniel
 154 | Maltese dog, Maltese terrier, Maltese
 155 | Pekinese, Pekingese, Peke
 156 | Shih-Tzu
 157 | Blenheim spaniel
 158 | papillon
 159 | toy terrier
 160 | Rhodesian ridgeback
 161 | Afghan hound, Afghan
 162 | basset, basset hound
 163 | beagle
 164 | bloodhound, sleuthhound
 165 | bluetick
 166 | black-and-tan coonhound
 167 | Walker hound, Walker foxhound
 168 | English foxhound
 169 | redbone
 170 | borzoi, Russian wolfhound
 171 | Irish wolfhound
 172 | Italian greyhound
 173 | whippet
 174 | Ibizan hound, Ibizan Podenco
 175 | Norwegian elkhound, elkhound
 176 | otterhound, otter hound
 177 | Saluki, gazelle hound
 178 | Scottish deerhound, deerhound
 179 | Weimaraner
 180 | Staffordshire bullterrier, Staffordshire bull terrier
 181 | American Staffordshire terrier, Staffordshire terrier, American pit bull terrier, pit bull terrier
 182 | Bedlington terrier
 183 | Border terrier
 184 | Kerry blue terrier
 185 | Irish terrier
 186 | Norfolk terrier
 187 | Norwich terrier
 188 | Yorkshire terrier
 189 | wire-haired fox terrier
 190 | Lakeland terrier
 191 | Sealyham terrier, Sealyham
 192 | Airedale, Airedale terrier
 193 | cairn, cairn terrier
 194 | Australian terrier
 195 | Dandie Dinmont, Dandie Dinmont terrier
 196 | Boston bull, Boston terrier
 197 | miniature schnauzer
 198 | giant schnauzer
 199 | standard schnauzer
 200 | Scotch terrier, Scottish terrier, Scottie
 201 | Tibetan terrier, chrysanthemum dog
 202 | silky terrier, Sydney silky
 203 | soft-coated wheaten terrier
 204 | West Highland white terrier
 205 | Lhasa, Lhasa apso
 206 | flat-coated retriever
 207 | curly-coated retriever
 208 | golden retriever
 209 | Labrador retriever
 210 | Chesapeake Bay retriever
 211 | German short-haired pointer
 212 | vizsla, Hungarian pointer
 213 | English setter
 214 | Irish setter, red setter
 215 | Gordon setter
 216 | Brittany spaniel
 217 | clumber, clumber spaniel
 218 | English springer, English springer spaniel
 219 | Welsh springer spaniel
 220 | cocker spaniel, English cocker spaniel, cocker
 221 | Sussex spaniel
 222 | Irish water spaniel
 223 | kuvasz
 224 | schipperke
 225 | groenendael
 226 | malinois
 227 | briard
 228 | kelpie
 229 | komondor
 230 | Old English sheepdog, bobtail
 231 | Shetland sheepdog, Shetland sheep dog, Shetland
 232 | collie
 233 | Border collie
 234 | Bouvier des Flandres, Bouviers des Flandres
 235 | Rottweiler
 236 | German shepherd, German shepherd dog, German police dog, alsatian
 237 | Doberman, Doberman pinscher
 238 | miniature pinscher
 239 | Greater Swiss Mountain dog
 240 | Bernese mountain dog
 241 | Appenzeller
 242 | EntleBucher
 243 | boxer
 244 | bull mastiff
 245 | Tibetan mastiff
 246 | French bulldog
 247 | Great Dane
 248 | Saint Bernard, St Bernard
 249 | Eskimo dog, husky
 250 | malamute, malemute, Alaskan malamute
 251 | Siberian husky
 252 | dalmatian, coach dog, carriage dog
 253 | affenpinscher, monkey pinscher, monkey dog
 254 | basenji
 255 | pug, pug-dog
 256 | Leonberg
 257 | Newfoundland, Newfoundland dog
 258 | Great Pyrenees
 259 | Samoyed, Samoyede
 260 | Pomeranian
 261 | chow, chow chow
 262 | keeshond
 263 | Brabancon griffon
 264 | Pembroke, Pembroke Welsh corgi
 265 | Cardigan, Cardigan Welsh corgi
 266 | toy poodle
 267 | miniature poodle
 268 | standard poodle
 269 | Mexican hairless
 270 | timber wolf, grey wolf, gray wolf, Canis lupus
 271 | white wolf, Arctic wolf, Canis lupus tundrarum
 272 | red wolf, maned wolf, Canis rufus, Canis niger
 273 | coyote, prairie wolf, brush wolf, Canis latrans
 274 | dingo, warrigal, warragal, Canis dingo
 275 | dhole, Cuon alpinus
 276 | African hunting dog, hyena dog, Cape hunting dog, Lycaon pictus
 277 | hyena, hyaena
 278 | red fox, Vulpes vulpes
 279 | kit fox, Vulpes macrotis
 280 | Arctic fox, white fox, Alopex lagopus
 281 | grey fox, gray fox, Urocyon cinereoargenteus
 282 | tabby, tabby cat
 283 | tiger cat
 284 | Persian cat
 285 | Siamese cat, Siamese
 286 | Egyptian cat
 287 | cougar, puma, catamount, mountain lion, painter, panther, Felis concolor
 288 | lynx, catamount
 289 | leopard, Panthera pardus
 290 | snow leopard, ounce, Panthera uncia
 291 | jaguar, panther, Panthera onca, Felis onca
 292 | lion, king of beasts, Panthera leo
 293 | tiger, Panthera tigris
 294 | cheetah, chetah, Acinonyx jubatus
 295 | brown bear, bruin, Ursus arctos
 296 | American black bear, black bear, Ursus americanus, Euarctos americanus
 297 | ice bear, polar bear, Ursus Maritimus, Thalarctos maritimus
 298 | sloth bear, Melursus ursinus, Ursus ursinus
 299 | mongoose
 300 | meerkat, mierkat
 301 | tiger beetle
 302 | ladybug, ladybeetle, lady beetle, ladybird, ladybird beetle
 303 | ground beetle, carabid beetle
 304 | long-horned beetle, longicorn, longicorn beetle
 305 | leaf beetle, chrysomelid
 306 | dung beetle
 307 | rhinoceros beetle
 308 | weevil
 309 | fly
 310 | bee
 311 | ant, emmet, pismire
 312 | grasshopper, hopper
 313 | cricket
 314 | walking stick, walkingstick, stick insect
 315 | cockroach, roach
 316 | mantis, mantid
 317 | cicada, cicala
 318 | leafhopper
 319 | lacewing, lacewing fly
 320 | dragonfly, darning needle, devil's darning needle, sewing needle, snake feeder, snake doctor, mosquito hawk, skeeter hawk
 321 | damselfly
 322 | admiral
 323 | ringlet, ringlet butterfly
 324 | monarch, monarch butterfly, milkweed butterfly, Danaus plexippus
 325 | cabbage butterfly
 326 | sulphur butterfly, sulfur butterfly
 327 | lycaenid, lycaenid butterfly
 328 | starfish, sea star
 329 | sea urchin
 330 | sea cucumber, holothurian
 331 | wood rabbit, cottontail, cottontail rabbit
 332 | hare
 333 | Angora, Angora rabbit
 334 | hamster
 335 | porcupine, hedgehog
 336 | fox squirrel, eastern fox squirrel, Sciurus niger
 337 | marmot
 338 | beaver
 339 | guinea pig, Cavia cobaya
 340 | sorrel
 341 | zebra
 342 | hog, pig, grunter, squealer, Sus scrofa
 343 | wild boar, boar, Sus scrofa
 344 | warthog
 345 | hippopotamus, hippo, river horse, Hippopotamus amphibius
 346 | ox
 347 | water buffalo, water ox, Asiatic buffalo, Bubalus bubalis
 348 | bison
 349 | ram, tup
 350 | bighorn, bighorn sheep, cimarron, Rocky Mountain bighorn, Rocky Mountain sheep, Ovis canadensis
 351 | ibex, Capra ibex
 352 | hartebeest
 353 | impala, Aepyceros melampus
 354 | gazelle
 355 | Arabian camel, dromedary, Camelus dromedarius
 356 | llama
 357 | weasel
 358 | mink
 359 | polecat, fitch, foulmart, foumart, Mustela putorius
 360 | black-footed ferret, ferret, Mustela nigripes
 361 | otter
 362 | skunk, polecat, wood pussy
 363 | badger
 364 | armadillo
 365 | three-toed sloth, ai, Bradypus tridactylus
 366 | orangutan, orang, orangutang, Pongo pygmaeus
 367 | gorilla, Gorilla gorilla
 368 | chimpanzee, chimp, Pan troglodytes
 369 | gibbon, Hylobates lar
 370 | siamang, Hylobates syndactylus, Symphalangus syndactylus
 371 | guenon, guenon monkey
 372 | patas, hussar monkey, Erythrocebus patas
 373 | baboon
 374 | macaque
 375 | langur
 376 | colobus, colobus monkey
 377 | proboscis monkey, Nasalis larvatus
 378 | marmoset
 379 | capuchin, ringtail, Cebus capucinus
 380 | howler monkey, howler
 381 | titi, titi monkey
 382 | spider monkey, Ateles geoffroyi
 383 | squirrel monkey, Saimiri sciureus
 384 | Madagascar cat, ring-tailed lemur, Lemur catta
 385 | indri, indris, Indri indri, Indri brevicaudatus
 386 | Indian elephant, Elephas maximus
 387 | African elephant, Loxodonta africana
 388 | lesser panda, red panda, panda, bear cat, cat bear, Ailurus fulgens
 389 | giant panda, panda, panda bear, coon bear, Ailuropoda melanoleuca
 390 | barracouta, snoek
 391 | eel
 392 | coho, cohoe, coho salmon, blue jack, silver salmon, Oncorhynchus kisutch
 393 | rock beauty, Holocanthus tricolor
 394 | anemone fish
 395 | sturgeon
 396 | gar, garfish, garpike, billfish, Lepisosteus osseus
 397 | lionfish
 398 | puffer, pufferfish, blowfish, globefish
 399 | abacus
 400 | abaya
 401 | academic gown, academic robe, judge's robe
 402 | accordion, piano accordion, squeeze box
 403 | acoustic guitar
 404 | aircraft carrier, carrier, flattop, attack aircraft carrier
 405 | airliner
 406 | airship, dirigible
 407 | altar
 408 | ambulance
 409 | amphibian, amphibious vehicle
 410 | analog clock
 411 | apiary, bee house
 412 | apron
 413 | ashcan, trash can, garbage can, wastebin, ash bin, ash-bin, ashbin, dustbin, trash barrel, trash bin
 414 | assault rifle, assault gun
 415 | backpack, back pack, knapsack, packsack, rucksack, haversack
 416 | bakery, bakeshop, bakehouse
 417 | balance beam, beam
 418 | balloon
 419 | ballpoint, ballpoint pen, ballpen, Biro
 420 | Band Aid
 421 | banjo
 422 | bannister, banister, balustrade, balusters, handrail
 423 | barbell
 424 | barber chair
 425 | barbershop
 426 | barn
 427 | barometer
 428 | barrel, cask
 429 | barrow, garden cart, lawn cart, wheelbarrow
 430 | baseball
 431 | basketball
 432 | bassinet
 433 | bassoon
 434 | bathing cap, swimming cap
 435 | bath towel
 436 | bathtub, bathing tub, bath, tub
 437 | beach wagon, station wagon, wagon, estate car, beach waggon, station waggon, waggon
 438 | beacon, lighthouse, beacon light, pharos
 439 | beaker
 440 | bearskin, busby, shako
 441 | beer bottle
 442 | beer glass
 443 | bell cote, bell cot
 444 | bib
 445 | bicycle-built-for-two, tandem bicycle, tandem
 446 | bikini, two-piece
 447 | binder, ring-binder
 448 | binoculars, field glasses, opera glasses
 449 | birdhouse
 450 | boathouse
 451 | bobsled, bobsleigh, bob
 452 | bolo tie, bolo, bola tie, bola
 453 | bonnet, poke bonnet
 454 | bookcase
 455 | bookshop, bookstore, bookstall
 456 | bottlecap
 457 | bow
 458 | bow tie, bow-tie, bowtie
 459 | brass, memorial tablet, plaque
 460 | brassiere, bra, bandeau
 461 | breakwater, groin, groyne, mole, bulwark, seawall, jetty
 462 | breastplate, aegis, egis
 463 | broom
 464 | bucket, pail
 465 | buckle
 466 | bulletproof vest
 467 | bullet train, bullet
 468 | butcher shop, meat market
 469 | cab, hack, taxi, taxicab
 470 | caldron, cauldron
 471 | candle, taper, wax light
 472 | cannon
 473 | canoe
 474 | can opener, tin opener
 475 | cardigan
 476 | car mirror
 477 | carousel, carrousel, merry-go-round, roundabout, whirligig
 478 | carpenter's kit, tool kit
 479 | carton
 480 | car wheel
 481 | cash machine, cash dispenser, automated teller machine, automatic teller machine, automated teller, automatic teller, ATM
 482 | cassette
 483 | cassette player
 484 | castle
 485 | catamaran
 486 | CD player
 487 | cello, violoncello
 488 | cellular telephone, cellular phone, cellphone, cell, mobile phone
 489 | chain
 490 | chainlink fence
 491 | chain mail, ring mail, mail, chain armor, chain armour, ring armor, ring armour
 492 | chain saw, chainsaw
 493 | chest
 494 | chiffonier, commode
 495 | chime, bell, gong
 496 | china cabinet, china closet
 497 | Christmas stocking
 498 | church, church building
 499 | cinema, movie theater, movie theatre, movie house, picture palace
 500 | cleaver, meat cleaver, chopper
 501 | cliff dwelling
 502 | cloak
 503 | clog, geta, patten, sabot
 504 | cocktail shaker
 505 | coffee mug
 506 | coffeepot
 507 | coil, spiral, volute, whorl, helix
 508 | combination lock
 509 | computer keyboard, keypad
 510 | confectionery, confectionary, candy store
 511 | container ship, containership, container vessel
 512 | convertible
 513 | corkscrew, bottle screw
 514 | cornet, horn, trumpet, trump
 515 | cowboy boot
 516 | cowboy hat, ten-gallon hat
 517 | cradle
 518 | crane (machine)
 519 | crash helmet
 520 | crate
 521 | crib, cot
 522 | Crock Pot
 523 | croquet ball
 524 | crutch
 525 | cuirass
 526 | dam, dike, dyke
 527 | desk
 528 | desktop computer
 529 | dial telephone, dial phone
 530 | diaper, nappy, napkin
 531 | digital clock
 532 | digital watch
 533 | dining table, board
 534 | dishrag, dishcloth
 535 | dishwasher, dish washer, dishwashing machine
 536 | disk brake, disc brake
 537 | dock, dockage, docking facility
 538 | dogsled, dog sled, dog sleigh
 539 | dome
 540 | doormat, welcome mat
 541 | drilling platform, offshore rig
 542 | drum, membranophone, tympan
 543 | drumstick
 544 | dumbbell
 545 | Dutch oven
 546 | electric fan, blower
 547 | electric guitar
 548 | electric locomotive
 549 | entertainment center
 550 | envelope
 551 | espresso maker
 552 | face powder
 553 | feather boa, boa
 554 | file, file cabinet, filing cabinet
 555 | fireboat
 556 | fire engine, fire truck
 557 | fire screen, fireguard
 558 | flagpole, flagstaff
 559 | flute, transverse flute
 560 | folding chair
 561 | football helmet
 562 | forklift
 563 | fountain
 564 | fountain pen
 565 | four-poster
 566 | freight car
 567 | French horn, horn
 568 | frying pan, frypan, skillet
 569 | fur coat
 570 | garbage truck, dustcart
 571 | gasmask, respirator, gas helmet
 572 | gas pump, gasoline pump, petrol pump, island dispenser
 573 | goblet
 574 | go-kart
 575 | golf ball
 576 | golfcart, golf cart
 577 | gondola
 578 | gong, tam-tam
 579 | gown
 580 | grand piano, grand
 581 | greenhouse, nursery, glasshouse
 582 | grille, radiator grille
 583 | grocery store, grocery, food market, market
 584 | guillotine
 585 | hair slide
 586 | hair spray
 587 | half track
 588 | hammer
 589 | hamper
 590 | hand blower, blow dryer, blow drier, hair dryer, hair drier
 591 | hand-held computer, hand-held microcomputer
 592 | handkerchief, hankie, hanky, hankey
 593 | hard disc, hard disk, fixed disk
 594 | harmonica, mouth organ, harp, mouth harp
 595 | harp
 596 | harvester, reaper
 597 | hatchet
 598 | holster
 599 | home theater, home theatre
 600 | honeycomb
 601 | hook, claw
 602 | hoopskirt, crinoline
 603 | horizontal bar, high bar
 604 | horse cart, horse-cart
 605 | hourglass
 606 | iPod
 607 | iron, smoothing iron
 608 | jack-o'-lantern
 609 | jean, blue jean, denim
 610 | jeep, landrover
 611 | jersey, T-shirt, tee shirt
 612 | jigsaw puzzle
 613 | jinrikisha, ricksha, rickshaw
 614 | joystick
 615 | kimono
 616 | knee pad
 617 | knot
 618 | lab coat, laboratory coat
 619 | ladle
 620 | lampshade, lamp shade
 621 | laptop, laptop computer
 622 | lawn mower, mower
 623 | lens cap, lens cover
 624 | letter opener, paper knife, paperknife
 625 | library
 626 | lifeboat
 627 | lighter, light, igniter, ignitor
 628 | limousine, limo
 629 | liner, ocean liner
 630 | lipstick, lip rouge
 631 | Loafer
 632 | lotion
 633 | loudspeaker, speaker, speaker unit, loudspeaker system, speaker system
 634 | loupe, jeweler's loupe
 635 | lumbermill, sawmill
 636 | magnetic compass
 637 | mailbag, postbag
 638 | mailbox, letter box
 639 | maillot
 640 | maillot, tank suit
 641 | manhole cover
 642 | maraca
 643 | marimba, xylophone
 644 | mask
 645 | matchstick
 646 | maypole
 647 | maze, labyrinth
 648 | measuring cup
 649 | medicine chest, medicine cabinet
 650 | megalith, megalithic structure
 651 | microphone, mike
 652 | microwave, microwave oven
 653 | military uniform
 654 | milk can
 655 | minibus
 656 | miniskirt, mini
 657 | minivan
 658 | missile
 659 | mitten
 660 | mixing bowl
 661 | mobile home, manufactured home
 662 | Model T
 663 | modem
 664 | monastery
 665 | monitor
 666 | moped
 667 | mortar
 668 | mortarboard
 669 | mosque
 670 | mosquito net
 671 | motor scooter, scooter
 672 | mountain bike, all-terrain bike, off-roader
 673 | mountain tent
 674 | mouse, computer mouse
 675 | mousetrap
 676 | moving van
 677 | muzzle
 678 | nail
 679 | neck brace
 680 | necklace
 681 | nipple
 682 | notebook, notebook computer
 683 | obelisk
 684 | oboe, hautboy, hautbois
 685 | ocarina, sweet potato
 686 | odometer, hodometer, mileometer, milometer
 687 | oil filter
 688 | organ, pipe organ
 689 | oscilloscope, scope, cathode-ray oscilloscope, CRO
 690 | overskirt
 691 | oxcart
 692 | oxygen mask
 693 | packet
 694 | paddle, boat paddle
 695 | paddlewheel, paddle wheel
 696 | padlock
 697 | paintbrush
 698 | pajama, pyjama, pj's, jammies
 699 | palace
 700 | panpipe, pandean pipe, syrinx
 701 | paper towel
 702 | parachute, chute
 703 | parallel bars, bars
 704 | park bench
 705 | parking meter
 706 | passenger car, coach, carriage
 707 | patio, terrace
 708 | pay-phone, pay-station
 709 | pedestal, plinth, footstall
 710 | pencil box, pencil case
 711 | pencil sharpener
 712 | perfume, essence
 713 | Petri dish
 714 | photocopier
 715 | pick, plectrum, plectron
 716 | pickelhaube
 717 | picket fence, paling
 718 | pickup, pickup truck
 719 | pier
 720 | piggy bank, penny bank
 721 | pill bottle
 722 | pillow
 723 | ping-pong ball
 724 | pinwheel
 725 | pirate, pirate ship
 726 | pitcher, ewer
 727 | plane, carpenter's plane, woodworking plane
 728 | planetarium
 729 | plastic bag
 730 | plate rack
 731 | plow, plough
 732 | plunger, plumber's helper
 733 | Polaroid camera, Polaroid Land camera
 734 | pole
 735 | police van, police wagon, paddy wagon, patrol wagon, wagon, black Maria
 736 | poncho
 737 | pool table, billiard table, snooker table
 738 | pop bottle, soda bottle
 739 | pot, flowerpot
 740 | potter's wheel
 741 | power drill
 742 | prayer rug, prayer mat
 743 | printer
 744 | prison, prison house
 745 | projectile, missile
 746 | projector
 747 | puck, hockey puck
 748 | punching bag, punch bag, punching ball, punchball
 749 | purse
 750 | quill, quill pen
 751 | quilt, comforter, comfort, puff
 752 | racer, race car, racing car
 753 | racket, racquet
 754 | radiator
 755 | radio, wireless
 756 | radio telescope, radio reflector
 757 | rain barrel
 758 | recreational vehicle, RV, R.V.
 759 | reel
 760 | reflex camera
 761 | refrigerator, icebox
 762 | remote control, remote
 763 | restaurant, eating house, eating place, eatery
 764 | revolver, six-gun, six-shooter
 765 | rifle
 766 | rocking chair, rocker
 767 | rotisserie
 768 | rubber eraser, rubber, pencil eraser
 769 | rugby ball
 770 | rule, ruler
 771 | running shoe
 772 | safe
 773 | safety pin
 774 | saltshaker, salt shaker
 775 | sandal
 776 | sarong
 777 | sax, saxophone
 778 | scabbard
 779 | scale, weighing machine
 780 | school bus
 781 | schooner
 782 | scoreboard
 783 | screen, CRT screen
 784 | screw
 785 | screwdriver
 786 | seat belt, seatbelt
 787 | sewing machine
 788 | shield, buckler
 789 | shoe shop, shoe-shop, shoe store
 790 | shoji
 791 | shopping basket
 792 | shopping cart
 793 | shovel
 794 | shower cap
 795 | shower curtain
 796 | ski
 797 | ski mask
 798 | sleeping bag
 799 | slide rule, slipstick
 800 | sliding door
 801 | slot, one-armed bandit
 802 | snorkel
 803 | snowmobile
 804 | snowplow, snowplough
 805 | soap dispenser
 806 | soccer ball
 807 | sock
 808 | solar dish, solar collector, solar furnace
 809 | sombrero
 810 | soup bowl
 811 | space bar
 812 | space heater
 813 | space shuttle
 814 | spatula
 815 | speedboat
 816 | spider web, spider's web
 817 | spindle
 818 | sports car, sport car
 819 | spotlight, spot
 820 | stage
 821 | steam locomotive
 822 | steel arch bridge
 823 | steel drum
 824 | stethoscope
 825 | stole
 826 | stone wall
 827 | stopwatch, stop watch
 828 | stove
 829 | strainer
 830 | streetcar, tram, tramcar, trolley, trolley car
 831 | stretcher
 832 | studio couch, day bed
 833 | stupa, tope
 834 | submarine, pigboat, sub, U-boat
 835 | suit, suit of clothes
 836 | sundial
 837 | sunglass
 838 | sunglasses, dark glasses, shades
 839 | sunscreen, sunblock, sun blocker
 840 | suspension bridge
 841 | swab, swob, mop
 842 | sweatshirt
 843 | swimming trunks, bathing trunks
 844 | swing
 845 | switch, electric switch, electrical switch
 846 | syringe
 847 | table lamp
 848 | tank, army tank, armored combat vehicle, armoured combat vehicle
 849 | tape player
 850 | teapot
 851 | teddy, teddy bear
 852 | television, television system
 853 | tennis ball
 854 | thatch, thatched roof
 855 | theater curtain, theatre curtain
 856 | thimble
 857 | thresher, thrasher, threshing machine
 858 | throne
 859 | tile roof
 860 | toaster
 861 | tobacco shop, tobacconist shop, tobacconist
 862 | toilet seat
 863 | torch
 864 | totem pole
 865 | tow truck, tow car, wrecker
 866 | toyshop
 867 | tractor
 868 | trailer truck, tractor trailer, trucking rig, rig, articulated lorry, semi
 869 | tray
 870 | trench coat
 871 | tricycle, trike, velocipede
 872 | trimaran
 873 | tripod
 874 | triumphal arch
 875 | trolleybus, trolley coach, trackless trolley
 876 | trombone
 877 | tub, vat
 878 | turnstile
 879 | typewriter keyboard
 880 | umbrella
 881 | unicycle, monocycle
 882 | upright, upright piano
 883 | vacuum, vacuum cleaner
 884 | vase
 885 | vault
 886 | velvet
 887 | vending machine
 888 | vestment
 889 | viaduct
 890 | violin, fiddle
 891 | volleyball
 892 | waffle iron
 893 | wall clock
 894 | wallet, billfold, notecase, pocketbook
 895 | wardrobe, closet, press
 896 | warplane, military plane
 897 | washbasin, handbasin, washbowl, lavabo, wash-hand basin
 898 | washer, automatic washer, washing machine
 899 | water bottle
 900 | water jug
 901 | water tower
 902 | whiskey jug
 903 | whistle
 904 | wig
 905 | window screen
 906 | window shade
 907 | Windsor tie
 908 | wine bottle
 909 | wing
 910 | wok
 911 | wooden spoon
 912 | wool, woolen, woollen
 913 | worm fence, snake fence, snake-rail fence, Virginia fence
 914 | wreck
 915 | yawl
 916 | yurt
 917 | web site, website, internet site, site
 918 | comic book
 919 | crossword puzzle, crossword
 920 | street sign
 921 | traffic light, traffic signal, stoplight
 922 | book jacket, dust cover, dust jacket, dust wrapper
 923 | menu
 924 | plate
 925 | guacamole
 926 | consomme
 927 | hot pot, hotpot
 928 | trifle
 929 | ice cream, icecream
 930 | ice lolly, lolly, lollipop, popsicle
 931 | French loaf
 932 | bagel, beigel
 933 | pretzel
 934 | cheeseburger
 935 | hotdog, hot dog, red hot
 936 | mashed potato
 937 | head cabbage
 938 | broccoli
 939 | cauliflower
 940 | zucchini, courgette
 941 | spaghetti squash
 942 | acorn squash
 943 | butternut squash
 944 | cucumber, cuke
 945 | artichoke, globe artichoke
 946 | bell pepper
 947 | cardoon
 948 | mushroom
 949 | Granny Smith
 950 | strawberry
 951 | orange
 952 | lemon
 953 | fig
 954 | pineapple, ananas
 955 | banana
 956 | jackfruit, jak, jack
 957 | custard apple
 958 | pomegranate
 959 | hay
 960 | carbonara
 961 | chocolate sauce, chocolate syrup
 962 | dough
 963 | meat loaf, meatloaf
 964 | pizza, pizza pie
 965 | potpie
 966 | burrito
 967 | red wine
 968 | espresso
 969 | cup
 970 | eggnog
 971 | alp
 972 | bubble
 973 | cliff, drop, drop-off
 974 | coral reef
 975 | geyser
 976 | lakeside, lakeshore
 977 | promontory, headland, head, foreland
 978 | sandbar, sand bar
 979 | seashore, coast, seacoast, sea-coast
 980 | valley, vale
 981 | volcano
 982 | ballplayer, baseball player
 983 | groom, bridegroom
 984 | scuba diver
 985 | rapeseed
 986 | daisy
 987 | yellow lady's slipper, yellow lady-slipper, Cypripedium calceolus, Cypripedium parviflorum
 988 | corn
 989 | acorn
 990 | hip, rose hip, rosehip
 991 | buckeye, horse chestnut, conker
 992 | coral fungus
 993 | agaric
 994 | gyromitra
 995 | stinkhorn, carrion fungus
 996 | earthstar
 997 | hen-of-the-woods, hen of the woods, Polyporus frondosus, Grifola frondosa
 998 | bolete
 999 | ear, spike, capitulum
1000 | toilet tissue, toilet paper, bathroom tissue
1001 | 


--------------------------------------------------------------------------------
/src/caffemodel/predict.py:
--------------------------------------------------------------------------------
 1 | # -*- coding: utf-8 -*-
 2 | 
 3 | import argparse
 4 | import numpy as np
 5 | import os
 6 | from PIL import Image
 7 | 
 8 | import chainer
 9 | from chainer import cuda
10 | from chainer import functions as F
11 | from chainer.functions import caffe
12 | 
13 | 
14 | def load_image(file_path, device=-1):
15 |     mean = np.asarray([104, 117, 123], dtype=np.float32)
16 |     # 中央の正方形領域を切り抜き、入力サイズを224 x 224
17 |     # 入力サイズはモデルによって異なる
18 |     image = Image.open(file_path).convert('RGB')
19 |     w, h = image.size
20 |     if w > h:
21 |         offset_x = (w - h) // 2
22 |         offset_y = 0
23 |         size = h
24 |     else:
25 |         offset_x = 0
26 |         offset_y = (h - w) // 2
27 |         size = w
28 |     image = image.crop((offset_x, offset_y, size, size)).resize((224, 224))
29 |     # RGBからBGRに変換する
30 |     # caffemodelは通常BGRを入力とする
31 |     x = np.asarray(image, dtype=np.float32)[:,:,::-1] - mean
32 |     # shapeを(batch, channel, height, width)に変換
33 |     x = x.transpose((2, 0, 1)).reshape((1, 3, 224, 224))
34 |     # 必要ならcupy配列に変換
35 |     if device >= 0:
36 |         x = cuda.to_gpu(x, device)
37 |     return x
38 | 
39 | def predict(model, x):
40 |     # caffemodelインスタンスを関数として呼ぶことでレイヤーの出力を得ることができる。
41 |     # 入力はinputsでdict形式で指定する
42 |     # 出力はレイヤー名のリストを指定する
43 |     # disableパラメータで使用しないレイヤーを指定できる。出力に関与しないレイヤーを使用しないことで無駄な計算を抑えることができる。
44 |     y, = model(inputs={'data': chainer.Variable(x, volatile=True)}, outputs=['loss3/classifier'], disable=['loss1/ave_pool', 'loss2/ave_pool'], train=False)
45 |     return F.softmax(y)
46 | 
47 | def is_image_file(path):
48 |     base, ext = os.path.splitext()
49 |     os.path.isfile(path)
50 | 
51 | def print_top(x, categories, top=10):
52 |     scores = x.reshape((-1,))
53 |     result = sorted(zip(scores, categories), reverse=True)
54 |     for i, (score, label) in enumerate(result[:top]):
55 |         print('{:>3d} {:>6.2f}% {}'.format(i + 1, float(score) * 100, label))
56 | 
57 | 
58 | if __name__ == '__main__':
59 |     parser = argparse.ArgumentParser('predict image category')
60 |     parser.add_argument('image_path', type=str, help='Image file  or directory path')
61 |     parser.add_argument('--model', '-m', type=str, default='bvlc_googlenet.caffemodel', help='caffe model file path')
62 |     parser.add_argument('--label', '-l', type=str, default='labels.txt', help='label file path')
63 |     parser.add_argument('--gpu', '-g', type=int, default=-1, help='GPU device index. negative value indicates CPU')
64 |     args = parser.parse_args()
65 | 
66 |     device = args.gpu
67 |     categories = np.loadtxt(args.label, str, delimiter='\n')
68 |     caffe_model = caffe.CaffeFunction(args.model)
69 |     if device >= 0:
70 |         caffe_model.to_gpu(device)
71 | 
72 |     if os.path.isdir(args.image_path):
73 |         image_files = os.listdir(args.image_path)
74 |         image_paths = map(lambda f: os.path.join(args.image_path, f), image_files)
75 |         image_paths = filter(os.path.isfile, image_paths)
76 |     else:
77 |         image_paths = [args.image_path]
78 | 
79 |     for image_path in image_paths:
80 |         try:
81 |             x = load_image(image_path, device)
82 |             y = predict(caffe_model, x)
83 |             print(os.path.basename(image_path))
84 |             # cupy配列はソートに時間がかかるのでnumpy配列に変換する
85 |             print_top(cuda.to_cpu(y.data), categories)
86 |         except IOError:
87 |             print('cannot load {}'.format(image_path))
88 | 


--------------------------------------------------------------------------------
/src/ex1/data_test.txt:
--------------------------------------------------------------------------------
 1 | 2.92, -6.85, 1
 2 | 6.92, -1.65, 1
 3 | -5.89, 6.34, 1
 4 | 3.90, -3.10, 1
 5 | -5.11, 6.61, 1
 6 | -2.61, -4.00, 0
 7 | -7.89, 4.80, 1
 8 | -9.52, 8.90, 1
 9 | -6.44, -8.27, 0
10 | -0.18, 3.20, 1
11 | -7.41, -4.15, 0
12 | -7.73, 9.88, 1
13 | 2.21, 8.65, 0
14 | 8.22, 1.68, 0
15 | -2.95, 6.34, 1
16 | 9.87, 2.16, 0
17 | 2.61, 1.74, 0
18 | 8.50, 9.68, 0
19 | -4.74, 8.44, 1
20 | -2.33, -5.38, 0
21 | -0.93, -4.84, 0
22 | -6.15, 3.46, 1
23 | -8.79, 9.77, 1
24 | -8.57, -9.01, 0
25 | -0.54, -9.48, 0
26 | 2.22, 5.13, 0
27 | 9.33, -7.75, 1
28 | -1.54, 7.15, 1
29 | -0.36, 7.45, 1
30 | 2.97, 2.34, 0
31 | 6.84, 5.49, 0
32 | -3.95, 2.26, 1
33 | -0.92, -6.99, 0
34 | 6.41, -9.42, 1
35 | 0.96, -0.07, 1
36 | 2.34, 6.67, 0
37 | 8.75, -7.65, 1
38 | 5.00, -2.92, 1
39 | 3.42, 3.37, 0
40 | 1.18, -6.38, 1
41 | -0.24, 8.02, 1
42 | -6.72, 2.95, 1
43 | -5.97, 1.17, 1
44 | -2.79, 1.54, 1
45 | 0.52, 9.24, 0
46 | -1.33, -1.86, 0
47 | 7.57, -0.41, 1
48 | 8.91, -3.04, 1
49 | -7.77, 9.66, 1
50 | -1.10, -5.82, 0
51 | 


--------------------------------------------------------------------------------
/src/ex1/data_train.txt:
--------------------------------------------------------------------------------
  1 | 7.95, 2.18, 0
  2 | 2.07, -5.19, 1
  3 | -2.65, 6.56, 1
  4 | 3.92, -4.12, 1
  5 | -1.57, -9.04, 0
  6 | 2.24, -5.98, 1
  7 | -4.21, 1.15, 1
  8 | -1.79, 2.32, 1
  9 | 5.98, 4.81, 0
 10 | 3.49, -9.13, 1
 11 | -4.50, 7.10, 1
 12 | -3.27, -2.77, 0
 13 | -2.28, 4.10, 1
 14 | -0.95, -1.54, 0
 15 | 5.65, -3.53, 1
 16 | -2.36, 3.15, 1
 17 | -0.80, -6.42, 0
 18 | 3.36, 6.17, 0
 19 | 7.85, -2.09, 1
 20 | 1.98, 1.73, 0
 21 | -0.90, 7.88, 1
 22 | -6.94, -1.43, 0
 23 | -7.74, -9.88, 0
 24 | 9.51, -7.43, 1
 25 | -7.85, -7.24, 0
 26 | -7.16, -8.31, 0
 27 | 2.01, -2.53, 1
 28 | 4.90, -7.28, 1
 29 | 9.32, 8.76, 0
 30 | -8.42, 7.04, 1
 31 | 5.13, 7.42, 0
 32 | 2.09, 6.07, 0
 33 | 3.66, -6.79, 1
 34 | -2.05, 0.18, 1
 35 | 0.04, -7.46, 1
 36 | 6.09, -2.41, 1
 37 | 2.61, -5.04, 1
 38 | 8.06, -8.99, 1
 39 | 7.04, -1.53, 1
 40 | 4.98, 5.65, 0
 41 | 1.30, 9.28, 0
 42 | -7.82, 2.74, 1
 43 | 0.90, 4.24, 0
 44 | 0.57, 6.13, 0
 45 | 7.16, -5.06, 1
 46 | 2.93, -9.62, 1
 47 | -1.05, -7.02, 0
 48 | 5.94, -8.53, 1
 49 | -4.78, -8.36, 0
 50 | 1.68, 5.53, 0
 51 | 3.85, 2.57, 0
 52 | -2.65, -2.76, 0
 53 | -1.61, 4.66, 1
 54 | -1.23, -6.82, 0
 55 | -6.10, 1.98, 1
 56 | 5.39, 4.08, 0
 57 | -5.99, -7.68, 0
 58 | -0.39, -7.82, 0
 59 | -3.81, -5.42, 0
 60 | -6.98, -3.58, 0
 61 | -2.93, 3.14, 1
 62 | 3.09, 8.48, 0
 63 | -2.38, -5.77, 0
 64 | 3.05, -1.90, 1
 65 | -6.94, -7.69, 0
 66 | 5.32, -4.06, 1
 67 | -0.44, -9.49, 0
 68 | -5.62, -8.19, 0
 69 | 7.23, 5.06, 0
 70 | 3.90, -7.83, 1
 71 | 8.50, -8.28, 1
 72 | 0.35, 0.66, 0
 73 | -7.30, 1.20, 1
 74 | 3.23, 9.44, 0
 75 | 8.51, -8.12, 1
 76 | -0.69, -6.02, 0
 77 | 0.23, -3.74, 1
 78 | -6.38, 9.39, 1
 79 | -1.27, 1.81, 1
 80 | 2.49, -5.17, 1
 81 | 7.83, 6.36, 0
 82 | -3.44, -3.57, 0
 83 | 2.23, 5.04, 0
 84 | -5.78, 6.93, 1
 85 | -2.03, 0.09, 1
 86 | -6.47, -1.19, 0
 87 | -4.20, -4.20, 0
 88 | -3.35, 3.89, 1
 89 | 2.45, 7.00, 0
 90 | -5.52, -3.59, 0
 91 | -9.53, 0.05, 1
 92 | 3.44, 0.26, 0
 93 | -3.30, -3.47, 0
 94 | -8.21, 2.29, 1
 95 | -9.66, -9.03, 0
 96 | -4.45, 9.60, 1
 97 | 0.34, 4.08, 0
 98 | 9.56, -3.26, 1
 99 | -0.15, -0.44, 0
100 | -4.31, -2.49, 0
101 | 


--------------------------------------------------------------------------------
/src/ex1/example/image/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/src/ex1/example/image/.gitkeep


--------------------------------------------------------------------------------
/src/ex1/example/train.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | # -*- coding: utf-8 -*-
  3 | 
  4 | import numpy as np
  5 | from PIL import Image, ImageDraw
  6 | import six
  7 | 
  8 | import chainer
  9 | from chainer import functions as F
 10 | from chainer import links as L
 11 | from chainer import optimizers
 12 | from chainer import serializers
 13 | from chainer.dataset import convert
 14 | 
 15 | # ヒント
 16 | # link:
 17 | # L.Linear
 18 | #
 19 | # function:
 20 | # F.relu, F.tanh, F.sigmoid
 21 | # F.softmax_cross_entropy
 22 | #
 23 | # optimizer:
 24 | # optimizers.Adam, optimizers.MomentumSGD
 25 | 
 26 | class ClassifierNet(chainer.Chain):
 27 |     def __init__(self):
 28 |         super(ClassifierNet, self).__init__(
 29 |             # TODO linkを追加する
 30 |             # 例: l1=L.Linear(2, 10),
 31 |             l1=L.Linear(2, 8),
 32 |             l2=L.Linear(8, 8),
 33 |             l3=L.Linear(8, 2),
 34 |         )
 35 | 
 36 |     def __call__(self, x, train=True):
 37 |         # TODO ニューラルネットワークの構成を定義する
 38 |         # 入力xのshapeは(batch_size, 2)
 39 |         # 出力のshapeは(batch_size, 2)
 40 |         h = F.relu(self.l1(x))
 41 |         h = F.relu(self.l2(h))
 42 |         return self.l3(h)
 43 | 
 44 | def main(train_x, train_y, test_x, test_y):
 45 |     # TODO 必要に応じてepoch_num, bach_sizeを修正
 46 |     epoch_num = 100
 47 |     batch_size = 10
 48 | 
 49 |     train_data = chainer.datasets.TupleDataset(train_x, train_y)
 50 |     test_data = chainer.datasets.TupleDataset(test_x, test_y)
 51 | 
 52 |     train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
 53 | 
 54 |     # TODO ニューラルネットワークの生成
 55 |     # net = ...
 56 |     net = ClassifierNet()
 57 |     # TODO optimizerの生成
 58 |     # optimizer = ...
 59 |     optimizer = optimizers.Adam(alpha=0.01)
 60 |     # TODO optimizerとモデルの関連付け
 61 |     optimizer.setup(net)
 62 | 
 63 |     train_loss_sum = 0
 64 |     train_acc_sum = 0
 65 |     train_num = 0
 66 |     while train_iterator.epoch < epoch_num:
 67 |         batch = train_iterator.next()
 68 |         x, t = convert.concat_examples(batch)
 69 |         # x: 入力データ shapeは(batch_size, 2)
 70 |         # t: 正解ラベル shapeは(batch_size,)
 71 | 
 72 |         # TODO ニューラルネットワークの実行
 73 |         # y = ...
 74 |         y = net(x, train=True)
 75 |         # TODO 損失の計算
 76 |         # loss = ...
 77 |         loss = F.softmax_cross_entropy(y, t)
 78 |         # TODO 精度の計算
 79 |         # acc = ...
 80 |         acc = F.accuracy(y, t)
 81 |         # TODO ネットワークの勾配初期化
 82 |         net.cleargrads()
 83 |         # TODO バックプロパゲーションを行い勾配を計算する
 84 |         loss.backward()
 85 |         # TODO パラメータを更新する
 86 |         optimizer.update()
 87 | 
 88 |         train_loss_sum += float(loss.data) * len(x)
 89 |         train_acc_sum += float(acc.data) * len(x)
 90 |         train_num += len(x)
 91 |         if train_iterator.is_new_epoch:
 92 |             train_loss = train_loss_sum / train_num
 93 |             train_acc = train_acc_sum / train_num
 94 |             print('epoch {} done'.format(train_iterator.epoch))
 95 |             print('train loss: {} accracy: {}'.format(train_loss, train_acc))
 96 | 
 97 |             # TODO テストデータの精度を計算
 98 |             test_loss, test_acc = evaluate(net, test_data)
 99 |             print('test loss: {} accuracy: {}'.format(test_loss, test_acc))
100 | 
101 |             train_loss_sum = 0
102 |             train_acc_sum = 0
103 |             train_num = 0
104 | 
105 |     save_image(net, train_x, train_y, test_x, test_y)
106 | 
107 | def evaluate(net, dataset):
108 |     batch_size = 100
109 |     iterator = chainer.iterators.SerialIterator(dataset, batch_size, repeat=False, shuffle=False)
110 |     loss_sum = 0
111 |     acc_sum = 0
112 |     num = 0
113 |     for batch in iterator:
114 |         x, t = convert.concat_examples(batch)
115 |         x = chainer.Variable(x, volatile=True)
116 |         t = chainer.Variable(t, volatile=True)
117 |         # TODO ネットワーク実行
118 |         # y = ...
119 |         y = net(x, train=False)
120 |         # TODO 損失を計算
121 |         # loss = ...
122 |         loss = F.softmax_cross_entropy(y, t)
123 |         # TODO 精度を計算
124 |         # acc = ...
125 |         acc = F.accuracy(y, t)
126 | 
127 |         n = len(x.data)
128 |         loss_sum += float(loss.data) * n
129 |         acc_sum += float(acc.data) * n
130 |         num += n
131 |     return loss_sum / num, acc_sum / num
132 | 
133 | def point_to_pixel(p):
134 |     return (p[0] + 10) * 25, (10 - p[1]) * 25
135 | 
136 | def save_image(net, train_x, train_y, test_x, test_y):
137 |     image = Image.new('RGB', (500, 500), (255, 255, 255))
138 |     draw = ImageDraw.Draw(image)
139 | 
140 |     colors = [(255, 192, 192), (192, 192, 255)]
141 |     for i in six.moves.range(100):
142 |         x = np.zeros((100, 2), dtype=np.float32)
143 |         x[:,1] = float(i) * 0.2 - 10 + 0.1
144 |         x[:,0] = np.arange(100).astype(np.float32) * 0.2 - 10 + 0.1
145 |         y = net(x, train=False)
146 |         labels = y.data.argmax(axis=1)
147 |         for p, label in zip(x, labels):
148 |             pos_x, pos_y = point_to_pixel(p)
149 |             draw.rectangle((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[label])
150 |     draw.line((250, 0, 250, 500), fill=0)
151 |     draw.line((0, 250, 500, 250), fill=0)
152 |     test_image = image.copy()
153 | 
154 |     colors = [(255, 0, 0), (0, 0, 255)]
155 |     for x, y in zip(train_x, train_y):
156 |         pos_x, pos_y = point_to_pixel(x)
157 |         draw.ellipse((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[y])
158 |     image.save('image/result_train.png')
159 | 
160 |     draw = ImageDraw.Draw(test_image)
161 |     for x, y in zip(test_x, test_y):
162 |         pos_x, pos_y = point_to_pixel(x)
163 |         draw.ellipse((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[y])
164 |     test_image.save('image/result_test.png')
165 | 
166 | def load_data(file_path):
167 |     positions = []
168 |     labels = []
169 |     with open(file_path) as f:
170 |         for line in f:
171 |             x, y, label = line.split(',')
172 |             positions.append([float(x), float(y)])
173 |             labels.append(int(label))
174 |     positions = np.asarray(positions, dtype=np.float32)
175 |     labels = np.asarray(labels, dtype=np.int32)
176 |     return positions, labels
177 | 
178 | if __name__ == '__main__':
179 |     train_x, train_y = load_data('../data_train.txt')
180 |     test_x, test_y = load_data('../data_test.txt')
181 |     main(train_x, train_y, test_x, test_y)
182 | 


--------------------------------------------------------------------------------
/src/ex1/image/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/src/ex1/image/.gitkeep


--------------------------------------------------------------------------------
/src/ex1/train.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | # -*- coding: utf-8 -*-
  3 | 
  4 | import numpy as np
  5 | from PIL import Image, ImageDraw
  6 | import six
  7 | 
  8 | import chainer
  9 | from chainer import functions as F
 10 | from chainer import links as L
 11 | from chainer import optimizers
 12 | from chainer import serializers
 13 | from chainer.dataset import convert
 14 | 
 15 | # ヒント
 16 | # chainer links:
 17 | # L.Linear
 18 | #
 19 | # chainer functions:
 20 | # F.relu, F.tanh, F.sigmoid
 21 | # F.softmax_cross_entropy
 22 | #
 23 | # chainer optimizers:
 24 | # optimizers.Adam, optimizers.MomentumSGD
 25 | # optimizer methods:
 26 | # optimzier.setup(), optimizer,update()
 27 | #
 28 | # Variable methods:
 29 | # var.backward()
 30 | 
 31 | class ClassifierNet(chainer.Chain):
 32 |     def __init__(self):
 33 |         super(ClassifierNet, self).__init__(
 34 |             # TODO linkを追加する
 35 |             # 例: l1=L.Linear(2, 10),
 36 | 
 37 |         )
 38 | 
 39 |     def __call__(self, x, train=True):
 40 |         # TODO ニューラルネットワークの構成を定義する
 41 |         # 入力xのshapeは(batch_size, 2)
 42 |         # 出力のshapeは(batch_size, 2)
 43 | 
 44 | 
 45 | def main(train_x, train_y, test_x, test_y):
 46 |     # TODO 必要に応じてepoch_num, bach_sizeを修正
 47 |     epoch_num = 100
 48 |     batch_size = 10
 49 | 
 50 |     train_data = chainer.datasets.TupleDataset(train_x, train_y)
 51 |     test_data = chainer.datasets.TupleDataset(test_x, test_y)
 52 | 
 53 |     train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
 54 | 
 55 |     # TODO ニューラルネットワークの生成
 56 |     # net = ...
 57 | 
 58 |     # TODO optimizerの生成
 59 |     # optimizer = ...
 60 | 
 61 |     # TODO optimizerとモデルの関連付け
 62 | 
 63 | 
 64 |     train_loss_sum = 0
 65 |     train_acc_sum = 0
 66 |     train_num = 0
 67 |     while train_iterator.epoch < epoch_num:
 68 |         batch = train_iterator.next()
 69 |         x, t = convert.concat_examples(batch)
 70 |         # x: 入力データ shapeは(batch_size, 2)
 71 |         # t: 正解ラベル shapeは(batch_size,)
 72 | 
 73 |         # TODO ニューラルネットワークの実行
 74 |         # y = ...
 75 | 
 76 |         # TODO 損失の計算
 77 |         # loss = ...
 78 | 
 79 |         # TODO 精度の計算
 80 |         # acc = ...
 81 | 
 82 |         # TODO ネットワークの勾配初期化
 83 | 
 84 |         # TODO バックプロパゲーションを行い勾配を計算する
 85 | 
 86 |         # TODO パラメータを更新する
 87 | 
 88 | 
 89 |         train_loss_sum += float(loss.data) * len(x)
 90 |         train_acc_sum += float(acc.data) * len(x)
 91 |         train_num += len(x)
 92 |         if train_iterator.is_new_epoch:
 93 |             train_loss = train_loss_sum / train_num
 94 |             train_acc = train_acc_sum / train_num
 95 |             print('epoch {} done'.format(train_iterator.epoch))
 96 |             print('train loss: {} accracy: {}'.format(train_loss, train_acc))
 97 | 
 98 |             # TODO テストデータの精度を計算
 99 |             # コメント部分を有効にする
100 |             # test_loss, test_acc = evaluate(net, test_data)
101 |             # print('test loss: {} accuracy: {}'.format(test_loss, test_acc))
102 | 
103 |             train_loss_sum = 0
104 |             train_acc_sum = 0
105 |             train_num = 0
106 | 
107 |     save_image(net, train_x, train_y, test_x, test_y)
108 | 
109 | def evaluate(net, dataset):
110 |     batch_size = 100
111 |     iterator = chainer.iterators.SerialIterator(dataset, batch_size, repeat=False, shuffle=False)
112 |     loss_sum = 0
113 |     acc_sum = 0
114 |     num = 0
115 |     for batch in iterator:
116 |         x, t = convert.concat_examples(batch)
117 |         x = chainer.Variable(x, volatile=True)
118 |         t = chainer.Variable(t, volatile=True)
119 |         # TODO ネットワーク実行
120 |         # y = ...
121 | 
122 |         # TODO 損失を計算
123 |         # loss = ...
124 | 
125 |         # TODO 精度を計算
126 |         # acc = ...
127 | 
128 | 
129 |         n = len(x.data)
130 |         loss_sum += float(loss.data) * n
131 |         acc_sum += float(acc.data) * n
132 |         num += n
133 |     return loss_sum / num, acc_sum / num
134 | 
135 | def point_to_pixel(p):
136 |     return (p[0] + 10) * 25, (10 - p[1]) * 25
137 | 
138 | def save_image(net, train_x, train_y, test_x, test_y):
139 |     image = Image.new('RGB', (500, 500), (255, 255, 255))
140 |     draw = ImageDraw.Draw(image)
141 | 
142 |     colors = [(255, 192, 192), (192, 192, 255)]
143 |     for i in six.moves.range(100):
144 |         x = np.zeros((100, 2), dtype=np.float32)
145 |         x[:,1] = float(i) * 0.2 - 10 + 0.1
146 |         x[:,0] = np.arange(100).astype(np.float32) * 0.2 - 10 + 0.1
147 |         y = net(x, train=False)
148 |         labels = y.data.argmax(axis=1)
149 |         for p, label in zip(x, labels):
150 |             pos_x, pos_y = point_to_pixel(p)
151 |             draw.rectangle((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[label])
152 |     draw.line((250, 0, 250, 500), fill=0)
153 |     draw.line((0, 250, 500, 250), fill=0)
154 |     test_image = image.copy()
155 | 
156 |     colors = [(255, 0, 0), (0, 0, 255)]
157 |     for x, y in zip(train_x, train_y):
158 |         pos_x, pos_y = point_to_pixel(x)
159 |         draw.ellipse((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[y])
160 |     image.save('image/result_train.png')
161 | 
162 |     draw = ImageDraw.Draw(test_image)
163 |     for x, y in zip(test_x, test_y):
164 |         pos_x, pos_y = point_to_pixel(x)
165 |         draw.ellipse((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[y])
166 |     test_image.save('image/result_test.png')
167 | 
168 | def load_data(file_path):
169 |     positions = []
170 |     labels = []
171 |     with open(file_path) as f:
172 |         for line in f:
173 |             x, y, label = line.split(',')
174 |             positions.append([float(x), float(y)])
175 |             labels.append(int(label))
176 |     positions = np.asarray(positions, dtype=np.float32)
177 |     labels = np.asarray(labels, dtype=np.int32)
178 |     return positions, labels
179 | 
180 | if __name__ == '__main__':
181 |     train_x, train_y = load_data('data_train.txt')
182 |     test_x, test_y = load_data('data_test.txt')
183 |     main(train_x, train_y, test_x, test_y)
184 | 


--------------------------------------------------------------------------------
/src/ex1_2/data_test.txt:
--------------------------------------------------------------------------------
 1 | 0.56, 0.57, 0
 2 | -1.92, 1.55, 0
 3 | 0.41, 1.16, 0
 4 | 4.68, -3.22, 0
 5 | -7.29, 4.98, 0
 6 | 5.75, 3.74, 0
 7 | -3.05, 0.24, 0
 8 | 0.05, 1.54, 0
 9 | 5.29, 4.58, 0
10 | -1.66, 1.67, 0
11 | -0.94, 1.82, 0
12 | -0.13, 1.64, 0
13 | -2.35, 1.24, 0
14 | 4.74, -3.15, 0
15 | 2.10, 7.27, 0
16 | 4.39, 5.71, 0
17 | -1.58, 1.70, 0
18 | 6.35, 1.27, 0
19 | 1.53, 7.50, 0
20 | 5.67, -1.85, 0
21 | 0.65, -4.80, 0
22 | 1.77, 7.41, 0
23 | 0.00, 7.85, 0
24 | 6.09, 2.86, 0
25 | -0.27, -4.65, 0
26 | -2.16, 1.39, 0
27 | 6.32, 1.72, 0
28 | -2.21, 1.36, 0
29 | -0.85, 1.82, 0
30 | -2.96, 0.45, 0
31 | -0.02, 4.72, 1
32 | 7.83, -4.29, 1
33 | 9.02, -1.89, 1
34 | 1.24, 4.25, 1
35 | 1.21, -1.79, 1
36 | -1.10, 4.81, 1
37 | -0.46, -1.06, 1
38 | 2.07, -7.84, 1
39 | 3.11, -0.09, 1
40 | -0.55, -0.55, 1
41 | 2.66, -0.91, 1
42 | 0.99, -1.82, 1
43 | -0.22, -1.40, 1
44 | -4.03, -6.04, 1
45 | 1.21, 4.27, 1
46 | 1.88, 3.81, 1
47 | -2.92, 4.42, 1
48 | 0.33, 4.63, 1
49 | -0.03, -1.55, 1
50 | 2.71, -0.84, 1
51 | -0.23, -7.82, 1
52 | 0.36, -1.74, 1
53 | -1.34, 4.80, 1
54 | -5.81, -3.60, 1
55 | 0.48, -1.77, 1
56 | -6.04, -3.00, 1
57 | 9.23, -1.14, 1
58 | 1.46, 4.12, 1
59 | -4.65, 3.25, 1
60 | -0.05, -1.54, 1
61 | 


--------------------------------------------------------------------------------
/src/ex1_2/data_train.txt:
--------------------------------------------------------------------------------
  1 | 0.46, 1.07, 0
  2 | 5.77, -1.65, 0
  3 | 2.88, -4.43, 0
  4 | 6.03, 3.02, 0
  5 | -3.20, -1.71, 0
  6 | 4.91, 5.11, 0
  7 | -0.25, 1.70, 0
  8 | 0.45, 1.10, 0
  9 | -2.39, 1.20, 0
 10 | -2.31, 1.28, 0
 11 | -0.62, -4.53, 0
 12 | -0.89, 1.82, 0
 13 | 0.19, 1.43, 0
 14 | -7.36, 4.90, 0
 15 | -3.71, 7.44, 0
 16 | 3.86, -3.91, 0
 17 | 1.91, -4.72, 0
 18 | -7.00, 5.30, 0
 19 | 4.47, -3.42, 0
 20 | -1.44, 1.74, 0
 21 | -2.09, 1.44, 0
 22 | 6.36, 0.84, 0
 23 | 0.85, -4.81, 0
 24 | -2.23, 1.34, 0
 25 | 2.11, -4.68, 0
 26 | 0.15, 1.46, 0
 27 | -3.27, -0.66, 0
 28 | 2.76, 6.95, 0
 29 | -0.86, -4.44, 0
 30 | 1.38, -4.80, 0
 31 | -3.20, -0.20, 0
 32 | -0.41, 1.75, 0
 33 | -0.22, 1.68, 0
 34 | 2.39, 7.14, 0
 35 | 1.39, -4.80, 0
 36 | 2.49, 7.09, 0
 37 | 0.53, 0.42, 0
 38 | -1.14, 1.80, 0
 39 | -0.28, -4.64, 0
 40 | -0.63, -4.53, 0
 41 | -5.83, 6.32, 0
 42 | -0.43, 1.76, 0
 43 | 0.56, 0.70, 0
 44 | 0.50, 0.98, 0
 45 | -1.61, 1.69, 0
 46 | -1.67, -3.98, 0
 47 | 5.93, -1.30, 0
 48 | 5.77, -1.65, 0
 49 | -6.39, 5.88, 0
 50 | -8.76, 2.61, 0
 51 | -5.90, -3.39, 1
 52 | -0.51, -0.96, 1
 53 | -2.77, -6.94, 1
 54 | 2.26, 3.46, 1
 55 | -0.77, -7.72, 1
 56 | -5.55, 2.08, 1
 57 | -0.48, -1.03, 1
 58 | -3.50, -6.47, 1
 59 | 2.52, -1.07, 1
 60 | -3.68, -6.34, 1
 61 | -5.76, 1.67, 1
 62 | 1.18, 4.28, 1
 63 | 1.58, -1.70, 1
 64 | 1.46, 4.12, 1
 65 | 4.15, -7.27, 1
 66 | 3.37, -7.56, 1
 67 | -0.09, -1.51, 1
 68 | -5.74, -3.76, 1
 69 | -4.14, -5.94, 1
 70 | -2.38, 4.61, 1
 71 | 3.16, 0.05, 1
 72 | 6.14, -6.08, 1
 73 | 1.49, 4.10, 1
 74 | 2.43, -7.79, 1
 75 | -5.28, 2.50, 1
 76 | 5.84, -6.32, 1
 77 | -5.97, -3.21, 1
 78 | 1.81, -1.60, 1
 79 | 1.66, 3.99, 1
 80 | 3.25, 0.49, 1
 81 | 0.30, -1.72, 1
 82 | 9.23, -1.16, 1
 83 | 2.15, -1.40, 1
 84 | -3.12, 4.33, 1
 85 | -0.24, -1.38, 1
 86 | -0.27, 4.76, 1
 87 | 1.82, -1.60, 1
 88 | -5.98, -3.18, 1
 89 | 3.52, -7.51, 1
 90 | 3.24, 0.40, 1
 91 | -5.08, -4.89, 1
 92 | -4.94, 2.93, 1
 93 | -5.74, -3.76, 1
 94 | -4.62, 3.28, 1
 95 | 0.60, -1.80, 1
 96 | 0.31, -1.72, 1
 97 | -5.38, 2.36, 1
 98 | -6.30, -0.12, 1
 99 | -1.76, 4.75, 1
100 | 6.40, -5.87, 1
101 | 


--------------------------------------------------------------------------------
/src/ex1_2/example/image/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/src/ex1_2/example/image/.gitkeep


--------------------------------------------------------------------------------
/src/ex1_2/example/train.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | # -*- coding: utf-8 -*-
  3 | 
  4 | import numpy as np
  5 | from PIL import Image, ImageDraw
  6 | import six
  7 | 
  8 | import chainer
  9 | from chainer import functions as F
 10 | from chainer import links as L
 11 | from chainer import optimizers
 12 | from chainer import serializers
 13 | from chainer.dataset import convert
 14 | 
 15 | # ヒント
 16 | # link:
 17 | # L.Linear
 18 | #
 19 | # function:
 20 | # F.relu, F.tanh, F.sigmoid
 21 | # F.softmax_cross_entropy
 22 | #
 23 | # optimizer:
 24 | # optimizers.Adam, optimizers.MomentumSGD
 25 | 
 26 | class ClassifierNet(chainer.Chain):
 27 |     def __init__(self):
 28 |         super(ClassifierNet, self).__init__(
 29 |             # TODO linkを追加する
 30 |             # 例: l1=L.Linear(2, 10),
 31 |             l1=L.Linear(2, 8),
 32 |             l2=L.Linear(8, 8),
 33 |             l3=L.Linear(8, 8),
 34 |             l4=L.Linear(8, 2),
 35 |         )
 36 | 
 37 |     def __call__(self, x, train=True):
 38 |         # TODO ニューラルネットワークの構成を定義する
 39 |         # 入力xのshapeは(batch_size, 2)
 40 |         # 出力のshapeは(batch_size, 2)
 41 |         h = F.relu(self.l1(x))
 42 |         h = F.relu(self.l2(h))
 43 |         h = F.relu(self.l3(h))
 44 |         return self.l4(h)
 45 | 
 46 | def main(train_x, train_y, test_x, test_y):
 47 |     # TODO 必要に応じてepoch_num, bach_sizeを修正
 48 |     epoch_num = 200
 49 |     batch_size = 10
 50 | 
 51 |     train_data = chainer.datasets.TupleDataset(train_x, train_y)
 52 |     test_data = chainer.datasets.TupleDataset(test_x, test_y)
 53 | 
 54 |     train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
 55 | 
 56 |     # TODO ニューラルネットワークの生成
 57 |     # net = ...
 58 |     net = ClassifierNet()
 59 |     # TODO optimizerの生成
 60 |     # optimizer = ...
 61 |     optimizer = optimizers.MomentumSGD(0.005)
 62 |     # TODO optimizerとモデルの関連付け
 63 |     optimizer.setup(net)
 64 | 
 65 |     train_loss_sum = 0
 66 |     train_acc_sum = 0
 67 |     train_num = 0
 68 |     while train_iterator.epoch < epoch_num:
 69 |         batch = train_iterator.next()
 70 |         x, t = convert.concat_examples(batch)
 71 |         # x: 入力データ shapeは(batch_size, 2)
 72 |         # t: 正解ラベル shapeは(batch_size,)
 73 | 
 74 |         # TODO ニューラルネットワークの実行
 75 |         # y = ...
 76 |         y = net(x, train=True)
 77 |         # TODO 損失の計算
 78 |         # loss = ...
 79 |         loss = F.softmax_cross_entropy(y, t)
 80 |         # TODO 精度の計算
 81 |         # acc = ...
 82 |         acc = F.accuracy(y, t)
 83 |         # TODO ネットワークの勾配初期化
 84 |         net.cleargrads()
 85 |         # TODO バックプロパゲーションを行い勾配を計算する
 86 |         loss.backward()
 87 |         # TODO パラメータを更新する
 88 |         optimizer.update()
 89 | 
 90 |         train_loss_sum += float(loss.data) * len(x)
 91 |         train_acc_sum += float(acc.data) * len(x)
 92 |         train_num += len(x)
 93 |         if train_iterator.is_new_epoch:
 94 |             train_loss = train_loss_sum / train_num
 95 |             train_acc = train_acc_sum / train_num
 96 |             print('epoch {} done'.format(train_iterator.epoch))
 97 |             print('train loss: {} accracy: {}'.format(train_loss, train_acc))
 98 | 
 99 |             # TODO テストデータの精度を計算
100 |             test_loss, test_acc = evaluate(net, test_data)
101 |             print('test loss: {} accuracy: {}'.format(test_loss, test_acc))
102 | 
103 |             train_loss_sum = 0
104 |             train_acc_sum = 0
105 |             train_num = 0
106 | 
107 |     save_image(net, train_x, train_y, test_x, test_y)
108 | 
109 | def evaluate(net, dataset):
110 |     batch_size = 100
111 |     iterator = chainer.iterators.SerialIterator(dataset, batch_size, repeat=False, shuffle=False)
112 |     loss_sum = 0
113 |     acc_sum = 0
114 |     num = 0
115 |     for batch in iterator:
116 |         x, t = convert.concat_examples(batch)
117 |         x = chainer.Variable(x, volatile=True)
118 |         t = chainer.Variable(t, volatile=True)
119 |         # TODO ネットワーク実行
120 |         # y = ...
121 |         y = net(x, train=False)
122 |         # TODO 損失を計算
123 |         # loss = ...
124 |         loss = F.softmax_cross_entropy(y, t)
125 |         # TODO 精度を計算
126 |         # acc = ...
127 |         acc = F.accuracy(y, t)
128 | 
129 |         n = len(x.data)
130 |         loss_sum += float(loss.data) * n
131 |         acc_sum += float(acc.data) * n
132 |         num += n
133 |     return loss_sum / num, acc_sum / num
134 | 
135 | def point_to_pixel(p):
136 |     return (p[0] + 10) * 25, (10 - p[1]) * 25
137 | 
138 | def save_image(net, train_x, train_y, test_x, test_y):
139 |     image = Image.new('RGB', (500, 500), (255, 255, 255))
140 |     draw = ImageDraw.Draw(image)
141 | 
142 |     colors = [(255, 192, 192), (192, 192, 255)]
143 |     for i in six.moves.range(100):
144 |         x = np.zeros((100, 2), dtype=np.float32)
145 |         x[:,1] = float(i) * 0.2 - 10 + 0.1
146 |         x[:,0] = np.arange(100).astype(np.float32) * 0.2 - 10 + 0.1
147 |         y = net(x, train=False)
148 |         labels = y.data.argmax(axis=1)
149 |         for p, label in zip(x, labels):
150 |             pos_x, pos_y = point_to_pixel(p)
151 |             draw.rectangle((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[label])
152 |     draw.line((250, 0, 250, 500), fill=0)
153 |     draw.line((0, 250, 500, 250), fill=0)
154 |     test_image = image.copy()
155 | 
156 |     colors = [(255, 0, 0), (0, 0, 255)]
157 |     for x, y in zip(train_x, train_y):
158 |         pos_x, pos_y = point_to_pixel(x)
159 |         draw.ellipse((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[y])
160 |     image.save('image/result_train.png')
161 | 
162 |     draw = ImageDraw.Draw(test_image)
163 |     for x, y in zip(test_x, test_y):
164 |         pos_x, pos_y = point_to_pixel(x)
165 |         draw.ellipse((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[y])
166 |     test_image.save('image/result_test.png')
167 | 
168 | def load_data(file_path):
169 |     positions = []
170 |     labels = []
171 |     with open(file_path) as f:
172 |         for line in f:
173 |             x, y, label = line.split(',')
174 |             positions.append([float(x), float(y)])
175 |             labels.append(int(label))
176 |     positions = np.asarray(positions, dtype=np.float32)
177 |     labels = np.asarray(labels, dtype=np.int32)
178 |     return positions, labels
179 | 
180 | if __name__ == '__main__':
181 |     train_x, train_y = load_data('../data_train.txt')
182 |     test_x, test_y = load_data('../data_test.txt')
183 |     main(train_x, train_y, test_x, test_y)
184 | 


--------------------------------------------------------------------------------
/src/ex1_2/image/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/src/ex1_2/image/.gitkeep


--------------------------------------------------------------------------------
/src/ex1_2/train.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | # -*- coding: utf-8 -*-
  3 | 
  4 | import numpy as np
  5 | from PIL import Image, ImageDraw
  6 | import six
  7 | 
  8 | import chainer
  9 | from chainer import functions as F
 10 | from chainer import links as L
 11 | from chainer import optimizers
 12 | from chainer import serializers
 13 | from chainer.dataset import convert
 14 | 
 15 | # ヒント
 16 | # chainer links:
 17 | # L.Linear
 18 | #
 19 | # chainer functions:
 20 | # F.relu, F.tanh, F.sigmoid
 21 | # F.softmax_cross_entropy
 22 | #
 23 | # chainer optimizers:
 24 | # optimizers.Adam, optimizers.MomentumSGD
 25 | # optimizer methods:
 26 | # optimzier.setup(), optimizer,update()
 27 | #
 28 | # Variable methods:
 29 | # var.backward()
 30 | 
 31 | class ClassifierNet(chainer.Chain):
 32 |     def __init__(self):
 33 |         super(ClassifierNet, self).__init__(
 34 |             # TODO linkを追加する
 35 |             # 例: l1=L.Linear(2, 10),
 36 | 
 37 |         )
 38 | 
 39 |     def __call__(self, x, train=True):
 40 |         # TODO ニューラルネットワークの構成を定義する
 41 |         # 入力xのshapeは(batch_size, 2)
 42 |         # 出力のshapeは(batch_size, 2)
 43 | 
 44 | 
 45 | def main(train_x, train_y, test_x, test_y):
 46 |     # TODO 必要に応じてepoch_num, bach_sizeを修正
 47 |     epoch_num = 100
 48 |     batch_size = 10
 49 | 
 50 |     train_data = chainer.datasets.TupleDataset(train_x, train_y)
 51 |     test_data = chainer.datasets.TupleDataset(test_x, test_y)
 52 | 
 53 |     train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
 54 | 
 55 |     # TODO ニューラルネットワークの生成
 56 |     # net = ...
 57 | 
 58 |     # TODO optimizerの生成
 59 |     # optimizer = ...
 60 | 
 61 |     # TODO optimizerとモデルの関連付け
 62 | 
 63 | 
 64 |     train_loss_sum = 0
 65 |     train_acc_sum = 0
 66 |     train_num = 0
 67 |     while train_iterator.epoch < epoch_num:
 68 |         batch = train_iterator.next()
 69 |         x, t = convert.concat_examples(batch)
 70 |         # x: 入力データ shapeは(batch_size, 2)
 71 |         # t: 正解ラベル shapeは(batch_size,)
 72 | 
 73 |         # TODO ニューラルネットワークの実行
 74 |         # y = ...
 75 | 
 76 |         # TODO 損失の計算
 77 |         # loss = ...
 78 | 
 79 |         # TODO 精度の計算
 80 |         # acc = ...
 81 | 
 82 |         # TODO ネットワークの勾配初期化
 83 | 
 84 |         # TODO バックプロパゲーションを行い勾配を計算する
 85 | 
 86 |         # TODO パラメータを更新する
 87 | 
 88 | 
 89 |         train_loss_sum += float(loss.data) * len(x)
 90 |         train_acc_sum += float(acc.data) * len(x)
 91 |         train_num += len(x)
 92 |         if train_iterator.is_new_epoch:
 93 |             train_loss = train_loss_sum / train_num
 94 |             train_acc = train_acc_sum / train_num
 95 |             print('epoch {} done'.format(train_iterator.epoch))
 96 |             print('train loss: {} accracy: {}'.format(train_loss, train_acc))
 97 | 
 98 |             # TODO テストデータの精度を計算
 99 |             # コメント部分を有効にする
100 |             # test_loss, test_acc = evaluate(net, test_data)
101 |             # print('test loss: {} accuracy: {}'.format(test_loss, test_acc))
102 | 
103 |             train_loss_sum = 0
104 |             train_acc_sum = 0
105 |             train_num = 0
106 | 
107 |     save_image(net, train_x, train_y, test_x, test_y)
108 | 
109 | def evaluate(net, dataset):
110 |     batch_size = 100
111 |     iterator = chainer.iterators.SerialIterator(dataset, batch_size, repeat=False, shuffle=False)
112 |     loss_sum = 0
113 |     acc_sum = 0
114 |     num = 0
115 |     for batch in iterator:
116 |         x, t = convert.concat_examples(batch)
117 |         x = chainer.Variable(x, volatile=True)
118 |         t = chainer.Variable(t, volatile=True)
119 |         # TODO ネットワーク実行
120 |         # y = ...
121 | 
122 |         # TODO 損失を計算
123 |         # loss = ...
124 | 
125 |         # TODO 精度を計算
126 |         # acc = ...
127 | 
128 | 
129 |         n = len(x.data)
130 |         loss_sum += float(loss.data) * n
131 |         acc_sum += float(acc.data) * n
132 |         num += n
133 |     return loss_sum / num, acc_sum / num
134 | 
135 | def point_to_pixel(p):
136 |     return (p[0] + 10) * 25, (10 - p[1]) * 25
137 | 
138 | def save_image(net, train_x, train_y, test_x, test_y):
139 |     image = Image.new('RGB', (500, 500), (255, 255, 255))
140 |     draw = ImageDraw.Draw(image)
141 | 
142 |     colors = [(255, 192, 192), (192, 192, 255)]
143 |     for i in six.moves.range(100):
144 |         x = np.zeros((100, 2), dtype=np.float32)
145 |         x[:,1] = float(i) * 0.2 - 10 + 0.1
146 |         x[:,0] = np.arange(100).astype(np.float32) * 0.2 - 10 + 0.1
147 |         y = net(x, train=False)
148 |         labels = y.data.argmax(axis=1)
149 |         for p, label in zip(x, labels):
150 |             pos_x, pos_y = point_to_pixel(p)
151 |             draw.rectangle((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[label])
152 |     draw.line((250, 0, 250, 500), fill=0)
153 |     draw.line((0, 250, 500, 250), fill=0)
154 |     test_image = image.copy()
155 | 
156 |     colors = [(255, 0, 0), (0, 0, 255)]
157 |     for x, y in zip(train_x, train_y):
158 |         pos_x, pos_y = point_to_pixel(x)
159 |         draw.ellipse((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[y])
160 |     image.save('image/result_train.png')
161 | 
162 |     draw = ImageDraw.Draw(test_image)
163 |     for x, y in zip(test_x, test_y):
164 |         pos_x, pos_y = point_to_pixel(x)
165 |         draw.ellipse((pos_x - 2, pos_y - 2, pos_x + 2, pos_y + 2), colors[y])
166 |     test_image.save('image/result_test.png')
167 | 
168 | def load_data(file_path):
169 |     positions = []
170 |     labels = []
171 |     with open(file_path) as f:
172 |         for line in f:
173 |             x, y, label = line.split(',')
174 |             positions.append([float(x), float(y)])
175 |             labels.append(int(label))
176 |     positions = np.asarray(positions, dtype=np.float32)
177 |     labels = np.asarray(labels, dtype=np.int32)
178 |     return positions, labels
179 | 
180 | if __name__ == '__main__':
181 |     train_x, train_y = load_data('data_train.txt')
182 |     test_x, test_y = load_data('data_test.txt')
183 |     main(train_x, train_y, test_x, test_y)
184 | 


--------------------------------------------------------------------------------
/src/ex2/zundoko.py:
--------------------------------------------------------------------------------
 1 | # -*- coding: utf-8 -*-
 2 | 
 3 | import numpy as np
 4 | import six
 5 | 
 6 | import chainer
 7 | from chainer import functions as F
 8 | from chainer import links as L
 9 | from chainer import optimizers
10 | from chainer import Variable
11 | 
12 | np.random.seed()
13 | zun = 0
14 | doko = 1
15 | input_num = 2
16 | none = 0
17 | kiyoshi = 1
18 | output_num = 2
19 | input_words = ['ズン', 'ドコ']
20 | output_words = [None, '＼キ・ヨ・シ！／']
21 | # for Windows command prompt
22 | #input_words = ['Zun', 'Doko']
23 | #output_words = [None, 'Ki.Yo.Shi!']
24 | hidden_num = 8
25 | update_iteration = 20
26 | 
27 | class Zundoko(chainer.Chain):
28 |     def __init__(self):
29 |         super(Zundoko, self).__init__(
30 |             word=L.EmbedID(input_num, hidden_num),
31 |             l1 = L.Linear(hidden_num, hidden_num),
32 |             out=L.Linear(hidden_num, output_num),
33 |         )
34 | 
35 |     def __call__(self, x, prev_h=None, train=True):
36 |         h0 = self.word(x)
37 |         # next_h = ...
38 |         # y = ...
39 |         return y, next_h
40 | 
41 | kiyoshi_list = [zun, zun, zun, zun, doko]
42 | kiyoshi_pattern = 0
43 | kiyoshi_mask = (1 << len(kiyoshi_list)) - 1
44 | for token in kiyoshi_list:
45 |     kiyoshi_pattern = (kiyoshi_pattern << 1) | token
46 | 
47 | zundoko = Zundoko()
48 | optimizer = optimizers.Adam(alpha=0.01)
49 | optimizer.setup(zundoko)
50 | 
51 | def train():
52 |     loss = 0
53 |     acc = 0
54 |     batch_size = 20
55 |     recent_pattern = np.zeros((batch_size,), dtype=np.int32)
56 |     h = None
57 |     for i in six.moves.range(200):
58 |         x = np.random.randint(0, input_num, batch_size).astype(np.int32)
59 |         y, h = zundoko(x, h, train=True)
60 |         recent_pattern = ((recent_pattern << 1) | x) & kiyoshi_mask
61 |         if i < len(kiyoshi_list):
62 |             t = np.full((batch_size,), none, dtype=np.int32)
63 |         else:
64 |             t = np.where(recent_pattern == kiyoshi_pattern, kiyoshi, none).astype(np.int32)
65 |         loss += F.softmax_cross_entropy(y, t)
66 |         acc += float(F.accuracy(y, t).data)
67 |         if (i + 1) % update_iteration == 0:
68 |             zundoko.cleargrads()
69 |             loss.backward()
70 |             loss.unchain_backward()
71 |             optimizer.update()
72 |             print('train loss: {} accuracy: {}'.format(loss.data, acc / update_iteration))
73 |             loss = 0
74 |             acc = 0
75 | 
76 | def predict():
77 |     batch_size = 1
78 |     h = None
79 |     for i in six.moves.range(200):
80 |         x = np.random.randint(0, input_num, batch_size).astype(np.int32)
81 |         x = Variable(x, volatile=True)
82 |         y, h = zundoko(x, h, train=False)
83 |         print input_words[x.data[0]]
84 |         out_word = output_words[np.argmax(y.data[0])]
85 |         if out_word != None:
86 |             print(out_word)
87 |             break
88 | 
89 | for iteration in range(25):
90 |     train()
91 | 
92 | for i in range(5):
93 |     print('prediction: {}'.format(i + 1))
94 |     predict()
95 | 


--------------------------------------------------------------------------------
/src/ex2/zundoko_lstm.py:
--------------------------------------------------------------------------------
 1 | # -*- coding: utf-8 -*-
 2 | 
 3 | import numpy as np
 4 | import six
 5 | 
 6 | import chainer
 7 | from chainer import functions as F
 8 | from chainer import links as L
 9 | from chainer import optimizers
10 | from chainer import Variable
11 | 
12 | np.random.seed()
13 | zun = 0
14 | doko = 1
15 | input_num = 2
16 | none = 0
17 | kiyoshi = 1
18 | output_num = 2
19 | input_words = ['ズン', 'ドコ']
20 | output_words = [None, '＼キ・ヨ・シ！／']
21 | # for Windows command prompt
22 | #input_words = ['Zun', 'Doko']
23 | #output_words = [None, 'Ki.Yo.Shi!']
24 | hidden_num = 8
25 | update_iteration = 20
26 | 
27 | class Zundoko(chainer.Chain):
28 |     def __init__(self):
29 |         super(Zundoko, self).__init__(
30 |             word=L.EmbedID(input_num, hidden_num),
31 |             lstm = L.LSTM(hidden_num, hidden_num),
32 |             out=L.Linear(hidden_num, output_num),
33 |         )
34 | 
35 |     def __call__(self, x, prev_h=None, train=True):
36 |         h0 = self.word(x)
37 |         # y = ...
38 |         return y
39 | 
40 | kiyoshi_list = [zun, zun, zun, zun, doko]
41 | kiyoshi_pattern = 0
42 | kiyoshi_mask = (1 << len(kiyoshi_list)) - 1
43 | for token in kiyoshi_list:
44 |     kiyoshi_pattern = (kiyoshi_pattern << 1) | token
45 | 
46 | zundoko = Zundoko()
47 | optimizer = optimizers.Adam(alpha=0.01)
48 | optimizer.setup(zundoko)
49 | 
50 | def train():
51 |     loss = 0
52 |     acc = 0
53 |     batch_size = 20
54 |     recent_pattern = np.zeros((batch_size,), dtype=np.int32)
55 |     h = None
56 |     for i in six.moves.range(200):
57 |         x = np.random.randint(0, input_num, batch_size).astype(np.int32)
58 |         y = zundoko(x, train=True)
59 |         recent_pattern = ((recent_pattern << 1) | x) & kiyoshi_mask
60 |         if i < len(kiyoshi_list):
61 |             t = np.full((batch_size,), none, dtype=np.int32)
62 |         else:
63 |             t = np.where(recent_pattern == kiyoshi_pattern, kiyoshi, none).astype(np.int32)
64 |         loss += F.softmax_cross_entropy(y, t)
65 |         acc += float(F.accuracy(y, t).data)
66 |         if (i + 1) % update_iteration == 0:
67 |             zundoko.cleargrads()
68 |             loss.backward()
69 |             loss.unchain_backward()
70 |             optimizer.update()
71 |             print('train loss: {} accuracy: {}'.format(loss.data, acc / update_iteration))
72 |             loss = 0
73 |             acc = 0
74 | 
75 | def predict():
76 |     batch_size = 1
77 |     h = None
78 |     for i in six.moves.range(200):
79 |         x = np.random.randint(0, input_num, batch_size).astype(np.int32)
80 |         x = Variable(x, volatile=True)
81 |         y = zundoko(x, train=False)
82 |         print input_words[x.data[0]]
83 |         out_word = output_words[np.argmax(y.data[0])]
84 |         if out_word != None:
85 |             print(out_word)
86 |             break
87 | 
88 | for iteration in range(25):
89 |     train()
90 | 
91 | for i in range(5):
92 |     print('prediction: {}'.format(i + 1))
93 |     predict()
94 | 


--------------------------------------------------------------------------------
/src/gan/image/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/src/gan/image/.gitkeep


--------------------------------------------------------------------------------
/src/gan/train.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | # -*- coding: utf-8 -*-
  3 | 
  4 | import argparse
  5 | import math
  6 | import numpy as np
  7 | import six
  8 | import time
  9 | from PIL import Image
 10 | 
 11 | import chainer
 12 | from chainer import cuda
 13 | from chainer import functions as F
 14 | from chainer import links as L
 15 | from chainer import optimizers
 16 | from chainer import serializers
 17 | from chainer.dataset import convert
 18 | 
 19 | latent_size = 30
 20 | 
 21 | class Generator(chainer.Chain):
 22 | 
 23 |     def __init__(self):
 24 |         initialW = chainer.initializers.Normal(0.02)
 25 |         super(Generator, self).__init__(
 26 |             fc1=L.Linear(latent_size, 3 * 3 * 128, initialW=initialW),
 27 |             deconv2=L.Deconvolution2D(128, 64, 3, stride=2, initialW=initialW),
 28 |             bn2=L.BatchNormalization(64),
 29 |             deconv3=L.Deconvolution2D(64, 32, 4, stride=2, pad=1, initialW=initialW),
 30 |             bn3=L.BatchNormalization(32),
 31 |             deconv4=L.Deconvolution2D(32, 1, 4, stride=2, pad=1, initialW=initialW),
 32 |         )
 33 | 
 34 |     def __call__(self, x, train=True):
 35 |         h1 = F.reshape(F.relu(self.fc1(x)), (-1, 128, 3, 3))
 36 |         h2 = F.relu(self.bn2(self.deconv2(h1), test=not train))
 37 |         h3 = F.relu(self.bn3(self.deconv3(h2), test=not train))
 38 |         return self.deconv4(h3)
 39 | 
 40 | 
 41 | class Discriminator(chainer.Chain):
 42 | 
 43 |     def __init__(self):
 44 |         initialW = chainer.initializers.Normal(0.02)
 45 |         super(Discriminator, self).__init__(
 46 |             conv1=L.Convolution2D(1, 32, 4, stride=2, pad=1, initialW=initialW),
 47 |             bn1=L.BatchNormalization(32),
 48 |             conv2=L.Convolution2D(32, 64, 4, stride=2, pad=1, initialW=initialW),
 49 |             bn2=L.BatchNormalization(64),
 50 |             conv3=L.Convolution2D(64, 128, 3, stride=2, initialW=initialW),
 51 |             bn3=L.BatchNormalization(128),
 52 |             fc4=L.Linear(3 * 3 * 128, 1),
 53 |         )
 54 | 
 55 |     def __call__(self, x, train=True):
 56 |         h0 = F.reshape(x, (-1, 1, 28, 28))
 57 |         h1 = F.leaky_relu(self.bn1(self.conv1(h0), test=not train))
 58 |         h2 = F.leaky_relu(self.bn2(self.conv2(h1), test=not train))
 59 |         h3 = F.leaky_relu(self.bn3(self.conv3(h2), test=not train))
 60 |         return self.fc4(h3)
 61 | 
 62 | 
 63 | # for data augmentation
 64 | def translate(x, offset):
 65 |     size = 28
 66 |     org_shape = x.shape
 67 |     batch_size = x.shape[0]
 68 |     xp = cuda.get_array_module(x)
 69 |     x = x.reshape((-1, 1, size, size))
 70 |     y = xp.zeros_like(x)
 71 |     y = y.reshape((-1, 1, size, size))
 72 |     offsets = xp.random.randint(-offset, offset + 1, (batch_size, 2))
 73 |     for i in six.moves.range(batch_size):
 74 |         left, top = offsets[i]
 75 |         right = min(size, size + left)
 76 |         bottom = min(size, size + top)
 77 |         left = max(0, left)
 78 |         top = max(0, top)
 79 |         y[i,:,size-bottom:size-top,size-right:size-left] = x[i,:,top:bottom,left:right]
 80 |     return y.reshape(org_shape)
 81 | 
 82 | def update_generator(gen, dis, gen_optimizer, z):
 83 |     xp = cuda.get_array_module(z)
 84 | 
 85 |     # generate
 86 |     x_fake = gen(z)
 87 | 
 88 |     #discriminate
 89 |     y_fake = dis(x_fake)
 90 | 
 91 |     t_real = xp.ones_like(y_fake.data, dtype=np.int32)
 92 |     loss_gen = F.sigmoid_cross_entropy(y_fake, t_real)
 93 |     gen.cleargrads()
 94 |     loss_gen.backward()
 95 |     gen_optimizer.update()
 96 | 
 97 |     return loss_gen
 98 | 
 99 | def update_discriminator(gen, dis, dis_optimizer, x, z):
100 |     xp = cuda.get_array_module(x)
101 | 
102 |     # generate
103 |     x_fake = gen(z)
104 | 
105 |     #discriminate
106 |     y_real = dis(x)
107 |     y_fake = dis(x_fake)
108 | 
109 |     # target
110 |     t_real = xp.ones_like(y_real.data, dtype=np.int32)
111 |     t_fake = xp.zeros_like(y_fake.data, dtype=np.int32)
112 |     loss_dis = F.sigmoid_cross_entropy(y_real, t_real) + F.sigmoid_cross_entropy(y_fake, t_fake)
113 |     dis.cleargrads()
114 |     loss_dis.backward()
115 |     dis_optimizer.update()
116 | 
117 |     return loss_dis
118 | 
119 | def update(gen, dis, gen_optimizer, dis_optimizer, x, z):
120 |     xp = cuda.get_array_module(x)
121 | 
122 |     # generate
123 |     x_fake = gen(z)
124 | 
125 |     #discriminate
126 |     y_real = dis(x)
127 |     y_fake = dis(x_fake)
128 | 
129 |     # target
130 |     t_real = xp.ones_like(y_real.data, dtype=np.int32)
131 |     t_fake = xp.zeros_like(y_fake.data, dtype=np.int32)
132 |     loss_dis = F.sigmoid_cross_entropy(y_real, t_real) + F.sigmoid_cross_entropy(y_fake, t_fake)
133 |     dis.cleargrads()
134 |     loss_dis.backward()
135 |     dis_optimizer.update()
136 | 
137 |     t_real = xp.ones_like(y_fake.data, dtype=np.int32)
138 |     loss_gen = F.sigmoid_cross_entropy(y_fake, t_real)
139 |     gen.cleargrads()
140 |     loss_gen.backward()
141 |     gen_optimizer.update()
142 | 
143 |     return loss_gen, loss_dis
144 | 
145 | def generate_image(gen, z, file_path):
146 |     x = gen(z, train=True)
147 |     im = cuda.to_cpu(x.data)
148 |     im = im.reshape((10, 10, 28, 28)).transpose((0, 2, 1, 3)).reshape((280, 280))
149 |     im = ((1 - im) * 128).clip(0, 255).astype(np.uint8)
150 |     Image.fromarray(im).save(file_path)
151 | 
152 | def train(gen, dis, gen_optimizer, dis_optimizer, iterator, iteration, name):
153 |     xp = gen.xp
154 |     loss_gen_sum = 0
155 |     loss_dis_sum = 0
156 |     loss_num = 0
157 |     test_z1 = xp.random.uniform(-1, 1, (10, 1, latent_size)).astype(np.float32)
158 |     test_z2 = xp.random.uniform(-1, 1, (10, 1, latent_size)).astype(np.float32)
159 |     weights = xp.arange(10).astype(np.float32).reshape((1, 10, 1)).repeat(10, axis=0) / 9
160 |     test_z = (1 - weights) * test_z1 + weights * test_z2
161 |     test_z = test_z.reshape((-1, latent_size))
162 |     last_clock = time.clock()
163 |     for i in six.moves.range(iteration):
164 |         batch = train_iterator.next()
165 |         x = convert.concat_examples(batch)
166 |         x = xp.asarray(translate(x, 2) - 1)
167 |         z = xp.random.uniform(-1, 1, (x.shape[0], latent_size)).astype(np.float32)
168 |         loss_gen, loss_dis = update(gen, dis, gen_optimizer, dis_optimizer, x, z)
169 |         loss_dis_sum += float(loss_dis.data)
170 |         loss_gen_sum += float(loss_gen.data)
171 |         loss_num += 1
172 | 
173 |         if (i + 1) % 200 == 0:
174 |             current_clock = time.clock()
175 |             print('iteration {} done {}s elapsed'.format(i + 1, current_clock - last_clock))
176 |             last_clock = current_clock
177 |             print('gen loss: {}'.format(loss_gen_sum / loss_num))
178 |             print('dis loss: {}'.format(loss_dis_sum / loss_num))
179 |             loss_gen_sum = 0
180 |             loss_dis_sum = 0
181 |             loss_num = 0
182 |             generate_image(gen, test_z, '{0}_{1:06d}.png'.format(name, i + 1))
183 |             serializers.save_npz('{}_gen.model'.format(name), gen)
184 |             serializers.save_npz('{}_dis.model'.format(name), dis)
185 |     train_iterator.finalize()
186 | 
187 | if __name__ == '__main__':
188 | 
189 |     parser = argparse.ArgumentParser(description='Generative Adversarial Net sample')
190 |     parser.add_argument('--gpu', '-g', type=int, default=-1, help='GPU device index, -1 indicates CPU')
191 |     parser.add_argument('--iter', '-i', type=int, default=5000, help='Number of iterations')
192 |     parser.add_argument('--batch-size', '-b', type=int, default=100, help='Mini batch size')
193 |     parser.add_argument('--name', '-n', type=str, default='image/gan', help='saved file name')
194 |     args = parser.parse_args()
195 | 
196 |     batch_size = args.batch_size
197 |     gen = Generator()
198 |     dis = Discriminator()
199 |     gpu_device = args.gpu
200 |     if gpu_device >= 0:
201 |         chainer.cuda.get_device(gpu_device).use()
202 |         gen.to_gpu(gpu_device)
203 |         dis.to_gpu(gpu_device)
204 |         xp = cuda.cupy
205 |     else:
206 |         xp = np
207 |     gen_optimizer = optimizers.Adam(0.0002, beta1=0.5)
208 |     gen_optimizer.setup(gen)
209 |     dis_optimizer = optimizers.Adam(0.0002, beta1=0.5)
210 |     dis_optimizer.setup(dis)
211 | 
212 |     train_data, test_data = chainer.datasets.get_mnist(False, scale=2)
213 |     train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
214 |     train(gen, dis, gen_optimizer, dis_optimizer, train_iterator, args.iter, args.name)
215 | 


--------------------------------------------------------------------------------
/src/human_activity/make_animation.py:
--------------------------------------------------------------------------------
 1 | import argparse
 2 | import numpy as np
 3 | import matplotlib.pyplot as plt
 4 | import matplotlib.animation as animation
 5 | import mpl_toolkits.mplot3d.axes3d as p3
 6 | import six
 7 | 
 8 | def update(index, scat, items):
 9 |     item = items[index]
10 |     scat.set_offsets(item[:, :2])
11 |     scat.set_3d_properties(item[:, 2], 'z')
12 |     return scat
13 | 
14 | def plot_3d_animation(data, step=1, interval=100):
15 |     sub_data = np.ascontiguousarray(data[::step,:,:])
16 |     fig = plt.figure()
17 |     ax = p3.Axes3D(fig)
18 |     ax.set_xlim3d([-2000, 2000])
19 |     ax.set_ylim3d([-2000, 2000])
20 |     ax.set_zlim3d([-1000, 2000])
21 |     ax.set_xlabel('X')
22 |     ax.set_ylabel('Y')
23 |     ax.set_zlabel('Z')
24 |     ax.set_title('Human Activity')
25 | 
26 |     scat = ax.scatter(sub_data[0, :, 0], sub_data[0, :, 1], sub_data[0, :, 2], c=['g', 'r', 'r', 'r', 'r', 'b', 'b', 'b', 'b'], s=50, alpha=1)
27 | 
28 |     # Creating the Animation object
29 |     anim = animation.FuncAnimation(
30 |         fig, update, len(sub_data), fargs=(scat, sub_data),
31 |         interval=interval, blit=False
32 |     )
33 |     plt.show()
34 | 
35 | def preprocess(x):
36 |     n, m, ch = x.shape
37 |     h = x.reshape((n, -1, 3))
38 |     # subtract head x,y positions and constant z position
39 |     base = np.mean(h, axis=(0, 1), keepdims=True)
40 |     base[:, :, 2] = 800
41 |     h = h - base
42 |     # calculate body direction
43 |     lankle_x = np.mean(h[:, 6, 0], axis=0, keepdims=True)
44 |     lankle_y = np.mean(h[:, 6, 1], axis=0, keepdims=True)
45 |     rankle_x = np.mean(h[:, 8, 0], axis=0, keepdims=True)
46 |     rankle_y = np.mean(h[:, 8, 1], axis=0, keepdims=True)
47 |     angle = np.arctan2(lankle_y - rankle_y, lankle_x - rankle_x)
48 |     angle = angle.reshape((-1,))
49 |     cos = np.cos(angle)
50 |     sin = np.sin(angle)
51 |     rot = np.asarray([
52 |         [cos, -sin, 0],
53 |         [sin, cos, 0],
54 |         [0, 0, 1]], dtype=np.float32)
55 |     return np.dot(h, rot).reshape(x.shape)
56 | 
57 | def read_file(file_path):
58 |     data = np.loadtxt(file_path, usecols=six.moves.range(1, 28), dtype=np.float32)
59 |     row_num, col_num = data.shape
60 |     return data.reshape((row_num, -1, 3))
61 | 
62 | if __name__ == '__main__':
63 |     parser = argparse.ArgumentParser(description='Physical action training')
64 |     parser.add_argument('file_path', type=str, help='data file path')
65 |     args = parser.parse_args()
66 | 
67 |     data = read_file(args.file_path)
68 |     data = preprocess(data)
69 |     plot_3d_animation(data, step=8, interval=40)
70 | 


--------------------------------------------------------------------------------
/src/human_activity/train_lstm.py:
--------------------------------------------------------------------------------
  1 | # -*- coding: utf-8 -*-
  2 | 
  3 | import argparse
  4 | from datetime import datetime
  5 | import json
  6 | import numpy as np
  7 | import os
  8 | import six
  9 | import time
 10 | 
 11 | import chainer
 12 | from chainer import cuda
 13 | from chainer import functions as F
 14 | from chainer import links as L
 15 | from chainer import optimizers
 16 | from chainer import serializers
 17 | from chainer.dataset import convert
 18 | 
 19 | step_size = 10
 20 | window_size = 80
 21 | offset_size = 10
 22 | 
 23 | normal_actions = [
 24 |     'Bowing',
 25 |     'Clapping',
 26 |     'Handshaking',
 27 |     'Hugging',
 28 |     'Jumping',
 29 |     'Running',
 30 |     'Seating',
 31 |     'Standing',
 32 |     'Walking',
 33 |     'Waving',
 34 | ]
 35 | 
 36 | def rotate(x, angle):
 37 |     xp = cuda.get_array_module(x)
 38 |     cos = xp.cos(-angle)
 39 |     sin = xp.sin(-angle)
 40 |     rot = xp.zeros((angle.shape[0], 3, 3), dtype=np.float32)
 41 |     rot[:, 0, 0] = cos
 42 |     rot[:, 0, 1] = -sin
 43 |     rot[:, 1, 0] = sin
 44 |     rot[:, 1, 1] = cos
 45 |     rot[:, 2, 2] = 1
 46 |     y = xp.zeros_like(x)
 47 |     for i in six.moves.range(len(y)):
 48 |         y[i, :, :, :] = xp.dot(rot[i], x[i]).transpose((1, 0, 2))
 49 |     return y
 50 | 
 51 | def preprocess(data):
 52 |     x, t = data
 53 |     n, ch, m = x.shape
 54 |     h = x.reshape((n, -1, 3, m))
 55 |     # subtract head x,y positions and constant z position
 56 |     base = h[:,0:1,:,0:1].copy()
 57 |     base[:, :, 2, :] = 0.8
 58 |     h = h - base
 59 |     # calculate body direction
 60 |     lankle_x = h[:, 6, 0, 0:1]
 61 |     lankle_y = h[:, 6, 1, 0:1]
 62 |     rankle_x = h[:, 8, 0, 0:1]
 63 |     rankle_y = h[:, 8, 1, 0:1]
 64 |     angle = np.arctan2(lankle_y - rankle_y, lankle_x - rankle_x)
 65 |     angle = angle.reshape((-1,))
 66 |     # normalize direction
 67 |     h = rotate(h, angle)
 68 |     return h.reshape(x.shape), t
 69 | 
 70 | def transform(x, scale_size, offset, rot_size):
 71 |     xp = cuda.get_array_module(x)
 72 |     n, ch, m = x.shape
 73 |     angle = (xp.random.random(n).astype(np.float32) - 0.5) * (np.pi / 180 * rot_size * 2)
 74 |     scale = 1 + (xp.random.random((n, 1, 1)).astype(np.float32) - 0.5) * scale_size * 2
 75 |     offsets = xp.random.uniform(-offset, offset, (n, 1, 1)).astype(np.float32)
 76 |     x = rotate(x.reshape((n, -1, 3, m)), angle).reshape((n, ch, m))
 77 |     return x * scale + offset
 78 | 
 79 | class ActionClassifier(chainer.Chain):
 80 | 
 81 |     def __init__(self):
 82 |         super(ActionClassifier, self).__init__(
 83 |             enc1=L.Linear(27, 50),
 84 |             enc2=L.Linear(50, 50),
 85 |             lstm1=L.LSTM(50, 50),
 86 |             dec1=L.Linear(50, 50),
 87 |             dec2=L.Linear(50, 10),
 88 |         )
 89 | 
 90 |     def __call__(self, x, train=True):
 91 |         h = F.relu(self.enc1(x))
 92 |         h = F.dropout(h, 0.5, train=train)
 93 |         h = F.relu(self.enc2(h))
 94 |         h = self.lstm1(h)
 95 |         h = F.relu(self.dec1(h))
 96 |         h = F.dropout(h, 0.5, train=train)
 97 |         h = self.dec2(h)
 98 |         return h
 99 | 
100 |     def reset(self):
101 |         self.lstm1.reset_state()
102 | 
103 | def read_file(file_path):
104 |     data = np.loadtxt(file_path, usecols=six.moves.range(1, 28), dtype=np.float32)
105 |     data = data[::step_size]
106 |     row_num, col_num = data.shape
107 |     n = (row_num - window_size) // offset_size + 1
108 |     org_data = data
109 |     data = np.zeros((n, window_size, col_num), dtype=np.float32)
110 |     for i in six.moves.range(n):
111 |         data[i, :, :] = org_data[i * offset_size:i * offset_size + window_size]
112 |     data = data / 500
113 |     return data.reshape((n, window_size, col_num)).transpose((0, 2, 1))
114 | 
115 | def read_data(data_dir, indices):
116 |     xs = []
117 |     ys = []
118 |     for index in indices:
119 |         sub_dir = 'sub{}'.format(index)
120 |         for i, action_name in enumerate(normal_actions):
121 |             file_name = '{}.txt'.format(action_name)
122 |             path = os.path.join(data_dir, sub_dir, 'normal', file_name)
123 |             x = read_file(path)
124 |             xs.append(x)
125 |             ys.append(np.full((x.shape[0],), i, dtype=np.int32))
126 |         label_offset = len(normal_actions)
127 |     return np.concatenate(xs, axis=0), np.concatenate(ys, axis=0)
128 | 
129 | def evaluate(net, dataset, batch_size, device=None):
130 |     iterator = chainer.iterators.SerialIterator(dataset, batch_size, repeat=False, shuffle=False)
131 |     loss_sum = 0
132 |     acc_sum = 0
133 |     num = 0
134 |     acc_table = np.zeros((10, 10), dtype=np.float32)
135 |     for batch in iterator:
136 |         raw_x, raw_t = convert.concat_examples(batch, device)
137 |         t = chainer.Variable(raw_t, volatile=True)
138 |         net.reset()
139 |         for i in six.moves.range(raw_x.shape[2]):
140 |             x = chainer.Variable(raw_x[:,:,i], volatile=True)
141 |             y = net(x, train=False)
142 |         loss = F.softmax_cross_entropy(y, t)
143 |         acc = F.accuracy(y, t)
144 |         n = len(raw_x)
145 |         loss_sum += float(loss.data) * n
146 |         acc_sum += float(acc.data) * n
147 |         num += n
148 |         selected = np.argmax(cuda.to_cpu(y.data), axis=1)
149 |         for i, j in zip(raw_t, selected):
150 |             acc_table[i, j] += 1
151 |     return loss_sum / num, acc_sum / num, acc_table / np.sum(acc_table, axis=1, keepdims=True)
152 | 
153 | def train(epoch_num, batch_size, gpu_device, train_data, valid_data):
154 |     net = ActionClassifier()
155 |     if gpu_device >= 0:
156 |         chainer.cuda.get_device(gpu_device).use()
157 |         net.to_gpu(gpu_device)
158 |         xp = cuda.cupy
159 |     else:
160 |         xp = np
161 |     optimizer = optimizers.Adam()
162 |     optimizer.setup(net)
163 |     optimizer.add_hook(chainer.optimizer.WeightDecay(0.0001))
164 | 
165 |     train_dataset = chainer.datasets.TupleDataset(*train_data)
166 |     valid_dataset = chainer.datasets.TupleDataset(*valid_data)
167 |     train_iterator = chainer.iterators.SerialIterator(train_dataset, batch_size)
168 | 
169 |     train_loss_sum = 0
170 |     train_acc_sum = 0
171 |     train_num = 0
172 |     last_clock = time.clock()
173 |     log = {
174 |         'log': [],
175 |         'max_valid_acc': 0,
176 |     }
177 |     save_dir = 'result_{}'.format(datetime.now().strftime('%Y-%m-%d_%H-%M-%S'))
178 |     os.mkdir(save_dir)
179 |     log_file_name = os.path.join(save_dir, 'log.txt')
180 |     model_file_path = os.path.join(save_dir, 'action.model')
181 | 
182 |     while train_iterator.epoch < epoch_num:
183 |         batch = train_iterator.next()
184 |         x, t = convert.concat_examples(batch)
185 |         x = transform(xp.asarray(x), 0.1, 0.2, 30)
186 |         t = xp.asarray(t)
187 |         net.reset()
188 |         for i in six.moves.range(x.shape[2]):
189 |             y = net(x[:,:,i])
190 |         loss = F.softmax_cross_entropy(y, t)
191 |         acc = F.accuracy(y, t)
192 |         net.cleargrads()
193 |         loss.backward()
194 |         optimizer.update()
195 |         train_loss_sum += float(loss.data) * len(x)
196 |         train_acc_sum += float(acc.data) * len(x)
197 |         train_num += len(x)
198 |         if train_iterator.is_new_epoch:
199 |             train_loss = train_loss_sum / train_num
200 |             train_acc = train_acc_sum / train_num
201 |             valid_loss, valid_acc, acc_table = evaluate(net, valid_dataset, batch_size, gpu_device)
202 |             current_clock = time.clock()
203 |             print('epoch {} done {}s elapsed'.format(train_iterator.epoch, current_clock - last_clock))
204 |             last_clock = current_clock
205 |             print('train loss: {} accuracy: {}'.format(train_loss, train_acc))
206 |             print('valid loss: {} accuracy: {}'.format(valid_loss, valid_acc))
207 |             train_loss_sum = 0
208 |             train_acc_sum = 0
209 |             train_num = 0
210 |             log['log'].append({
211 |                 'epoch': train_iterator.epoch,
212 |                 'train_loss': train_loss,
213 |                 'train_acc': train_acc,
214 |                 'valid_loss': valid_loss,
215 |                 'valid_acc': valid_acc,
216 |             })
217 |             if valid_acc > log['max_valid_acc']:
218 |                 log['max_valid_acc'] = valid_acc
219 |                 log['acc_table'] = acc_table.tolist()
220 |                 serializers.save_npz(model_file_path, net)
221 |             with open(log_file_name, 'w') as f:
222 |                 json.dump(log, f, indent=4)
223 |             if train_iterator.epoch == int(epoch_num * 0.5) or train_iterator.epoch == int(epoch_num * 0.75):
224 |                 optimizer.alpha *= 0.1 ** 0.5
225 |     train_iterator.finalize()
226 |     print('max valid accuracy: {}'.format(log['max_valid_acc']))
227 | 
228 | if __name__ == '__main__':
229 |     parser = argparse.ArgumentParser(description='Physical action training')
230 |     parser.add_argument('--gpu', '-g', type=int, default=-1, help='GPU device index, -1 indicates CPU')
231 |     parser.add_argument('--epoch', '-e', type=int, default=100, help='Number of epochs')
232 |     parser.add_argument('--batch-size', '-b', type=int, default=50, help='Mini batch size')
233 |     parser.add_argument('--data-dir', '-d', type=str, default='Vicon Physical Action Data Set', help='Dataset file directory')
234 |     args = parser.parse_args()
235 | 
236 |     print('loading train data...')
237 |     train_data = read_data(args.data_dir, [1, 2, 3, 4, 5, 6, 7])
238 |     train_data = preprocess(train_data)
239 |     print('train data num: {}'.format(len(train_data[0])))
240 |     print('loading valid data...')
241 |     valid_data = read_data(args.data_dir, [8, 9])
242 |     valid_data = preprocess(valid_data)
243 |     print('valid data num: {}'.format(len(valid_data[0])))
244 |     print('start training')
245 |     train(args.epoch, args.batch_size, args.gpu, train_data, valid_data)
246 | 


--------------------------------------------------------------------------------
/src/mnist/net.py:
--------------------------------------------------------------------------------
 1 | #!/usr/bin/env python
 2 | # -*- coding: utf-8 -*-
 3 | 
 4 | import chainer
 5 | from chainer import functions as F
 6 | from chainer import links as L
 7 | 
 8 | # Fully Connected LayerのみのMulti Layer Perceptron
 9 | class MLP(chainer.Chain):
10 | 
11 |     def __init__(self, n_in, n_out, n_hidden):
12 |         # Networkの持つLinkを定義する
13 |         # superクラスの__init__を使う方法と
14 |         # add_link()メソッドで追加する方法とがある
15 |         # Chainer1.12からL.Linear()の第1引数はNoneでよく、
16 |         # その場合入力サイズは実際のデータから自動的に決まる
17 |         super(MLP, self).__init__(
18 |             l1=L.Linear(n_in, n_hidden),
19 |             l2=L.Linear(n_hidden, n_hidden),
20 |             l3=L.Linear(n_hidden, n_out),
21 |         )
22 | 
23 |     def __call__(self, x, train=True):
24 |         # Linkはcallableとなっており、関数として呼び出すとLinkの処理
25 |         # L.LinearはWx + bを計算する(W, bはL.Linearが持つ重み行列とバイアス項)
26 |         h = self.l1(x)
27 |         # Dropoutを実行する場合、学習時と予測時とで挙動が異なるので、trainでどちらであるかを指定する必要がある
28 |         # h = F.dropout(h, 0.25, train=train)
29 |         # ReLUを実行する
30 |         h = F.relu(h)
31 |         h = self.l2(h)
32 |         # h = F.dropout(h, 0.25, train=train)
33 |         h = F.relu(h)
34 |         h = self.l3(h)
35 |         return h
36 | 
37 | # Convolutional Neural Network
38 | class CNN(chainer.Chain):
39 | 
40 |     def __init__(self):
41 |         super(CNN, self).__init__(
42 |             conv1=L.Convolution2D(1, 16, 5),
43 |             conv2=L.Convolution2D(16, 32, 3, pad=1),
44 |             conv3=L.Convolution2D(32, 64, 3, pad=1),
45 |             bn1=L.BatchNormalization(16),
46 |             bn2=L.BatchNormalization(32),
47 |             bn3=L.BatchNormalization(64),
48 |             fc4=L.Linear(3 * 3 * 64, 128),
49 |             fc5=L.Linear(128, 10),
50 |         )
51 | 
52 |     def __call__(self, x, train=True):
53 |         # L.Convolution2Dの入力は4次元である必要があるのでreshapeする
54 |         # 各軸は(batch, channel, y, x)を意味する
55 |         h = F.reshape(x, (-1, 1, 28, 28))
56 |         h = self.conv1(h)
57 |         h = self.bn1(h, test=not train)
58 |         h = F.relu(h)
59 |         h = F.max_pooling_2d(h, 2)
60 |         h = self.conv2(h)
61 |         h = self.bn2(h, test=not train)
62 |         h = F.relu(h)
63 |         h = F.max_pooling_2d(h, 2)
64 |         h = self.conv3(h)
65 |         h = self.bn3(h, test=not train)
66 |         h = F.relu(h)
67 |         h = F.max_pooling_2d(h, 2)
68 |         h = self.fc4(h)
69 |         h = F.relu(h)
70 |         h = self.fc5(h)
71 |         return h
72 | 


--------------------------------------------------------------------------------
/src/mnist/predict.py:
--------------------------------------------------------------------------------
 1 | #!/usr/bin/env python
 2 | # -*- coding: utf-8 -*-
 3 | 
 4 | import argparse
 5 | import numpy as np
 6 | import six
 7 | from PIL import Image
 8 | 
 9 | import chainer
10 | from chainer import cuda
11 | from chainer import functions as F
12 | from chainer import serializers
13 | 
14 | import net as net_module
15 | 
16 | def predict(net, image):
17 |     # 予測時にはtrain=Falseを指定する
18 |     y = net(chainer.Variable(image, volatile=True), train=False)
19 |     return F.softmax(y).data
20 | 
21 | def main(args):
22 |     if args.model == 'cnn':
23 |         net = net_module.CNN()
24 |     else:
25 |         net = net_module.MLP(28 * 28, 10, 100)
26 |     gpu_device = args.gpu
27 |     if gpu_device >= 0:
28 |         chainer.cuda.get_device(gpu_device).use()
29 |         net.to_gpu(gpu_device)
30 |         xp = cuda.cupy
31 |     else:
32 |         xp = np
33 |     serializers.load_npz(args.model_file, net)
34 |     image = Image.open(args.image_file).convert('L').resize((28, 28), Image.BILINEAR)
35 |     # 学習データは値の範囲が0～1なのでそれに合わせるために255で割る
36 |     # 学習データは背景が0なので反転する
37 |     image = 1 - xp.asarray(image).astype(np.float32) / 255
38 |     image = image.reshape((1, -1))
39 |     probs = cuda.to_cpu(predict(net, image))[0]
40 |     results = sorted(zip(six.moves.range(10), probs), key=lambda x: -x[1])
41 |     for n, p in results:
42 |         print('{0:d}: {1:.4f}'.format(n, p))
43 | 
44 | if __name__ == '__main__':
45 | 
46 |     parser = argparse.ArgumentParser(description='MNIST prediction')
47 |     parser.add_argument('model_file', type=str, help='Model file path')
48 |     parser.add_argument('image_file', type=str, help='Image file path')
49 |     parser.add_argument('--model', '-m', type=str, default='mlp', choices=['mlp', 'cnn'], help='Neural network model')
50 |     parser.add_argument('--gpu', '-g', type=int, default=-1, help='GPU device index, -1 indicates CPU')
51 |     args = parser.parse_args()
52 | 
53 |     main(args)
54 | 


--------------------------------------------------------------------------------
/src/mnist/train.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | # -*- coding: utf-8 -*-
  3 | 
  4 | import argparse
  5 | import numpy as np
  6 | import six
  7 | import time
  8 | 
  9 | import chainer
 10 | from chainer import cuda
 11 | from chainer import functions as F
 12 | from chainer import optimizers
 13 | from chainer import serializers
 14 | from chainer.dataset import convert
 15 | 
 16 | import net as net_module
 17 | 
 18 | # Data augmentationのために
 19 | # データを-offset～offsetの範囲で平行移動する
 20 | def translate(x, offset):
 21 |     size = 28
 22 |     org_shape = x.shape
 23 |     batch_size = x.shape[0]
 24 |     xp = cuda.get_array_module(x)
 25 |     x = x.reshape((-1, 1, size, size))
 26 |     y = xp.zeros_like(x)
 27 |     y = y.reshape((-1, 1, size, size))
 28 |     offsets = xp.random.randint(-offset, offset + 1, (batch_size, 2))
 29 |     for i in six.moves.range(batch_size):
 30 |         left, top = offsets[i]
 31 |         right = min(size, size + left)
 32 |         bottom = min(size, size + top)
 33 |         left = max(0, left)
 34 |         top = max(0, top)
 35 |         y[i,:,size-bottom:size-top,size-right:size-left] = x[i,:,top:bottom,left:right]
 36 |     return y.reshape(org_shape)
 37 | 
 38 | def update(net, x, t, loss_func):
 39 |     y = net(x)
 40 |     loss = loss_func(y, t)
 41 | 
 42 | def evaluate(net, dataset, batch_size, device=None):
 43 |     # データを1回ずつ使用する場合はrepeat=Falseにする
 44 |     # そうしないと`for batch in iterator`が終了しない
 45 |     iterator = chainer.iterators.SerialIterator(dataset, batch_size, repeat=False, shuffle=False)
 46 |     loss_sum = 0
 47 |     acc_sum = 0
 48 |     num = 0
 49 |     for batch in iterator:
 50 |         raw_x, raw_t = convert.concat_examples(batch, device)
 51 |         # backpropagationは必要ないのでvolatileをTrueにする
 52 |         x = chainer.Variable(raw_x, volatile=True)
 53 |         t = chainer.Variable(raw_t, volatile=True)
 54 |         y = net(x)
 55 |         loss = F.softmax_cross_entropy(y, t)
 56 |         acc = F.accuracy(y, t)
 57 |         n = len(raw_x)
 58 |         loss_sum += float(loss.data) * n
 59 |         acc_sum += float(acc.data) * n
 60 |         num += n
 61 |     return loss_sum / num, acc_sum / num
 62 | 
 63 | if __name__ == '__main__':
 64 | 
 65 |     parser = argparse.ArgumentParser(description='MNIST training')
 66 |     parser.add_argument('--model', '-m', type=str, default='mlp', choices=['mlp', 'cnn'], help='Neural network model')
 67 |     parser.add_argument('--gpu', '-g', type=int, default=-1, help='GPU device index, -1 indicates CPU')
 68 |     parser.add_argument('--epoch', '-e', type=int, default=100, help='Number of epochs')
 69 |     parser.add_argument('--batch-size', '-b', type=int, default=100, help='Mini batch size')
 70 |     parser.add_argument('--prefix', '-p', type=str, default=None, help='prefix of saved file name')
 71 |     args = parser.parse_args()
 72 | 
 73 |     n_epoch = args.epoch
 74 |     batch_size = args.batch_size
 75 |     if args.prefix is None:
 76 |         prefix = args.model
 77 |     else:
 78 |         prefix = args.prefix
 79 |     if args.model == 'cnn':
 80 |         net = net_module.CNN()
 81 |     else:
 82 |         net = net_module.MLP(28 * 28, 10, 100)
 83 |     gpu_device = args.gpu
 84 |     if gpu_device >= 0:
 85 |         chainer.cuda.get_device(gpu_device).use()
 86 |         net.to_gpu(gpu_device)
 87 |         xp = cuda.cupy
 88 |     else:
 89 |         xp = np
 90 |     optimizer = optimizers.Adam()
 91 |     optimizer.setup(net)
 92 | 
 93 |     # MNISTデータセットを読み込む
 94 |     # get_mnistはMNISTデータセットファイルがなければダウンロードを行うので
 95 |     # 初回実行時は時間がかかる
 96 |     # データセットは"~/.chainer/dataset"以下に保存される
 97 |     train_data, test_data = chainer.datasets.get_mnist()
 98 |     # train dataとvalidation dataに分離する
 99 |     train_data, valid_data = chainer.datasets.split_dataset_random(train_data, len(train_data) - 5000)
100 |     train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
101 |     train_loss_sum = 0
102 |     train_acc_sum = 0
103 |     train_num = 0
104 |     best_valid_acc = 0
105 |     best_test_acc = 0
106 |     last_clock = time.clock()
107 | 
108 |     while train_iterator.epoch < n_epoch:
109 |         # 入力値と正解ラベルを取得
110 |         # x: 入力値
111 |         # t: 正解ラベル
112 |         batch = train_iterator.next()
113 |         x, t = convert.concat_examples(batch)
114 |         x = xp.asarray(translate(x, 2))
115 |         t = xp.asarray(t)
116 |         # ネットワークの実行
117 |         y = net(x)
118 |         # 損失の計算
119 |         loss = F.softmax_cross_entropy(y, t)
120 |         # 精度の計算(学習時に必須ではない)
121 |         acc = F.accuracy(y, t)
122 |         # ネットワークの勾配初期化
123 |         net.cleargrads()
124 |         # バックプロパゲーションを行い勾配を計算する
125 |         loss.backward()
126 |         # パラメータを更新する
127 |         optimizer.update()
128 |         # 損失、精度の累積
129 |         train_loss_sum += float(loss.data) * len(x)
130 |         train_acc_sum += float(acc.data) * len(x)
131 |         train_num += len(x)
132 |         if train_iterator.is_new_epoch:
133 |             train_loss = train_loss_sum / train_num
134 |             train_acc = train_acc_sum / train_num
135 |             valid_loss, valid_acc = evaluate(net, valid_data, batch_size, gpu_device)
136 |             test_loss, test_acc = evaluate(net, test_data, batch_size, gpu_device)
137 |             current_clock = time.clock()
138 |             print('epoch {} done {}s elapsed'.format(train_iterator.epoch, current_clock - last_clock))
139 |             last_clock = current_clock
140 |             print('train loss: {} accuracy: {}'.format(train_loss, train_acc))
141 |             print('valid loss: {} accuracy: {}'.format(valid_loss, valid_acc))
142 |             print('test  loss: {} accuracy: {}'.format(test_loss, test_acc))
143 |             train_loss_sum = 0
144 |             train_acc_sum = 0
145 |             train_num = 0
146 |             if valid_acc > best_valid_acc:
147 |                 best_valid_acc = valid_acc
148 |                 best_test_acc = test_acc
149 |                 serializers.save_npz('{}.model'.format(prefix), net)
150 |     train_iterator.finalize()
151 | 
152 |     print('best test accuracy: {}'.format(best_test_acc))
153 | 


--------------------------------------------------------------------------------
/src/open_ai_gym/train.py:
--------------------------------------------------------------------------------
  1 | import argparse
  2 | import gym
  3 | import six
  4 | import numpy as np
  5 | import random
  6 | 
  7 | import chainer
  8 | from chainer import functions as F
  9 | from chainer import links as L
 10 | 
 11 | class Agent(chainer.Chain):
 12 |     gamma = 0.99
 13 |     initial_epsilon = 1
 14 |     epsilon_reduction = 0.001
 15 |     min_epsilon = 0.01
 16 | 
 17 |     def __init__(self, input_size, output_size, hidden_size):
 18 |         initialW = chainer.initializers.HeNormal(0.01)
 19 |         super(Agent, self).__init__(
 20 |             fc1=F.Linear(input_size, hidden_size, initialW=initialW),
 21 |             fc2=F.Linear(hidden_size, hidden_size, initialW=initialW),
 22 |             fc3=F.Linear(hidden_size, output_size, initialW=initialW),
 23 |         )
 24 |         self.epsilon = self.initial_epsilon
 25 |         self.output_size = output_size
 26 | 
 27 |     def __call__(self, x):
 28 |         h = F.relu(self.fc1(x))
 29 |         h = F.relu(self.fc2(h))
 30 |         h = self.fc3(h)
 31 |         return h
 32 | 
 33 |     def randomize_action(self, action):
 34 |         if random.random() < self.epsilon:
 35 |             return random.randint(0, self.output_size - 1)
 36 |         return action
 37 | 
 38 |     def reduce_epsilon(self):
 39 |         self.epsilon = (self.epsilon - self.min_epsilon) * (1 - self.epsilon_reduction) + self.min_epsilon
 40 | 
 41 |     def adjust_reward(self, state, reward, done):
 42 |         return reward
 43 | 
 44 |     def normalize_state(self, state):
 45 |         return np.asarray(state, dtype=np.float32)
 46 | 
 47 | class CartPoleAgent(Agent):
 48 |     gamma = 0.9
 49 |     initial_epsilon = 1
 50 |     min_epsilon = 0.01
 51 |     epsilon_reduction = 0.001
 52 | 
 53 |     def __init__(self):
 54 |         super(CartPoleAgent, self).__init__(4, 2, 24)
 55 | 
 56 |     def adjust_reward(self, state, reward, done):
 57 |         return reward
 58 | 
 59 |     def normalize_state(self, state):
 60 |         scale = np.asarray([1 / 2.4, 1 / 4.0, 1 / 0.2, 1 / 3.0], dtype=np.float32)
 61 |         return np.asarray(state, dtype=np.float32) * scale
 62 | 
 63 | class MountainCarAgent(Agent):
 64 |     gamma = 0.99
 65 |     initial_epsilon = 0.8
 66 |     min_epsilon = 0.1
 67 |     epsilon_reduction = 0.0001
 68 | 
 69 |     def __init__(self):
 70 |         super(MountainCarAgent, self).__init__(2, 3, 64)
 71 | 
 72 |     def adjust_reward(self, state, reward, done):
 73 |         return reward
 74 | 
 75 |     def normalize_state(self, state):
 76 |         scale = np.asarray([1 / 1.2, 1 / 0.07], dtype=np.float32)
 77 |         return np.asarray(state, dtype=np.float32) * scale
 78 | 
 79 | class ExperiencePool(object):
 80 | 
 81 |     def __init__(self, size, state_shape):
 82 |         self.size = size
 83 |         self.states = np.zeros(((size,) + state_shape), dtype=np.float32)
 84 |         self.actions = np.zeros((size,), dtype=np.int32)
 85 |         self.rewards = np.zeros((size,), dtype=np.float32)
 86 |         self.nexts = np.zeros((size,), dtype=np.float32)
 87 |         self.pos = 0
 88 | 
 89 |     def add(self, state, action, reward, done):
 90 |         index = self.pos % self.size
 91 |         self.states[index, ...] = state
 92 |         self.actions[index] = action
 93 |         self.rewards[index] = reward
 94 |         if done:
 95 |             self.nexts[index] = 0
 96 |         else:
 97 |             self.nexts[index] = 1
 98 |         self.pos += 1
 99 | 
100 |     def available_size(self):
101 |         if self.pos > self.size:
102 |             return self.size - 1
103 |         return self.pos - 1
104 | 
105 |     def __getitem__(self, index):
106 |         if self.pos < self.size:
107 |             offset = 0
108 |         else:
109 |             offset = self.pos % self.size - self.size
110 |         index += offset
111 |         return self.states[index], self.actions[index], self.rewards[index], self.states[index + 1], self.nexts[index]
112 | 
113 | def update(agent, target_agent, optimizer, ex_pool, batch_size):
114 |     available_size = ex_pool.available_size()
115 |     if available_size < batch_size:
116 |         return
117 |     indices = np.random.permutation(available_size)[:batch_size]
118 |     data = [ex_pool[i] for i in indices]
119 |     state, action, reward, next_state, has_next = zip(*data)
120 |     state = np.asarray(state)
121 |     action = np.asarray(action)
122 |     reward = np.asarray(reward)
123 |     next_state = np.asarray(next_state)
124 |     has_next = np.asarray(has_next)
125 | 
126 |     q = F.select_item(agent(state), action)
127 |     next_action = np.argmax(agent(next_state).data, axis=1)
128 |     y = reward + agent.gamma * has_next * target_agent(next_state).data[(six.moves.range(len(next_action))), next_action]
129 |     loss = F.mean_squared_error(q, y)
130 |     agent.cleargrads()
131 |     loss.backward()
132 |     optimizer.update()
133 | 
134 | def parse_arg():
135 |     parser = argparse.ArgumentParser('Open AI Gym learning sample')
136 |     parser.add_argument('--env', '-e', type=str, choices=['cart_pole', 'mountain_car'], help='Environment name')
137 |     parser.add_argument('--skip_render', '-s', type=int, default=0, help='Episodes nterval to skip rendering')
138 |     parser.add_argument('--batch-size', '-b', type=int, default=32, help='Batch size for taining')
139 |     parser.add_argument('--pool-size', '-p', type=int, default=2000, help='Experiance pool size')
140 |     parser.add_argument('--train-iter', '-t', type=int, default=10, help='Number of training iterations')
141 |     parser.add_argument('--episode', type=int, default=1000, help='Number of episodes')
142 |     parser.add_argument('--episode-len', type=int, default=1000, help='Length of an episode')
143 |     parser.add_argument('--use-double-q', action='store_true', help='Use Double Q-learning')
144 |     return parser.parse_args()
145 | 
146 | def main():
147 |     args = parse_arg()
148 |     episode_num = args.episode
149 |     episode_length = args.episode_len
150 |     pool_size = args.pool_size
151 |     batch_size = args.batch_size
152 |     train_num = args.train_iter
153 |     update_count = 0
154 |     update_agent_interval = 100
155 |     use_double_q = args.use_double_q
156 | 
157 |     env_name = args.env
158 |     if env_name == 'mountain_car':
159 |         env = gym.make('MountainCar-v0')
160 |         agent = MountainCarAgent()
161 |     else:
162 |         env = gym.make('CartPole-v0')
163 |         agent = CartPoleAgent()
164 |     skip_rendering_interval = args.skip_render
165 | 
166 |     if use_double_q:
167 |         target_agent = agent.copy()
168 |     else:
169 |         target_agent = agent
170 |     optimizer = chainer.optimizers.Adam()
171 |     optimizer.setup(agent)
172 |     ex_pool = ExperiencePool(pool_size, env.observation_space.shape)
173 | 
174 |     for episode in six.moves.range(episode_num):
175 |         raw_state = env.reset()
176 |         state = agent.normalize_state(raw_state)
177 |         need_render = skip_rendering_interval <= 0 or episode % skip_rendering_interval == 0
178 |         for t in six.moves.range(episode_length):
179 |             if need_render:
180 |                 env.render()
181 |             action = np.argmax(agent(np.expand_dims(state, 0)).data)
182 |             action = agent.randomize_action(action)
183 | 
184 |             prev_state = state
185 |             raw_state, raw_reward, done, info = env.step(action)
186 |             reward = agent.adjust_reward(raw_state, raw_reward, done)
187 |             state = agent.normalize_state(raw_state)
188 |             ex_pool.add(prev_state, action, reward, done or t == episode_length - 1)
189 |             for i in six.moves.range(train_num):
190 |                 update(agent, target_agent, optimizer, ex_pool, batch_size)
191 |             update_count += 1
192 |             agent.reduce_epsilon()
193 |             if use_double_q and update_count % update_agent_interval == 0:
194 |                 target_agent = agent.copy()
195 |             if done:
196 |                 print('Episode {} finished after {} timesteps'.format(episode + 1, t + 1))
197 |                 break
198 |         if not done:
199 |             print('Epsode {} completed'.format(episode + 1))
200 | 
201 | if __name__ == '__main__':
202 |     main()
203 | 


--------------------------------------------------------------------------------
/src/translation/dataset/dataset.json:
--------------------------------------------------------------------------------
 1 | {
 2 |   "en": {
 3 |     "train": "train.en",
 4 |     "test": "test.en"
 5 |   },
 6 |   "ja": {
 7 |     "train": "train.ja",
 8 |     "test": "test.ja"
 9 |   }
10 | }
11 | 


--------------------------------------------------------------------------------
/src/translation/dataset/dev.json:
--------------------------------------------------------------------------------
 1 | {
 2 |   "en": {
 3 |     "train": "dev.en",
 4 |     "test": "test.en"
 5 |   },
 6 |   "ja": {
 7 |     "train": "dev.ja",
 8 |     "test": "test.ja"
 9 |   }
10 | }
11 | 


--------------------------------------------------------------------------------
/src/translation/make_dataset.py:
--------------------------------------------------------------------------------
 1 | import argparse
 2 | import json
 3 | import numpy as np
 4 | import os
 5 | import six
 6 | 
 7 | pickle = six.moves.cPickle
 8 | 
 9 | def count_words(sentences):
10 |     word_counts = {}
11 |     for tokens in sentences:
12 |         for token in tokens:
13 |             if token in word_counts:
14 |                 word_counts[token] += 1
15 |             else:
16 |                 word_counts[token] = 1
17 |     return word_counts
18 | 
19 | def tokens_to_ids(tokens, word_ids, unknown_id):
20 |     return [word_ids[token] if token in word_ids else unknown_id for token in tokens]
21 | 
22 | def make(data_dir, train_file, test_file, min_count):
23 |     with open(os.path.join(data_dir, train_file)) as f:
24 |         sentences = [tokens.split() for tokens in f]
25 |     word_counts = count_words(sentences)
26 |     words = [w for w, c in word_counts.items() if c >= min_count]
27 |     begin_id = 0
28 |     end_id = 1
29 |     unknown_id = 2
30 |     words = ['<S>', '</S>', '<UNK>'] + words
31 |     word_ids = {w: i for i, w in enumerate(words)}
32 |     train_token_ids = [tokens_to_ids(tokens, word_ids, unknown_id) for tokens in sentences]
33 |     with open(os.path.join(data_dir, test_file)) as f:
34 |         sentences = [tokens.split() for tokens in f]
35 |     test_token_ids = [tokens_to_ids(tokens, word_ids, unknown_id) for tokens in sentences]
36 |     return {
37 |         'words': words,
38 |         'train': train_token_ids,
39 |         'test': test_token_ids,
40 |     }
41 | 
42 | if __name__ == '__main__':
43 |     parser = argparse.ArgumentParser('Make translation dataset')
44 |     parser.add_argument('input_path', type=str, help='Input json file path')
45 |     parser.add_argument('output_path', type=str, help='Output file path')
46 |     parser.add_argument('--data-dir', '-d', type=str, default='.', help='Data file directory')
47 |     parser.add_argument('--min-count', '-m', type=int, default=1, help='Minimum count for training words')
48 |     args = parser.parse_args()
49 | 
50 |     dataset = {}
51 |     with open(args.input_path) as f:
52 |         config = json.load(f)
53 |     for lang, lang_config in config.items():
54 |         train_file = lang_config['train']
55 |         test_file = lang_config['test']
56 |         dataset[lang] = make(args.data_dir, train_file, test_file, args.min_count)
57 |     with open(args.output_path, 'wb') as f:
58 |         pickle.dump(dataset, f, pickle.HIGHEST_PROTOCOL)
59 | 


--------------------------------------------------------------------------------
/src/translation/model/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/src/translation/model/.gitkeep


--------------------------------------------------------------------------------
/src/translation/train.py:
--------------------------------------------------------------------------------
  1 | import argparse
  2 | import numpy as np
  3 | import os
  4 | import six
  5 | import time
  6 | 
  7 | import chainer
  8 | from chainer import cuda
  9 | from chainer import functions as F
 10 | from chainer import links as L
 11 | from chainer import serializers
 12 | 
 13 | pickle = six.moves.cPickle
 14 | 
 15 | begin_id = 0
 16 | end_id = 1
 17 | 
 18 | class EncoderDecoder(chainer.Chain):
 19 | 
 20 |     def __init__(self, input_size, output_size, hidden_size=256):
 21 |         super(EncoderDecoder, self).__init__(
 22 |             enc_embed=L.EmbedID(input_size, hidden_size, ignore_label=-1),
 23 |             dec_embed=L.EmbedID(output_size, hidden_size, ignore_label=-1),
 24 |             enc1=L.LSTM(hidden_size, hidden_size),
 25 |             enc2=L.LSTM(hidden_size, hidden_size),
 26 |             dec1=L.LSTM(hidden_size, hidden_size),
 27 |             dec2=L.LSTM(hidden_size, hidden_size),
 28 |             dec_out=L.Linear(hidden_size, output_size),
 29 |         )
 30 |         self.c = None
 31 |         self.h = None
 32 | 
 33 |     def __call__(self, x, decode=False, train=True):
 34 |         if decode:
 35 |             return self.decode(x, train=train)
 36 |         return self.encode(x, train=train)
 37 | 
 38 |     def encode(self, x, train=True):
 39 |         h = self.enc_embed(x)
 40 |         h = self.enc1(h)
 41 |         h = self.enc2(h)
 42 |         return h
 43 | 
 44 |     def decode(self, x, train=True):
 45 |         if self.dec1.c is None:
 46 |             self.dec1.set_state(self.enc1.c, self.enc1.h)
 47 |             self.dec2.set_state(self.enc2.c, self.enc2.h)
 48 |         h = self.dec_embed(x)
 49 |         h = self.dec1(h)
 50 |         h = self.dec2(h)
 51 |         h = F.dropout(h, train=train)
 52 |         return self.dec_out(h)
 53 | 
 54 |     def reset_state(self):
 55 |         self.enc1.reset_state()
 56 |         self.enc2.reset_state()
 57 |         self.dec1.reset_state()
 58 |         self.dec2.reset_state()
 59 | 
 60 |     def permutate(self, order):
 61 |         for link in [self.enc1, self.enc2]:
 62 |             link.c = F.permutate(link.c, order)
 63 |             link.h = F.permutate(link.h, order)
 64 | 
 65 |     def get_sub_state(self, index):
 66 |         state = []
 67 |         for link in [self.dec1, self.dec2]:
 68 |             c = None
 69 |             h = None
 70 |             if link.c is not None:
 71 |                 c = link.c.data[index, ...].copy()
 72 |             if link.h is not None:
 73 |                 h = link.h.data[index, ...].copy()
 74 |             state.append((c, h))
 75 |         return tuple(state)
 76 | 
 77 |     def set_sub_state(self, index, state):
 78 |         for s, link in zip(state, [self.dec1, self.dec2]):
 79 |             c, h = s
 80 |             if c is not None and link.c is not None:
 81 |                 link.c.data[index, ...] = c
 82 |             if h is not None and link.h is not None:
 83 |                 link.h.data[index, ...] = h
 84 | 
 85 | def sort_batch(inputs, targets):
 86 |     input_order = np.argsort(np.asarray(list(map(len, inputs)), dtype=np.int32))[::-1]
 87 |     target_order = np.argsort(np.asarray(list(map(len, targets)), dtype=np.int32))[::-1]
 88 |     sorted_inputs = [inputs[i] for i in input_order]
 89 |     sorted_targets = [targets[i] for i in target_order]
 90 |     inv_input_order = np.zeros_like(input_order, dtype=np.int32)
 91 |     inv_input_order[input_order] = np.arange(len(input_order), dtype=np.int32)
 92 |     return sorted_inputs, sorted_targets, inv_input_order[target_order]
 93 | 
 94 | def concat_examples(examples):
 95 |     inputs, targets = zip(*examples)
 96 |     return sort_batch(inputs, targets)
 97 | 
 98 | def forward(net, xs, ts, permutation, train=True):
 99 |     xp = net.xp
100 |     loss = 0
101 |     acc = 0
102 |     num = 0
103 |     net.reset_state()
104 |     for x in F.transpose_sequence(xs):
105 |         v = chainer.Variable(xp.asarray(x.data), volatile=not train)
106 |         net(v, decode=False, train=train)
107 |     net.permutate(permutation)
108 |     t_seq = F.transpose_sequence(ts)
109 |     t = xp.asarray(t_seq[0].data)
110 |     for next_t in t_seq[1:]:
111 |         n = len(next_t.data)
112 |         y = net(t[:n], decode=True, train=train)
113 |         next_t = xp.asarray(next_t.data)
114 |         loss += F.softmax_cross_entropy(y, next_t) * n
115 |         acc += F.accuracy(y, next_t) * n
116 |         num += n
117 |         t = next_t
118 |     return num, loss, acc
119 | 
120 | def evaluate(net, inputs, targets, batch_size):
121 |     dataset = chainer.datasets.TupleDataset(inputs, targets)
122 |     iterator = chainer.iterators.SerialIterator(dataset, batch_size, shuffle=False, repeat=False)
123 |     loss_sum = 0
124 |     acc_sum = 0
125 |     total_num = 0
126 |     for batch in iterator:
127 |         xs, ts, permutation = concat_examples(batch)
128 |         num, loss, acc = forward(net, xs, ts, permutation, train=False)
129 |         loss_sum += float(loss.data)
130 |         acc_sum += float(acc.data)
131 |         total_num += float(num)
132 |     return total_num, loss_sum, acc_sum
133 | 
134 | def test(net, inputs, test_token_len, beam_width=10):
135 |     xp = net.xp
136 |     from_sentences = []
137 |     sentences = []
138 |     for xs in inputs:
139 |         net.reset_state()
140 |         for raw_x in xs.data:
141 |             x = xp.full((beam_width,), raw_x, dtype=np.int32)
142 |             x = chainer.Variable(x, volatile=True)
143 |             net(x, decode=False, train=False)
144 |         candidates = [(None, [begin_id], 0)]
145 |         for i in six.moves.range(test_token_len):
146 |             next_candidates = []
147 |             current_candidates = []
148 |             x = []
149 |             for sub_state, tokens, likelihood in candidates:
150 |                 if tokens[-1] == end_id:
151 |                     continue
152 |                 if sub_state != None:
153 |                     net.set_sub_state(len(x), sub_state)
154 |                 current_candidates.append((len(x), tokens, likelihood))
155 |                 x.append(tokens[-1])
156 |             x = chainer.Variable(xp.asarray(x, dtype=np.int32), volatile=True)
157 |             y = F.log_softmax(net(x, decode=True, train=False))
158 |             for j, tokens, likelihood in current_candidates:
159 |                 sub_state = net.get_sub_state(j)
160 |                 token_likelihoods = cuda.to_cpu(y.data[0])
161 |                 top_tokens = token_likelihoods.argsort()[-beam_width:]
162 |                 next_candidates.extend([(sub_state, tokens + [j], likelihood + token_likelihoods[j]) for j in top_tokens])
163 |             candidates = sorted(next_candidates, key=lambda x: -x[2])[:beam_width]
164 |             if all([candidate[1][-1] == end_id for candidate in candidates]):
165 |                 break
166 |         sentences.append(candidates[0][1][1:-1])
167 |     return sentences
168 | 
169 |     for xs in inputs:
170 |         while len(tokens) < test_token_len:
171 |             token_id = chainer.Variable(xp.asarray([token_id], dtype=np.int32), volatile=True)
172 |             y = net(token_id, decode=True, train=False)
173 |             token_id = int(xp.argmax(y.data[0]))
174 |             if token_id == end_id:
175 |                 break
176 |             tokens.append(token_id)
177 |         sentences.append(tokens)
178 |     return sentences
179 | 
180 | def output_result(sentences, words, output_path, epoch):
181 |     base, ext = os.path.splitext(output_path)
182 |     with open('{0}_{1:04d}{2}'.format(base, epoch, ext), 'w') as f:
183 |         for token_ids in sentences:
184 |             tokens = map(lambda x: words[x], token_ids)
185 |             f.write(' '.join(tokens) + '\n')
186 | 
187 | def train(net, optimizer, inputs, targets, epoch_num, batch_size, output_path, test_words=None, test_inputs=None, test_targets=None, test_output_path=None, test_token_len=20):
188 |     xp = net.xp
189 |     train_dataset = chainer.datasets.TupleDataset(inputs, targets)
190 |     train_iterator = chainer.iterators.SerialIterator(train_dataset, batch_size)
191 |     train_loss_sum = 0
192 |     train_acc_sum = 0
193 |     train_num = 0
194 |     last_clock = time.clock()
195 |     while train_iterator.epoch < epoch_num:
196 |         batch = train_iterator.next()
197 |         xs, ts, permutation = concat_examples(batch)
198 |         num, loss, acc = forward(net, xs, ts, permutation)
199 |         net.cleargrads()
200 |         loss /= num
201 |         loss.backward()
202 |         optimizer.update()
203 |         train_loss_sum += float(loss.data) * num
204 |         train_acc_sum += float(acc.data)
205 |         train_num += float(num)
206 |         if train_iterator.is_new_epoch:
207 |             print('epoch {} done'.format(train_iterator.epoch))
208 |             print('train loss: {}'.format(train_loss_sum / train_num))
209 |             if test_inputs is not None and test_targets is not None:
210 |                 test_num, test_loss, test_acc = evaluate(net, test_inputs, test_targets, batch_size)
211 |             print('train acc:  {}'.format(train_acc_sum / train_num))
212 |             print('test loss: {}'.format(test_loss / test_num))
213 |             print('test acc:  {}'.format(test_acc / test_num))
214 |             if test_words is not None and test_inputs is not None and test_output_path is not None:
215 |                 test_sentences = test(net, test_inputs, test_token_len)
216 |                 output_result(test_sentences, test_words, test_output_path, train_iterator.epoch)
217 |             current_clock = time.clock()
218 |             print('{}s elapsed'.format(current_clock - last_clock))
219 |             last_clock = current_clock
220 |             train_loss_sum = 0
221 |             train_acc_sum = 0
222 |             train_num = 0
223 |             serializers.save_npz(output_path, net)
224 |             if train_iterator.epoch >= 12 and (train_iterator.epoch - 12) % 8 == 0:
225 |                 optimizer.alpha *= 0.5
226 | 
227 | if __name__ == '__main__':
228 |     parser = argparse.ArgumentParser(description='Translation training')
229 |     parser.add_argument('data_path', type=str, help='Dataset file path')
230 |     parser.add_argument('input_lang', type=str, help='Input language')
231 |     parser.add_argument('output_lang', type=str, help='Output language')
232 |     parser.add_argument('output_model', type=str, help='Output model file path')
233 |     parser.add_argument('test_result', type=str, help='Test result file path')
234 |     parser.add_argument('--gpu', '-g', type=int, default=-1, help='GPU device index, -1 indicates CPU')
235 |     parser.add_argument('--epoch', '-e', type=int, default=50, help='Number of epochs')
236 |     parser.add_argument('--batch-size', '-b', type=int, default=100, help='Mini batch size')
237 |     parser.add_argument('--hidden-size', type=int, default=256, help='Model hidden layer size')
238 |     parser.add_argument('--max_result_len', type=int, default=20, help='Maximum test result token length')
239 |     args = parser.parse_args()
240 | 
241 |     with open(args.data_path, 'rb') as f:
242 |         dataset = pickle.load(f)
243 |     input_words = dataset[args.input_lang]['words']
244 |     input_train = dataset[args.input_lang]['train']
245 |     input_test = dataset[args.input_lang]['test']
246 |     output_words = dataset[args.output_lang]['words']
247 |     output_train = dataset[args.output_lang]['train']
248 |     output_test = dataset[args.output_lang]['test']
249 | 
250 |     inputs = list(map(lambda x: chainer.Variable(np.asarray(x[::-1], dtype=np.int32), volatile=True), input_train))
251 |     max_output_len = max(map(len, output_train))
252 |     targets = list(map(lambda x: chainer.Variable(np.asarray([begin_id] + x + [end_id], dtype=np.int32), volatile=True), output_train))
253 |     test_inputs = list(map(lambda x: chainer.Variable(np.asarray(x[::-1], dtype=np.int32), volatile=True), input_test))
254 |     test_targets = list(map(lambda x: chainer.Variable(np.asarray([begin_id] + x + [end_id], dtype=np.int32), volatile=True), output_test))
255 | 
256 |     net = EncoderDecoder(len(input_words), len(output_words), args.hidden_size)
257 |     optimizer = chainer.optimizers.Adam(alpha=0.001)
258 |     optimizer.setup(net)
259 |     if args.gpu >= 0:
260 |         cuda.get_device(args.gpu).use()
261 |         net.to_gpu(args.gpu)
262 |     train(net, optimizer, inputs, targets, args.epoch, args.batch_size, args.output_model, test_words=output_words, test_inputs=test_inputs, test_targets=test_targets, test_output_path=args.test_result, test_token_len=args.max_result_len)
263 | 


--------------------------------------------------------------------------------
/src/vae/image/.gitkeep:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/dsanno/deep-learning-study/3b648848f65299fb478e8d3746eb9ebbf87ad0b3/src/vae/image/.gitkeep


--------------------------------------------------------------------------------
/src/vae/net.py:
--------------------------------------------------------------------------------
 1 | import numpy as np
 2 | import chainer
 3 | import chainer.functions as F
 4 | import chainer.links as L
 5 | from chainer import cuda, Variable
 6 | 
 7 | class VAE(chainer.Chain):
 8 | 
 9 |     def __init__(self, hidden_size=500, latent_size=30):
10 |         super(VAE, self).__init__(
11 |             enc1 = L.Linear(28 * 28, hidden_size),
12 |             enc2 = L.Linear(hidden_size, hidden_size),
13 |             enc_mean = L.Linear(hidden_size, latent_size),
14 |             enc_var  = L.Linear(hidden_size, latent_size),
15 |             dec1 = L.Linear(latent_size, hidden_size),
16 |             dec2 = L.Linear(hidden_size, hidden_size),
17 |             dec3 = L.Linear(hidden_size, 28 * 28)
18 |         )
19 | 
20 |     def __call__(self, x, train=True):
21 |         xp = self.xp
22 |         h1 = F.relu(self.enc1(x))
23 |         h2 = F.relu(self.enc2(h1))
24 |         mean = self.enc_mean(h2)
25 |         var  = 0.5 * self.enc_var(h2)
26 |         rand = xp.random.normal(0, 1, var.data.shape).astype(mean.dtype)
27 |         z  = mean + F.exp(var) * rand
28 |         g1 = F.relu(self.dec1(z))
29 |         g2 = F.relu(self.dec2(g1))
30 |         g3 = F.sigmoid(self.dec3(g2))
31 |         return (g3, mean, var)
32 | 
33 |     def generate(self, z):
34 |         g1 = F.relu(self.dec1(z))
35 |         g2 = F.relu(self.dec2(g1))
36 |         return F.sigmoid(self.dec3(g2))
37 | 
38 | class VAEM2(chainer.Chain):
39 | 
40 |     def __init__(self, hidden_size=500, latent_size=30, class_size=10):
41 |         super(VAEM2, self).__init__(
42 |             enc_embed=L.EmbedID(class_size, hidden_size),
43 |             enc1=L.Linear(28 * 28, hidden_size),
44 |             enc2=L.Linear(hidden_size, hidden_size),
45 |             enc_mean=L.Linear(hidden_size, latent_size),
46 |             enc_var=L.Linear(hidden_size, latent_size),
47 |             dec_embed=L.EmbedID(class_size, hidden_size),
48 |             dec1=L.Linear(latent_size, hidden_size),
49 |             dec2=L.Linear(hidden_size, hidden_size),
50 |             dec3=L.Linear(hidden_size, 28 * 28)
51 |         )
52 | 
53 |     def __call__(self, x, t, gen_t, train=True):
54 |         xp = self.xp
55 |         h1 = F.relu(self.enc1(x) + self.enc_embed(t))
56 |         h2 = F.relu(self.enc2(h1))
57 |         mean = self.enc_mean(h2)
58 |         var  = 0.5 * self.enc_var(h2)
59 |         rand = xp.random.normal(0, 1, var.data.shape).astype(mean.dtype)
60 |         z  = mean + F.exp(var) * rand
61 |         g1 = F.relu(self.dec1(z) + self.dec_embed(gen_t))
62 |         g2 = F.relu(self.dec2(g1))
63 |         g3 = F.sigmoid(self.dec3(g2))
64 |         return (g3, mean, var)
65 | 
66 |     def generate(self, z, t):
67 |         g1 = F.relu(self.dec1(z) + self.dec_embed(t))
68 |         g2 = F.relu(self.dec2(g1))
69 |         return F.sigmoid(self.dec3(g2))
70 | 


--------------------------------------------------------------------------------
/src/vae/train.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | # -*- coding: utf-8 -*-
  3 | 
  4 | import argparse
  5 | import math
  6 | import numpy as np
  7 | import six
  8 | import time
  9 | from PIL import Image
 10 | 
 11 | import chainer
 12 | from chainer import cuda
 13 | from chainer import functions as F
 14 | from chainer import links as L
 15 | from chainer import optimizers
 16 | from chainer import serializers
 17 | from chainer.dataset import convert
 18 | 
 19 | from net import VAE
 20 | 
 21 | latent_size = 30
 22 | 
 23 | 
 24 | # for data augmentation
 25 | def translate(x, offset):
 26 |     size = 28
 27 |     org_shape = x.shape
 28 |     batch_size = x.shape[0]
 29 |     xp = cuda.get_array_module(x)
 30 |     x = x.reshape((-1, 1, size, size))
 31 |     y = xp.zeros_like(x)
 32 |     y = y.reshape((-1, 1, size, size))
 33 |     offsets = xp.random.randint(-offset, offset + 1, (batch_size, 2))
 34 |     for i in six.moves.range(batch_size):
 35 |         left, top = offsets[i]
 36 |         right = min(size, size + left)
 37 |         bottom = min(size, size + top)
 38 |         left = max(0, left)
 39 |         top = max(0, top)
 40 |         y[i,:,size-bottom:size-top,size-right:size-left] = x[i,:,top:bottom,left:right]
 41 |     return y.reshape(org_shape)
 42 | 
 43 | def update(net, optimizer, x):
 44 |     xp = cuda.get_array_module(x)
 45 |     div_weight = 1
 46 | 
 47 |     y, mean, var = net(x)
 48 |     loss = F.mean_squared_error(x, y) + div_weight * F.gaussian_kl_divergence(mean, var) / float(y.size)
 49 |     net.cleargrads()
 50 |     loss.backward()
 51 |     optimizer.update()
 52 |     return loss
 53 | 
 54 | def generate_image(net, z, file_path):
 55 |     x = net.generate(z)
 56 |     im = cuda.to_cpu(x.data)
 57 |     im = im.reshape((10, 10, 28, 28)).transpose((0, 2, 1, 3)).reshape((280, 280))
 58 |     im = ((1 - im) * 256).clip(0, 255).astype(np.uint8)
 59 |     Image.fromarray(im).save(file_path)
 60 | 
 61 | def train(net, optimizer, iterator, iteration, name):
 62 |     xp = net.xp
 63 |     loss_sum = 0
 64 |     loss_num = 0
 65 |     test_z1 = xp.random.uniform(-1, 1, (10, 1, latent_size)).astype(np.float32)
 66 |     test_z2 = xp.random.uniform(-1, 1, (10, 1, latent_size)).astype(np.float32)
 67 |     weights = xp.arange(10).astype(np.float32).reshape((1, 10, 1)).repeat(10, axis=0) / 9
 68 |     test_z = (1 - weights) * test_z1 + weights * test_z2
 69 |     test_z = test_z.reshape((-1, latent_size))
 70 |     last_clock = time.clock()
 71 |     for i in six.moves.range(iteration):
 72 |         batch = train_iterator.next()
 73 |         x, t = convert.concat_examples(batch)
 74 |         x = xp.asarray(translate(x, 2))
 75 |         loss = update(net, optimizer, x)
 76 |         loss_sum += float(loss.data)
 77 |         loss_num += 1
 78 | 
 79 |         if (i + 1) % 1000 == 0:
 80 |             current_clock = time.clock()
 81 |             print('iteration {} done {}s elapsed'.format(i + 1, current_clock - last_clock))
 82 |             last_clock = current_clock
 83 |             print('loss: {}'.format(loss_sum / loss_num))
 84 |             loss_sum = 0
 85 |             loss_num = 0
 86 |             generate_image(net, test_z, '{0}_{1:06d}.png'.format(name, i + 1))
 87 |             serializers.save_npz('{}.model'.format(name), net)
 88 |     train_iterator.finalize()
 89 | 
 90 | if __name__ == '__main__':
 91 | 
 92 |     parser = argparse.ArgumentParser(description='Variational Auto Encoder sample')
 93 |     parser.add_argument('--gpu', '-g', type=int, default=-1, help='GPU device index, -1 indicates CPU')
 94 |     parser.add_argument('--iter', '-i', type=int, default=30000, help='Number of iterations')
 95 |     parser.add_argument('--batch-size', '-b', type=int, default=100, help='Mini batch size')
 96 |     parser.add_argument('--name', '-n', type=str, default='image/vae', help='saved file name')
 97 |     args = parser.parse_args()
 98 | 
 99 |     batch_size = args.batch_size
100 |     net = VAE()
101 |     gpu_device = args.gpu
102 |     if gpu_device >= 0:
103 |         chainer.cuda.get_device(gpu_device).use()
104 |         net.to_gpu(gpu_device)
105 |         xp = cuda.cupy
106 |     else:
107 |         xp = np
108 |     optimizer = optimizers.Adam()
109 |     optimizer.setup(net)
110 | 
111 |     train_data, test_data = chainer.datasets.get_mnist()
112 |     train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
113 |     train(net, optimizer, train_iterator, args.iter, args.name)
114 | 


--------------------------------------------------------------------------------
/src/vae/train_m2.py:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env python
  2 | # -*- coding: utf-8 -*-
  3 | 
  4 | import argparse
  5 | import math
  6 | import numpy as np
  7 | import six
  8 | import time
  9 | from PIL import Image
 10 | 
 11 | import chainer
 12 | from chainer import cuda
 13 | from chainer import functions as F
 14 | from chainer import links as L
 15 | from chainer import optimizers
 16 | from chainer import serializers
 17 | from chainer.dataset import convert
 18 | 
 19 | from net import VAEM2
 20 | 
 21 | latent_size = 30
 22 | 
 23 | 
 24 | # for data augmentation
 25 | def translate(x, offset):
 26 |     size = 28
 27 |     org_shape = x.shape
 28 |     batch_size = x.shape[0]
 29 |     xp = cuda.get_array_module(x)
 30 |     x = x.reshape((-1, 1, size, size))
 31 |     y = xp.zeros_like(x)
 32 |     y = y.reshape((-1, 1, size, size))
 33 |     offsets = xp.random.randint(-offset, offset + 1, (batch_size, 2))
 34 |     for i in six.moves.range(batch_size):
 35 |         left, top = offsets[i]
 36 |         right = min(size, size + left)
 37 |         bottom = min(size, size + top)
 38 |         left = max(0, left)
 39 |         top = max(0, top)
 40 |         y[i,:,size-bottom:size-top,size-right:size-left] = x[i,:,top:bottom,left:right]
 41 |     return y.reshape(org_shape)
 42 | 
 43 | def update(net, optimizer, x, t):
 44 |     xp = cuda.get_array_module(x)
 45 |     div_weight = 1
 46 | 
 47 |     y, mean, var = net(x, t, t)
 48 |     loss = F.mean_squared_error(x, y) + div_weight * F.gaussian_kl_divergence(mean, var) / float(y.size)
 49 |     net.cleargrads()
 50 |     loss.backward()
 51 |     optimizer.update()
 52 |     return loss
 53 | 
 54 | def generate_image(net, x, t, file_path):
 55 |     xp = cuda.get_array_module(x)
 56 |     class_size = 10
 57 |     x2 = xp.repeat(x, class_size, axis=0)
 58 |     t2 = xp.repeat(t, class_size, axis=0)
 59 |     gen_t = xp.arange(class_size * x.shape[0]).astype(np.int32) % class_size
 60 |     y, mean, var = net(x2, t2, gen_t)
 61 |     im_x = cuda.to_cpu(x)
 62 |     im_x = im_x.reshape((280, 28))
 63 |     im_y = cuda.to_cpu(y.data).reshape((10, 10, 28, 28)).transpose((0, 2, 1, 3)).reshape((280, 280))
 64 |     im = np.concatenate((im_x, im_y), axis=1)
 65 |     im = ((1 - im) * 256).clip(0, 255).astype(np.uint8)
 66 |     Image.fromarray(im).save(file_path)
 67 | 
 68 | def train(net, optimizer, iterator, iteration, test_data, name):
 69 |     xp = net.xp
 70 |     loss_sum = 0
 71 |     loss_num = 0
 72 |     test_x, test_t = convert.concat_examples(test_data[:10])
 73 |     test_x = xp.asarray(test_x)
 74 |     test_t = xp.asarray(test_t)
 75 |     last_clock = time.clock()
 76 |     for i in six.moves.range(iteration):
 77 |         batch = train_iterator.next()
 78 |         x, t = convert.concat_examples(batch)
 79 |         x = xp.asarray(translate(x, 2))
 80 |         t = xp.asarray(t)
 81 |         loss = update(net, optimizer, x, t)
 82 |         loss_sum += float(loss.data)
 83 |         loss_num += 1
 84 | 
 85 |         if (i + 1) % 1000 == 0:
 86 |             current_clock = time.clock()
 87 |             print('iteration {} done {}s elapsed'.format(i + 1, current_clock - last_clock))
 88 |             last_clock = current_clock
 89 |             print('loss: {}'.format(loss_sum / loss_num))
 90 |             loss_sum = 0
 91 |             loss_num = 0
 92 |             generate_image(net, test_x, test_t, '{0}_{1:06d}.png'.format(name, i + 1))
 93 |             serializers.save_npz('{}.model'.format(name), net)
 94 |     train_iterator.finalize()
 95 | 
 96 | if __name__ == '__main__':
 97 | 
 98 |     parser = argparse.ArgumentParser(description='Variational Auto Encoder M2 sample')
 99 |     parser.add_argument('--gpu', '-g', type=int, default=-1, help='GPU device index, -1 indicates CPU')
100 |     parser.add_argument('--iter', '-i', type=int, default=30000, help='Number of iterations')
101 |     parser.add_argument('--batch-size', '-b', type=int, default=100, help='Mini batch size')
102 |     parser.add_argument('--name', '-n', type=str, default='image/vae_m2', help='saved file name')
103 |     args = parser.parse_args()
104 | 
105 |     batch_size = args.batch_size
106 |     net = VAEM2()
107 |     gpu_device = args.gpu
108 |     if gpu_device >= 0:
109 |         chainer.cuda.get_device(gpu_device).use()
110 |         net.to_gpu(gpu_device)
111 |         xp = cuda.cupy
112 |     else:
113 |         xp = np
114 |     optimizer = optimizers.Adam()
115 |     optimizer.setup(net)
116 | 
117 |     train_data, test_data = chainer.datasets.get_mnist()
118 |     train_iterator = chainer.iterators.SerialIterator(train_data, batch_size)
119 |     train(net, optimizer, train_iterator, args.iter, test_data, args.name)
120 | 


--------------------------------------------------------------------------------