├── redes.png
├── LICENSE
├── README.md
├── codes
├── feedforward_multclass.py
├── cnn.py
├── rbm.py
├── feedforward_binary.py
├── lstm.py
├── gan.py
└── autoencoder.py
├── libraries.txt
└── deepLearning_LSTM.ipynb
/redes.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/hfarruda/deeplearningtutorial/HEAD/redes.png
--------------------------------------------------------------------------------
/LICENSE:
--------------------------------------------------------------------------------
1 | Deep learning tutorial (c) by Henrique Ferraz de Arruda, Alexandre Benatti, César Henrique Comin, and Luciano da Fontoura Costa
2 |
3 | Deep learning tutorial is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
4 |
5 | You should have received a copy of the license along with this work.
6 | If not, see .
7 |
--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
1 | # Deep Learning Tutorial
2 |
3 | This tutorial is part of the didactic text: [Learning Deep Learning](https://www.scielo.br/j/rbef/a/hMZfS8hRwMvVktkbCZtjJff/?format=html), authored by Henrique Ferraz de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa.
4 |
5 | The purpose of this tutorial is to provide simple didactic examples of deep learning architectures and problem solution. The codes included here are based on toy datasets, and restricted to parameters allowing short processing time. So, these codes are not suitable for other data and/or applications, which will require modifications in the structure and parameters. These codes have absolutely no warranty.
6 |
7 | For all the codes presented here, we use [Keras](https://keras.io/) as the deep learning library. Keras is a useful and straightforward framework, which can be employed for simple and complex tasks. Keras is written in the Python language, providing self-explanatory codes, with the additional advantage of being executed under [TensorFlow](https://www.tensorflow.org/) backend. We also employ the [Scikit-learn](https://scikit-learn.org/), which is devoted to machine learning.
8 |
9 | 
10 |
11 | More details are available at [Learning Deep Learning](https://www.scielo.br/j/rbef/a/hMZfS8hRwMvVktkbCZtjJff/?format=html).
12 |
13 |
14 | ## Feedforward networks
15 |
16 | ### Binary Classification
17 | This is the first example of deep learning implementation, in which we address binary classification of wine data. In this example, we consider one feedforward network with 5 hidden layers and with 30 neurons in each layer. The provided networks were built only for a didactic purpose and are not appropriate for real applications.
18 |
19 | ### Multiclass Classification
20 | In this example, we illustrate a multiclass classification through a wine dataset, in which there are three classes, which were defined according to their regions. We employed the same dataset presented above, but here we considered the three classes. To do so, we use the *softmax* activation function.
21 |
22 | [](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_feedforward.ipynb)
23 |
24 |
25 | ## Convolutional Neural Network (CNN)
26 | This tutorial is the second example of deep learning implementation, in which we exemplify a classification task. More specifically, we considered ten classes of colored pictures.
27 |
28 | [](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_CNN.ipynb)
29 |
30 |
31 | ## Long Short-Term Memory (LSTM)
32 |
33 | This is the third example of deep learning implementation. Here we use a LSTM network to predict the Bitcoin prices along time by using the input as a temporal series.
34 |
35 | [](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_LSTM.ipynb)
36 |
37 |
38 | ## Restricted Boltzmann Machine (RBM)
39 |
40 | This is the fourth example of deep learning implementation. Here we use a RMB network to provide a recommendation system of musical instruments.
41 |
42 | [](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_RBM.ipynb)
43 |
44 |
45 | ## Autoencoders
46 | This example uses the Autoencoder model to illustrate a possible application. Here we show how to use the resulting codes to reduce the dimentionality. We also project our data by using a Principal Component Analysis(PCA).
47 |
48 | [](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_autoencoder.ipynb)
49 |
50 |
51 | ## Generative Adversarial Networks (GAN)
52 | This example was elaborated to create a network that can generate handwritten characters automatically.
53 |
54 | [](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_GAN.ipynb)
55 |
56 |
57 | ## Libraries
58 | All of these codes were developed and executed with the environment described in "libraries.txt".
59 |
60 | ## Citation Request
61 | If you publish a paper related to this material, please cite:
62 |
63 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning deep learning." Revista Brasileira de Ensino de Física 44, 2022.
64 |
65 |
66 | ## Acknowledgments
67 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). H. F. de Arruda also thanks Soremartec S.A. and Soremartec Italia, Ferrero Group, for partial financial support (from 1st July 2021). His funders had no role in study design, data collection, and analysis, decision to publish, or manuscript preparation. Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and FAPESP (proc. 15/22308-2) for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 2018/09125-4 and 2021/12354-8) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 15/22308-2.
68 |
--------------------------------------------------------------------------------
/codes/feedforward_multclass.py:
--------------------------------------------------------------------------------
1 | # -*- coding: utf-8 -*-
2 | """deepLearning_feedforward.ipynb
3 |
4 | Automatically generated by Colaboratory.
5 |
6 | Original file is located at
7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_feedforward.ipynb
8 |
9 | #Feedforward networks
10 |
11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. These codes have absolutely no warranty.
12 |
13 | If you publish a paper related on this material, please cite:
14 |
15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019.
16 |
17 | ##Multiclass Classification
18 | In this example, we illustrate a multiclass classification through a wine dataset, in which there are three classes, which were defined according to their regions. We employed the same dataset presented above, but here we considered the three classes. To do so, we use the *softmax* activation function.
19 |
20 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend).
21 | """
22 |
23 | import numpy as np
24 | import keras
25 | from keras.utils import np_utils
26 | from keras.models import Sequential
27 | from keras.layers import Dense, Dropout
28 | from sklearn.datasets import load_wine
29 | from sklearn.model_selection import train_test_split
30 | from sklearn.metrics import accuracy_score
31 | from sklearn.preprocessing import LabelEncoder
32 | from sklearn.metrics import confusion_matrix
33 |
34 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*)."""
35 |
36 | import tensorflow as tf
37 | from keras import backend as K
38 |
39 | print(K.tensorflow_backend._get_available_gpus())
40 |
41 | number_of_cpu_cores = 8
42 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores})
43 | session = tf.Session(config=config)
44 | keras.backend.set_session(session)
45 |
46 | """(*) In this example the dataset used is Wine. It is available at Sklearn library on [sklearn-datasets-wine](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_wine.html). For more information [wine-UCI](https://archive.ics.uci.edu/ml/datasets/Wine).
47 |
48 | These data show the results of a chemical analysis of wines grown in Italy, derived from three different cultivars in the same region, and can be loaded as follows.
49 | """
50 |
51 | wine = load_wine()
52 | data = wine['data']
53 | target = wine['target']
54 | target_names = wine['target_names']
55 |
56 | label_encoder = LabelEncoder()
57 | target = label_encoder.fit_transform(target)
58 | target_one_hot_encoding = np_utils.to_categorical(target)
59 |
60 | #Here, we divide our dataset into training and test sets.
61 | test_size = 0.25 #fraction
62 | training_data,test_data,training_target,test_target = train_test_split(data,
63 | target_one_hot_encoding, test_size=test_size)
64 |
65 | """In the following, we configure the neuronal network. It is not necessary to include bias because this parameter is set as true by default."""
66 |
67 | #Set of parameters
68 | input_dim = data.shape[1]
69 | kernel_initializer = 'random_uniform'
70 | bias_initializer='zeros'
71 | activation_function_hidden = 'relu'
72 | activation_function_output = 'softmax'
73 | optimizer = 'adam'
74 | loss = 'categorical_crossentropy'
75 | metrics = ['categorical_accuracy']
76 | number_of_layers = 5
77 | number_of_units_hidden = 30
78 | number_of_units_output = len(set(target_names))
79 | dropout_percentage = 0.25
80 |
81 |
82 | #Creating model
83 | ff_model = Sequential()
84 | ff_model.add(Dense(units = number_of_units_hidden,
85 | activation = activation_function_hidden,
86 | kernel_initializer = kernel_initializer,
87 | input_dim = input_dim))
88 |
89 | for i in range(number_of_layers-1):
90 | #Inserting a dense hidden layer
91 | ff_model.add(Dense(units = number_of_units_hidden,
92 | activation = activation_function_hidden,
93 | kernel_initializer = kernel_initializer,
94 | input_dim = number_of_units_hidden))
95 | #Inserting dropout
96 | ff_model.add(Dropout(dropout_percentage))
97 |
98 | ff_model.add(Dense(units = number_of_units_output,
99 | activation = activation_function_output))
100 | ff_model.compile(optimizer = optimizer, loss = loss, metrics = metrics)
101 | ff_model.summary()
102 |
103 | """The training step is executed as follows."""
104 |
105 | batch_size = 10
106 | epochs = 250
107 | ff_model.fit(training_data,training_target, batch_size = batch_size,
108 | epochs = epochs)
109 |
110 | """Because there are three classes, we show the classification results through a confusion matrix."""
111 |
112 | predictions = ff_model.predict(test_data)
113 |
114 | found_target = predictions.argmax(axis=1)
115 | categorical_test_target = test_target.argmax(axis=1)
116 |
117 | accuracy = accuracy_score(categorical_test_target, found_target)
118 | print("Accuracy =", accuracy)
119 |
120 | print("Confusion matrix:")
121 | matrix = confusion_matrix(found_target,categorical_test_target)
122 | print(matrix)
123 |
124 | """
125 | ## License
126 |
127 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License.
128 |
129 | ## Acknowledgments
130 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2.
131 | """
132 |
--------------------------------------------------------------------------------
/codes/cnn.py:
--------------------------------------------------------------------------------
1 | # -*- coding: utf-8 -*-
2 | """deepLearning_CNN.ipynb
3 |
4 | Automatically generated by Colaboratory.
5 |
6 | Original file is located at
7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_CNN.ipynb
8 |
9 | # Convolutional Neural Network (CNN)
10 |
11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty.
12 |
13 | If you publish a paper related on this material, please cite:
14 |
15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019.
16 |
17 | This tutorial is the second example of deep learning implementation, in which we exemplify a classification task. More specifically, we considered ten classes of color pictures.
18 |
19 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend).
20 | """
21 |
22 | import keras
23 | from keras.utils import np_utils
24 | from keras.models import Sequential
25 | from keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Dropout
26 | from keras.layers.normalization import BatchNormalization
27 | from keras.preprocessing.image import ImageDataGenerator
28 | from keras.preprocessing import image
29 | from keras.datasets import cifar10
30 | from keras.utils.vis_utils import plot_model
31 | from sklearn.metrics import accuracy_score
32 | import numpy as np
33 | import matplotlib.pyplot as plt
34 | from sklearn.metrics import classification_report, confusion_matrix
35 |
36 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*)."""
37 |
38 | import tensorflow as tf
39 | from keras import backend as K
40 |
41 | print(K.tensorflow_backend._get_available_gpus())
42 |
43 | number_of_cpu_cores = 8
44 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores})
45 | session = tf.Session(config=config)
46 | keras.backend.set_session(session)
47 |
48 | """(*) In this example, we used the CIFAR10, which is consists of a colored dataset of images. It is available in Keras library, available on [keras-datasets](https://keras.io/datasets/).
49 | This dataset is organized into two parts, where the first is called x_train/x_test and comprises RGB images with dimensions of 32x32x3 . The second represents the targets, and the variables are called y_train/y_test, which are represented by arrays of category tags from 0 to 9.
50 |
51 | The following command is used to load the data set.
52 | """
53 |
54 | (train_data, train_target), (test_data, test_target) = cifar10.load_data()
55 |
56 | train_target_one_hot_encoding = np_utils.to_categorical(train_target)
57 |
58 | """In order to visualize a given figure, the following code can be executed."""
59 |
60 | image_id = 700
61 | plt.imshow(test_data[image_id])
62 | plt.title("Test image: " + str(image_id))
63 | plt.show()
64 |
65 | """In the following, we define the network topology. In this case, because of the redundancy typically found in images, we do not employ dropout in the convolutional layers."""
66 |
67 | input_shape = train_data.shape[1:]
68 | filters = 128
69 | kernel_size = (3,3)
70 | pool_size = (2,2)
71 |
72 | optimizer = 'adam'
73 | loss = 'categorical_crossentropy'
74 | metrics = ['categorical_accuracy']
75 | activation = 'relu'
76 | activation_function_output = 'softmax'
77 | number_of_cnn_layers = 3
78 | number_of_ff_layers = 3
79 | number_of_units_output = train_target_one_hot_encoding.shape[1]
80 |
81 | cnn_model = Sequential()
82 | cnn_model.add(Conv2D(filters, kernel_size, input_shape = input_shape,
83 | activation = activation))
84 |
85 | cnn_model.add(BatchNormalization())
86 | cnn_model.add(MaxPooling2D(pool_size = pool_size))
87 |
88 | for i in range(number_of_cnn_layers-1):
89 | cnn_model.add(Conv2D(filters, kernel_size, activation = activation))
90 | cnn_model.add(BatchNormalization())
91 | cnn_model.add(MaxPooling2D(pool_size = pool_size))
92 |
93 | cnn_model.add(Flatten())
94 |
95 | #Feedforward network
96 | for i in range(number_of_ff_layers):
97 | cnn_model.add(Dense(units = 128, activation = activation))
98 | cnn_model.add(Dropout(0.3))
99 |
100 | cnn_model.add(Dense(units = number_of_units_output,
101 | activation = activation_function_output))
102 |
103 | cnn_model.compile(optimizer = optimizer, loss = loss, metrics = metrics)
104 |
105 | """We can use the following command to see the network topology."""
106 |
107 | cnn_model.summary()
108 | #Saving the resultant figure as 'cnn_model.png'.
109 | plot_model(cnn_model, to_file='cnn_model.png', show_shapes=True,
110 | show_layer_names=True)
111 |
112 | """The training step is executed as follows. Because this network demands a high computational power, we can use a small number of epochs."""
113 |
114 | batch_size = 30
115 | epochs = 50
116 |
117 | cnn_model.fit(train_data, train_target_one_hot_encoding,
118 | batch_size = batch_size, epochs = epochs)
119 |
120 | """Since there are more than two classes, we show the classification results through a confusion matrix."""
121 |
122 | predictions = cnn_model.predict(test_data)
123 | found_target = predictions.argmax(axis=1)
124 |
125 | accuracy = accuracy_score(test_target, found_target)
126 | print("Accuracy =", accuracy)
127 |
128 | print("Confusion matrix:")
129 | matrix = confusion_matrix(found_target,test_target)
130 |
131 | plt.title("Confusion matrix:")
132 | plt.xticks(np.linspace(0,9,10))
133 | plt.yticks(np.linspace(0,9,10))
134 | plt.imshow(matrix)
135 | plt.show()
136 |
137 | """
138 | ## License
139 |
140 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License.
141 |
142 | ## Acknowledgments
143 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2.
144 | """
145 |
--------------------------------------------------------------------------------
/codes/rbm.py:
--------------------------------------------------------------------------------
1 | # -*- coding: utf-8 -*-
2 | """deepLearning_RBM.ipynb
3 |
4 | Automatically generated by Colaboratory.
5 |
6 | Original file is located at
7 | https://colab.research.google.com/drive/1R7ZfTxrtIIG_22IlApzXgWxuilbScLLJ
8 |
9 | # Restricted Boltzmann Machine (RBM)
10 |
11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty.
12 |
13 | If you publish a paper related on this material, please cite:
14 |
15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019.
16 |
17 | This is the fourth example of deep learning implementation. Here we use a RMB network to provide a recommendation system of CDs and vinyls.
18 |
19 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend).
20 | """
21 |
22 | import numpy as np
23 | import pandas as pd
24 | from sklearn.neural_network import BernoulliRBM
25 | import matplotlib.pyplot as plt
26 | import urllib.request
27 | from keras.utils import np_utils
28 | from sklearn.preprocessing import LabelEncoder
29 | import matplotlib.pyplot as plt
30 |
31 | """The following code downlods a dataset regarding the ratings of CDs and vinyls from the Amazon website ([link](http://snap.stanford.edu/data/amazon/productGraph/)).
32 | These data is divided into four columns, as follows: user id, item id, rating, and timestamp. The latter was removed from our analysis.
33 | """
34 |
35 | main_url = "http://snap.stanford.edu/data/amazon/productGraph/categoryFiles/"
36 | file_name = "ratings_CDs_and_Vinyl.csv"
37 | url = main_url + file_name
38 | col_names = ["user", "item", "rating", "timestamp"]
39 | urllib.request.urlretrieve(url, file_name)
40 | musical_instruments_reviews = pd.read_csv(file_name, names = col_names)
41 |
42 | """In the following, we preprocess the dataset."""
43 |
44 | #Defining dataset variables
45 | rating = musical_instruments_reviews["rating"].get_values()
46 | rating /= np.max(rating)
47 |
48 | users = musical_instruments_reviews["user"].get_values()
49 |
50 | label_encoder = LabelEncoder()
51 | items = musical_instruments_reviews["item"].get_values()
52 | items = label_encoder.fit_transform(items)
53 |
54 | """In order to reduce the time for running this tutorial, we reduced the number of items."""
55 |
56 | number_of_items = 6
57 |
58 | #Finding lines to erase
59 | unique_items, item_counts = np.unique(items, return_counts=True)
60 | item2count = dict(zip(unique_items, item_counts))
61 | item_count = sorted(item2count.items(), key=lambda x: x[1])[::-1]
62 | item_count = item_count[0:number_of_items]
63 | selected_items = [item for item, count in item_count]
64 |
65 | #keeping only the most frequent items
66 | keep_lines = [i for i,item in enumerate(items) if item in selected_items]
67 | keep_lines = np.array(keep_lines)
68 |
69 | rating = rating[keep_lines]
70 | users = users[keep_lines]
71 | items = items[keep_lines]
72 |
73 | #Converting the categorical data into a matrix
74 | item2new_code = {item:i for i,item in enumerate(set(items))}
75 | items = [item2new_code[item] for item in items]
76 | items_one_hot_encoding = np_utils.to_categorical(items)
77 |
78 | items_one_hot_encoding.shape
79 |
80 | """In the following, we weight and merge the codings of the selected columns."""
81 |
82 | items_weighted = [items_one_hot_encoding[i] * rating[i]
83 | for i in range(len(rating))]
84 |
85 | items_weighted = np.array(items_weighted)
86 |
87 | user2matrix_lines = {user: np.argwhere(user == users).T[0]
88 | for user in set(users)}
89 |
90 | user2purchases = {user:np.max(items_weighted[user2matrix_lines[user]],axis = 0)
91 | for user in set(users)}
92 |
93 | """In the next step, we eliminate the data from users that bought zero or one item. In our analysis, we do not consider the user names."""
94 |
95 | data = list(user2purchases.values())
96 | data = np.array(data)
97 |
98 | items_per_line = np.count_nonzero(data, axis=1)
99 | keep_lines = np.argwhere(items_per_line >= 2).T[0]
100 |
101 | data = data[keep_lines,:]
102 |
103 | """The code presented as follows define the neuronal network."""
104 |
105 | batch_size = 10
106 | learning_rate = 0.01
107 | n_components = 10 #Number of binary hidden units.
108 | n_iter = 5000
109 | verbose = 1
110 |
111 | rbm_model = BernoulliRBM(batch_size = batch_size, learning_rate = learning_rate,
112 | n_components = n_components, n_iter = n_iter,
113 | verbose = verbose)
114 |
115 | """Next, we train the network."""
116 |
117 | rbm_model = rbm_model.fit(data)
118 |
119 | """Finally, we test the network.
120 | For that, we first analyze some inputs to know if the output makes sense. We test the output as a person that bought only the first product (1). So, we show the matrix lines for others that bought the same product, as follows.
121 | """
122 |
123 | product_test = 0
124 | selected_lines = np.argwhere(data[:,product_test] > 0).T[0]
125 | plt.imshow(data[selected_lines])
126 | plt.colorbar()
127 | plt.show()
128 |
129 | """The following code tests the network in order to recommend the most relevant products for a given user. More specifically, for a vector of scores of the acquired products, the RBM returns the products that this user could like. Here, we selected the two first indications by excluding the already acquired products."""
130 |
131 | test_set = np.zeros(number_of_items)
132 | test_set[product_test] = 1
133 |
134 | test_set = [0,1,0,0,0,0]
135 |
136 | #Here we test a single sample
137 | result_hidden_layer = rbm_model.transform([test_set])[0]
138 | weight_matrix = rbm_model.components_
139 |
140 | result = np.matmul(weight_matrix.T,result_hidden_layer)
141 | recomended_products = np.argsort(result)[::-1]
142 | recomended_products = [product for product in recomended_products
143 | if product != product_test]
144 | print ("The two recommended product, in drecreasing order, are: " +
145 | str(recomended_products[0:2]) + '.')
146 |
147 | """
148 | ## License
149 |
150 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License.
151 |
152 | ## Acknowledgments
153 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2.
154 | """
155 |
--------------------------------------------------------------------------------
/codes/feedforward_binary.py:
--------------------------------------------------------------------------------
1 | # -*- coding: utf-8 -*-
2 | """deepLearning_feedforward.ipynb
3 |
4 | Automatically generated by Colaboratory.
5 |
6 | Original file is located at
7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_feedforward.ipynb
8 |
9 | #Feedforward networks
10 |
11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. These codes have absolutely no warranty.
12 |
13 | If you publish a paper related on this material, please cite:
14 |
15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019.
16 |
17 | ## Binary Classification
18 | This is the first example of deep learning implementation, in which we address binary classification of wine data. In this example, we consider one feedforward network with 5 hidden layers and with 30 neurons in each layer. The provided networks were built only for didactic purposes and are not appropriate for real applications.
19 |
20 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend).
21 | """
22 |
23 | import numpy as np
24 | import keras
25 | from keras.models import Sequential
26 | from keras.layers import Dense, Dropout
27 | from keras.utils.vis_utils import plot_model
28 | from keras.models import model_from_json
29 | from sklearn.datasets import load_wine
30 | from sklearn.model_selection import train_test_split
31 | from sklearn.metrics import accuracy_score
32 |
33 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*)."""
34 |
35 | import tensorflow as tf
36 | from keras import backend as K
37 |
38 | print(K.tensorflow_backend._get_available_gpus())
39 |
40 | number_of_cpu_cores = 8
41 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores})
42 | session = tf.Session(config=config)
43 | keras.backend.set_session(session)
44 |
45 | """Here, we use the Wine dataset. It is available at Sklearn library on [sklearn-datasets-wine](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_wine.html). For more information [wine-UCI](https://archive.ics.uci.edu/ml/datasets/Wine).
46 | Because this dataset comprises three classes and here we exemplify a binary classification, we considered only the two first classes.
47 | """
48 |
49 | wine = load_wine()
50 | data = wine['data']
51 | target = wine['target']
52 | target_names = wine['target_names']
53 |
54 | #The selected items are stored in the variable called "hold".
55 | hold = np.argwhere(target!=2).T[0]
56 | data = data[hold]
57 | target = target[hold]
58 | target_names = target_names[0:1]
59 |
60 | #Here, we divide our dataset into training and test sets.
61 | test_size = 0.25 #fraction
62 | training_data,test_data,training_target,test_target = train_test_split(data,
63 | target, test_size=test_size)
64 |
65 | """In the following, we configure the neuronal network. It is not necessary to include bias because this parameter is set as true by default."""
66 |
67 | #Set of parameters
68 | input_dim = data.shape[1]
69 | kernel_initializer = 'random_uniform'
70 | bias_initializer='zeros'
71 | activation_function_hidden = 'relu'
72 | activation_function_output = 'sigmoid'
73 | optimizer = 'adam'
74 | loss = 'binary_crossentropy'
75 | metrics = ['binary_accuracy']
76 | number_of_layers = 5
77 | number_of_units_hidden = 30
78 | number_of_units_output = 1
79 | dropout_percentage = 0.25
80 |
81 |
82 | #Creating model
83 | ff_model = Sequential()
84 | ff_model.add(Dense(units = number_of_units_hidden,
85 | activation = activation_function_hidden,
86 | kernel_initializer = kernel_initializer,
87 | input_dim = input_dim))
88 |
89 | for i in range(number_of_layers-1):
90 | #Inserting a dense hidden layer
91 | ff_model.add(Dense(units = number_of_units_hidden,
92 | activation = activation_function_hidden,
93 | kernel_initializer = kernel_initializer,
94 | input_dim = number_of_units_hidden))
95 | #Inserting dropout
96 | ff_model.add(Dropout(dropout_percentage))
97 |
98 | ff_model.add(Dense(units = number_of_units_output,
99 | activation = activation_function_output))
100 | ff_model.compile(optimizer = optimizer, loss = loss, metrics = metrics)
101 |
102 | """In order to check the network topology, you can use the subsequent command."""
103 |
104 | ff_model.summary()
105 |
106 | """Another option is to visualize the topology as a figure."""
107 |
108 | #Saving the resultant figure as 'ff_model.png'.
109 | plot_model(ff_model, to_file='ff_model.png', show_shapes=True,
110 | show_layer_names=True)
111 |
112 | """Next, we train the network"""
113 |
114 | batch_size = 10
115 | epochs = 200
116 | ff_model.fit(training_data,training_target, batch_size = batch_size,
117 | epochs = epochs)
118 |
119 | """In order to create an application, it is possible to save the network and the respective trained weights as follows."""
120 |
121 | #Saving the network model
122 | ff_model_json = ff_model.to_json()
123 | with open('ff_model.json', 'w') as file:
124 | file.write(ff_model_json)
125 |
126 | #Saving weights
127 | ff_model.save_weights('ff_model.h5')
128 |
129 | """The following code can be employed to open a pre-trained model."""
130 |
131 | with open('ff_model.json', 'r') as file:
132 | ff_model_json = file.read()
133 |
134 | ff_model = model_from_json(ff_model_json)
135 | ff_model.load_weights('ff_model.h5')
136 |
137 | """There are different analysis that can account for the quality of the results. Here, we consider only the measurement of accuracy."""
138 |
139 | predictions = ff_model.predict(test_data)
140 | #Because it is a binary classification, we consider the values higher than 0.5
141 | #as being part of class 1 otherwise 0.
142 | predictions = (predictions > 0.5)
143 | accuracy = accuracy_score(test_target, predictions)
144 | print("Accuracy =", accuracy)
145 |
146 |
147 | """
148 | ## License
149 |
150 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License.
151 |
152 | ## Acknowledgments
153 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2.
154 | """
155 |
--------------------------------------------------------------------------------
/codes/lstm.py:
--------------------------------------------------------------------------------
1 | # -*- coding: utf-8 -*-
2 | """deepLearning_LSTM.ipynb
3 |
4 | Automatically generated by Colaboratory.
5 |
6 | Original file is located at
7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_LSTM.ipynb
8 |
9 | #Long Short-Term Memory (LSTM)
10 |
11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty.
12 |
13 | If you publish a paper related on this material, please cite:
14 |
15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019.
16 |
17 | This is the third example of deep learning implementation. Here we use a LSTM network to predict the Bitcoin prices along time by using the input as a temporal series.
18 |
19 |
20 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend).
21 | """
22 |
23 | import numpy as np
24 | import keras
25 | from keras.models import Sequential
26 | from keras.utils.vis_utils import plot_model
27 | from keras.layers import Dense, Dropout, LSTM
28 | from keras.callbacks import EarlyStopping, ReduceLROnPlateau, ModelCheckpoint
29 | from sklearn.model_selection import train_test_split
30 | from sklearn.metrics import accuracy_score
31 | from sklearn.preprocessing import MinMaxScaler
32 | import matplotlib.pyplot as plt
33 | import pandas as pd
34 | import pandas_datareader
35 |
36 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*)."""
37 |
38 | import tensorflow as tf
39 | from keras import backend as K
40 |
41 | print(K.tensorflow_backend._get_available_gpus())
42 |
43 | number_of_cpu_cores = 8
44 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores})
45 | session = tf.Session(config=config)
46 | keras.backend.set_session(session)
47 |
48 | """(*) Here, we use the Bitcoin daily prices dataset, which is available at
49 | [yhaoo-stock-market](https://finance.yahoo.com/). The data contains seven columns, organized as follows: date, opening stock price, high daily price, low daily price, closing stock price, the currency volume traded on the day, and the adjusted closing price.
50 | """
51 |
52 | train_size = 1200
53 | start_date = '2015-01-01'# Bitcoin started on '2010-07-16'
54 |
55 | dataset = pandas_datareader.data.get_data_yahoo("BTC-USD", start = start_date)
56 | data_oerder = ['Open','High', 'Low', 'Close', 'Volume', 'Adj Close']
57 | dataset = dataset[data_oerder]
58 |
59 |
60 | train_dataset = dataset.iloc[0:train_size, 1::].values
61 | test_dataset = dataset.iloc[train_size::, 1::].values
62 |
63 | min_max_scaler = MinMaxScaler(feature_range=(0,1))
64 | normalized_train_dataset = min_max_scaler.fit_transform(train_dataset)
65 |
66 | min_max_scaler_train = MinMaxScaler(feature_range=(0,1))
67 | normalized_train_price = min_max_scaler_train.fit_transform(train_dataset[:,0:1])
68 |
69 | """In the following, we define the network topology."""
70 |
71 | window_size = 50
72 | number_of_lstm_layers = 3
73 | activation = 'sigmoid'
74 | return_sequences = True
75 | units_first_layer = 100
76 | units = 50
77 |
78 | data = []
79 | train_price = []
80 | for i in range(window_size, train_size):
81 | data.append(normalized_train_dataset[i-window_size:i, 0:6])
82 | train_price.append(normalized_train_dataset[i, 0])
83 | data, train_price = np.array(data), np.array(train_price)
84 |
85 | lstm_model = Sequential()
86 | lstm_model.add(LSTM(units = units_first_layer,
87 | return_sequences = return_sequences,
88 | input_shape = (data.shape[1], 5)))
89 | lstm_model.add(Dropout(0.2))
90 |
91 | for i in range(number_of_lstm_layers-2):
92 | lstm_model.add(LSTM(units = units, return_sequences = return_sequences))
93 | lstm_model.add(Dropout(0.2))
94 |
95 | lstm_model.add(LSTM(units = units))
96 | lstm_model.add(Dropout(0.2))
97 |
98 | #Output layer
99 | lstm_model.add(Dense(units = 1, activation = activation))
100 |
101 | """In order to check the network topology, the subsequent command can be used."""
102 |
103 | lstm_model.summary()
104 | #Saving the resultant figure as 'ff_model.png'.
105 | plot_model(lstm_model, to_file='lstm_model.png', show_shapes=True,
106 | show_layer_names=True)
107 |
108 | """The training step is executed as follows."""
109 |
110 | #Here we set verbose as true
111 | verbose = 1
112 |
113 | batch_size = 32
114 | epochs = 10
115 | filepath = 'weights.h5' #name of the file with the network weights
116 | monitor = 'loss'
117 | optimizer = 'adam'
118 | loss = 'mean_squared_error'
119 | metrics = ['mean_absolute_error']
120 |
121 | lstm_model.compile(optimizer = optimizer, loss = loss, metrics = metrics)
122 |
123 | early_stopping = EarlyStopping(monitor = monitor, min_delta = 1e-15,
124 | patience = 10, verbose = verbose)
125 | reduce_learning_rate_on_plateau = ReduceLROnPlateau(monitor = monitor,
126 | factor = 0.2, patience = 5,
127 | verbose = verbose)
128 | model_checkpoint = ModelCheckpoint(filepath = filepath, monitor = monitor,
129 | save_best_only = True, verbose = verbose)
130 | lstm_model.fit(data, train_price, epochs = epochs, batch_size = batch_size,
131 | callbacks = [early_stopping, reduce_learning_rate_on_plateau,
132 | model_checkpoint])
133 |
134 | """The following code verifies the data in the network."""
135 |
136 | test_price = test_dataset[:, 0:1]
137 | complete_dataset = dataset.iloc[:,1::]
138 |
139 | train_data = complete_dataset[len(complete_dataset) - len(test_dataset) -
140 | window_size:].values
141 | train_data = min_max_scaler.transform(train_data)
142 |
143 |
144 | X_test = []
145 | for i in range(window_size,len(train_data)):
146 | X_test.append(train_data[i-window_size:i, 0:6])
147 | X_test = np.array(X_test)
148 |
149 | calculated_prices = lstm_model.predict(X_test)
150 | calculated_prices = min_max_scaler_train.inverse_transform(calculated_prices)
151 |
152 | train_price = min_max_scaler_train.inverse_transform([train_price])
153 | train_price = train_price[0]
154 |
155 | """In the following, we plot the train set, as well as the prediction and the expected values."""
156 |
157 | plt.plot(np.linspace(0,len(train_price)-1,len(train_price)),
158 | train_price, label = 'Real price train', color = 'k')
159 | plt.plot(np.linspace(len(train_price),len(train_price)+len(test_price)-1,
160 | len(test_price)), test_price, label = 'Real price')
161 | plt.plot(np.linspace(len(train_price),len(train_price)+len(test_price)-1,
162 | len(test_price)), calculated_prices, label = 'Prevision')
163 | plt.title('BTC prevision')
164 | plt.xlabel('Time')
165 | plt.ylabel('Value')
166 | plt.legend()
167 | plt.show()
168 |
169 | """
170 | ## License
171 |
172 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License.
173 |
174 | ## Acknowledgments
175 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2.
176 | """
177 |
--------------------------------------------------------------------------------
/codes/gan.py:
--------------------------------------------------------------------------------
1 | # -*- coding: utf-8 -*-
2 | """deepLearning_GAN.ipynb
3 |
4 | Automatically generated by Colaboratory.
5 |
6 | Original file is located at
7 | https://colab.research.google.com/drive/1CbRNDN25uaN2WCeyklwPkbZZkXM_IDge
8 |
9 | # Generative Adversarial Networks
10 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty.
11 |
12 | If you publish a paper related on this material, please cite:
13 |
14 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019.
15 |
16 | It was elaborated to create a network that can generate handwritten characters automatically.
17 |
18 |
19 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend).
20 | """
21 |
22 | import numpy as np
23 | import pandas as pd
24 | import keras
25 | from keras.models import Sequential, model_from_json
26 | from keras.utils.vis_utils import plot_model
27 | from keras.datasets import mnist
28 | from keras.layers import InputLayer, Dense, Flatten, Reshape, Input, Dropout
29 | from keras.layers.advanced_activations import LeakyReLU
30 | from keras.layers import BatchNormalization
31 | from keras.models import Model,Sequential
32 | from keras.regularizers import L1L2
33 | from sklearn.model_selection import train_test_split
34 | from sklearn.metrics import accuracy_score
35 | from sklearn.preprocessing import MinMaxScaler
36 | import matplotlib.pyplot as plt
37 | import cv2
38 |
39 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*)."""
40 |
41 | import tensorflow as tf
42 | from keras import backend as K
43 |
44 | print(K.tensorflow_backend._get_available_gpus())
45 |
46 | number_of_cpu_cores = 8
47 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores})
48 | session = tf.Session(config=config)
49 | keras.backend.set_session(session)
50 |
51 | """(*) In this example we used the MNIST database in which it is composed by grayscale images of the 10 handwritten digits. It is available at Keras library on [keras-datasets](https://keras.io/datasets/).
52 |
53 | The following command is used to load the data set.
54 | """
55 |
56 | (train_data_raw, train_target_raw), (_, _) = mnist.load_data()
57 |
58 | """Because this code consumes too much of processing time, here we considered only the zeros and ones."""
59 |
60 | train_data = [img for i, img in enumerate(train_data_raw)
61 | if train_target_raw[i] == 0 or train_target_raw[i] == 1]
62 | train_data = np.array(train_data)
63 |
64 | """In order to visualize a given figure, the following code can be executed."""
65 |
66 | image_id = 1000
67 | plt.figure(figsize = (1,1))
68 | plt.imshow(train_data[image_id], cmap='gray')
69 | plt.title("Test image: " + str(image_id))
70 | #plt.axis('off')
71 | plt.show()
72 |
73 | """Definition of the used variables."""
74 |
75 | input_shape = train_data.shape[1::]
76 | activation_output_generator = 'sigmoid'
77 | activation_output_discrimninator = 'sigmoid'
78 | input_dim = 50
79 | number_of_epochs = 1000
80 | batch_size = 100
81 | train_data = train_data.astype('float32') / 255
82 |
83 | """In the following, we present the generator model."""
84 |
85 | generator_model = Sequential()
86 |
87 | generator_model.add(Dense(units=64,input_dim = input_dim,
88 | kernel_regularizer = L1L2(1e-5, 1e-5)))
89 | generator_model.add(BatchNormalization())
90 | generator_model.add(LeakyReLU(alpha=0.3))
91 |
92 | generator_model.add(Dense(units=128, kernel_regularizer = L1L2(1e-5, 1e-5)))
93 | generator_model.add(BatchNormalization())
94 | generator_model.add(LeakyReLU(alpha=0.3))
95 |
96 | generator_model.add(Dense(units=256, kernel_regularizer = L1L2(1e-5, 1e-5)))
97 | generator_model.add(BatchNormalization())
98 | generator_model.add(LeakyReLU(alpha=0.3))
99 |
100 | generator_model.add(Dense(units = input_shape[0] * input_shape[1],
101 | activation = activation_output_generator))
102 |
103 | generator_model.add(Reshape(input_shape))
104 |
105 | generator_model.compile(loss='binary_crossentropy', optimizer="adam")
106 |
107 | """The summary of the generator model is shown by employing the following code."""
108 |
109 | generator_model.summary()
110 |
111 | """The following code represents the discriminator model."""
112 |
113 | discriminator_model = Sequential()
114 | discriminator_model.add(InputLayer(input_shape = input_shape))
115 | discriminator_model.add(Flatten())
116 |
117 | discriminator_model.add(Dense(units=256,kernel_regularizer = L1L2(1e-5, 1e-5)))
118 | discriminator_model.add(LeakyReLU(alpha=0.3))
119 | discriminator_model.add(Dropout(0.2))
120 |
121 |
122 | discriminator_model.add(Dense(units=128,kernel_regularizer = L1L2(1e-5, 1e-5)))
123 | discriminator_model.add(LeakyReLU(alpha=0.3))
124 | discriminator_model.add(Dropout(0.2))
125 |
126 | discriminator_model.add(Dense(units=64,kernel_regularizer = L1L2(1e-5, 1e-5)))
127 | discriminator_model.add(LeakyReLU(alpha=0.3))
128 |
129 | discriminator_model.add(Dense(units=1,
130 | activation = activation_output_discrimninator))
131 |
132 | discriminator_model.compile(loss='binary_crossentropy',
133 | optimizer = "adam")
134 |
135 | """The summary of the discriminator model is shown by using the following code."""
136 |
137 | discriminator_model.summary()
138 |
139 | """The following code incorporates the complete gan model."""
140 |
141 | gan_input = Input(shape = (input_dim,))
142 | gan_output= discriminator_model(generator_model(gan_input))
143 | gan = Model(inputs = gan_input, outputs = gan_output)
144 | gan.compile(loss = 'binary_crossentropy', optimizer = 'adam')
145 |
146 | """The summary of the gan model is shown by using the following code."""
147 |
148 | gan.summary()
149 |
150 | """Next, we train the GAN."""
151 |
152 | y = np.ones(batch_size)
153 |
154 | #Parameters of the noise distribution
155 | mu = 0
156 | sigma = 1
157 |
158 | #We created this array to avoid number repetitions
159 | train_indices = np.arange(train_data.shape[0])
160 | np.random.shuffle(train_indices)
161 |
162 | #Here we define the labels used to train the gan
163 | train_labels = np.zeros(2*batch_size,dtype = int)
164 | train_labels[0:batch_size] = 1#generated images
165 |
166 | for epoch in range(number_of_epochs):
167 | print("\rEpoch:", epoch + 1, "of", number_of_epochs, end = '')
168 | for _ in range(batch_size):
169 | input_noise = np.random.normal(loc = mu, scale = sigma,
170 | size = [batch_size, input_dim])
171 | generated_images = generator_model.predict(input_noise)
172 | np.random.shuffle(train_indices)
173 | image_batch = train_data[train_indices[0:batch_size]]
174 | train_images = np.concatenate((image_batch, generated_images))
175 | #Training the discriminator
176 | discriminator_model.trainable = True
177 | discriminator_model.train_on_batch(train_images, train_labels)
178 | #Training the gan
179 | discriminator_model.trainable = False
180 | train_noise = np.random.normal(loc = mu, scale = sigma,
181 | size = [batch_size, input_dim])
182 | gan.train_on_batch(train_noise, y)
183 |
184 | #In order to visualize the training progress, we employ the following code.
185 | if epoch % 100 == 0:
186 | n_examples = 10
187 | scale_image = 1 * n_examples
188 | noise= np.random.normal(loc = mu, scale = sigma,
189 | size = (n_examples, input_dim))
190 | generated_images = generator_model.predict(noise)
191 | n_pixels = generated_images.shape[1]
192 | n_pixels_col = np.int(np.sqrt(n_pixels))
193 | fig, axes = plt.subplots(1,n_examples,
194 | figsize = (scale_image,
195 | scale_image * n_examples))
196 | for i in range(generated_images.shape[0]):
197 | axes[i].imshow(generated_images[i], cmap = "gray")
198 | axes[i].axis('off')
199 | plt.show()
200 | print("")
201 |
202 | """In order to generate the figures the following code can be employed."""
203 |
204 | n_examples = 5
205 | scale_image = 5
206 | noise= np.random.normal(loc = mu, scale = sigma, size = (n_examples, input_dim))
207 | generated_images = generator_model.predict(noise)
208 |
209 | fig, axes = plt.subplots(1,n_examples,
210 | figsize = (scale_image, scale_image * n_examples))
211 | for i in range(generated_images.shape[0]):
212 | axes[i].imshow(generated_images[i], cmap = "gray")
213 | axes[i].axis('off')
214 |
215 | plt.show()
216 |
217 | """
218 | ## License
219 |
220 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License.
221 |
222 | ## Acknowledgments
223 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2.
224 | """
225 |
--------------------------------------------------------------------------------
/codes/autoencoder.py:
--------------------------------------------------------------------------------
1 | # -*- coding: utf-8 -*-
2 | """deepLearning_autoencoder.ipynb
3 |
4 | Automatically generated by Colaboratory.
5 |
6 | Original file is located at
7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_autoencoder.ipynb
8 |
9 | # Autoencoders
10 |
11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty.
12 |
13 | If you publish a paper related on this material, please cite:
14 |
15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019.
16 |
17 | This example uses the Autoencoder model to illustrate a possible application concerning image clustering. Here we show how to use the resulting codes to reduce the dimensionality. We also project our data by using a Principal Component Analysis (PCA).
18 |
19 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend).
20 | """
21 |
22 | import numpy as np
23 | import matplotlib.pyplot as plt
24 | import pandas as pd
25 | import keras
26 | from keras.models import Sequential, model_from_json, Model
27 | from keras.utils import np_utils
28 | from keras.utils.vis_utils import plot_model
29 | from keras.datasets import fashion_mnist
30 | from keras.callbacks import EarlyStopping, ReduceLROnPlateau, ModelCheckpoint
31 | from sklearn.model_selection import train_test_split
32 | from sklearn.metrics import accuracy_score
33 | from keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Reshape
34 | from keras.layers import UpSampling2D
35 | from sklearn.preprocessing import MinMaxScaler
36 | import sklearn.decomposition
37 | from sklearn.preprocessing import StandardScaler
38 |
39 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*)."""
40 |
41 | import tensorflow as tf
42 | from keras import backend as K
43 |
44 | print(K.tensorflow_backend._get_available_gpus())
45 |
46 | number_of_cpu_cores = 8
47 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores})
48 | session = tf.Session(config=config)
49 | keras.backend.set_session(session)
50 |
51 | """(*) In this example, we used the Fashion-MNIST database, composed by grayscale images of 10 categories of fashion items (trouser, pullover, dress, coat, sandal, shirt, sneaker, bag, and ankle boot). It is available at Keras library on [keras-datasets](https://keras.io/datasets/)."""
52 |
53 | (train_data, train_target), (test_data, test_target) = fashion_mnist.load_data()
54 |
55 | train_target_one_hot_encoding = np_utils.to_categorical(train_target)
56 |
57 | #Divide by the maximun value of a pixel (255) to have the values between 0 and 1
58 | train_data = train_data.astype('float32') / 255.
59 | test_data = test_data.astype('float32') / 255.
60 |
61 | """For the sake of simplicity, we add zeros to the images to have shape 32x32. Because 32 is a power of 2 it is easier to configure the decoder layer."""
62 |
63 | train_data_auxiliar = []
64 | for data in train_data:
65 | new_image = np.zeros((32,32))
66 | new_image[2:data.shape[0]+2, 2:data.shape[1]+2] = data
67 | train_data_auxiliar.append(new_image)
68 |
69 | test_data_auxiliar = []
70 | for data in test_data:
71 | new_image = np.zeros((32,32))
72 | new_image[2:data.shape[0]+2, 2:data.shape[1]+2] = data
73 | test_data_auxiliar.append(new_image)
74 |
75 | train_data = np.array(train_data_auxiliar)
76 | test_data = np.array(test_data_auxiliar)
77 |
78 | train_data = train_data.reshape(train_data.shape[0], train_data.shape[1],
79 | train_data.shape[2], 1)
80 | test_data = test_data.reshape(test_data.shape[0], test_data.shape[1],
81 | test_data.shape[2], 1)
82 |
83 | """In order to visualize a given figure, the following code can be executed."""
84 |
85 | image_id = 700
86 | image = test_data[image_id]
87 | image = image[:,:,0]
88 | plt.imshow(image, cmap = 'gray')
89 | plt.title("Test image: " + str(image_id))
90 | plt.show()
91 |
92 | """In the following, we define the network topology. Similar to what was adopted for the CNN case, here we do not employ dropout after the convolutional layers. Because this network demands a high computational power, the variable epochs can receive a smaller number (e.g., 5). However, in this case, the resulting accuracy tends to be much lower.
93 |
94 | First, we define some necessary variables.
95 | """
96 |
97 | input_shape = train_data.shape[1::]
98 | #if len(input_shape) == 2:
99 | # input_shape = (input_shape[0], input_shape[1], 1)
100 | filters_first_layer = 64
101 | filters = 32
102 | kernel_size = (3,3)
103 | pool_size = (2,2)
104 |
105 | activation = 'relu'
106 | activation_function_output = 'sigmoid' #the output should be between 0 and 1
107 | number_of_cnn_layers = 2
108 | number_of_units_output = train_target_one_hot_encoding.shape[1]
109 | padding = 'same'
110 | strides = (2,2)
111 |
112 | optimizer = 'adam'
113 | loss = 'binary_crossentropy'
114 | metrics = ['accuracy']
115 | epochs = 50
116 | batch_size = 128
117 |
118 | #Network model
119 | autoencoder_model = Sequential()
120 |
121 | """We configure the encoder layers. Normally, for images the autoencoder is represented by a 2D matrix, but here we
122 | adopt flattening in order to be able to plot the respective PCA projection.
123 | """
124 |
125 | autoencoder_model.add(Conv2D(filters = filters_first_layer,
126 | kernel_size = kernel_size,
127 | input_shape = input_shape,
128 | activation = activation, padding = padding ))
129 |
130 | autoencoder_model.add(MaxPooling2D(pool_size = pool_size, padding = padding))
131 |
132 |
133 | for i in range(number_of_cnn_layers-1):
134 | autoencoder_model.add(Conv2D(filters = filters, kernel_size = kernel_size,
135 | activation = activation, padding = padding,
136 | strides = strides))
137 | autoencoder_model.add(MaxPooling2D(pool_size = pool_size,
138 | padding = padding))
139 |
140 |
141 | #This is the coding
142 | autoencoder_model.add(Flatten())
143 | flatten_layer_name = autoencoder_model.output_names[0]
144 |
145 | """Here, we define the decoder."""
146 |
147 | #First we define the input size
148 | output_len = autoencoder_model.output_shape[1]
149 | height = np.int(np.sqrt(output_len/filters))
150 |
151 | #Find the shape of the decoder input
152 | autoencoder_model.add(Reshape((height, height, filters)))
153 |
154 | for i in range(number_of_cnn_layers):
155 | autoencoder_model.add(Conv2D(filters = filters, kernel_size = kernel_size,
156 | activation = activation, padding = padding))
157 | autoencoder_model.add(UpSampling2D(size = pool_size))
158 |
159 | autoencoder_model.add(Conv2D(filters = filters_first_layer,
160 | kernel_size = kernel_size,
161 | activation = activation, padding = padding))
162 | autoencoder_model.add(UpSampling2D(size = pool_size))
163 | autoencoder_model.add(Conv2D(filters = 1, kernel_size = kernel_size,
164 | activation = activation_function_output,
165 | padding = padding))
166 |
167 | """We can use the following command to see the network topology."""
168 |
169 | autoencoder_model.summary()
170 | #Saving the resultant figure as 'autoencoder_model.png'.
171 | plot_model(autoencoder_model, to_file='autoencoder_model.png', show_shapes=True,
172 | show_layer_names=True)
173 |
174 | """The entire configuration is then used to train the coding and decoding."""
175 |
176 | autoencoder_model.compile(optimizer = optimizer, loss = loss, metrics = metrics)
177 | autoencoder_model.fit(train_data, train_data, epochs = epochs,
178 | batch_size = batch_size)
179 |
180 | """The following code shows how to use the already trained coding."""
181 |
182 | output_model = autoencoder_model.get_layer(flatten_layer_name).output
183 | encoder = Model(inputs = autoencoder_model.input,
184 | outputs = output_model)
185 | encoder.summary()
186 |
187 | """The following code is used to compute the codings."""
188 |
189 | codings = encoder.predict(test_data)
190 |
191 | """By employing the codings and the known classes, we plot a PCA (principal component analysis) of the test data."""
192 |
193 | X = codings.copy()
194 | targets = test_target
195 |
196 | #Standardization
197 | X = StandardScaler().fit_transform(X)
198 | decomposition = sklearn.decomposition.PCA(n_components=2)
199 | pca = decomposition.fit(X)
200 | transform = pca.transform(X)
201 |
202 | plt.figure(figsize = (6,4))
203 | classes = []
204 | for target in set(targets):
205 | classes.append(target)
206 | pos = np.argwhere(targets == target).T[0]
207 | plt.scatter([transform[pos,0]],[transform[pos,1]], alpha = 0.3)
208 |
209 |
210 | label = "PC1 ({:1.2f}%)".format(pca.explained_variance_ratio_[0]*100)
211 | plt.xlabel(label)
212 | label = "PC2 ({:1.2f}%)".format(pca.explained_variance_ratio_[1]*100)
213 | plt.ylabel(label)
214 |
215 | plt.margins(0.05,0.05)
216 | plt.legend(classes, loc = 'best')
217 | plt.tight_layout()
218 |
219 | plt.show()
220 |
221 | """
222 | ## License
223 |
224 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License.
225 |
226 | ## Acknowledgments
227 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2.
228 | """
229 |
--------------------------------------------------------------------------------
/libraries.txt:
--------------------------------------------------------------------------------
1 | All of these codes were developed and executed in Python 3, with the following libraries:
2 |
3 | Package Version
4 | ------------------------ ---------------------
5 | absl-py 0.7.1
6 | alabaster 0.7.12
7 | albumentations 0.1.12
8 | altair 3.2.0
9 | astor 0.8.0
10 | astropy 3.0.5
11 | atari-py 0.1.15
12 | atomicwrites 1.3.0
13 | attrs 19.1.0
14 | audioread 2.1.8
15 | autograd 1.3
16 | Babel 2.7.0
17 | backcall 0.1.0
18 | backports.tempfile 1.0
19 | backports.weakref 1.0.post1
20 | beautifulsoup4 4.6.3
21 | bleach 3.1.0
22 | blis 0.2.4
23 | bokeh 1.0.4
24 | boto 2.49.0
25 | boto3 1.9.216
26 | botocore 1.12.216
27 | Bottleneck 1.2.1
28 | branca 0.3.1
29 | bs4 0.0.1
30 | bz2file 0.98
31 | cachetools 3.1.1
32 | certifi 2019.6.16
33 | cffi 1.12.3
34 | chainer 5.4.0
35 | chardet 3.0.4
36 | Click 7.0
37 | cloudpickle 0.6.1
38 | cmake 3.12.0
39 | colorlover 0.3.0
40 | community 1.0.0b1
41 | contextlib2 0.5.5
42 | convertdate 2.1.3
43 | coverage 3.7.1
44 | coveralls 0.5
45 | crcmod 1.7
46 | cufflinks 0.14.6
47 | cvxopt 1.2.3
48 | cvxpy 1.0.25
49 | cycler 0.10.0
50 | cymem 2.0.2
51 | Cython 0.29.13
52 | daft 0.0.4
53 | dask 1.1.5
54 | dataclasses 0.6
55 | datascience 0.10.6
56 | decorator 4.4.0
57 | defusedxml 0.6.0
58 | descartes 1.1.0
59 | dill 0.3.0
60 | distributed 1.25.3
61 | Django 2.2.4
62 | dlib 19.16.0
63 | dm-sonnet 1.34
64 | docopt 0.6.2
65 | docutils 0.15.2
66 | dopamine-rl 1.0.5
67 | easydict 1.9
68 | ecos 2.0.7.post1
69 | editdistance 0.5.3
70 | en-core-web-sm 2.1.0
71 | entrypoints 0.3
72 | ephem 3.7.7.0
73 | et-xmlfile 1.0.1
74 | fa2 0.3.5
75 | fancyimpute 0.4.3
76 | fastai 1.0.57
77 | fastcache 1.1.0
78 | fastdtw 0.3.2
79 | fastprogress 0.1.21
80 | fastrlock 0.4
81 | fbprophet 0.5
82 | feather-format 0.4.0
83 | featuretools 0.4.1
84 | filelock 3.0.12
85 | fix-yahoo-finance 0.0.22
86 | Flask 1.1.1
87 | folium 0.8.3
88 | fsspec 0.4.1
89 | future 0.16.0
90 | gast 0.2.2
91 | GDAL 2.2.2
92 | gdown 3.6.4
93 | gensim 3.6.0
94 | geographiclib 1.49
95 | geopy 1.17.0
96 | gevent 1.4.0
97 | gin-config 0.2.0
98 | glob2 0.7
99 | google 2.0.2
100 | google-api-core 1.14.2
101 | google-api-python-client 1.7.11
102 | google-auth 1.4.2
103 | google-auth-httplib2 0.0.3
104 | google-auth-oauthlib 0.4.0
105 | google-cloud-bigquery 1.14.0
106 | google-cloud-core 1.0.3
107 | google-cloud-datastore 1.8.0
108 | google-cloud-language 1.2.0
109 | google-cloud-storage 1.16.1
110 | google-cloud-translate 1.5.0
111 | google-colab 1.0.0
112 | google-pasta 0.1.7
113 | google-resumable-media 0.3.3
114 | googleapis-common-protos 1.6.0
115 | googledrivedownloader 0.4
116 | graph-nets 1.0.4
117 | graphviz 0.10.1
118 | greenlet 0.4.15
119 | grpcio 1.15.0
120 | gspread 3.0.1
121 | gspread-dataframe 3.0.3
122 | gunicorn 19.9.0
123 | gym 0.10.11
124 | h5py 2.8.0
125 | HeapDict 1.0.0
126 | holidays 0.9.11
127 | html5lib 1.0.1
128 | httpimport 0.5.16
129 | httplib2 0.11.3
130 | humanize 0.5.1
131 | hyperopt 0.1.2
132 | ideep4py 2.0.0.post3
133 | idna 2.8
134 | image 1.5.27
135 | imageio 2.4.1
136 | imagesize 1.1.0
137 | imbalanced-learn 0.4.3
138 | imblearn 0.0
139 | imgaug 0.2.9
140 | importlib-metadata 0.19
141 | imutils 0.5.3
142 | inflect 2.1.0
143 | intel-openmp 2019.0
144 | intervaltree 2.1.0
145 | ipykernel 4.6.1
146 | ipython 5.5.0
147 | ipython-genutils 0.2.0
148 | ipython-sql 0.3.9
149 | ipywidgets 7.5.1
150 | itsdangerous 1.1.0
151 | jax 0.1.43
152 | jaxlib 0.1.26
153 | jdcal 1.4.1
154 | jedi 0.15.1
155 | jieba 0.39
156 | Jinja2 2.10.1
157 | jmespath 0.9.4
158 | joblib 0.13.2
159 | jpeg4py 0.1.4
160 | jsonschema 2.6.0
161 | jupyter 1.0.0
162 | jupyter-client 5.3.1
163 | jupyter-console 5.2.0
164 | jupyter-core 4.5.0
165 | kaggle 1.5.5
166 | kapre 0.1.3.1
167 | Keras 2.2.5
168 | Keras-Applications 1.0.8
169 | Keras-Preprocessing 1.1.0
170 | keras-vis 0.4.1
171 | kiwisolver 1.1.0
172 | knnimpute 0.1.0
173 | librosa 0.6.3
174 | lightgbm 2.2.3
175 | llvmlite 0.29.0
176 | lmdb 0.97
177 | lucid 0.3.8
178 | lunardate 0.2.0
179 | lxml 4.2.6
180 | magenta 0.3.19
181 | Markdown 3.1.1
182 | MarkupSafe 1.1.1
183 | matplotlib 3.0.3
184 | matplotlib-venn 0.11.5
185 | mesh-tensorflow 0.0.5
186 | mido 1.2.6
187 | mir-eval 0.5
188 | missingno 0.4.2
189 | mistune 0.8.4
190 | mizani 0.5.4
191 | mkl 2019.0
192 | mlxtend 0.14.0
193 | more-itertools 7.2.0
194 | moviepy 0.2.3.5
195 | mpi4py 3.0.2
196 | mpmath 1.1.0
197 | msgpack 0.5.6
198 | multiprocess 0.70.8
199 | multitasking 0.0.9
200 | murmurhash 1.0.2
201 | music21 5.5.0
202 | natsort 5.5.0
203 | nbconvert 5.6.0
204 | nbformat 4.4.0
205 | networkx 2.3
206 | nibabel 2.3.3
207 | nltk 3.2.5
208 | nose 1.3.7
209 | notebook 5.2.2
210 | np-utils 0.5.11.1
211 | numba 0.40.1
212 | numexpr 2.7.0
213 | numpy 1.16.4
214 | nvidia-ml-py3 7.352.0
215 | oauth2client 4.1.3
216 | oauthlib 3.1.0
217 | okgrade 0.4.3
218 | olefile 0.46
219 | opencv-contrib-python 3.4.3.18
220 | opencv-python 3.4.5.20
221 | openpyxl 2.5.9
222 | opt-einsum 3.0.1
223 | osqp 0.5.0
224 | packaging 19.1
225 | palettable 3.2.0
226 | pandas 0.24.2
227 | pandas-datareader 0.7.4
228 | pandas-gbq 0.4.1
229 | pandas-profiling 1.4.1
230 | pandocfilters 1.4.2
231 | parso 0.5.1
232 | pathlib 1.0.1
233 | patsy 0.5.1
234 | pexpect 4.7.0
235 | pickleshare 0.7.5
236 | Pillow 4.3.0
237 | pip 19.2.3
238 | pip-tools 3.9.0
239 | plac 0.9.6
240 | plotly 3.6.1
241 | plotnine 0.5.1
242 | pluggy 0.7.1
243 | portpicker 1.2.0
244 | prefetch-generator 1.0.1
245 | preshed 2.0.1
246 | pretty-midi 0.2.8
247 | prettytable 0.7.2
248 | progressbar2 3.38.0
249 | prometheus-client 0.7.1
250 | promise 2.2.1
251 | prompt-toolkit 1.0.16
252 | protobuf 3.7.1
253 | psutil 5.4.8
254 | psycopg2 2.7.6.1
255 | ptyprocess 0.6.0
256 | py 1.8.0
257 | pyarrow 0.14.1
258 | pyasn1 0.4.6
259 | pyasn1-modules 0.2.6
260 | pycocotools 2.0.0
261 | pycparser 2.19
262 | pydot 1.3.0
263 | pydot-ng 2.0.0
264 | pydotplus 2.0.2
265 | pyemd 0.5.1
266 | pyglet 1.4.2
267 | Pygments 2.1.3
268 | pygobject 3.26.1
269 | pymc3 3.7
270 | pymongo 3.9.0
271 | pymystem3 0.2.0
272 | PyOpenGL 3.1.0
273 | pyparsing 2.4.2
274 | pyrsistent 0.15.4
275 | pysndfile 1.3.7
276 | PySocks 1.7.0
277 | pystan 2.19.0.0
278 | pytest 3.6.4
279 | python-apt 1.6.4
280 | python-chess 0.23.11
281 | python-dateutil 2.5.3
282 | python-louvain 0.13
283 | python-rtmidi 1.3.0
284 | python-slugify 3.0.3
285 | python-utils 2.3.0
286 | pytz 2018.9
287 | PyWavelets 1.0.3
288 | PyYAML 3.13
289 | pyzmq 17.0.0
290 | qtconsole 4.5.4
291 | requests 2.21.0
292 | requests-oauthlib 1.2.0
293 | resampy 0.2.2
294 | retrying 1.3.3
295 | rpy2 2.9.5
296 | rsa 4.0
297 | s3fs 0.3.3
298 | s3transfer 0.2.1
299 | scikit-image 0.15.0
300 | scikit-learn 0.21.3
301 | scipy 1.3.1
302 | screen-resolution-extra 0.0.0
303 | scs 2.1.1.post2
304 | seaborn 0.9.0
305 | semantic-version 2.6.0
306 | Send2Trash 1.5.0
307 | setuptools 41.2.0
308 | setuptools-git 1.2
309 | Shapely 1.6.4.post2
310 | simplegeneric 0.8.1
311 | six 1.12.0
312 | sklearn 0.0
313 | sklearn-pandas 1.8.0
314 | smart-open 1.8.4
315 | snowballstemmer 1.9.0
316 | sortedcontainers 2.1.0
317 | spacy 2.1.8
318 | Sphinx 1.8.5
319 | sphinxcontrib-websupport 1.1.2
320 | SQLAlchemy 1.3.7
321 | sqlparse 0.3.0
322 | srsly 0.1.0
323 | stable-baselines 2.2.1
324 | statsmodels 0.10.1
325 | sympy 1.1.1
326 | tables 3.4.4
327 | tabulate 0.8.3
328 | tblib 1.4.0
329 | tensor2tensor 1.11.0
330 | tensorboard 1.14.0
331 | tensorboardcolab 0.0.22
332 | tensorflow 1.14.0
333 | tensorflow-estimator 1.14.0
334 | tensorflow-hub 0.5.0
335 | tensorflow-metadata 0.14.0
336 | tensorflow-probability 0.7.0
337 | termcolor 1.1.0
338 | terminado 0.8.2
339 | testpath 0.4.2
340 | text-unidecode 1.2
341 | textblob 0.15.3
342 | textgenrnn 1.4.1
343 | tfds-nightly 1.2.0.dev201908260105
344 | tflearn 0.3.2
345 | Theano 1.0.4
346 | thinc 7.0.8
347 | toolz 0.10.0
348 | torch 1.1.0
349 | torchsummary 1.5.1
350 | torchtext 0.3.1
351 | torchvision 0.3.0
352 | tornado 4.5.3
353 | tqdm 4.28.1
354 | traitlets 4.3.2
355 | tweepy 3.6.0
356 | typing 3.7.4.1
357 | tzlocal 1.5.1
358 | umap-learn 0.3.10
359 | uritemplate 3.0.0
360 | urllib3 1.24.3
361 | vega-datasets 0.7.0
362 | wasabi 0.2.2
363 | wcwidth 0.1.7
364 | webencodings 0.5.1
365 | Werkzeug 0.15.5
366 | wheel 0.33.6
367 | widgetsnbextension 3.5.1
368 | wordcloud 1.5.0
369 | wrapt 1.11.2
370 | xarray 0.11.3
371 | xgboost 0.90
372 | xkit 0.0.0
373 | xlrd 1.1.0
374 | xlwt 1.3.0
375 | yellowbrick 0.9.1
376 | zict 1.0.0
377 | zipp 0.6.0
378 | zmq 0.0.0
379 |
--------------------------------------------------------------------------------
/deepLearning_LSTM.ipynb:
--------------------------------------------------------------------------------
1 | {
2 | "nbformat": 4,
3 | "nbformat_minor": 0,
4 | "metadata": {
5 | "colab": {
6 | "name": "deepLearning_LSTM.ipynb",
7 | "provenance": [],
8 | "collapsed_sections": [],
9 | "toc_visible": true,
10 | "include_colab_link": true
11 | },
12 | "kernelspec": {
13 | "name": "python3",
14 | "display_name": "Python 3"
15 | },
16 | "accelerator": "GPU"
17 | },
18 | "cells": [
19 | {
20 | "cell_type": "markdown",
21 | "metadata": {
22 | "id": "view-in-github",
23 | "colab_type": "text"
24 | },
25 | "source": [
26 | "
"
27 | ]
28 | },
29 | {
30 | "cell_type": "markdown",
31 | "metadata": {
32 | "id": "dTQnbI5ALRnR"
33 | },
34 | "source": [
35 | "#Long Short-Term Memory (LSTM)\n",
36 | "\n",
37 | "This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty.\n",
38 | "\n",
39 | "If you publish a paper related on this material, please cite:\n",
40 | "\n",
41 | "H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, \"Learning Deep Learning (CDT-15),\" 2019.\n",
42 | "\n",
43 | "This is the third example of deep learning implementation. Here we use a LSTM network to predict the Bitcoin prices along time by using the input as a temporal series.\n",
44 | "\n",
45 | "\n",
46 | "First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend)."
47 | ]
48 | },
49 | {
50 | "cell_type": "code",
51 | "source": [
52 | "!pip install yfinance"
53 | ],
54 | "metadata": {
55 | "id": "rV_VpYzoxiji"
56 | },
57 | "execution_count": null,
58 | "outputs": []
59 | },
60 | {
61 | "cell_type": "code",
62 | "metadata": {
63 | "id": "Z_tGqT1mKu_4",
64 | "colab": {
65 | "base_uri": "https://localhost:8080/"
66 | },
67 | "outputId": "f83cedc2-f626-4df5-d333-6a875972154e"
68 | },
69 | "source": [
70 | "%tensorflow_version 1.x\n",
71 | "import numpy as np\n",
72 | "import keras\n",
73 | "from keras.models import Sequential\n",
74 | "from keras.utils.vis_utils import plot_model\n",
75 | "from keras.layers import Dense, Dropout, LSTM\n",
76 | "from keras.callbacks import EarlyStopping, ReduceLROnPlateau, ModelCheckpoint\n",
77 | "from sklearn.model_selection import train_test_split\n",
78 | "from sklearn.metrics import accuracy_score\n",
79 | "from sklearn.preprocessing import MinMaxScaler\n",
80 | "import matplotlib.pyplot as plt\n",
81 | "import pandas as pd\n",
82 | "#import pandas_datareader\n",
83 | "import yfinance as yf"
84 | ],
85 | "execution_count": null,
86 | "outputs": [
87 | {
88 | "output_type": "stream",
89 | "name": "stdout",
90 | "text": [
91 | "TensorFlow 1.x selected.\n"
92 | ]
93 | },
94 | {
95 | "output_type": "stream",
96 | "name": "stderr",
97 | "text": [
98 | "Using TensorFlow backend.\n"
99 | ]
100 | }
101 | ]
102 | },
103 | {
104 | "cell_type": "markdown",
105 | "metadata": {
106 | "id": "DBk2wzXpMb7i"
107 | },
108 | "source": [
109 | "If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*)."
110 | ]
111 | },
112 | {
113 | "cell_type": "code",
114 | "metadata": {
115 | "id": "rBk-5FD3Mf5j",
116 | "colab": {
117 | "base_uri": "https://localhost:8080/"
118 | },
119 | "outputId": "f079924d-f08a-4123-c63f-914a472e81d4"
120 | },
121 | "source": [
122 | "import tensorflow as tf \n",
123 | "from keras import backend as K\n",
124 | "\n",
125 | "print(K.tensorflow_backend._get_available_gpus())\n",
126 | "\n",
127 | "number_of_cpu_cores = 8\n",
128 | "config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores}) \n",
129 | "session = tf.Session(config=config) \n",
130 | "keras.backend.set_session(session)"
131 | ],
132 | "execution_count": null,
133 | "outputs": [
134 | {
135 | "output_type": "stream",
136 | "name": "stdout",
137 | "text": [
138 | "['/job:localhost/replica:0/task:0/device:GPU:0']\n"
139 | ]
140 | }
141 | ]
142 | },
143 | {
144 | "cell_type": "markdown",
145 | "metadata": {
146 | "id": "SkpEIvMCMhc0"
147 | },
148 | "source": [
149 | "(*) Here, we use the Bitcoin daily prices dataset, which is available at\n",
150 | "[yhaoo-stock-market](https://finance.yahoo.com/). The data contains seven columns, organized as follows: date, opening stock price, high daily price, low daily price, closing stock price, the currency volume traded on the day, and the adjusted closing price."
151 | ]
152 | },
153 | {
154 | "cell_type": "code",
155 | "metadata": {
156 | "id": "7XsGYIwMb_aI"
157 | },
158 | "source": [
159 | "train_size = 1500\n",
160 | "start_date = '2015-01-01'# Bitcoin started on '2010-07-16'\n",
161 | "end_date = '2020-04-01'\n",
162 | "\n",
163 | "\n",
164 | "tickerData = yf.Ticker(\"BTC-USD\")\n",
165 | "dataset = tickerData.history(period='max', interval='1d', start=start_date, end=end_date)\n",
166 | "data_oerder = ['Open','High', 'Low', 'Close', 'Volume']\n",
167 | "dataset = dataset[data_oerder]\n",
168 | "\n",
169 | "\n",
170 | "train_dataset = dataset.iloc[0:train_size, 1::].values\n",
171 | "test_dataset = dataset.iloc[train_size::, 1::].values\n",
172 | "\n",
173 | "min_max_scaler = MinMaxScaler(feature_range=(0,1))\n",
174 | "normalized_train_dataset = min_max_scaler.fit_transform(train_dataset)\n",
175 | "\n",
176 | "min_max_scaler_train = MinMaxScaler(feature_range=(0,1))\n",
177 | "normalized_train_price = min_max_scaler_train.fit_transform(train_dataset[:,0:1])\n"
178 | ],
179 | "execution_count": null,
180 | "outputs": []
181 | },
182 | {
183 | "cell_type": "markdown",
184 | "metadata": {
185 | "id": "xndlWJv8Oi0N"
186 | },
187 | "source": [
188 | "In the following, we define the network topology."
189 | ]
190 | },
191 | {
192 | "cell_type": "code",
193 | "metadata": {
194 | "id": "imLL0-wBOf8o"
195 | },
196 | "source": [
197 | "window_size = 50\n",
198 | "number_of_lstm_layers = 3\n",
199 | "activation = 'sigmoid' \n",
200 | "return_sequences = True\n",
201 | "units_first_layer = 100\n",
202 | "units = 50\n",
203 | "\n",
204 | "data = []\n",
205 | "train_price = []\n",
206 | "for i in range(window_size, train_size):\n",
207 | " data.append(normalized_train_dataset[i-window_size:i, 0:5])\n",
208 | " train_price.append(normalized_train_dataset[i, 0])\n",
209 | "data, train_price = np.array(data), np.array(train_price)\n",
210 | "\n",
211 | "lstm_model = Sequential()\n",
212 | "lstm_model.add(LSTM(units = units_first_layer, \n",
213 | " return_sequences = return_sequences, \n",
214 | " input_shape = (data.shape[1], 4)))\n",
215 | "lstm_model.add(Dropout(0.2))\n",
216 | "\n",
217 | "for i in range(number_of_lstm_layers-2):\n",
218 | " lstm_model.add(LSTM(units = units, return_sequences = return_sequences))\n",
219 | " lstm_model.add(Dropout(0.2))\n",
220 | "\n",
221 | "lstm_model.add(LSTM(units = units))\n",
222 | "lstm_model.add(Dropout(0.2))\n",
223 | "\n",
224 | "#Output layer\n",
225 | "lstm_model.add(Dense(units = 1, activation = activation))"
226 | ],
227 | "execution_count": null,
228 | "outputs": []
229 | },
230 | {
231 | "cell_type": "markdown",
232 | "metadata": {
233 | "id": "lgj_GKsJVms-"
234 | },
235 | "source": [
236 | "In order to check the network topology, the subsequent command can be used."
237 | ]
238 | },
239 | {
240 | "cell_type": "code",
241 | "metadata": {
242 | "id": "ZDE5FFHQTD2c",
243 | "outputId": "f87b32e1-9de9-42d6-a30f-8ebb89d00c2c",
244 | "colab": {
245 | "base_uri": "https://localhost:8080/",
246 | "height": 1000
247 | }
248 | },
249 | "source": [
250 | "lstm_model.summary() \n",
251 | "#Saving the resultant figure as 'ff_model.png'.\n",
252 | "plot_model(lstm_model, to_file='lstm_model.png', show_shapes=True, \n",
253 | " show_layer_names=True)"
254 | ],
255 | "execution_count": null,
256 | "outputs": [
257 | {
258 | "output_type": "stream",
259 | "name": "stdout",
260 | "text": [
261 | "Model: \"sequential_7\"\n",
262 | "_________________________________________________________________\n",
263 | "Layer (type) Output Shape Param # \n",
264 | "=================================================================\n",
265 | "lstm_19 (LSTM) (None, 50, 100) 42000 \n",
266 | "_________________________________________________________________\n",
267 | "dropout_19 (Dropout) (None, 50, 100) 0 \n",
268 | "_________________________________________________________________\n",
269 | "lstm_20 (LSTM) (None, 50, 50) 30200 \n",
270 | "_________________________________________________________________\n",
271 | "dropout_20 (Dropout) (None, 50, 50) 0 \n",
272 | "_________________________________________________________________\n",
273 | "lstm_21 (LSTM) (None, 50) 20200 \n",
274 | "_________________________________________________________________\n",
275 | "dropout_21 (Dropout) (None, 50) 0 \n",
276 | "_________________________________________________________________\n",
277 | "dense_7 (Dense) (None, 1) 51 \n",
278 | "=================================================================\n",
279 | "Total params: 92,451\n",
280 | "Trainable params: 92,451\n",
281 | "Non-trainable params: 0\n",
282 | "_________________________________________________________________\n"
283 | ]
284 | },
285 | {
286 | "output_type": "execute_result",
287 | "data": {
288 | "image/png": "\n",
289 | "text/plain": [
290 | ""
291 | ]
292 | },
293 | "metadata": {},
294 | "execution_count": 62
295 | }
296 | ]
297 | },
298 | {
299 | "cell_type": "markdown",
300 | "metadata": {
301 | "id": "wYn2dnj2TCm3"
302 | },
303 | "source": [
304 | "The training step is executed as follows.\n",
305 | "\n"
306 | ]
307 | },
308 | {
309 | "cell_type": "code",
310 | "metadata": {
311 | "id": "i-npHi6vVdQB",
312 | "outputId": "e5286a2f-e0b2-4e2d-cc52-f2eb9c15f2af",
313 | "colab": {
314 | "base_uri": "https://localhost:8080/"
315 | }
316 | },
317 | "source": [
318 | "#Here we set verbose as true\n",
319 | "verbose = 1\n",
320 | "\n",
321 | "batch_size = 32\n",
322 | "epochs = 10\n",
323 | "filepath = 'weights.h5' #name of the file with the network weights\n",
324 | "monitor = 'loss'\n",
325 | "optimizer = 'adam'\n",
326 | "loss = 'mean_squared_error'\n",
327 | "metrics = ['mean_absolute_error'] \n",
328 | "\n",
329 | "lstm_model.compile(optimizer = optimizer, loss = loss, metrics = metrics)\n",
330 | "\n",
331 | "early_stopping = EarlyStopping(monitor = monitor, min_delta = 1e-15, \n",
332 | " patience = 10, verbose = verbose)\n",
333 | "reduce_learning_rate_on_plateau = ReduceLROnPlateau(monitor = monitor, \n",
334 | " factor = 0.2, patience = 5, \n",
335 | " verbose = verbose)\n",
336 | "model_checkpoint = ModelCheckpoint(filepath = filepath, monitor = monitor, \n",
337 | " save_best_only = True, verbose = verbose)\n",
338 | "lstm_model.fit(data, train_price, epochs = epochs, batch_size = batch_size,\n",
339 | " callbacks = [early_stopping, reduce_learning_rate_on_plateau, \n",
340 | " model_checkpoint])"
341 | ],
342 | "execution_count": null,
343 | "outputs": [
344 | {
345 | "output_type": "stream",
346 | "name": "stdout",
347 | "text": [
348 | "Epoch 1/10\n",
349 | "1450/1450 [==============================] - 6s 4ms/step - loss: 0.0567 - mean_absolute_error: 0.1899\n",
350 | "\n",
351 | "Epoch 00001: loss improved from inf to 0.05668, saving model to weights.h5\n",
352 | "Epoch 2/10\n",
353 | "1450/1450 [==============================] - 7s 4ms/step - loss: 0.0074 - mean_absolute_error: 0.0533\n",
354 | "\n",
355 | "Epoch 00002: loss improved from 0.05668 to 0.00736, saving model to weights.h5\n",
356 | "Epoch 3/10\n",
357 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0032 - mean_absolute_error: 0.0331\n",
358 | "\n",
359 | "Epoch 00003: loss improved from 0.00736 to 0.00316, saving model to weights.h5\n",
360 | "Epoch 4/10\n",
361 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0032 - mean_absolute_error: 0.0344\n",
362 | "\n",
363 | "Epoch 00004: loss did not improve from 0.00316\n",
364 | "Epoch 5/10\n",
365 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0023 - mean_absolute_error: 0.0291\n",
366 | "\n",
367 | "Epoch 00005: loss improved from 0.00316 to 0.00232, saving model to weights.h5\n",
368 | "Epoch 6/10\n",
369 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0018 - mean_absolute_error: 0.0252\n",
370 | "\n",
371 | "Epoch 00006: loss improved from 0.00232 to 0.00178, saving model to weights.h5\n",
372 | "Epoch 7/10\n",
373 | "1450/1450 [==============================] - 5s 4ms/step - loss: 0.0016 - mean_absolute_error: 0.0247\n",
374 | "\n",
375 | "Epoch 00007: loss improved from 0.00178 to 0.00164, saving model to weights.h5\n",
376 | "Epoch 8/10\n",
377 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0015 - mean_absolute_error: 0.0242\n",
378 | "\n",
379 | "Epoch 00008: loss improved from 0.00164 to 0.00149, saving model to weights.h5\n",
380 | "Epoch 9/10\n",
381 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0016 - mean_absolute_error: 0.0257\n",
382 | "\n",
383 | "Epoch 00009: loss did not improve from 0.00149\n",
384 | "Epoch 10/10\n",
385 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0014 - mean_absolute_error: 0.0234\n",
386 | "\n",
387 | "Epoch 00010: loss improved from 0.00149 to 0.00143, saving model to weights.h5\n"
388 | ]
389 | },
390 | {
391 | "output_type": "execute_result",
392 | "data": {
393 | "text/plain": [
394 | ""
395 | ]
396 | },
397 | "metadata": {},
398 | "execution_count": 63
399 | }
400 | ]
401 | },
402 | {
403 | "cell_type": "markdown",
404 | "metadata": {
405 | "id": "dvFufFaJMjXB"
406 | },
407 | "source": [
408 | "The following code verifies the data in the network."
409 | ]
410 | },
411 | {
412 | "cell_type": "code",
413 | "metadata": {
414 | "id": "0yBjjCoCjCo3"
415 | },
416 | "source": [
417 | "test_price = test_dataset[:, 0:1]\n",
418 | "complete_dataset = dataset.iloc[:,1::]\n",
419 | "\n",
420 | "train_data = complete_dataset[len(complete_dataset) - len(test_dataset) - \n",
421 | " window_size:].values\n",
422 | "train_data = min_max_scaler.transform(train_data)\n",
423 | "\n",
424 | "\n",
425 | "X_test = []\n",
426 | "for i in range(window_size,len(train_data)):\n",
427 | " X_test.append(train_data[i-window_size:i, 0:5])\n",
428 | "X_test = np.array(X_test)\n",
429 | "\n",
430 | "calculated_prices = lstm_model.predict(X_test)\n",
431 | "calculated_prices = min_max_scaler_train.inverse_transform(calculated_prices)\n",
432 | "\n",
433 | "train_price = min_max_scaler_train.inverse_transform([train_price])\n",
434 | "train_price = train_price[0]"
435 | ],
436 | "execution_count": null,
437 | "outputs": []
438 | },
439 | {
440 | "cell_type": "markdown",
441 | "metadata": {
442 | "id": "IY8n4My_X4HO"
443 | },
444 | "source": [
445 | "In the following, we plot the train set, as well as the prediction and the expected values."
446 | ]
447 | },
448 | {
449 | "cell_type": "code",
450 | "metadata": {
451 | "id": "HXWZWul0X3hH",
452 | "outputId": "00aaf3a5-fad8-4981-d96b-1d05eab6fddd",
453 | "colab": {
454 | "base_uri": "https://localhost:8080/",
455 | "height": 295
456 | }
457 | },
458 | "source": [
459 | "plt.plot(np.linspace(0,len(train_price)-1,len(train_price)), \n",
460 | " train_price, label = 'Real price train', color = 'k')\n",
461 | "plt.plot(np.linspace(len(train_price),len(train_price)+len(test_price)-1,\n",
462 | " len(test_price)), test_price, label = 'Real price')\n",
463 | "plt.plot(np.linspace(len(train_price),len(train_price)+len(test_price)-1,\n",
464 | " len(test_price)), calculated_prices, label = 'Prevision')\n",
465 | "plt.title('BTC prevision')\n",
466 | "plt.xlabel('Time')\n",
467 | "plt.ylabel('Value')\n",
468 | "plt.legend()\n",
469 | "plt.show()"
470 | ],
471 | "execution_count": null,
472 | "outputs": [
473 | {
474 | "output_type": "display_data",
475 | "data": {
476 | "image/png": "\n",
477 | "text/plain": [
478 | ""
479 | ]
480 | },
481 | "metadata": {
482 | "needs_background": "light"
483 | }
484 | }
485 | ]
486 | },
487 | {
488 | "cell_type": "markdown",
489 | "metadata": {
490 | "id": "EmXldp6aSUwa"
491 | },
492 | "source": [
493 | "## License\n",
494 | "\n",
495 | "This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0) International License."
496 | ]
497 | },
498 | {
499 | "cell_type": "markdown",
500 | "metadata": {
501 | "id": "IGfUENRWu4Pm"
502 | },
503 | "source": [
504 | "## Acknowledgments\n",
505 | "Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). H. F. de Arruda also thanks Soremartec S.A. and Soremartec Italia, Ferrero Group, for partial financial support (from 1st July 2021). His funders had no role in study design, data collection, and analysis, decision to publish, or manuscript preparation. Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and FAPESP (proc. 15/22308-2) for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 2018/09125-4 and 2021/12354-8) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 15/22308-2."
506 | ]
507 | }
508 | ]
509 | }
--------------------------------------------------------------------------------