├── redes.png ├── LICENSE ├── README.md ├── codes ├── feedforward_multclass.py ├── cnn.py ├── rbm.py ├── feedforward_binary.py ├── lstm.py ├── gan.py └── autoencoder.py ├── libraries.txt └── deepLearning_LSTM.ipynb /redes.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hfarruda/deeplearningtutorial/HEAD/redes.png -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | Deep learning tutorial (c) by Henrique Ferraz de Arruda, Alexandre Benatti, César Henrique Comin, and Luciano da Fontoura Costa 2 | 3 | Deep learning tutorial is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. 4 | 5 | You should have received a copy of the license along with this work. 6 | If not, see . 7 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # Deep Learning Tutorial 2 | 3 | This tutorial is part of the didactic text: [Learning Deep Learning](https://www.scielo.br/j/rbef/a/hMZfS8hRwMvVktkbCZtjJff/?format=html), authored by Henrique Ferraz de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. 4 | 5 | The purpose of this tutorial is to provide simple didactic examples of deep learning architectures and problem solution. The codes included here are based on toy datasets, and restricted to parameters allowing short processing time. So, these codes are not suitable for other data and/or applications, which will require modifications in the structure and parameters. These codes have absolutely no warranty. 6 | 7 | For all the codes presented here, we use [Keras](https://keras.io/) as the deep learning library. Keras is a useful and straightforward framework, which can be employed for simple and complex tasks. Keras is written in the Python language, providing self-explanatory codes, with the additional advantage of being executed under [TensorFlow](https://www.tensorflow.org/) backend. We also employ the [Scikit-learn](https://scikit-learn.org/), which is devoted to machine learning. 8 | 9 | ![](./redes.png) 10 | 11 | More details are available at [Learning Deep Learning](https://www.scielo.br/j/rbef/a/hMZfS8hRwMvVktkbCZtjJff/?format=html). 12 | 13 | 14 | ## Feedforward networks 15 | 16 | ### Binary Classification 17 | This is the first example of deep learning implementation, in which we address binary classification of wine data. In this example, we consider one feedforward network with 5 hidden layers and with 30 neurons in each layer. The provided networks were built only for a didactic purpose and are not appropriate for real applications. 18 | 19 | ### Multiclass Classification 20 | In this example, we illustrate a multiclass classification through a wine dataset, in which there are three classes, which were defined according to their regions. We employed the same dataset presented above, but here we considered the three classes. To do so, we use the *softmax* activation function. 21 | 22 | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_feedforward.ipynb) 23 | 24 | 25 | ## Convolutional Neural Network (CNN) 26 | This tutorial is the second example of deep learning implementation, in which we exemplify a classification task. More specifically, we considered ten classes of colored pictures. 27 | 28 | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_CNN.ipynb) 29 | 30 | 31 | ## Long Short-Term Memory (LSTM) 32 | 33 | This is the third example of deep learning implementation. Here we use a LSTM network to predict the Bitcoin prices along time by using the input as a temporal series. 34 | 35 | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_LSTM.ipynb) 36 | 37 | 38 | ## Restricted Boltzmann Machine (RBM) 39 | 40 | This is the fourth example of deep learning implementation. Here we use a RMB network to provide a recommendation system of musical instruments. 41 | 42 | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_RBM.ipynb) 43 | 44 | 45 | ## Autoencoders 46 | This example uses the Autoencoder model to illustrate a possible application. Here we show how to use the resulting codes to reduce the dimentionality. We also project our data by using a Principal Component Analysis(PCA). 47 | 48 | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_autoencoder.ipynb) 49 | 50 | 51 | ## Generative Adversarial Networks (GAN) 52 | This example was elaborated to create a network that can generate handwritten characters automatically. 53 | 54 | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_GAN.ipynb) 55 | 56 | 57 | ## Libraries 58 | All of these codes were developed and executed with the environment described in "libraries.txt". 59 | 60 | ## Citation Request 61 | If you publish a paper related to this material, please cite: 62 | 63 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning deep learning." Revista Brasileira de Ensino de Física 44, 2022. 64 | 65 | 66 | ## Acknowledgments 67 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). H. F. de Arruda also thanks Soremartec S.A. and Soremartec Italia, Ferrero Group, for partial financial support (from 1st July 2021). His funders had no role in study design, data collection, and analysis, decision to publish, or manuscript preparation. Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and FAPESP (proc. 15/22308-2) for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 2018/09125-4 and 2021/12354-8) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 15/22308-2. 68 | -------------------------------------------------------------------------------- /codes/feedforward_multclass.py: -------------------------------------------------------------------------------- 1 | # -*- coding: utf-8 -*- 2 | """deepLearning_feedforward.ipynb 3 | 4 | Automatically generated by Colaboratory. 5 | 6 | Original file is located at 7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_feedforward.ipynb 8 | 9 | #Feedforward networks 10 | 11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. These codes have absolutely no warranty. 12 | 13 | If you publish a paper related on this material, please cite: 14 | 15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019. 16 | 17 | ##Multiclass Classification 18 | In this example, we illustrate a multiclass classification through a wine dataset, in which there are three classes, which were defined according to their regions. We employed the same dataset presented above, but here we considered the three classes. To do so, we use the *softmax* activation function. 19 | 20 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend). 21 | """ 22 | 23 | import numpy as np 24 | import keras 25 | from keras.utils import np_utils 26 | from keras.models import Sequential 27 | from keras.layers import Dense, Dropout 28 | from sklearn.datasets import load_wine 29 | from sklearn.model_selection import train_test_split 30 | from sklearn.metrics import accuracy_score 31 | from sklearn.preprocessing import LabelEncoder 32 | from sklearn.metrics import confusion_matrix 33 | 34 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*).""" 35 | 36 | import tensorflow as tf 37 | from keras import backend as K 38 | 39 | print(K.tensorflow_backend._get_available_gpus()) 40 | 41 | number_of_cpu_cores = 8 42 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores}) 43 | session = tf.Session(config=config) 44 | keras.backend.set_session(session) 45 | 46 | """(*) In this example the dataset used is Wine. It is available at Sklearn library on [sklearn-datasets-wine](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_wine.html). For more information [wine-UCI](https://archive.ics.uci.edu/ml/datasets/Wine). 47 | 48 | These data show the results of a chemical analysis of wines grown in Italy, derived from three different cultivars in the same region, and can be loaded as follows. 49 | """ 50 | 51 | wine = load_wine() 52 | data = wine['data'] 53 | target = wine['target'] 54 | target_names = wine['target_names'] 55 | 56 | label_encoder = LabelEncoder() 57 | target = label_encoder.fit_transform(target) 58 | target_one_hot_encoding = np_utils.to_categorical(target) 59 | 60 | #Here, we divide our dataset into training and test sets. 61 | test_size = 0.25 #fraction 62 | training_data,test_data,training_target,test_target = train_test_split(data, 63 | target_one_hot_encoding, test_size=test_size) 64 | 65 | """In the following, we configure the neuronal network. It is not necessary to include bias because this parameter is set as true by default.""" 66 | 67 | #Set of parameters 68 | input_dim = data.shape[1] 69 | kernel_initializer = 'random_uniform' 70 | bias_initializer='zeros' 71 | activation_function_hidden = 'relu' 72 | activation_function_output = 'softmax' 73 | optimizer = 'adam' 74 | loss = 'categorical_crossentropy' 75 | metrics = ['categorical_accuracy'] 76 | number_of_layers = 5 77 | number_of_units_hidden = 30 78 | number_of_units_output = len(set(target_names)) 79 | dropout_percentage = 0.25 80 | 81 | 82 | #Creating model 83 | ff_model = Sequential() 84 | ff_model.add(Dense(units = number_of_units_hidden, 85 | activation = activation_function_hidden, 86 | kernel_initializer = kernel_initializer, 87 | input_dim = input_dim)) 88 | 89 | for i in range(number_of_layers-1): 90 | #Inserting a dense hidden layer 91 | ff_model.add(Dense(units = number_of_units_hidden, 92 | activation = activation_function_hidden, 93 | kernel_initializer = kernel_initializer, 94 | input_dim = number_of_units_hidden)) 95 | #Inserting dropout 96 | ff_model.add(Dropout(dropout_percentage)) 97 | 98 | ff_model.add(Dense(units = number_of_units_output, 99 | activation = activation_function_output)) 100 | ff_model.compile(optimizer = optimizer, loss = loss, metrics = metrics) 101 | ff_model.summary() 102 | 103 | """The training step is executed as follows.""" 104 | 105 | batch_size = 10 106 | epochs = 250 107 | ff_model.fit(training_data,training_target, batch_size = batch_size, 108 | epochs = epochs) 109 | 110 | """Because there are three classes, we show the classification results through a confusion matrix.""" 111 | 112 | predictions = ff_model.predict(test_data) 113 | 114 | found_target = predictions.argmax(axis=1) 115 | categorical_test_target = test_target.argmax(axis=1) 116 | 117 | accuracy = accuracy_score(categorical_test_target, found_target) 118 | print("Accuracy =", accuracy) 119 | 120 | print("Confusion matrix:") 121 | matrix = confusion_matrix(found_target,categorical_test_target) 122 | print(matrix) 123 | 124 | """ 125 | ## License 126 | 127 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License. 128 | 129 | ## Acknowledgments 130 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2. 131 | """ 132 | -------------------------------------------------------------------------------- /codes/cnn.py: -------------------------------------------------------------------------------- 1 | # -*- coding: utf-8 -*- 2 | """deepLearning_CNN.ipynb 3 | 4 | Automatically generated by Colaboratory. 5 | 6 | Original file is located at 7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_CNN.ipynb 8 | 9 | # Convolutional Neural Network (CNN) 10 | 11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty. 12 | 13 | If you publish a paper related on this material, please cite: 14 | 15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019. 16 | 17 | This tutorial is the second example of deep learning implementation, in which we exemplify a classification task. More specifically, we considered ten classes of color pictures. 18 | 19 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend). 20 | """ 21 | 22 | import keras 23 | from keras.utils import np_utils 24 | from keras.models import Sequential 25 | from keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Dropout 26 | from keras.layers.normalization import BatchNormalization 27 | from keras.preprocessing.image import ImageDataGenerator 28 | from keras.preprocessing import image 29 | from keras.datasets import cifar10 30 | from keras.utils.vis_utils import plot_model 31 | from sklearn.metrics import accuracy_score 32 | import numpy as np 33 | import matplotlib.pyplot as plt 34 | from sklearn.metrics import classification_report, confusion_matrix 35 | 36 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*).""" 37 | 38 | import tensorflow as tf 39 | from keras import backend as K 40 | 41 | print(K.tensorflow_backend._get_available_gpus()) 42 | 43 | number_of_cpu_cores = 8 44 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores}) 45 | session = tf.Session(config=config) 46 | keras.backend.set_session(session) 47 | 48 | """(*) In this example, we used the CIFAR10, which is consists of a colored dataset of images. It is available in Keras library, available on [keras-datasets](https://keras.io/datasets/). 49 | This dataset is organized into two parts, where the first is called x_train/x_test and comprises RGB images with dimensions of 32x32x3 . The second represents the targets, and the variables are called y_train/y_test, which are represented by arrays of category tags from 0 to 9. 50 | 51 | The following command is used to load the data set. 52 | """ 53 | 54 | (train_data, train_target), (test_data, test_target) = cifar10.load_data() 55 | 56 | train_target_one_hot_encoding = np_utils.to_categorical(train_target) 57 | 58 | """In order to visualize a given figure, the following code can be executed.""" 59 | 60 | image_id = 700 61 | plt.imshow(test_data[image_id]) 62 | plt.title("Test image: " + str(image_id)) 63 | plt.show() 64 | 65 | """In the following, we define the network topology. In this case, because of the redundancy typically found in images, we do not employ dropout in the convolutional layers.""" 66 | 67 | input_shape = train_data.shape[1:] 68 | filters = 128 69 | kernel_size = (3,3) 70 | pool_size = (2,2) 71 | 72 | optimizer = 'adam' 73 | loss = 'categorical_crossentropy' 74 | metrics = ['categorical_accuracy'] 75 | activation = 'relu' 76 | activation_function_output = 'softmax' 77 | number_of_cnn_layers = 3 78 | number_of_ff_layers = 3 79 | number_of_units_output = train_target_one_hot_encoding.shape[1] 80 | 81 | cnn_model = Sequential() 82 | cnn_model.add(Conv2D(filters, kernel_size, input_shape = input_shape, 83 | activation = activation)) 84 | 85 | cnn_model.add(BatchNormalization()) 86 | cnn_model.add(MaxPooling2D(pool_size = pool_size)) 87 | 88 | for i in range(number_of_cnn_layers-1): 89 | cnn_model.add(Conv2D(filters, kernel_size, activation = activation)) 90 | cnn_model.add(BatchNormalization()) 91 | cnn_model.add(MaxPooling2D(pool_size = pool_size)) 92 | 93 | cnn_model.add(Flatten()) 94 | 95 | #Feedforward network 96 | for i in range(number_of_ff_layers): 97 | cnn_model.add(Dense(units = 128, activation = activation)) 98 | cnn_model.add(Dropout(0.3)) 99 | 100 | cnn_model.add(Dense(units = number_of_units_output, 101 | activation = activation_function_output)) 102 | 103 | cnn_model.compile(optimizer = optimizer, loss = loss, metrics = metrics) 104 | 105 | """We can use the following command to see the network topology.""" 106 | 107 | cnn_model.summary() 108 | #Saving the resultant figure as 'cnn_model.png'. 109 | plot_model(cnn_model, to_file='cnn_model.png', show_shapes=True, 110 | show_layer_names=True) 111 | 112 | """The training step is executed as follows. Because this network demands a high computational power, we can use a small number of epochs.""" 113 | 114 | batch_size = 30 115 | epochs = 50 116 | 117 | cnn_model.fit(train_data, train_target_one_hot_encoding, 118 | batch_size = batch_size, epochs = epochs) 119 | 120 | """Since there are more than two classes, we show the classification results through a confusion matrix.""" 121 | 122 | predictions = cnn_model.predict(test_data) 123 | found_target = predictions.argmax(axis=1) 124 | 125 | accuracy = accuracy_score(test_target, found_target) 126 | print("Accuracy =", accuracy) 127 | 128 | print("Confusion matrix:") 129 | matrix = confusion_matrix(found_target,test_target) 130 | 131 | plt.title("Confusion matrix:") 132 | plt.xticks(np.linspace(0,9,10)) 133 | plt.yticks(np.linspace(0,9,10)) 134 | plt.imshow(matrix) 135 | plt.show() 136 | 137 | """ 138 | ## License 139 | 140 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License. 141 | 142 | ## Acknowledgments 143 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2. 144 | """ 145 | -------------------------------------------------------------------------------- /codes/rbm.py: -------------------------------------------------------------------------------- 1 | # -*- coding: utf-8 -*- 2 | """deepLearning_RBM.ipynb 3 | 4 | Automatically generated by Colaboratory. 5 | 6 | Original file is located at 7 | https://colab.research.google.com/drive/1R7ZfTxrtIIG_22IlApzXgWxuilbScLLJ 8 | 9 | # Restricted Boltzmann Machine (RBM) 10 | 11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty. 12 | 13 | If you publish a paper related on this material, please cite: 14 | 15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019. 16 | 17 | This is the fourth example of deep learning implementation. Here we use a RMB network to provide a recommendation system of CDs and vinyls. 18 | 19 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend). 20 | """ 21 | 22 | import numpy as np 23 | import pandas as pd 24 | from sklearn.neural_network import BernoulliRBM 25 | import matplotlib.pyplot as plt 26 | import urllib.request 27 | from keras.utils import np_utils 28 | from sklearn.preprocessing import LabelEncoder 29 | import matplotlib.pyplot as plt 30 | 31 | """The following code downlods a dataset regarding the ratings of CDs and vinyls from the Amazon website ([link](http://snap.stanford.edu/data/amazon/productGraph/)). 32 | These data is divided into four columns, as follows: user id, item id, rating, and timestamp. The latter was removed from our analysis. 33 | """ 34 | 35 | main_url = "http://snap.stanford.edu/data/amazon/productGraph/categoryFiles/" 36 | file_name = "ratings_CDs_and_Vinyl.csv" 37 | url = main_url + file_name 38 | col_names = ["user", "item", "rating", "timestamp"] 39 | urllib.request.urlretrieve(url, file_name) 40 | musical_instruments_reviews = pd.read_csv(file_name, names = col_names) 41 | 42 | """In the following, we preprocess the dataset.""" 43 | 44 | #Defining dataset variables 45 | rating = musical_instruments_reviews["rating"].get_values() 46 | rating /= np.max(rating) 47 | 48 | users = musical_instruments_reviews["user"].get_values() 49 | 50 | label_encoder = LabelEncoder() 51 | items = musical_instruments_reviews["item"].get_values() 52 | items = label_encoder.fit_transform(items) 53 | 54 | """In order to reduce the time for running this tutorial, we reduced the number of items.""" 55 | 56 | number_of_items = 6 57 | 58 | #Finding lines to erase 59 | unique_items, item_counts = np.unique(items, return_counts=True) 60 | item2count = dict(zip(unique_items, item_counts)) 61 | item_count = sorted(item2count.items(), key=lambda x: x[1])[::-1] 62 | item_count = item_count[0:number_of_items] 63 | selected_items = [item for item, count in item_count] 64 | 65 | #keeping only the most frequent items 66 | keep_lines = [i for i,item in enumerate(items) if item in selected_items] 67 | keep_lines = np.array(keep_lines) 68 | 69 | rating = rating[keep_lines] 70 | users = users[keep_lines] 71 | items = items[keep_lines] 72 | 73 | #Converting the categorical data into a matrix 74 | item2new_code = {item:i for i,item in enumerate(set(items))} 75 | items = [item2new_code[item] for item in items] 76 | items_one_hot_encoding = np_utils.to_categorical(items) 77 | 78 | items_one_hot_encoding.shape 79 | 80 | """In the following, we weight and merge the codings of the selected columns.""" 81 | 82 | items_weighted = [items_one_hot_encoding[i] * rating[i] 83 | for i in range(len(rating))] 84 | 85 | items_weighted = np.array(items_weighted) 86 | 87 | user2matrix_lines = {user: np.argwhere(user == users).T[0] 88 | for user in set(users)} 89 | 90 | user2purchases = {user:np.max(items_weighted[user2matrix_lines[user]],axis = 0) 91 | for user in set(users)} 92 | 93 | """In the next step, we eliminate the data from users that bought zero or one item. In our analysis, we do not consider the user names.""" 94 | 95 | data = list(user2purchases.values()) 96 | data = np.array(data) 97 | 98 | items_per_line = np.count_nonzero(data, axis=1) 99 | keep_lines = np.argwhere(items_per_line >= 2).T[0] 100 | 101 | data = data[keep_lines,:] 102 | 103 | """The code presented as follows define the neuronal network.""" 104 | 105 | batch_size = 10 106 | learning_rate = 0.01 107 | n_components = 10 #Number of binary hidden units. 108 | n_iter = 5000 109 | verbose = 1 110 | 111 | rbm_model = BernoulliRBM(batch_size = batch_size, learning_rate = learning_rate, 112 | n_components = n_components, n_iter = n_iter, 113 | verbose = verbose) 114 | 115 | """Next, we train the network.""" 116 | 117 | rbm_model = rbm_model.fit(data) 118 | 119 | """Finally, we test the network. 120 | For that, we first analyze some inputs to know if the output makes sense. We test the output as a person that bought only the first product (1). So, we show the matrix lines for others that bought the same product, as follows. 121 | """ 122 | 123 | product_test = 0 124 | selected_lines = np.argwhere(data[:,product_test] > 0).T[0] 125 | plt.imshow(data[selected_lines]) 126 | plt.colorbar() 127 | plt.show() 128 | 129 | """The following code tests the network in order to recommend the most relevant products for a given user. More specifically, for a vector of scores of the acquired products, the RBM returns the products that this user could like. Here, we selected the two first indications by excluding the already acquired products.""" 130 | 131 | test_set = np.zeros(number_of_items) 132 | test_set[product_test] = 1 133 | 134 | test_set = [0,1,0,0,0,0] 135 | 136 | #Here we test a single sample 137 | result_hidden_layer = rbm_model.transform([test_set])[0] 138 | weight_matrix = rbm_model.components_ 139 | 140 | result = np.matmul(weight_matrix.T,result_hidden_layer) 141 | recomended_products = np.argsort(result)[::-1] 142 | recomended_products = [product for product in recomended_products 143 | if product != product_test] 144 | print ("The two recommended product, in drecreasing order, are: " + 145 | str(recomended_products[0:2]) + '.') 146 | 147 | """ 148 | ## License 149 | 150 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License. 151 | 152 | ## Acknowledgments 153 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2. 154 | """ 155 | -------------------------------------------------------------------------------- /codes/feedforward_binary.py: -------------------------------------------------------------------------------- 1 | # -*- coding: utf-8 -*- 2 | """deepLearning_feedforward.ipynb 3 | 4 | Automatically generated by Colaboratory. 5 | 6 | Original file is located at 7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_feedforward.ipynb 8 | 9 | #Feedforward networks 10 | 11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. These codes have absolutely no warranty. 12 | 13 | If you publish a paper related on this material, please cite: 14 | 15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019. 16 | 17 | ## Binary Classification 18 | This is the first example of deep learning implementation, in which we address binary classification of wine data. In this example, we consider one feedforward network with 5 hidden layers and with 30 neurons in each layer. The provided networks were built only for didactic purposes and are not appropriate for real applications. 19 | 20 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend). 21 | """ 22 | 23 | import numpy as np 24 | import keras 25 | from keras.models import Sequential 26 | from keras.layers import Dense, Dropout 27 | from keras.utils.vis_utils import plot_model 28 | from keras.models import model_from_json 29 | from sklearn.datasets import load_wine 30 | from sklearn.model_selection import train_test_split 31 | from sklearn.metrics import accuracy_score 32 | 33 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*).""" 34 | 35 | import tensorflow as tf 36 | from keras import backend as K 37 | 38 | print(K.tensorflow_backend._get_available_gpus()) 39 | 40 | number_of_cpu_cores = 8 41 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores}) 42 | session = tf.Session(config=config) 43 | keras.backend.set_session(session) 44 | 45 | """Here, we use the Wine dataset. It is available at Sklearn library on [sklearn-datasets-wine](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_wine.html). For more information [wine-UCI](https://archive.ics.uci.edu/ml/datasets/Wine). 46 | Because this dataset comprises three classes and here we exemplify a binary classification, we considered only the two first classes. 47 | """ 48 | 49 | wine = load_wine() 50 | data = wine['data'] 51 | target = wine['target'] 52 | target_names = wine['target_names'] 53 | 54 | #The selected items are stored in the variable called "hold". 55 | hold = np.argwhere(target!=2).T[0] 56 | data = data[hold] 57 | target = target[hold] 58 | target_names = target_names[0:1] 59 | 60 | #Here, we divide our dataset into training and test sets. 61 | test_size = 0.25 #fraction 62 | training_data,test_data,training_target,test_target = train_test_split(data, 63 | target, test_size=test_size) 64 | 65 | """In the following, we configure the neuronal network. It is not necessary to include bias because this parameter is set as true by default.""" 66 | 67 | #Set of parameters 68 | input_dim = data.shape[1] 69 | kernel_initializer = 'random_uniform' 70 | bias_initializer='zeros' 71 | activation_function_hidden = 'relu' 72 | activation_function_output = 'sigmoid' 73 | optimizer = 'adam' 74 | loss = 'binary_crossentropy' 75 | metrics = ['binary_accuracy'] 76 | number_of_layers = 5 77 | number_of_units_hidden = 30 78 | number_of_units_output = 1 79 | dropout_percentage = 0.25 80 | 81 | 82 | #Creating model 83 | ff_model = Sequential() 84 | ff_model.add(Dense(units = number_of_units_hidden, 85 | activation = activation_function_hidden, 86 | kernel_initializer = kernel_initializer, 87 | input_dim = input_dim)) 88 | 89 | for i in range(number_of_layers-1): 90 | #Inserting a dense hidden layer 91 | ff_model.add(Dense(units = number_of_units_hidden, 92 | activation = activation_function_hidden, 93 | kernel_initializer = kernel_initializer, 94 | input_dim = number_of_units_hidden)) 95 | #Inserting dropout 96 | ff_model.add(Dropout(dropout_percentage)) 97 | 98 | ff_model.add(Dense(units = number_of_units_output, 99 | activation = activation_function_output)) 100 | ff_model.compile(optimizer = optimizer, loss = loss, metrics = metrics) 101 | 102 | """In order to check the network topology, you can use the subsequent command.""" 103 | 104 | ff_model.summary() 105 | 106 | """Another option is to visualize the topology as a figure.""" 107 | 108 | #Saving the resultant figure as 'ff_model.png'. 109 | plot_model(ff_model, to_file='ff_model.png', show_shapes=True, 110 | show_layer_names=True) 111 | 112 | """Next, we train the network""" 113 | 114 | batch_size = 10 115 | epochs = 200 116 | ff_model.fit(training_data,training_target, batch_size = batch_size, 117 | epochs = epochs) 118 | 119 | """In order to create an application, it is possible to save the network and the respective trained weights as follows.""" 120 | 121 | #Saving the network model 122 | ff_model_json = ff_model.to_json() 123 | with open('ff_model.json', 'w') as file: 124 | file.write(ff_model_json) 125 | 126 | #Saving weights 127 | ff_model.save_weights('ff_model.h5') 128 | 129 | """The following code can be employed to open a pre-trained model.""" 130 | 131 | with open('ff_model.json', 'r') as file: 132 | ff_model_json = file.read() 133 | 134 | ff_model = model_from_json(ff_model_json) 135 | ff_model.load_weights('ff_model.h5') 136 | 137 | """There are different analysis that can account for the quality of the results. Here, we consider only the measurement of accuracy.""" 138 | 139 | predictions = ff_model.predict(test_data) 140 | #Because it is a binary classification, we consider the values higher than 0.5 141 | #as being part of class 1 otherwise 0. 142 | predictions = (predictions > 0.5) 143 | accuracy = accuracy_score(test_target, predictions) 144 | print("Accuracy =", accuracy) 145 | 146 | 147 | """ 148 | ## License 149 | 150 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License. 151 | 152 | ## Acknowledgments 153 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2. 154 | """ 155 | -------------------------------------------------------------------------------- /codes/lstm.py: -------------------------------------------------------------------------------- 1 | # -*- coding: utf-8 -*- 2 | """deepLearning_LSTM.ipynb 3 | 4 | Automatically generated by Colaboratory. 5 | 6 | Original file is located at 7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_LSTM.ipynb 8 | 9 | #Long Short-Term Memory (LSTM) 10 | 11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty. 12 | 13 | If you publish a paper related on this material, please cite: 14 | 15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019. 16 | 17 | This is the third example of deep learning implementation. Here we use a LSTM network to predict the Bitcoin prices along time by using the input as a temporal series. 18 | 19 | 20 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend). 21 | """ 22 | 23 | import numpy as np 24 | import keras 25 | from keras.models import Sequential 26 | from keras.utils.vis_utils import plot_model 27 | from keras.layers import Dense, Dropout, LSTM 28 | from keras.callbacks import EarlyStopping, ReduceLROnPlateau, ModelCheckpoint 29 | from sklearn.model_selection import train_test_split 30 | from sklearn.metrics import accuracy_score 31 | from sklearn.preprocessing import MinMaxScaler 32 | import matplotlib.pyplot as plt 33 | import pandas as pd 34 | import pandas_datareader 35 | 36 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*).""" 37 | 38 | import tensorflow as tf 39 | from keras import backend as K 40 | 41 | print(K.tensorflow_backend._get_available_gpus()) 42 | 43 | number_of_cpu_cores = 8 44 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores}) 45 | session = tf.Session(config=config) 46 | keras.backend.set_session(session) 47 | 48 | """(*) Here, we use the Bitcoin daily prices dataset, which is available at 49 | [yhaoo-stock-market](https://finance.yahoo.com/). The data contains seven columns, organized as follows: date, opening stock price, high daily price, low daily price, closing stock price, the currency volume traded on the day, and the adjusted closing price. 50 | """ 51 | 52 | train_size = 1200 53 | start_date = '2015-01-01'# Bitcoin started on '2010-07-16' 54 | 55 | dataset = pandas_datareader.data.get_data_yahoo("BTC-USD", start = start_date) 56 | data_oerder = ['Open','High', 'Low', 'Close', 'Volume', 'Adj Close'] 57 | dataset = dataset[data_oerder] 58 | 59 | 60 | train_dataset = dataset.iloc[0:train_size, 1::].values 61 | test_dataset = dataset.iloc[train_size::, 1::].values 62 | 63 | min_max_scaler = MinMaxScaler(feature_range=(0,1)) 64 | normalized_train_dataset = min_max_scaler.fit_transform(train_dataset) 65 | 66 | min_max_scaler_train = MinMaxScaler(feature_range=(0,1)) 67 | normalized_train_price = min_max_scaler_train.fit_transform(train_dataset[:,0:1]) 68 | 69 | """In the following, we define the network topology.""" 70 | 71 | window_size = 50 72 | number_of_lstm_layers = 3 73 | activation = 'sigmoid' 74 | return_sequences = True 75 | units_first_layer = 100 76 | units = 50 77 | 78 | data = [] 79 | train_price = [] 80 | for i in range(window_size, train_size): 81 | data.append(normalized_train_dataset[i-window_size:i, 0:6]) 82 | train_price.append(normalized_train_dataset[i, 0]) 83 | data, train_price = np.array(data), np.array(train_price) 84 | 85 | lstm_model = Sequential() 86 | lstm_model.add(LSTM(units = units_first_layer, 87 | return_sequences = return_sequences, 88 | input_shape = (data.shape[1], 5))) 89 | lstm_model.add(Dropout(0.2)) 90 | 91 | for i in range(number_of_lstm_layers-2): 92 | lstm_model.add(LSTM(units = units, return_sequences = return_sequences)) 93 | lstm_model.add(Dropout(0.2)) 94 | 95 | lstm_model.add(LSTM(units = units)) 96 | lstm_model.add(Dropout(0.2)) 97 | 98 | #Output layer 99 | lstm_model.add(Dense(units = 1, activation = activation)) 100 | 101 | """In order to check the network topology, the subsequent command can be used.""" 102 | 103 | lstm_model.summary() 104 | #Saving the resultant figure as 'ff_model.png'. 105 | plot_model(lstm_model, to_file='lstm_model.png', show_shapes=True, 106 | show_layer_names=True) 107 | 108 | """The training step is executed as follows.""" 109 | 110 | #Here we set verbose as true 111 | verbose = 1 112 | 113 | batch_size = 32 114 | epochs = 10 115 | filepath = 'weights.h5' #name of the file with the network weights 116 | monitor = 'loss' 117 | optimizer = 'adam' 118 | loss = 'mean_squared_error' 119 | metrics = ['mean_absolute_error'] 120 | 121 | lstm_model.compile(optimizer = optimizer, loss = loss, metrics = metrics) 122 | 123 | early_stopping = EarlyStopping(monitor = monitor, min_delta = 1e-15, 124 | patience = 10, verbose = verbose) 125 | reduce_learning_rate_on_plateau = ReduceLROnPlateau(monitor = monitor, 126 | factor = 0.2, patience = 5, 127 | verbose = verbose) 128 | model_checkpoint = ModelCheckpoint(filepath = filepath, monitor = monitor, 129 | save_best_only = True, verbose = verbose) 130 | lstm_model.fit(data, train_price, epochs = epochs, batch_size = batch_size, 131 | callbacks = [early_stopping, reduce_learning_rate_on_plateau, 132 | model_checkpoint]) 133 | 134 | """The following code verifies the data in the network.""" 135 | 136 | test_price = test_dataset[:, 0:1] 137 | complete_dataset = dataset.iloc[:,1::] 138 | 139 | train_data = complete_dataset[len(complete_dataset) - len(test_dataset) - 140 | window_size:].values 141 | train_data = min_max_scaler.transform(train_data) 142 | 143 | 144 | X_test = [] 145 | for i in range(window_size,len(train_data)): 146 | X_test.append(train_data[i-window_size:i, 0:6]) 147 | X_test = np.array(X_test) 148 | 149 | calculated_prices = lstm_model.predict(X_test) 150 | calculated_prices = min_max_scaler_train.inverse_transform(calculated_prices) 151 | 152 | train_price = min_max_scaler_train.inverse_transform([train_price]) 153 | train_price = train_price[0] 154 | 155 | """In the following, we plot the train set, as well as the prediction and the expected values.""" 156 | 157 | plt.plot(np.linspace(0,len(train_price)-1,len(train_price)), 158 | train_price, label = 'Real price train', color = 'k') 159 | plt.plot(np.linspace(len(train_price),len(train_price)+len(test_price)-1, 160 | len(test_price)), test_price, label = 'Real price') 161 | plt.plot(np.linspace(len(train_price),len(train_price)+len(test_price)-1, 162 | len(test_price)), calculated_prices, label = 'Prevision') 163 | plt.title('BTC prevision') 164 | plt.xlabel('Time') 165 | plt.ylabel('Value') 166 | plt.legend() 167 | plt.show() 168 | 169 | """ 170 | ## License 171 | 172 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License. 173 | 174 | ## Acknowledgments 175 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2. 176 | """ 177 | -------------------------------------------------------------------------------- /codes/gan.py: -------------------------------------------------------------------------------- 1 | # -*- coding: utf-8 -*- 2 | """deepLearning_GAN.ipynb 3 | 4 | Automatically generated by Colaboratory. 5 | 6 | Original file is located at 7 | https://colab.research.google.com/drive/1CbRNDN25uaN2WCeyklwPkbZZkXM_IDge 8 | 9 | # Generative Adversarial Networks 10 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty. 11 | 12 | If you publish a paper related on this material, please cite: 13 | 14 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019. 15 | 16 | It was elaborated to create a network that can generate handwritten characters automatically. 17 | 18 | 19 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend). 20 | """ 21 | 22 | import numpy as np 23 | import pandas as pd 24 | import keras 25 | from keras.models import Sequential, model_from_json 26 | from keras.utils.vis_utils import plot_model 27 | from keras.datasets import mnist 28 | from keras.layers import InputLayer, Dense, Flatten, Reshape, Input, Dropout 29 | from keras.layers.advanced_activations import LeakyReLU 30 | from keras.layers import BatchNormalization 31 | from keras.models import Model,Sequential 32 | from keras.regularizers import L1L2 33 | from sklearn.model_selection import train_test_split 34 | from sklearn.metrics import accuracy_score 35 | from sklearn.preprocessing import MinMaxScaler 36 | import matplotlib.pyplot as plt 37 | import cv2 38 | 39 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*).""" 40 | 41 | import tensorflow as tf 42 | from keras import backend as K 43 | 44 | print(K.tensorflow_backend._get_available_gpus()) 45 | 46 | number_of_cpu_cores = 8 47 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores}) 48 | session = tf.Session(config=config) 49 | keras.backend.set_session(session) 50 | 51 | """(*) In this example we used the MNIST database in which it is composed by grayscale images of the 10 handwritten digits. It is available at Keras library on [keras-datasets](https://keras.io/datasets/). 52 | 53 | The following command is used to load the data set. 54 | """ 55 | 56 | (train_data_raw, train_target_raw), (_, _) = mnist.load_data() 57 | 58 | """Because this code consumes too much of processing time, here we considered only the zeros and ones.""" 59 | 60 | train_data = [img for i, img in enumerate(train_data_raw) 61 | if train_target_raw[i] == 0 or train_target_raw[i] == 1] 62 | train_data = np.array(train_data) 63 | 64 | """In order to visualize a given figure, the following code can be executed.""" 65 | 66 | image_id = 1000 67 | plt.figure(figsize = (1,1)) 68 | plt.imshow(train_data[image_id], cmap='gray') 69 | plt.title("Test image: " + str(image_id)) 70 | #plt.axis('off') 71 | plt.show() 72 | 73 | """Definition of the used variables.""" 74 | 75 | input_shape = train_data.shape[1::] 76 | activation_output_generator = 'sigmoid' 77 | activation_output_discrimninator = 'sigmoid' 78 | input_dim = 50 79 | number_of_epochs = 1000 80 | batch_size = 100 81 | train_data = train_data.astype('float32') / 255 82 | 83 | """In the following, we present the generator model.""" 84 | 85 | generator_model = Sequential() 86 | 87 | generator_model.add(Dense(units=64,input_dim = input_dim, 88 | kernel_regularizer = L1L2(1e-5, 1e-5))) 89 | generator_model.add(BatchNormalization()) 90 | generator_model.add(LeakyReLU(alpha=0.3)) 91 | 92 | generator_model.add(Dense(units=128, kernel_regularizer = L1L2(1e-5, 1e-5))) 93 | generator_model.add(BatchNormalization()) 94 | generator_model.add(LeakyReLU(alpha=0.3)) 95 | 96 | generator_model.add(Dense(units=256, kernel_regularizer = L1L2(1e-5, 1e-5))) 97 | generator_model.add(BatchNormalization()) 98 | generator_model.add(LeakyReLU(alpha=0.3)) 99 | 100 | generator_model.add(Dense(units = input_shape[0] * input_shape[1], 101 | activation = activation_output_generator)) 102 | 103 | generator_model.add(Reshape(input_shape)) 104 | 105 | generator_model.compile(loss='binary_crossentropy', optimizer="adam") 106 | 107 | """The summary of the generator model is shown by employing the following code.""" 108 | 109 | generator_model.summary() 110 | 111 | """The following code represents the discriminator model.""" 112 | 113 | discriminator_model = Sequential() 114 | discriminator_model.add(InputLayer(input_shape = input_shape)) 115 | discriminator_model.add(Flatten()) 116 | 117 | discriminator_model.add(Dense(units=256,kernel_regularizer = L1L2(1e-5, 1e-5))) 118 | discriminator_model.add(LeakyReLU(alpha=0.3)) 119 | discriminator_model.add(Dropout(0.2)) 120 | 121 | 122 | discriminator_model.add(Dense(units=128,kernel_regularizer = L1L2(1e-5, 1e-5))) 123 | discriminator_model.add(LeakyReLU(alpha=0.3)) 124 | discriminator_model.add(Dropout(0.2)) 125 | 126 | discriminator_model.add(Dense(units=64,kernel_regularizer = L1L2(1e-5, 1e-5))) 127 | discriminator_model.add(LeakyReLU(alpha=0.3)) 128 | 129 | discriminator_model.add(Dense(units=1, 130 | activation = activation_output_discrimninator)) 131 | 132 | discriminator_model.compile(loss='binary_crossentropy', 133 | optimizer = "adam") 134 | 135 | """The summary of the discriminator model is shown by using the following code.""" 136 | 137 | discriminator_model.summary() 138 | 139 | """The following code incorporates the complete gan model.""" 140 | 141 | gan_input = Input(shape = (input_dim,)) 142 | gan_output= discriminator_model(generator_model(gan_input)) 143 | gan = Model(inputs = gan_input, outputs = gan_output) 144 | gan.compile(loss = 'binary_crossentropy', optimizer = 'adam') 145 | 146 | """The summary of the gan model is shown by using the following code.""" 147 | 148 | gan.summary() 149 | 150 | """Next, we train the GAN.""" 151 | 152 | y = np.ones(batch_size) 153 | 154 | #Parameters of the noise distribution 155 | mu = 0 156 | sigma = 1 157 | 158 | #We created this array to avoid number repetitions 159 | train_indices = np.arange(train_data.shape[0]) 160 | np.random.shuffle(train_indices) 161 | 162 | #Here we define the labels used to train the gan 163 | train_labels = np.zeros(2*batch_size,dtype = int) 164 | train_labels[0:batch_size] = 1#generated images 165 | 166 | for epoch in range(number_of_epochs): 167 | print("\rEpoch:", epoch + 1, "of", number_of_epochs, end = '') 168 | for _ in range(batch_size): 169 | input_noise = np.random.normal(loc = mu, scale = sigma, 170 | size = [batch_size, input_dim]) 171 | generated_images = generator_model.predict(input_noise) 172 | np.random.shuffle(train_indices) 173 | image_batch = train_data[train_indices[0:batch_size]] 174 | train_images = np.concatenate((image_batch, generated_images)) 175 | #Training the discriminator 176 | discriminator_model.trainable = True 177 | discriminator_model.train_on_batch(train_images, train_labels) 178 | #Training the gan 179 | discriminator_model.trainable = False 180 | train_noise = np.random.normal(loc = mu, scale = sigma, 181 | size = [batch_size, input_dim]) 182 | gan.train_on_batch(train_noise, y) 183 | 184 | #In order to visualize the training progress, we employ the following code. 185 | if epoch % 100 == 0: 186 | n_examples = 10 187 | scale_image = 1 * n_examples 188 | noise= np.random.normal(loc = mu, scale = sigma, 189 | size = (n_examples, input_dim)) 190 | generated_images = generator_model.predict(noise) 191 | n_pixels = generated_images.shape[1] 192 | n_pixels_col = np.int(np.sqrt(n_pixels)) 193 | fig, axes = plt.subplots(1,n_examples, 194 | figsize = (scale_image, 195 | scale_image * n_examples)) 196 | for i in range(generated_images.shape[0]): 197 | axes[i].imshow(generated_images[i], cmap = "gray") 198 | axes[i].axis('off') 199 | plt.show() 200 | print("") 201 | 202 | """In order to generate the figures the following code can be employed.""" 203 | 204 | n_examples = 5 205 | scale_image = 5 206 | noise= np.random.normal(loc = mu, scale = sigma, size = (n_examples, input_dim)) 207 | generated_images = generator_model.predict(noise) 208 | 209 | fig, axes = plt.subplots(1,n_examples, 210 | figsize = (scale_image, scale_image * n_examples)) 211 | for i in range(generated_images.shape[0]): 212 | axes[i].imshow(generated_images[i], cmap = "gray") 213 | axes[i].axis('off') 214 | 215 | plt.show() 216 | 217 | """ 218 | ## License 219 | 220 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License. 221 | 222 | ## Acknowledgments 223 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2. 224 | """ 225 | -------------------------------------------------------------------------------- /codes/autoencoder.py: -------------------------------------------------------------------------------- 1 | # -*- coding: utf-8 -*- 2 | """deepLearning_autoencoder.ipynb 3 | 4 | Automatically generated by Colaboratory. 5 | 6 | Original file is located at 7 | https://colab.research.google.com/github/hfarruda/deeplearningtutorial/blob/master/deepLearning_autoencoder.ipynb 8 | 9 | # Autoencoders 10 | 11 | This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty. 12 | 13 | If you publish a paper related on this material, please cite: 14 | 15 | H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, "Learning Deep Learning (CDT-15)," 2019. 16 | 17 | This example uses the Autoencoder model to illustrate a possible application concerning image clustering. Here we show how to use the resulting codes to reduce the dimensionality. We also project our data by using a Principal Component Analysis (PCA). 18 | 19 | First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend). 20 | """ 21 | 22 | import numpy as np 23 | import matplotlib.pyplot as plt 24 | import pandas as pd 25 | import keras 26 | from keras.models import Sequential, model_from_json, Model 27 | from keras.utils import np_utils 28 | from keras.utils.vis_utils import plot_model 29 | from keras.datasets import fashion_mnist 30 | from keras.callbacks import EarlyStopping, ReduceLROnPlateau, ModelCheckpoint 31 | from sklearn.model_selection import train_test_split 32 | from sklearn.metrics import accuracy_score 33 | from keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Reshape 34 | from keras.layers import UpSampling2D 35 | from sklearn.preprocessing import MinMaxScaler 36 | import sklearn.decomposition 37 | from sklearn.preprocessing import StandardScaler 38 | 39 | """If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*).""" 40 | 41 | import tensorflow as tf 42 | from keras import backend as K 43 | 44 | print(K.tensorflow_backend._get_available_gpus()) 45 | 46 | number_of_cpu_cores = 8 47 | config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores}) 48 | session = tf.Session(config=config) 49 | keras.backend.set_session(session) 50 | 51 | """(*) In this example, we used the Fashion-MNIST database, composed by grayscale images of 10 categories of fashion items (trouser, pullover, dress, coat, sandal, shirt, sneaker, bag, and ankle boot). It is available at Keras library on [keras-datasets](https://keras.io/datasets/).""" 52 | 53 | (train_data, train_target), (test_data, test_target) = fashion_mnist.load_data() 54 | 55 | train_target_one_hot_encoding = np_utils.to_categorical(train_target) 56 | 57 | #Divide by the maximun value of a pixel (255) to have the values between 0 and 1 58 | train_data = train_data.astype('float32') / 255. 59 | test_data = test_data.astype('float32') / 255. 60 | 61 | """For the sake of simplicity, we add zeros to the images to have shape 32x32. Because 32 is a power of 2 it is easier to configure the decoder layer.""" 62 | 63 | train_data_auxiliar = [] 64 | for data in train_data: 65 | new_image = np.zeros((32,32)) 66 | new_image[2:data.shape[0]+2, 2:data.shape[1]+2] = data 67 | train_data_auxiliar.append(new_image) 68 | 69 | test_data_auxiliar = [] 70 | for data in test_data: 71 | new_image = np.zeros((32,32)) 72 | new_image[2:data.shape[0]+2, 2:data.shape[1]+2] = data 73 | test_data_auxiliar.append(new_image) 74 | 75 | train_data = np.array(train_data_auxiliar) 76 | test_data = np.array(test_data_auxiliar) 77 | 78 | train_data = train_data.reshape(train_data.shape[0], train_data.shape[1], 79 | train_data.shape[2], 1) 80 | test_data = test_data.reshape(test_data.shape[0], test_data.shape[1], 81 | test_data.shape[2], 1) 82 | 83 | """In order to visualize a given figure, the following code can be executed.""" 84 | 85 | image_id = 700 86 | image = test_data[image_id] 87 | image = image[:,:,0] 88 | plt.imshow(image, cmap = 'gray') 89 | plt.title("Test image: " + str(image_id)) 90 | plt.show() 91 | 92 | """In the following, we define the network topology. Similar to what was adopted for the CNN case, here we do not employ dropout after the convolutional layers. Because this network demands a high computational power, the variable epochs can receive a smaller number (e.g., 5). However, in this case, the resulting accuracy tends to be much lower. 93 | 94 | First, we define some necessary variables. 95 | """ 96 | 97 | input_shape = train_data.shape[1::] 98 | #if len(input_shape) == 2: 99 | # input_shape = (input_shape[0], input_shape[1], 1) 100 | filters_first_layer = 64 101 | filters = 32 102 | kernel_size = (3,3) 103 | pool_size = (2,2) 104 | 105 | activation = 'relu' 106 | activation_function_output = 'sigmoid' #the output should be between 0 and 1 107 | number_of_cnn_layers = 2 108 | number_of_units_output = train_target_one_hot_encoding.shape[1] 109 | padding = 'same' 110 | strides = (2,2) 111 | 112 | optimizer = 'adam' 113 | loss = 'binary_crossentropy' 114 | metrics = ['accuracy'] 115 | epochs = 50 116 | batch_size = 128 117 | 118 | #Network model 119 | autoencoder_model = Sequential() 120 | 121 | """We configure the encoder layers. Normally, for images the autoencoder is represented by a 2D matrix, but here we 122 | adopt flattening in order to be able to plot the respective PCA projection. 123 | """ 124 | 125 | autoencoder_model.add(Conv2D(filters = filters_first_layer, 126 | kernel_size = kernel_size, 127 | input_shape = input_shape, 128 | activation = activation, padding = padding )) 129 | 130 | autoencoder_model.add(MaxPooling2D(pool_size = pool_size, padding = padding)) 131 | 132 | 133 | for i in range(number_of_cnn_layers-1): 134 | autoencoder_model.add(Conv2D(filters = filters, kernel_size = kernel_size, 135 | activation = activation, padding = padding, 136 | strides = strides)) 137 | autoencoder_model.add(MaxPooling2D(pool_size = pool_size, 138 | padding = padding)) 139 | 140 | 141 | #This is the coding 142 | autoencoder_model.add(Flatten()) 143 | flatten_layer_name = autoencoder_model.output_names[0] 144 | 145 | """Here, we define the decoder.""" 146 | 147 | #First we define the input size 148 | output_len = autoencoder_model.output_shape[1] 149 | height = np.int(np.sqrt(output_len/filters)) 150 | 151 | #Find the shape of the decoder input 152 | autoencoder_model.add(Reshape((height, height, filters))) 153 | 154 | for i in range(number_of_cnn_layers): 155 | autoencoder_model.add(Conv2D(filters = filters, kernel_size = kernel_size, 156 | activation = activation, padding = padding)) 157 | autoencoder_model.add(UpSampling2D(size = pool_size)) 158 | 159 | autoencoder_model.add(Conv2D(filters = filters_first_layer, 160 | kernel_size = kernel_size, 161 | activation = activation, padding = padding)) 162 | autoencoder_model.add(UpSampling2D(size = pool_size)) 163 | autoencoder_model.add(Conv2D(filters = 1, kernel_size = kernel_size, 164 | activation = activation_function_output, 165 | padding = padding)) 166 | 167 | """We can use the following command to see the network topology.""" 168 | 169 | autoencoder_model.summary() 170 | #Saving the resultant figure as 'autoencoder_model.png'. 171 | plot_model(autoencoder_model, to_file='autoencoder_model.png', show_shapes=True, 172 | show_layer_names=True) 173 | 174 | """The entire configuration is then used to train the coding and decoding.""" 175 | 176 | autoencoder_model.compile(optimizer = optimizer, loss = loss, metrics = metrics) 177 | autoencoder_model.fit(train_data, train_data, epochs = epochs, 178 | batch_size = batch_size) 179 | 180 | """The following code shows how to use the already trained coding.""" 181 | 182 | output_model = autoencoder_model.get_layer(flatten_layer_name).output 183 | encoder = Model(inputs = autoencoder_model.input, 184 | outputs = output_model) 185 | encoder.summary() 186 | 187 | """The following code is used to compute the codings.""" 188 | 189 | codings = encoder.predict(test_data) 190 | 191 | """By employing the codings and the known classes, we plot a PCA (principal component analysis) of the test data.""" 192 | 193 | X = codings.copy() 194 | targets = test_target 195 | 196 | #Standardization 197 | X = StandardScaler().fit_transform(X) 198 | decomposition = sklearn.decomposition.PCA(n_components=2) 199 | pca = decomposition.fit(X) 200 | transform = pca.transform(X) 201 | 202 | plt.figure(figsize = (6,4)) 203 | classes = [] 204 | for target in set(targets): 205 | classes.append(target) 206 | pos = np.argwhere(targets == target).T[0] 207 | plt.scatter([transform[pos,0]],[transform[pos,1]], alpha = 0.3) 208 | 209 | 210 | label = "PC1 ({:1.2f}%)".format(pca.explained_variance_ratio_[0]*100) 211 | plt.xlabel(label) 212 | label = "PC2 ({:1.2f}%)".format(pca.explained_variance_ratio_[1]*100) 213 | plt.ylabel(label) 214 | 215 | plt.margins(0.05,0.05) 216 | plt.legend(classes, loc = 'best') 217 | plt.tight_layout() 218 | 219 | plt.show() 220 | 221 | """ 222 | ## License 223 | 224 | This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0)International License. 225 | 226 | ## Acknowledgments 227 | Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 15/18942-8 and 18/09125-4) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 2015/22308-2. 228 | """ 229 | -------------------------------------------------------------------------------- /libraries.txt: -------------------------------------------------------------------------------- 1 | All of these codes were developed and executed in Python 3, with the following libraries: 2 | 3 | Package Version 4 | ------------------------ --------------------- 5 | absl-py 0.7.1 6 | alabaster 0.7.12 7 | albumentations 0.1.12 8 | altair 3.2.0 9 | astor 0.8.0 10 | astropy 3.0.5 11 | atari-py 0.1.15 12 | atomicwrites 1.3.0 13 | attrs 19.1.0 14 | audioread 2.1.8 15 | autograd 1.3 16 | Babel 2.7.0 17 | backcall 0.1.0 18 | backports.tempfile 1.0 19 | backports.weakref 1.0.post1 20 | beautifulsoup4 4.6.3 21 | bleach 3.1.0 22 | blis 0.2.4 23 | bokeh 1.0.4 24 | boto 2.49.0 25 | boto3 1.9.216 26 | botocore 1.12.216 27 | Bottleneck 1.2.1 28 | branca 0.3.1 29 | bs4 0.0.1 30 | bz2file 0.98 31 | cachetools 3.1.1 32 | certifi 2019.6.16 33 | cffi 1.12.3 34 | chainer 5.4.0 35 | chardet 3.0.4 36 | Click 7.0 37 | cloudpickle 0.6.1 38 | cmake 3.12.0 39 | colorlover 0.3.0 40 | community 1.0.0b1 41 | contextlib2 0.5.5 42 | convertdate 2.1.3 43 | coverage 3.7.1 44 | coveralls 0.5 45 | crcmod 1.7 46 | cufflinks 0.14.6 47 | cvxopt 1.2.3 48 | cvxpy 1.0.25 49 | cycler 0.10.0 50 | cymem 2.0.2 51 | Cython 0.29.13 52 | daft 0.0.4 53 | dask 1.1.5 54 | dataclasses 0.6 55 | datascience 0.10.6 56 | decorator 4.4.0 57 | defusedxml 0.6.0 58 | descartes 1.1.0 59 | dill 0.3.0 60 | distributed 1.25.3 61 | Django 2.2.4 62 | dlib 19.16.0 63 | dm-sonnet 1.34 64 | docopt 0.6.2 65 | docutils 0.15.2 66 | dopamine-rl 1.0.5 67 | easydict 1.9 68 | ecos 2.0.7.post1 69 | editdistance 0.5.3 70 | en-core-web-sm 2.1.0 71 | entrypoints 0.3 72 | ephem 3.7.7.0 73 | et-xmlfile 1.0.1 74 | fa2 0.3.5 75 | fancyimpute 0.4.3 76 | fastai 1.0.57 77 | fastcache 1.1.0 78 | fastdtw 0.3.2 79 | fastprogress 0.1.21 80 | fastrlock 0.4 81 | fbprophet 0.5 82 | feather-format 0.4.0 83 | featuretools 0.4.1 84 | filelock 3.0.12 85 | fix-yahoo-finance 0.0.22 86 | Flask 1.1.1 87 | folium 0.8.3 88 | fsspec 0.4.1 89 | future 0.16.0 90 | gast 0.2.2 91 | GDAL 2.2.2 92 | gdown 3.6.4 93 | gensim 3.6.0 94 | geographiclib 1.49 95 | geopy 1.17.0 96 | gevent 1.4.0 97 | gin-config 0.2.0 98 | glob2 0.7 99 | google 2.0.2 100 | google-api-core 1.14.2 101 | google-api-python-client 1.7.11 102 | google-auth 1.4.2 103 | google-auth-httplib2 0.0.3 104 | google-auth-oauthlib 0.4.0 105 | google-cloud-bigquery 1.14.0 106 | google-cloud-core 1.0.3 107 | google-cloud-datastore 1.8.0 108 | google-cloud-language 1.2.0 109 | google-cloud-storage 1.16.1 110 | google-cloud-translate 1.5.0 111 | google-colab 1.0.0 112 | google-pasta 0.1.7 113 | google-resumable-media 0.3.3 114 | googleapis-common-protos 1.6.0 115 | googledrivedownloader 0.4 116 | graph-nets 1.0.4 117 | graphviz 0.10.1 118 | greenlet 0.4.15 119 | grpcio 1.15.0 120 | gspread 3.0.1 121 | gspread-dataframe 3.0.3 122 | gunicorn 19.9.0 123 | gym 0.10.11 124 | h5py 2.8.0 125 | HeapDict 1.0.0 126 | holidays 0.9.11 127 | html5lib 1.0.1 128 | httpimport 0.5.16 129 | httplib2 0.11.3 130 | humanize 0.5.1 131 | hyperopt 0.1.2 132 | ideep4py 2.0.0.post3 133 | idna 2.8 134 | image 1.5.27 135 | imageio 2.4.1 136 | imagesize 1.1.0 137 | imbalanced-learn 0.4.3 138 | imblearn 0.0 139 | imgaug 0.2.9 140 | importlib-metadata 0.19 141 | imutils 0.5.3 142 | inflect 2.1.0 143 | intel-openmp 2019.0 144 | intervaltree 2.1.0 145 | ipykernel 4.6.1 146 | ipython 5.5.0 147 | ipython-genutils 0.2.0 148 | ipython-sql 0.3.9 149 | ipywidgets 7.5.1 150 | itsdangerous 1.1.0 151 | jax 0.1.43 152 | jaxlib 0.1.26 153 | jdcal 1.4.1 154 | jedi 0.15.1 155 | jieba 0.39 156 | Jinja2 2.10.1 157 | jmespath 0.9.4 158 | joblib 0.13.2 159 | jpeg4py 0.1.4 160 | jsonschema 2.6.0 161 | jupyter 1.0.0 162 | jupyter-client 5.3.1 163 | jupyter-console 5.2.0 164 | jupyter-core 4.5.0 165 | kaggle 1.5.5 166 | kapre 0.1.3.1 167 | Keras 2.2.5 168 | Keras-Applications 1.0.8 169 | Keras-Preprocessing 1.1.0 170 | keras-vis 0.4.1 171 | kiwisolver 1.1.0 172 | knnimpute 0.1.0 173 | librosa 0.6.3 174 | lightgbm 2.2.3 175 | llvmlite 0.29.0 176 | lmdb 0.97 177 | lucid 0.3.8 178 | lunardate 0.2.0 179 | lxml 4.2.6 180 | magenta 0.3.19 181 | Markdown 3.1.1 182 | MarkupSafe 1.1.1 183 | matplotlib 3.0.3 184 | matplotlib-venn 0.11.5 185 | mesh-tensorflow 0.0.5 186 | mido 1.2.6 187 | mir-eval 0.5 188 | missingno 0.4.2 189 | mistune 0.8.4 190 | mizani 0.5.4 191 | mkl 2019.0 192 | mlxtend 0.14.0 193 | more-itertools 7.2.0 194 | moviepy 0.2.3.5 195 | mpi4py 3.0.2 196 | mpmath 1.1.0 197 | msgpack 0.5.6 198 | multiprocess 0.70.8 199 | multitasking 0.0.9 200 | murmurhash 1.0.2 201 | music21 5.5.0 202 | natsort 5.5.0 203 | nbconvert 5.6.0 204 | nbformat 4.4.0 205 | networkx 2.3 206 | nibabel 2.3.3 207 | nltk 3.2.5 208 | nose 1.3.7 209 | notebook 5.2.2 210 | np-utils 0.5.11.1 211 | numba 0.40.1 212 | numexpr 2.7.0 213 | numpy 1.16.4 214 | nvidia-ml-py3 7.352.0 215 | oauth2client 4.1.3 216 | oauthlib 3.1.0 217 | okgrade 0.4.3 218 | olefile 0.46 219 | opencv-contrib-python 3.4.3.18 220 | opencv-python 3.4.5.20 221 | openpyxl 2.5.9 222 | opt-einsum 3.0.1 223 | osqp 0.5.0 224 | packaging 19.1 225 | palettable 3.2.0 226 | pandas 0.24.2 227 | pandas-datareader 0.7.4 228 | pandas-gbq 0.4.1 229 | pandas-profiling 1.4.1 230 | pandocfilters 1.4.2 231 | parso 0.5.1 232 | pathlib 1.0.1 233 | patsy 0.5.1 234 | pexpect 4.7.0 235 | pickleshare 0.7.5 236 | Pillow 4.3.0 237 | pip 19.2.3 238 | pip-tools 3.9.0 239 | plac 0.9.6 240 | plotly 3.6.1 241 | plotnine 0.5.1 242 | pluggy 0.7.1 243 | portpicker 1.2.0 244 | prefetch-generator 1.0.1 245 | preshed 2.0.1 246 | pretty-midi 0.2.8 247 | prettytable 0.7.2 248 | progressbar2 3.38.0 249 | prometheus-client 0.7.1 250 | promise 2.2.1 251 | prompt-toolkit 1.0.16 252 | protobuf 3.7.1 253 | psutil 5.4.8 254 | psycopg2 2.7.6.1 255 | ptyprocess 0.6.0 256 | py 1.8.0 257 | pyarrow 0.14.1 258 | pyasn1 0.4.6 259 | pyasn1-modules 0.2.6 260 | pycocotools 2.0.0 261 | pycparser 2.19 262 | pydot 1.3.0 263 | pydot-ng 2.0.0 264 | pydotplus 2.0.2 265 | pyemd 0.5.1 266 | pyglet 1.4.2 267 | Pygments 2.1.3 268 | pygobject 3.26.1 269 | pymc3 3.7 270 | pymongo 3.9.0 271 | pymystem3 0.2.0 272 | PyOpenGL 3.1.0 273 | pyparsing 2.4.2 274 | pyrsistent 0.15.4 275 | pysndfile 1.3.7 276 | PySocks 1.7.0 277 | pystan 2.19.0.0 278 | pytest 3.6.4 279 | python-apt 1.6.4 280 | python-chess 0.23.11 281 | python-dateutil 2.5.3 282 | python-louvain 0.13 283 | python-rtmidi 1.3.0 284 | python-slugify 3.0.3 285 | python-utils 2.3.0 286 | pytz 2018.9 287 | PyWavelets 1.0.3 288 | PyYAML 3.13 289 | pyzmq 17.0.0 290 | qtconsole 4.5.4 291 | requests 2.21.0 292 | requests-oauthlib 1.2.0 293 | resampy 0.2.2 294 | retrying 1.3.3 295 | rpy2 2.9.5 296 | rsa 4.0 297 | s3fs 0.3.3 298 | s3transfer 0.2.1 299 | scikit-image 0.15.0 300 | scikit-learn 0.21.3 301 | scipy 1.3.1 302 | screen-resolution-extra 0.0.0 303 | scs 2.1.1.post2 304 | seaborn 0.9.0 305 | semantic-version 2.6.0 306 | Send2Trash 1.5.0 307 | setuptools 41.2.0 308 | setuptools-git 1.2 309 | Shapely 1.6.4.post2 310 | simplegeneric 0.8.1 311 | six 1.12.0 312 | sklearn 0.0 313 | sklearn-pandas 1.8.0 314 | smart-open 1.8.4 315 | snowballstemmer 1.9.0 316 | sortedcontainers 2.1.0 317 | spacy 2.1.8 318 | Sphinx 1.8.5 319 | sphinxcontrib-websupport 1.1.2 320 | SQLAlchemy 1.3.7 321 | sqlparse 0.3.0 322 | srsly 0.1.0 323 | stable-baselines 2.2.1 324 | statsmodels 0.10.1 325 | sympy 1.1.1 326 | tables 3.4.4 327 | tabulate 0.8.3 328 | tblib 1.4.0 329 | tensor2tensor 1.11.0 330 | tensorboard 1.14.0 331 | tensorboardcolab 0.0.22 332 | tensorflow 1.14.0 333 | tensorflow-estimator 1.14.0 334 | tensorflow-hub 0.5.0 335 | tensorflow-metadata 0.14.0 336 | tensorflow-probability 0.7.0 337 | termcolor 1.1.0 338 | terminado 0.8.2 339 | testpath 0.4.2 340 | text-unidecode 1.2 341 | textblob 0.15.3 342 | textgenrnn 1.4.1 343 | tfds-nightly 1.2.0.dev201908260105 344 | tflearn 0.3.2 345 | Theano 1.0.4 346 | thinc 7.0.8 347 | toolz 0.10.0 348 | torch 1.1.0 349 | torchsummary 1.5.1 350 | torchtext 0.3.1 351 | torchvision 0.3.0 352 | tornado 4.5.3 353 | tqdm 4.28.1 354 | traitlets 4.3.2 355 | tweepy 3.6.0 356 | typing 3.7.4.1 357 | tzlocal 1.5.1 358 | umap-learn 0.3.10 359 | uritemplate 3.0.0 360 | urllib3 1.24.3 361 | vega-datasets 0.7.0 362 | wasabi 0.2.2 363 | wcwidth 0.1.7 364 | webencodings 0.5.1 365 | Werkzeug 0.15.5 366 | wheel 0.33.6 367 | widgetsnbextension 3.5.1 368 | wordcloud 1.5.0 369 | wrapt 1.11.2 370 | xarray 0.11.3 371 | xgboost 0.90 372 | xkit 0.0.0 373 | xlrd 1.1.0 374 | xlwt 1.3.0 375 | yellowbrick 0.9.1 376 | zict 1.0.0 377 | zipp 0.6.0 378 | zmq 0.0.0 379 | -------------------------------------------------------------------------------- /deepLearning_LSTM.ipynb: -------------------------------------------------------------------------------- 1 | { 2 | "nbformat": 4, 3 | "nbformat_minor": 0, 4 | "metadata": { 5 | "colab": { 6 | "name": "deepLearning_LSTM.ipynb", 7 | "provenance": [], 8 | "collapsed_sections": [], 9 | "toc_visible": true, 10 | "include_colab_link": true 11 | }, 12 | "kernelspec": { 13 | "name": "python3", 14 | "display_name": "Python 3" 15 | }, 16 | "accelerator": "GPU" 17 | }, 18 | "cells": [ 19 | { 20 | "cell_type": "markdown", 21 | "metadata": { 22 | "id": "view-in-github", 23 | "colab_type": "text" 24 | }, 25 | "source": [ 26 | "\"Open" 27 | ] 28 | }, 29 | { 30 | "cell_type": "markdown", 31 | "metadata": { 32 | "id": "dTQnbI5ALRnR" 33 | }, 34 | "source": [ 35 | "#Long Short-Term Memory (LSTM)\n", 36 | "\n", 37 | "This example is part of the [*Deep Learning Tutorial*](https://github.com/hfarruda/deeplearningtutorial), authored by Henrique F. de Arruda, Alexandre Benatti, César Comin, and Luciano da Fontoura Costa. This code is not suitable for other data and/or applications, which will require modifications in the structure and parameters. This code has absolutely no warranty.\n", 38 | "\n", 39 | "If you publish a paper related on this material, please cite:\n", 40 | "\n", 41 | "H. F. de Arruda, A. Benatti, C. H. Comin, L. da F. Costa, \"Learning Deep Learning (CDT-15),\" 2019.\n", 42 | "\n", 43 | "This is the third example of deep learning implementation. Here we use a LSTM network to predict the Bitcoin prices along time by using the input as a temporal series.\n", 44 | "\n", 45 | "\n", 46 | "First of all, we import the necessary libraries. Here we opt for using Keras (using TensorFlow backend)." 47 | ] 48 | }, 49 | { 50 | "cell_type": "code", 51 | "source": [ 52 | "!pip install yfinance" 53 | ], 54 | "metadata": { 55 | "id": "rV_VpYzoxiji" 56 | }, 57 | "execution_count": null, 58 | "outputs": [] 59 | }, 60 | { 61 | "cell_type": "code", 62 | "metadata": { 63 | "id": "Z_tGqT1mKu_4", 64 | "colab": { 65 | "base_uri": "https://localhost:8080/" 66 | }, 67 | "outputId": "f83cedc2-f626-4df5-d333-6a875972154e" 68 | }, 69 | "source": [ 70 | "%tensorflow_version 1.x\n", 71 | "import numpy as np\n", 72 | "import keras\n", 73 | "from keras.models import Sequential\n", 74 | "from keras.utils.vis_utils import plot_model\n", 75 | "from keras.layers import Dense, Dropout, LSTM\n", 76 | "from keras.callbacks import EarlyStopping, ReduceLROnPlateau, ModelCheckpoint\n", 77 | "from sklearn.model_selection import train_test_split\n", 78 | "from sklearn.metrics import accuracy_score\n", 79 | "from sklearn.preprocessing import MinMaxScaler\n", 80 | "import matplotlib.pyplot as plt\n", 81 | "import pandas as pd\n", 82 | "#import pandas_datareader\n", 83 | "import yfinance as yf" 84 | ], 85 | "execution_count": null, 86 | "outputs": [ 87 | { 88 | "output_type": "stream", 89 | "name": "stdout", 90 | "text": [ 91 | "TensorFlow 1.x selected.\n" 92 | ] 93 | }, 94 | { 95 | "output_type": "stream", 96 | "name": "stderr", 97 | "text": [ 98 | "Using TensorFlow backend.\n" 99 | ] 100 | } 101 | ] 102 | }, 103 | { 104 | "cell_type": "markdown", 105 | "metadata": { 106 | "id": "DBk2wzXpMb7i" 107 | }, 108 | "source": [ 109 | "If you have a GPU, you can use the following code to allocate processing into it. Otherwise, proceed to (*)." 110 | ] 111 | }, 112 | { 113 | "cell_type": "code", 114 | "metadata": { 115 | "id": "rBk-5FD3Mf5j", 116 | "colab": { 117 | "base_uri": "https://localhost:8080/" 118 | }, 119 | "outputId": "f079924d-f08a-4123-c63f-914a472e81d4" 120 | }, 121 | "source": [ 122 | "import tensorflow as tf \n", 123 | "from keras import backend as K\n", 124 | "\n", 125 | "print(K.tensorflow_backend._get_available_gpus())\n", 126 | "\n", 127 | "number_of_cpu_cores = 8\n", 128 | "config = tf.ConfigProto(device_count = {'GPU': 1 , 'CPU': number_of_cpu_cores}) \n", 129 | "session = tf.Session(config=config) \n", 130 | "keras.backend.set_session(session)" 131 | ], 132 | "execution_count": null, 133 | "outputs": [ 134 | { 135 | "output_type": "stream", 136 | "name": "stdout", 137 | "text": [ 138 | "['/job:localhost/replica:0/task:0/device:GPU:0']\n" 139 | ] 140 | } 141 | ] 142 | }, 143 | { 144 | "cell_type": "markdown", 145 | "metadata": { 146 | "id": "SkpEIvMCMhc0" 147 | }, 148 | "source": [ 149 | "(*) Here, we use the Bitcoin daily prices dataset, which is available at\n", 150 | "[yhaoo-stock-market](https://finance.yahoo.com/). The data contains seven columns, organized as follows: date, opening stock price, high daily price, low daily price, closing stock price, the currency volume traded on the day, and the adjusted closing price." 151 | ] 152 | }, 153 | { 154 | "cell_type": "code", 155 | "metadata": { 156 | "id": "7XsGYIwMb_aI" 157 | }, 158 | "source": [ 159 | "train_size = 1500\n", 160 | "start_date = '2015-01-01'# Bitcoin started on '2010-07-16'\n", 161 | "end_date = '2020-04-01'\n", 162 | "\n", 163 | "\n", 164 | "tickerData = yf.Ticker(\"BTC-USD\")\n", 165 | "dataset = tickerData.history(period='max', interval='1d', start=start_date, end=end_date)\n", 166 | "data_oerder = ['Open','High', 'Low', 'Close', 'Volume']\n", 167 | "dataset = dataset[data_oerder]\n", 168 | "\n", 169 | "\n", 170 | "train_dataset = dataset.iloc[0:train_size, 1::].values\n", 171 | "test_dataset = dataset.iloc[train_size::, 1::].values\n", 172 | "\n", 173 | "min_max_scaler = MinMaxScaler(feature_range=(0,1))\n", 174 | "normalized_train_dataset = min_max_scaler.fit_transform(train_dataset)\n", 175 | "\n", 176 | "min_max_scaler_train = MinMaxScaler(feature_range=(0,1))\n", 177 | "normalized_train_price = min_max_scaler_train.fit_transform(train_dataset[:,0:1])\n" 178 | ], 179 | "execution_count": null, 180 | "outputs": [] 181 | }, 182 | { 183 | "cell_type": "markdown", 184 | "metadata": { 185 | "id": "xndlWJv8Oi0N" 186 | }, 187 | "source": [ 188 | "In the following, we define the network topology." 189 | ] 190 | }, 191 | { 192 | "cell_type": "code", 193 | "metadata": { 194 | "id": "imLL0-wBOf8o" 195 | }, 196 | "source": [ 197 | "window_size = 50\n", 198 | "number_of_lstm_layers = 3\n", 199 | "activation = 'sigmoid' \n", 200 | "return_sequences = True\n", 201 | "units_first_layer = 100\n", 202 | "units = 50\n", 203 | "\n", 204 | "data = []\n", 205 | "train_price = []\n", 206 | "for i in range(window_size, train_size):\n", 207 | " data.append(normalized_train_dataset[i-window_size:i, 0:5])\n", 208 | " train_price.append(normalized_train_dataset[i, 0])\n", 209 | "data, train_price = np.array(data), np.array(train_price)\n", 210 | "\n", 211 | "lstm_model = Sequential()\n", 212 | "lstm_model.add(LSTM(units = units_first_layer, \n", 213 | " return_sequences = return_sequences, \n", 214 | " input_shape = (data.shape[1], 4)))\n", 215 | "lstm_model.add(Dropout(0.2))\n", 216 | "\n", 217 | "for i in range(number_of_lstm_layers-2):\n", 218 | " lstm_model.add(LSTM(units = units, return_sequences = return_sequences))\n", 219 | " lstm_model.add(Dropout(0.2))\n", 220 | "\n", 221 | "lstm_model.add(LSTM(units = units))\n", 222 | "lstm_model.add(Dropout(0.2))\n", 223 | "\n", 224 | "#Output layer\n", 225 | "lstm_model.add(Dense(units = 1, activation = activation))" 226 | ], 227 | "execution_count": null, 228 | "outputs": [] 229 | }, 230 | { 231 | "cell_type": "markdown", 232 | "metadata": { 233 | "id": "lgj_GKsJVms-" 234 | }, 235 | "source": [ 236 | "In order to check the network topology, the subsequent command can be used." 237 | ] 238 | }, 239 | { 240 | "cell_type": "code", 241 | "metadata": { 242 | "id": "ZDE5FFHQTD2c", 243 | "outputId": "f87b32e1-9de9-42d6-a30f-8ebb89d00c2c", 244 | "colab": { 245 | "base_uri": "https://localhost:8080/", 246 | "height": 1000 247 | } 248 | }, 249 | "source": [ 250 | "lstm_model.summary() \n", 251 | "#Saving the resultant figure as 'ff_model.png'.\n", 252 | "plot_model(lstm_model, to_file='lstm_model.png', show_shapes=True, \n", 253 | " show_layer_names=True)" 254 | ], 255 | "execution_count": null, 256 | "outputs": [ 257 | { 258 | "output_type": "stream", 259 | "name": "stdout", 260 | "text": [ 261 | "Model: \"sequential_7\"\n", 262 | "_________________________________________________________________\n", 263 | "Layer (type) Output Shape Param # \n", 264 | "=================================================================\n", 265 | "lstm_19 (LSTM) (None, 50, 100) 42000 \n", 266 | "_________________________________________________________________\n", 267 | "dropout_19 (Dropout) (None, 50, 100) 0 \n", 268 | "_________________________________________________________________\n", 269 | "lstm_20 (LSTM) (None, 50, 50) 30200 \n", 270 | "_________________________________________________________________\n", 271 | "dropout_20 (Dropout) (None, 50, 50) 0 \n", 272 | "_________________________________________________________________\n", 273 | "lstm_21 (LSTM) (None, 50) 20200 \n", 274 | "_________________________________________________________________\n", 275 | "dropout_21 (Dropout) (None, 50) 0 \n", 276 | "_________________________________________________________________\n", 277 | "dense_7 (Dense) (None, 1) 51 \n", 278 | "=================================================================\n", 279 | "Total params: 92,451\n", 280 | "Trainable params: 92,451\n", 281 | "Non-trainable params: 0\n", 282 | "_________________________________________________________________\n" 283 | ] 284 | }, 285 | { 286 | "output_type": "execute_result", 287 | "data": { 288 | "image/png": "\n", 289 | "text/plain": [ 290 | "" 291 | ] 292 | }, 293 | "metadata": {}, 294 | "execution_count": 62 295 | } 296 | ] 297 | }, 298 | { 299 | "cell_type": "markdown", 300 | "metadata": { 301 | "id": "wYn2dnj2TCm3" 302 | }, 303 | "source": [ 304 | "The training step is executed as follows.\n", 305 | "\n" 306 | ] 307 | }, 308 | { 309 | "cell_type": "code", 310 | "metadata": { 311 | "id": "i-npHi6vVdQB", 312 | "outputId": "e5286a2f-e0b2-4e2d-cc52-f2eb9c15f2af", 313 | "colab": { 314 | "base_uri": "https://localhost:8080/" 315 | } 316 | }, 317 | "source": [ 318 | "#Here we set verbose as true\n", 319 | "verbose = 1\n", 320 | "\n", 321 | "batch_size = 32\n", 322 | "epochs = 10\n", 323 | "filepath = 'weights.h5' #name of the file with the network weights\n", 324 | "monitor = 'loss'\n", 325 | "optimizer = 'adam'\n", 326 | "loss = 'mean_squared_error'\n", 327 | "metrics = ['mean_absolute_error'] \n", 328 | "\n", 329 | "lstm_model.compile(optimizer = optimizer, loss = loss, metrics = metrics)\n", 330 | "\n", 331 | "early_stopping = EarlyStopping(monitor = monitor, min_delta = 1e-15, \n", 332 | " patience = 10, verbose = verbose)\n", 333 | "reduce_learning_rate_on_plateau = ReduceLROnPlateau(monitor = monitor, \n", 334 | " factor = 0.2, patience = 5, \n", 335 | " verbose = verbose)\n", 336 | "model_checkpoint = ModelCheckpoint(filepath = filepath, monitor = monitor, \n", 337 | " save_best_only = True, verbose = verbose)\n", 338 | "lstm_model.fit(data, train_price, epochs = epochs, batch_size = batch_size,\n", 339 | " callbacks = [early_stopping, reduce_learning_rate_on_plateau, \n", 340 | " model_checkpoint])" 341 | ], 342 | "execution_count": null, 343 | "outputs": [ 344 | { 345 | "output_type": "stream", 346 | "name": "stdout", 347 | "text": [ 348 | "Epoch 1/10\n", 349 | "1450/1450 [==============================] - 6s 4ms/step - loss: 0.0567 - mean_absolute_error: 0.1899\n", 350 | "\n", 351 | "Epoch 00001: loss improved from inf to 0.05668, saving model to weights.h5\n", 352 | "Epoch 2/10\n", 353 | "1450/1450 [==============================] - 7s 4ms/step - loss: 0.0074 - mean_absolute_error: 0.0533\n", 354 | "\n", 355 | "Epoch 00002: loss improved from 0.05668 to 0.00736, saving model to weights.h5\n", 356 | "Epoch 3/10\n", 357 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0032 - mean_absolute_error: 0.0331\n", 358 | "\n", 359 | "Epoch 00003: loss improved from 0.00736 to 0.00316, saving model to weights.h5\n", 360 | "Epoch 4/10\n", 361 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0032 - mean_absolute_error: 0.0344\n", 362 | "\n", 363 | "Epoch 00004: loss did not improve from 0.00316\n", 364 | "Epoch 5/10\n", 365 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0023 - mean_absolute_error: 0.0291\n", 366 | "\n", 367 | "Epoch 00005: loss improved from 0.00316 to 0.00232, saving model to weights.h5\n", 368 | "Epoch 6/10\n", 369 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0018 - mean_absolute_error: 0.0252\n", 370 | "\n", 371 | "Epoch 00006: loss improved from 0.00232 to 0.00178, saving model to weights.h5\n", 372 | "Epoch 7/10\n", 373 | "1450/1450 [==============================] - 5s 4ms/step - loss: 0.0016 - mean_absolute_error: 0.0247\n", 374 | "\n", 375 | "Epoch 00007: loss improved from 0.00178 to 0.00164, saving model to weights.h5\n", 376 | "Epoch 8/10\n", 377 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0015 - mean_absolute_error: 0.0242\n", 378 | "\n", 379 | "Epoch 00008: loss improved from 0.00164 to 0.00149, saving model to weights.h5\n", 380 | "Epoch 9/10\n", 381 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0016 - mean_absolute_error: 0.0257\n", 382 | "\n", 383 | "Epoch 00009: loss did not improve from 0.00149\n", 384 | "Epoch 10/10\n", 385 | "1450/1450 [==============================] - 5s 3ms/step - loss: 0.0014 - mean_absolute_error: 0.0234\n", 386 | "\n", 387 | "Epoch 00010: loss improved from 0.00149 to 0.00143, saving model to weights.h5\n" 388 | ] 389 | }, 390 | { 391 | "output_type": "execute_result", 392 | "data": { 393 | "text/plain": [ 394 | "" 395 | ] 396 | }, 397 | "metadata": {}, 398 | "execution_count": 63 399 | } 400 | ] 401 | }, 402 | { 403 | "cell_type": "markdown", 404 | "metadata": { 405 | "id": "dvFufFaJMjXB" 406 | }, 407 | "source": [ 408 | "The following code verifies the data in the network." 409 | ] 410 | }, 411 | { 412 | "cell_type": "code", 413 | "metadata": { 414 | "id": "0yBjjCoCjCo3" 415 | }, 416 | "source": [ 417 | "test_price = test_dataset[:, 0:1]\n", 418 | "complete_dataset = dataset.iloc[:,1::]\n", 419 | "\n", 420 | "train_data = complete_dataset[len(complete_dataset) - len(test_dataset) - \n", 421 | " window_size:].values\n", 422 | "train_data = min_max_scaler.transform(train_data)\n", 423 | "\n", 424 | "\n", 425 | "X_test = []\n", 426 | "for i in range(window_size,len(train_data)):\n", 427 | " X_test.append(train_data[i-window_size:i, 0:5])\n", 428 | "X_test = np.array(X_test)\n", 429 | "\n", 430 | "calculated_prices = lstm_model.predict(X_test)\n", 431 | "calculated_prices = min_max_scaler_train.inverse_transform(calculated_prices)\n", 432 | "\n", 433 | "train_price = min_max_scaler_train.inverse_transform([train_price])\n", 434 | "train_price = train_price[0]" 435 | ], 436 | "execution_count": null, 437 | "outputs": [] 438 | }, 439 | { 440 | "cell_type": "markdown", 441 | "metadata": { 442 | "id": "IY8n4My_X4HO" 443 | }, 444 | "source": [ 445 | "In the following, we plot the train set, as well as the prediction and the expected values." 446 | ] 447 | }, 448 | { 449 | "cell_type": "code", 450 | "metadata": { 451 | "id": "HXWZWul0X3hH", 452 | "outputId": "00aaf3a5-fad8-4981-d96b-1d05eab6fddd", 453 | "colab": { 454 | "base_uri": "https://localhost:8080/", 455 | "height": 295 456 | } 457 | }, 458 | "source": [ 459 | "plt.plot(np.linspace(0,len(train_price)-1,len(train_price)), \n", 460 | " train_price, label = 'Real price train', color = 'k')\n", 461 | "plt.plot(np.linspace(len(train_price),len(train_price)+len(test_price)-1,\n", 462 | " len(test_price)), test_price, label = 'Real price')\n", 463 | "plt.plot(np.linspace(len(train_price),len(train_price)+len(test_price)-1,\n", 464 | " len(test_price)), calculated_prices, label = 'Prevision')\n", 465 | "plt.title('BTC prevision')\n", 466 | "plt.xlabel('Time')\n", 467 | "plt.ylabel('Value')\n", 468 | "plt.legend()\n", 469 | "plt.show()" 470 | ], 471 | "execution_count": null, 472 | "outputs": [ 473 | { 474 | "output_type": "display_data", 475 | "data": { 476 | "image/png": "\n", 477 | "text/plain": [ 478 | "
" 479 | ] 480 | }, 481 | "metadata": { 482 | "needs_background": "light" 483 | } 484 | } 485 | ] 486 | }, 487 | { 488 | "cell_type": "markdown", 489 | "metadata": { 490 | "id": "EmXldp6aSUwa" 491 | }, 492 | "source": [ 493 | "## License\n", 494 | "\n", 495 | "This Deep Learning Tutorial is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 (CC BY-NC-ND 4.0) International License." 496 | ] 497 | }, 498 | { 499 | "cell_type": "markdown", 500 | "metadata": { 501 | "id": "IGfUENRWu4Pm" 502 | }, 503 | "source": [ 504 | "## Acknowledgments\n", 505 | "Henrique F. de Arruda acknowledges FAPESP for sponsorship (grant no. 2018/10489-0). H. F. de Arruda also thanks Soremartec S.A. and Soremartec Italia, Ferrero Group, for partial financial support (from 1st July 2021). His funders had no role in study design, data collection, and analysis, decision to publish, or manuscript preparation. Alexandre Benatti thanks Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. Luciano da F. Costa thanks CNPq (grant no. 307085/2018-0) and FAPESP (proc. 15/22308-2) for sponsorship. César H. Comin thanks FAPESP (Grant Nos. 2018/09125-4 and 2021/12354-8) for financial support. This work has been supported also by FAPESP grants 11/50761-2 and 15/22308-2." 506 | ] 507 | } 508 | ] 509 | } --------------------------------------------------------------------------------