└── README.md /README.md: -------------------------------------------------------------------------------- 1 | Machine Learning Explorations 2 | === 3 | 4 | A list of machine learning resources published between 2011 and 2021. More recent resources can be found in my repositories [Awesome Generative AI](https://github.com/steven2358/awesome-generative-ai) and [AI in 2023](https://github.com/steven2358/AI_in_2023). 5 | 6 | 2021-12-20 7 | --- 8 | - [High-Resolution Image Synthesis with Latent Diffusion Models](https://arxiv.org/abs/2112.10752) A paper by Rombach et al. on diffusion models for image generation, later used as the model behind [Stable Diffusion](https://stability.ai/blog/stable-diffusion-public-release). #synthesis 9 | 10 | 2021-07-07 11 | --- 12 | - [Evaluating Large Language Models Trained on Code.](https://arxiv.org/abs/2107.03374) A paper by OpenAI introducing Codex, a GPT language model that translates language into code. [Accompanying blog post.](https://openai.com/blog/openai-codex/) #nlp 13 | 14 | 2021-02-26 15 | --- 16 | - [Learning Transferable Visual Models From Natural Language Supervision](https://arxiv.org/abs/2103.00020) A paper by OpenAI on CLIP (Contrastive Language-Image Pre-Training). [Accompanying blog post.](https://openai.com/blog/clip/) #nlp #vision 17 | 18 | 2020-05-28 19 | --- 20 | - [Language Models are Few-Shot Learners](https://arxiv.org/abs/2005.14165) A paper by OpenAI on the training of GPT-3, an autoregressive language model with 175 billion parameters. #nlp 21 | 22 | 2019-05-28 23 | --- 24 | - [EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.](https://arxiv.org/abs/1905.11946) A paper by Mingxing Tan and Quoc V. Le on a scaling method to achieve small and fast neural networks with high classification accuracy. 25 | 26 | 2019-03-13 27 | --- 28 | - [The Bitter Lesson.](http://www.incompleteideas.net/IncIdeas/BitterLesson.html) An essay by Richard Sutton on general-purpose methods, the leveraging of computation, and the fallacy of domain knowledge. #ai #reinforcementlearning 29 | 30 | 2019-02-14 31 | --- 32 | - [Language Models are Unsupervised Multitask Learners.](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) A paper by OpenAI's A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, and I. Sutskever introducing the GPT-2 model for word prediction. [Accompanying blog post.](https://openai.com/blog/better-language-models/) #nlp 33 | 34 | 2018-10-11 35 | --- 36 | - [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.](https://arxiv.org/abs/1810.04805v1) A paper by Google AI Language on a state-of-the-art language model for a wide variety of NLP tasks. Model weights available at https://github.com/google-research/bert. #nlp 37 | 38 | 2018-06-18 39 | --- 40 | - [Neural Ordinary Differential Equations.](https://arxiv.org/abs/1806.07366) A paper by [R. T. Q. Chen](https://github.com/rtqichen), [Y. Rubanova](https://github.com/YuliaRubanova), [J. Bettencourt](https://github.com/jessebett), and [D. Duvenaud](https://github.com/duvenaud) that defines neural networks as continuously evolving systems by parameterizing the derivative of their hidden states, using ODE solvers to find their final state. #dnn #ode 41 | 42 | 2018-04-19 43 | --- 44 | - [Artificial Intelligence — The Revolution Hasn’t Happened Yet.](https://medium.com/@mijordan3/artificial-intelligence-the-revolution-hasnt-happened-yet-5e1d5812e1e7) An essay by Michael I. Jordan on the current state of AI. #ai 45 | 46 | 2017-12-15 47 | --- 48 | - [Faceswap.](https://github.com/deepfakes/faceswap) Faceswap is a tool that utilizes deep learning to recognize and swap faces in pictures and videos. #deeplearning #video #fake 49 | 50 | 2017-10-26 51 | --- 52 | - [A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs.](http://science.sciencemag.org/content/early/2017/10/25/science.aag2612) A paper by Vicarious introducing the recursive cortical network, a generative model with prior structure that facilitates model building, authored by D. George, W. Lehrach, K. Kansky, M. Lázaro-Gredilla, C. Laan, B. Marthi, X. Lou, Z. Meng, Y. Liu, H. Wang, A. Lavin and D. S. Pho. #captcha #vision 53 | 54 | 2017-07-20 55 | --- 56 | - [ShortScience.org](https://www.shortscience.org/) Summaries of machine learning papers, provided by the community. [Acompanying paper.](https://arxiv.org/abs/1707.06684) #websites 57 | 58 | 2017-07-13 59 | --- 60 | - [Foolbox v0.8.0: A Python toolbox to benchmark the robustness of machine learning models.](https://arxiv.org/abs/1707.04131) A Python toolbox to create adversarial examples that fool neural networks. [Code repo.](https://github.com/bethgelab/foolbox) #adversarial #deeplearning #toolboxes #python 61 | 62 | 2017-07-06 63 | --- 64 | - [Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks.](https://arxiv.org/abs/1707.01836) A paper by Pranav Rajpurkar, Awni Y. Hannun et al. that uses a 34-layer convolutional neural network which maps a sequence of ECG samples to a sequence of rhythm classes. #biomedical #cnn #deeplearning 65 | 66 | 2017-06-12 67 | --- 68 | - [Attention Is All You Need.](https://arxiv.org/abs/1706.03762) A paper by Google Brain introducing Transformer networks for sequence transduction. #nlp 69 | 70 | 2017-06-07 71 | --- 72 | - [Functional neuroimaging of high-risk 6-month-old infants predicts a diagnosis of autism at 24 months of age.](http://stm.sciencemag.org/content/9/393/eaag2882) A paper by Robert W. Emerson et al. that uses pattern recognition techniques (SVM) on fMRI data. #autism #fmri #matlab #biomedical 73 | 74 | 2017-06-15 75 | --- 76 | - [Supercharge your Computer Vision models with the TensorFlow Object Detection API](https://research.googleblog.com/2017/06/supercharge-your-computer-vision-models.html) Release of TensorFlow Object Detection API. #Google #tensorflow #detection 77 | 78 | 2017-05-22 79 | --- 80 | - [pix2code: Generating Code from a Graphical User Interface Screenshot.](https://arxiv.org/abs/1705.07962) A paper by Tony Beltramelli that converts a graphical user interface screenshot created by a designer into computer code. #gui #deeplearning #design 81 | 82 | 2017-04-19 83 | --- 84 | - [The GAN Zoo.](https://github.com/hindupuravinash/the-gan-zoo) A list of all named GANs. [Accompanying blog post.](https://deephunt.in/the-gan-zoo-79597dc8c347) #GAN #zoo 85 | 86 | 2017-03-30 87 | --- 88 | - [Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks](https://arxiv.org/abs/1703.10593) CycleGAN paper by J.-Y. Zhu, T. Park, P. Isola, and A. A. Efros. [Official Torch implementation.](https://github.com/junyanz/CycleGAN) #GAN #paper 89 | 90 | 2017-03-20 91 | --- 92 | - [Distill.pub](http://distill.pub/) A web-based peer-reviewed journal dedicated to clear explanations of machine learning. #journals 93 | 94 | 2017-03-15 95 | --- 96 | - [Learning to Discover Cross-Domain Relations with Generative Adversarial Networks.](https://arxiv.org/abs/1703.05192) DiscoGAN paper by T. Kim, M. Cha, H. Kim, J. Lee, and J. Kim. [Official Torch implementation.](https://github.com/SKTBrain/DiscoGAN) #GAN 97 | 98 | 2017-03-07 99 | --- 100 | - [Machine Learning and Misinformation.](http://paulsoulos.com/editorial/2017/03/07/machine-learning-and-misinformation.html) A blog post by Paul Soulos about the destabilizing effects that disruptive technologies have when taken in a fragile social and economic climate that is slow to adapt. #GAN #psychology #beliefechos #confirmationbias 101 | 102 | 2017-02-02 103 | --- 104 | - [Dermatologist-level classification of skin cancer with deep neural networks.](https://www.nature.com/nature/journal/v542/n7639/full/nature21056.html) A paper by Andre Esteva on using convolutional neural networks to classify skin lesions. #cnn #deeplearning #tensorflow #biomedical 105 | 106 | 2016-11-06 107 | --- 108 | - [Deep Learning: The Unreasonable Effectiveness of Randomness.](https://medium.com/intuitionmachine/deep-learning-the-unreasonable-effectiveness-of-randomness-14d5aef13f87) A blog post by [Carlos E. Perez](https://medium.com/@IntuitMachine) about meta-learning (including a machine that learns variants of the LSTM node), thoughts on models memorizing the training data, and networks with random weights. #metalearning #randomness 109 | 110 | 2016-10-13 111 | --- 112 | - [Uncertainty in Deep Learning.](http://mlg.eng.cam.ac.uk/yarin/blog_2248.html) A blog post by Yarin Gal about his PhD Thesis on Bayesian Deep Learning through dropout. #dropout #bayesian #dl 113 | 114 | 2016-10-06 115 | --- 116 | - [Convolutional Variational Autoencoder, trained on MNIST.](https://transcranial.github.io/keras-js/#/mnist-vae) Interactive demo of convolutional variational autoencoder. #interactive #visualizations #cnn #vae 117 | 118 | 2016-10-03 119 | --- 120 | - [How to Use t-SNE Effectively.](http://distill.pub/2016/misread-tsne/) An interactive exploration of the t-SNE algorithm. #visualizations #interactive #tSNE 121 | - [cleverhans v2.0.0: an adversarial machine learning library.](https://arxiv.org/abs/1610.00768) Cleverhans is a software library that provides standardized reference implementations of adversarial example construction techniques and adversarial training. #adversarial #deeplearning #toolboxes #python 122 | 123 | 2016-09-30 124 | --- 125 | - [Open Sourcing a Deep Learning Solution for Detecting NSFW Images.](https://yahooeng.tumblr.com/post/151148689421/open-sourcing-a-deep-learning-solution-for) A Caffe DNN (ResNet 50) for detecting NSFW images, by Yahoo. #cnn 126 | - [Synthesizing the preferred inputs for neurons in neural networks via deep generator networks.](http://www.evolvingai.org/synthesizing) A paper by A. Nguyen, A. Dosovitskiy, J. Yosinski, T. Brox, and J. Clune. #papers #deeplearning #synthesis 127 | 128 | 2016-09-26 129 | --- 130 | - [Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.](https://arxiv.org/abs/1609.08144) A Google Research paper on end-to-end automated translation. [Accompanying blog post](https://research.googleblog.com/2016/11/zero-shot-translation-with-googles.html) #lstm 131 | 132 | 2016-07-21 133 | --- 134 | - [Approaching (Almost) Any Machine Learning Problem.](http://blog.kaggle.com/2016/07/21/approaching-almost-any-machine-learning-problem-abhishek-thakur/) A post on practical ML pipelines by Abhishek Thakur. #pipelines #python 135 | 136 | 2016-07-13 137 | --- 138 | - [Matching Networks for One Shot Learning.](https://arxiv.org/abs/1606.04080) A paper by Oriol Vinyals, Charles Blundell, Timothy Lillicrap, Koray Kavukcuoglu, and Daan Wierstra. #papers #oneshot 139 | 140 | 2016-07-05 141 | --- 142 | - [Gradient Boosting Interactive Playground.](http://arogozhnikov.github.io/2016/07/05/gradient_boosting_playground.html) An interactive demonstration-explanation of gradient boosting algorithm applied to classification problem. #visualizations #interactive #GB 143 | 144 | 2016-07-03 145 | --- 146 | - [Awesome Deep Learning Papers.](https://github.com/terryum/awesome-deep-learning-papers) A curated list of the most cited deep learning papers since 2012. #dl #papers 147 | 148 | 2016-06-21 149 | --- 150 | - [ML + AI advances 2016.](https://vzn1.wordpress.com/2016/06/21/ml-ai-advances-2016/) An exhaustive list of ML and AI headlines of the past year. #overview #2016 151 | 152 | 2016-05-22 153 | --- 154 | - [Magenta: Music and Art Generation with Machine Intelligence.](http://magenta.tensorflow.org/) A Google project to make art with TensorFlow. #art #tensorflow 155 | 156 | 2016-04-28 157 | --- 158 | - [Interactive demonstrations for ML courses.](http://arogozhnikov.github.io/2016/04/28/demonstrations-for-ml-courses.html) A list of interactive demos: ROC curve, convnet.js (RandomForest, SVM, Neural network), Decision tree, Gradient Boosting (and decision tree for regression), Mini-course on reinforcement learning, TensorFlow NN, Image Reconstruction, t-SNE, Variational AutoEncoder, Generative adversarial networks, etc. #visualizations #interactive #list 159 | - [Movidius Fathom, a Neural Network Compute Framework embedded on a USB stick.](http://www.movidius.com/solutions/machine-vision-algorithms/machine-learning) #usb #hardware #deeplearning 160 | 161 | 2016-04-27 162 | --- 163 | - [OpenAI Gym.](https://gym.openai.com) A toolkit for developing and comparing reinforcement learning algorithms. #reinforcementlearning #openai #python 164 | 165 | 2016-04-13 166 | --- 167 | - [CreativeAi.](http://www.creativeai.net/) A space to share research and experiments that deal with Creativity and A.I. #art #ai 168 | 169 | 2016-04-12 170 | --- 171 | - [A Neural Network Playground.](http://playground.tensorflow.org/) Play with neural networks. Brought by the TensorFlow team. #visualizations #interactive #tensorflow 172 | 173 | 2016-03-23 174 | --- 175 | - [Convolution arithmetic.](https://github.com/vdumoulin/conv_arithmetic) A set of animations illustrating different convolutions, from the paper "A technical report on convolution arithmetic in the context of deep learning" by Vincent Dumoulin and Francesco Visin. #cnn #deeplearning 176 | 177 | 2016-03-09 178 | --- 179 | - [XGBoost: A Scalable Tree Boosting System.](http://arxiv.org/abs/1603.02754) A paper by Tianqi Chen and Carlos Guestrin. #papers #boosting 180 | 181 | 2016-02-17 182 | --- 183 | - [Communication-Efficient Learning of Deep Networks from Decentralized Data](https://arxiv.org/abs/1602.05629) A paper by researchers from Google introducing federated learning. #papers #federatedlearning 184 | 185 | 2016-01-22 186 | --- 187 | - [An Interactive Node-Link Visualization of Convolutional Neural Networks.](http://scs.ryerson.ca/~aharley/vis/) A 3D interface by Adam W. Harley. #visualizations #interactive #cnn 188 | 189 | 2016-01-11 190 | --- 191 | - [Anthony Goldbloom gives you the secret to winning Kaggle competitions.](https://www.import.io/post/how-to-win-a-kaggle-competition/) A blog post reporting that XGBoost and Neural Networks are winning the majority of Kaggle competitions. #kaggle #xgboost #nn 192 | 193 | 2015-12-11 194 | --- 195 | - [Human-level concept learning through probabilistic program induction.](http://m.sciencemag.org/content/350/6266/1332.full.pdf) A paper by Brenden M. Lake, Ruslan Salakhutdinov, and Joshua B. Tenenbaum. #papers #one-shot #bayes 196 | 197 | 2015-12-10 198 | --- 199 | - [Deep Residual Learning for Image Recognition.](https://arxiv.org/abs/1512.03385) A paper by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun introducing the ResNet architecture. #deeplearning 200 | 201 | 2015-12-03 202 | --- 203 | - [A mathematical motivation for complex-valued convolutional networks.](http://arxiv.org/abs/1503.03438) A paper by Joan Bruna, Soumith Chintala, Yann LeCun, Serkan Piantino, Arthur Szlam, and Mark Tygert. #cnn #wavelets 204 | 205 | 2015-11-09 206 | --- 207 | - [TensorFlow, an open source software library for machine intelligence by Google.](http://tensorflow.org/) #opensource #google #deeplearning #tensorflow 208 | 209 | 2015-11-05 210 | --- 211 | - [Deepart.io.](https://deepart.io/) Generate images styled like your favorite artist #art 212 | 213 | 2015-10-12 214 | --- 215 | - [Grasp-and-Lift EEG Detection Winners' Interview: 1st place, Cat & Dog.](http://blog.kaggle.com/2015/10/12/grasp-and-lift-eeg-winners-interview-1st-place-cat-dog/) #kaggle #timeseries #eeg 216 | - [Code and documentation for the winning sollution at the Grasp-and-Lift EEG Detection challenge.](https://github.com/alexandrebarachant/Grasp-and-lift-EEG-challenge) #code #python #github #kaggle 217 | 218 | 2015-10-05 219 | --- 220 | - [Grasp-and-Lift EEG Detection Winners' Interview: 3rd place, Team HEDJ.](http://blog.kaggle.com/2015/10/05/grasp-and-lift-eeg-detection-winners-interview-3rd-place-team-hedj/) #kaggle #timeseries #eeg 221 | 222 | 2015-09-17 223 | --- 224 | - [Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs.](http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/) #rnn 225 | 226 | 2015-09-10 227 | --- 228 | - [Spatial Transformer Networks.](http://arxiv.org/abs/1506.02025) A paper by Max Jaderberg, Karen Simonyan, Andrew Zisserman, Koray Kavukcuoglu. And here's a demo on traffic sign recognition: [The power of Spatial Transformer Networks](http://torch.ch/blog/2015/09/07/spatial_transformers.html) #papers #code 229 | 230 | 2015-08-27 231 | --- 232 | - [Understanding LSTM Networks.](http://colah.github.io/posts/2015-08-Understanding-LSTMs/) A blog post tutorial on LSTMs by Christopher Olah. #lstm #rnn 233 | 234 | 2015-08-26 235 | --- 236 | - [A Neural Algorithm of Artistic Style.](http://arxiv.org/abs/1508.06576) A paper by Leon A. Gatys, Alexander S. Ecker, Matthias Bethge. #papers #art 237 | 238 | 2015-08-03 239 | --- 240 | - [GitXiv: Collaborative Open Computer Science.](http://gitxiv.com/) A space to share links to open computer science projects. Each project is presented as arXiv + Github + Links + Discussion. #github #arxiv 241 | 242 | 2015-07-20 243 | --- 244 | - [Caffe Model Zoo.](http://caffe.berkeleyvision.org/model_zoo.html) Caffe models for different tasks with all kinds of architectures and data. More on the [GitHub Wiki](https://github.com/BVLC/caffe/wiki/Model-Zoo) #caffe #models #pretrained 245 | 246 | 2015-06-25 247 | --- 248 | - [Why Deep Learning Is a Hindrance to Progress Toward True AI.](http://rebelscience.blogspot.fr/2015/06/why-deep-learning-is-hindrance-to.html) Blog post on the role of time in unsupervised learning. #unsupervisedlearning 249 | 250 | 2015-06-17 251 | --- 252 | - [Inceptionism: Going Deeper into Neural Networks.](http://googleresearch.blogspot.co.uk/2015/06/inceptionism-going-deeper-into-neural.html) Blog post creating feature-enhanced images by turning recurrent neural networks upside down. #rnn 253 | 254 | 2015-06-11 255 | --- 256 | - [The Holy Grail of Deep Learning: Modelling Invariances.](http://www.inference.vc/the-holy-gr/) A blog post by Ferenc Huszár. #deeplearning 257 | 258 | 2015-06-08 259 | --- 260 | - [Neural Turing Machines.](http://arxiv.org/abs/1410.5401) A paper by Alex Graves, Greg Wayne and Ivo Danihelka. #papers 261 | - [How to Evaluate Machine Learning Models: Hyperparameter Tuning.](http://blog.dato.com/how-to-evaluate-machine-learning-models-part-4-hyperparameter-tuning) #hyperparameters 262 | 263 | 2015-06-04 264 | --- 265 | - [Competing in a data science contest without reading the data.](http://blog.mrtz.org/2015/03/09/competition.html) Blog post that introduces the wacky boosting algorithm. #kaggle #boosting 266 | 267 | 2015-05-27 268 | --- 269 | - [Model-Based Machine Learning (Early Access): an online book.](http://www.mbmlbook.com/) #books 270 | - [Yann LeCun, Yoshua Bengio & Geoffrey Hinton - Deep Learning.](http://www.nature.com/nature/journal/v521/n7553/full/nature14539.html) Article in nature (paywalled). #papers #deeplearning 271 | 272 | 2015-05-25 273 | --- 274 | - [Siamese Neural Networks for One-Shot Image Recognition](https://www.cs.cmu.edu/~rsalakhu/papers/oneshot1.pdf) A paper by Gregory Koch, Richard Zemel, and Ruslan Salakhutdinov. #papers #deeplearning 275 | 276 | 2015-05-21 277 | --- 278 | - [The Unreasonable Effectiveness of Recurrent Neural Networks.](http://karpathy.github.io/2015/05/21/rnn-effectiveness/) Blog post by Andrej Karpathy. #rnn 279 | 280 | 2015-05-18 281 | --- 282 | - [U-Net: Convolutional Networks for Biomedical Image Segmentation.](https://lmb.informatik.uni-freiburg.de/people/ronneber/u-net/) A paper by O. Ronneberger, P. Fischer, and T. Brox on U-Net image segmentation, including source code. #papers #matlab #caffe 283 | 284 | 2015-04-06 285 | --- 286 | - [Early Stopping is Nonparametric Variational Inference.](http://arxiv.org/abs/1504.01344) A Bayesian interpretation of early-stopping stochastic gradient descent. #papers 287 | 288 | 2015-03-29 289 | --- 290 | - [CS224d: Deep Learning for Natural Language Processing.](http://cs224d.stanford.edu/) #courses 291 | - [Data Mining Courses.](https://datayo.wordpress.com/2015/03/29/courses/) Overview of online machine learning and data mining courses. #courses 292 | 293 | 2015-02-14 294 | --- 295 | - [Tutorial to configure an AWS instance to run Theano.](https://www.kaggle.com/c/facial-keypoints-detection/details/deep-learning-tutorial) #gpu #theano #aws 296 | 297 | 2015-02-13 298 | --- 299 | - [Using convolutional neural nets to detect facial keypoints tutorial.](http://danielnouri.org/notes/2014/12/17/using-convolutional-neural-nets-to-detect-facial-keypoints-tutorial/) Tutorial by Daniel Nouri on using Theano and Lasagne for detecting facial keypoints through deep convolutional regression. #deeplearning #cnn #theano #pretrained #regression 300 | 301 | 2015-02-11 302 | --- 303 | - [Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift.](http://arxiv.org/abs/1502.03167) A paper by Sergey Ioffe and Christian Szegedy. #papers #deeplearning 304 | 305 | 2015-01-24 306 | --- 307 | - [Visualizing DBSCAN Clustering.](https://www.naftaliharris.com/blog/visualizing-dbscan-clustering/) Interactive demo of DBSCAN clustering algorithm. #clustering #dbscan #interactive #visualization 308 | 309 | 2015-01-01 310 | --- 311 | - [CS231n: Convolutional Neural Networks for Visual Recognition.](http://vision.stanford.edu/teaching/cs231n/) #courses 312 | 313 | 2014-12-21 314 | --- 315 | - [Do Deep Nets Really Need to be Deep?](http://arxiv.org/abs/1312.6184) "Once a deep network is trained, a shallow network can learn the same function from the outputs of the deep network. The shallow network can't learn the same function directly from the data. This indicates that deep learning could be an optimization/learning trick". Via [Nuit Blanche](http://nuit-blanche.blogspot.com.es/2014/12/sunday-morning-insight-regularization.html) #papers #deeplearning 316 | 317 | 2014-12-17 318 | --- 319 | - [Learning Deep Architectures for AI.](http://www.iro.umontreal.ca/~bengioy/papers/ftml.pdf) A paper on the motivation for deep architectures, by Yoshua Bengio. #papers #ai 320 | - ["Using convolutional neural nets to detect facial keypoints tutorial."](http://danielnouri.org/notes/2014/12/17/using-convolutional-neural-nets-to-detect-facial-keypoints-tutorial/) A tutorial by Daniel Nouri on how to perform regression using Theano + Lasagne + nolearn to find keypoints in pictures of face. #tutorial #theano #deeplearning 321 | - ["Deep Learning Tutorial".](https://www.kaggle.com/c/facial-keypoints-detection/details/deep-learning-tutorial) A tutorial on creating an AWS instance to run Theano. #kaggle #deeplearning #aws 322 | 323 | 2014-12-09 324 | --- 325 | - [Do We Need Hundreds of Classifiers to Solve Real World Classification Problems?](http://jmlr.csail.mit.edu/papers/volume15/delgado14a/delgado14a.pdf) A paper by Manuel Fernández-Delgado, Eva Cernadas and Senén Barro. #paper #classification 326 | 327 | 2014-12-03 328 | --- 329 | - [So You Wanna Try Deep Learning?](http://snippyhollow.github.io/blog/2014/08/09/so-you-wanna-try-deep-learning/) Crash-course into deep learning including a one-file python implementation of a deep neural network. #code 330 | 331 | 2014-09-29 332 | --- 333 | - [Tom M. Mitchell - The Discipline of Machine Learning.](http://www.cs.cmu.edu/~tom/pubs/MachineLearning.pdf) #papers 334 | - [An interview with Judea Pearl](http://www.cambridge.org/gb/knowledge/features/featureitem/item6977116/?site_locale=en_GB) #interviews #ai 335 | 336 | 2014-09-28 337 | --- 338 | - [A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models.](http://crow.ee.washington.edu/people/bulyko/papers/em.pdf) #papers 339 | 340 | 2014-09-18 341 | --- 342 | - [Long short term memory.](http://en.wikipedia.org/wiki/Long_short_term_memory) Wikipedia entry. #lstm #rnn 343 | 344 | 2014-09-15 345 | --- 346 | - [Deep Learning: Past, Present and Future.](https://drive.google.com/file/d/0B16RwCMQqrtdb05qdDFnSXprM0E/view?sle=true) Slides by Kyunghyun Cho. #tutorial #deeplearning 347 | 348 | 2014-09-08 349 | - [Accelerate Machine Learning with the cuDNN Deep Neural Network Library](http://devblogs.nvidia.com/parallelforall/accelerate-machine-learning-cudnn-deep-neural-network-library/) #deeplearning #toolboxes 350 | 351 | 2014-09-05 352 | --- 353 | - [Intro to Artificial Intelligence.](https://www.udacity.com/course/cs271) Course by Sebastian Thrun and Peter Norvig on Udacity. #courses #online_courses 354 | - [Using deep learning to listen for whales.](http://danielnouri.org/notes/2014/01/10/using-deep-learning-to-listen-for-whales/) Daniel Nouri's notes on the Kaggle Whale Detection Challenge. #deeplearning #kaggle 355 | 356 | 2014-09-04 357 | --- 358 | - [Kernel Methods Match Deep Neural Networks On TIMIT.](http://www.ifp.illinois.edu/~huang146/papers/Kernel_DNN_ICASSP2014.pdf) Via [Reddit](http://www.reddit.com/r/MachineLearning/comments/2fbw7i/kernel_methods_match_deep_neural_networks_on_timit/) #kernelmethods #deeplearning #papers 359 | 360 | 2014-08-31 361 | --- 362 | - [A Deep Learning Tutorial: From Perceptrons to Deep Networks.](http://www.toptal.com/machine-learning/an-introduction-to-deep-learning-from-perceptrons-to-deep-networks) High-level overview of deep learning. #tutorials #deeplearning 363 | - [Stochastic Pooling for Regularization of Deep Convolutional Neural Networks.](http://techtalks.tv/talks/stochastic-pooling-for-regularization-of-deep-convolutional-neural-networks/58106/) A talk by Matthew D. Zeiler. Stochastic pooling: treat activations as probabilities and pick one with the corresponding probability. At test time: weight activations with probabilities. Implemented using [GPUmat](https://sourceforge.net/projects/gpumat/) with [cuda-convnet](https://code.google.com/p/cuda-convnet/) #videos #deeplearning 364 | 365 | 2014-08-30 366 | --- 367 | - [The History of AI.](http://arnetminer.org/event/aihistory) #AI #charts 368 | 369 | 2014-08-29 370 | --- 371 | - [Who is the best at X?](http://rodrigob.github.io/are_we_there_yet/build/) Benchmark visual recognition tests and comparison of results by state-of-the-art algorithms. #benchmarks #vision 372 | 373 | 2014-08-26 374 | --- 375 | - [Deep-er Kernels.](http://videolectures.net/roks2013_shawe_taylor_kernels/) A talk by John Shawe-Taylor #videos #kernelmethods #deeplearning 376 | 377 | 2014-08-25 378 | --- 379 | - [Recommending music on Spotify with deep learning.](http://benanne.github.io/2014/08/05/spotify-cnns.html) A blog post by Sander Dieleman #deeplearning 380 | 381 | 2014-08-24 382 | --- 383 | - [Deep Learning.](http://www.iro.umontreal.ca/~bengioy/dlbook/) An MIT Press book by Yoshua Bengio, Ian Goodfellow and Aaron Courville. #books #pdf #deeplearning 384 | - [Signal and Image Classification - Stephane Mallat Technion lecture.](https://www.youtube.com/watch?v=wHhYvtnY2zI) Via [Nuit Blanche](http://nuit-blanche.blogspot.fr/2014/08/saturday-morning-video-signal-and-image.html) #videos #wavelets #deeplearning 385 | - [Transforming Auto-encoders.](http://www.cs.toronto.edu/~fritz/absps/transauto6.pdf) Paper by G. E. Hinton, A. Krizhevsky & S. D. Wang, introducing the idea of "capsules". #papers #deeplearning #capsules 386 | - [The Data Scientist on a Quest to Turn Computers Into Doctors.](http://www.wired.com/2014/08/enlitic/) Article on Jeremy Howard's company [Enlitic](http://www.enlitic.com/) #deeplearning #vision #medicine #companies #articles 387 | - [Indolent or aggressive? A computerised pathologist that can outperform its human counterparts could transform the field of cancer diagnosis.](http://www.economist.com/node/21540387) Article on Daphne Koller's Computational Pathologist. #articles #medicine #vision 388 | 389 | 2014-08-19 390 | --- 391 | - [Timeseries Classification: KNN & DTW.](http://nbviewer.ipython.org/github/markdregan/K-Nearest-Neighbors-with-Dynamic-Time-Warping/blob/master/K_Nearest_Neighbor_Dynamic_Time_Warping.ipynb) An iPython notebook by Mark Regan. #python #dtw #knn #timeseries 392 | 393 | 2014-08-14 394 | --- 395 | - [Neural Networks course by Hugo Larochelle.](http://info.usherbrooke.ca/hlarochelle/neural_networks/content.html) #courses #videos #deeplearning 396 | 397 | 2014-08-10 398 | --- 399 | - [Neural Networks and Deep Learning.](http://neuralnetworksanddeeplearning.com/) Online book by Michael Nielsen. #books #deeplearning 400 | 401 | 2014-08-07 402 | --- 403 | - [Winning solution for the DecMeg2014 challenge.](https://github.com/alexandrebarachant/DecMeg2014) #code #matlab #github #kaggle 404 | 405 | 2014-08-04 406 | --- 407 | - [Deep Learning and Convolutional Kernel Networks.](http://nuit-blanche.blogspot.com/2014/08/deep-learning-and-convolutional-kernel.html) #deeplearning #kernelmethods 408 | 409 | 2014-07-31 410 | --- 411 | - [UFLDL Tutorial.](http://ufldl.stanford.edu/wiki/index.php/UFLDL_Tutorial) Stanford tutorial on deep learning. My solutions on Github: [Sparse Autoencoder.](https://github.com/steven2358/SparseAutoencoder), [Sparse Autoencode Vectorized](https://github.com/steven2358/SparseAutoencoderVectorized), [Softmax Regression](https://github.com/steven2358/SoftmaxRegression), [Self-Taught Learning](https://github.com/steven2358/SelfTaughtLearning), [Linear Decoder](https://github.com/steven2358/LinearDecoder), [Stacked Autoencoder](https://github.com/steven2358/StackedAutoencoder) and [Convolutional Neural Network](https://github.com/steven2358/cnn_ufldl). #courses #deeplearning 412 | 413 | 2014-07-29 414 | --- 415 | - [Kaggle Competition Past Solutions.](http://www.chioka.in/kaggle-competition-solutions/) #kaggle #code 416 | 417 | 2014-06-21 418 | --- 419 | - [Towards End-to-End Speech Recognition with Recurrent Neural Networks.](http://www.jmlr.org/proceedings/papers/v32/graves14.pdf) A paper by Alex Graves and Navdeep Jaitly. #rnn #lstm 420 | 421 | 2014-06-10 422 | --- 423 | - [Generative Adversarial Networks.](https://arxiv.org/abs/1406.2661) A paper by Ian J. Goodfellow et al. that proposes a framework for estimating generative models via an adversarial process. #gan 424 | 425 | 2014-02-14 426 | --- 427 | - [ConvNetJS: Deep Learning in your browser.](http://cs.stanford.edu/people/karpathy/convnetjs/) Javascript machine learning #code #deeplearning 428 | 429 | 2013-10-25 430 | --- 431 | - ["Learning Hierarchies Of Invariant Features".](http://www.slideshare.net/yandex/yann-le-cun) Slides by Yann LeCun. #tutorial #deeplearning 432 | 433 | 2012-12-03 434 | --- 435 | - [ImageNet Classification with Deep Convolutional Neural Networks.](http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf) A paper by Alex Krizhevsky, Ilya Sutskever and Geoffrey E. Hinton. #deeplearning #cnn 436 | 437 | 2011-03-11 438 | --- 439 | - [How to Grow a Mind: Statistics, Structure, and Abstraction.](http://web.mit.edu/cocosci/Papers/tkgg-science11-reprint.pdf) A paper by Joshua B. Tenenbaum, Charles Kemp, Thomas L. Griffiths, and Noah D. Goodman. #bayes #brain #science 440 | --------------------------------------------------------------------------------