├── Duplicate detection with LSH └── Duplicate detection with LSH.ipynb ├── README.md ├── Rating prediction ├── Rating prediction with matrix factorization.ipynb └── ratings.npy ├── Restaurant -PageRank ├── Restaurant_ranking.ipynb └── file ├── Review generation with HMM ├── data_HMM.npy └── task_03_hidden_markov_model.ipynb └── Spectral clustering ├── Spectral Clustering.ipynb └── data.zip /Duplicate detection with LSH/Duplicate detection with LSH.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/Duplicate detection with LSH/Duplicate detection with LSH.ipynb -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/README.md -------------------------------------------------------------------------------- /Rating prediction/Rating prediction with matrix factorization.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/Rating prediction/Rating prediction with matrix factorization.ipynb -------------------------------------------------------------------------------- /Rating prediction/ratings.npy: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/Rating prediction/ratings.npy -------------------------------------------------------------------------------- /Restaurant -PageRank/Restaurant_ranking.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/Restaurant -PageRank/Restaurant_ranking.ipynb -------------------------------------------------------------------------------- /Restaurant -PageRank/file: -------------------------------------------------------------------------------- 1 | 2 | -------------------------------------------------------------------------------- /Review generation with HMM/data_HMM.npy: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/Review generation with HMM/data_HMM.npy -------------------------------------------------------------------------------- /Review generation with HMM/task_03_hidden_markov_model.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/Review generation with HMM/task_03_hidden_markov_model.ipynb -------------------------------------------------------------------------------- /Spectral clustering/Spectral Clustering.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/Spectral clustering/Spectral Clustering.ipynb -------------------------------------------------------------------------------- /Spectral clustering/data.zip: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PiotrTa/Mining-Massive-Datasets/HEAD/Spectral clustering/data.zip --------------------------------------------------------------------------------