├── .gitignore ├── Install Spark On Mac.txt ├── LICENSE ├── README.md ├── data └── filter.txt ├── images ├── Top20ClosestWordsToChritstmas.png ├── Top30WordToChristmasVis.png ├── pyspark-shell.png ├── w2v-ibm-design.png └── w2v-visual-via-pca.png ├── ml-scripts ├── .ipynb_checkpoints │ └── Word2Vec with Tweets-checkpoint.ipynb ├── README.md ├── Word2Vec with Tweets.ipynb ├── w2vAndKmeans.py ├── word2vecUtilities.py └── word2vecUtilities.pyc └── mllib-scripts ├── .ipynb_checkpoints └── Word2Vec with Twitter Data using Spark RDDs-checkpoint.ipynb ├── README.md ├── Word2Vec with Twitter Data using Spark RDDs.ipynb ├── cluster-words.py ├── dist-to-words.py ├── tweets-to-w2v.py └── visualize-words.py /.gitignore: -------------------------------------------------------------------------------- 1 | *.npy 2 | *.gz 3 | -------------------------------------------------------------------------------- /Install Spark On Mac.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/Install Spark On Mac.txt -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/README.md -------------------------------------------------------------------------------- /data/filter.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/data/filter.txt -------------------------------------------------------------------------------- /images/Top20ClosestWordsToChritstmas.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/images/Top20ClosestWordsToChritstmas.png -------------------------------------------------------------------------------- /images/Top30WordToChristmasVis.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/images/Top30WordToChristmasVis.png -------------------------------------------------------------------------------- /images/pyspark-shell.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/images/pyspark-shell.png -------------------------------------------------------------------------------- /images/w2v-ibm-design.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/images/w2v-ibm-design.png -------------------------------------------------------------------------------- /images/w2v-visual-via-pca.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/images/w2v-visual-via-pca.png -------------------------------------------------------------------------------- /ml-scripts/.ipynb_checkpoints/Word2Vec with Tweets-checkpoint.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/ml-scripts/.ipynb_checkpoints/Word2Vec with Tweets-checkpoint.ipynb -------------------------------------------------------------------------------- /ml-scripts/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/ml-scripts/README.md -------------------------------------------------------------------------------- /ml-scripts/Word2Vec with Tweets.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/ml-scripts/Word2Vec with Tweets.ipynb -------------------------------------------------------------------------------- /ml-scripts/w2vAndKmeans.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/ml-scripts/w2vAndKmeans.py -------------------------------------------------------------------------------- /ml-scripts/word2vecUtilities.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/ml-scripts/word2vecUtilities.py -------------------------------------------------------------------------------- /ml-scripts/word2vecUtilities.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/ml-scripts/word2vecUtilities.pyc -------------------------------------------------------------------------------- /mllib-scripts/.ipynb_checkpoints/Word2Vec with Twitter Data using Spark RDDs-checkpoint.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/mllib-scripts/.ipynb_checkpoints/Word2Vec with Twitter Data using Spark RDDs-checkpoint.ipynb -------------------------------------------------------------------------------- /mllib-scripts/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/mllib-scripts/README.md -------------------------------------------------------------------------------- /mllib-scripts/Word2Vec with Twitter Data using Spark RDDs.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/mllib-scripts/Word2Vec with Twitter Data using Spark RDDs.ipynb -------------------------------------------------------------------------------- /mllib-scripts/cluster-words.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/mllib-scripts/cluster-words.py -------------------------------------------------------------------------------- /mllib-scripts/dist-to-words.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/mllib-scripts/dist-to-words.py -------------------------------------------------------------------------------- /mllib-scripts/tweets-to-w2v.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/mllib-scripts/tweets-to-w2v.py -------------------------------------------------------------------------------- /mllib-scripts/visualize-words.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/castanan/w2v/HEAD/mllib-scripts/visualize-words.py --------------------------------------------------------------------------------