├── README.md └── .gitignore /README.md: -------------------------------------------------------------------------------- 1 | # E-commerce Recommendation engine 2 | 3 | 4 | This repository contains the code for basic kind of E-commerce recommendation engine. By using the concept of TF-IDF and cosine similarity, we have built this recommendation engine. 5 | 6 | ## Data 7 | For this project we are using [this dataset](https://www.kaggle.com/cclark/product-item-data/data). This dataset contains data from the 500 actual SKUs from an outdoor apparel brand's product catalog. 8 | 9 | ## Dependencies and Installation 10 | We need the following dependencies. 11 | 12 | * Pandas: `sudo pip install pandas` 13 | * Sci-kit Learn (sklearn): `sudo pip install -U scikit-learn` 14 | * Jupyter Notebook: `$ sudo pip install jupyter ` 15 | 16 | ## Usage 17 | You need to run the jupyter notebook `Basic_content_based_recommendation_system.ipynb` 18 | 19 | ## credit 20 | The code of is this repository is inspired by the [Groveco's](https://github.com/groveco/content-engine) content based recommendation engine. -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- 1 | # Created by .ignore support plugin (hsz.mobi) 2 | ### Python template 3 | # Byte-compiled / optimized / DLL files 4 | __pycache__/ 5 | *.py[cod] 6 | *$py.class 7 | 8 | # C extensions 9 | *.so 10 | 11 | # Distribution / packaging 12 | .Python 13 | build/ 14 | develop-eggs/ 15 | dist/ 16 | downloads/ 17 | eggs/ 18 | .eggs/ 19 | lib/ 20 | lib64/ 21 | parts/ 22 | sdist/ 23 | var/ 24 | wheels/ 25 | *.egg-info/ 26 | .installed.cfg 27 | *.egg 28 | MANIFEST 29 | .idea/ 30 | 31 | # PyInstaller 32 | # Usually these files are written by a python script from a template 33 | # before PyInstaller builds the exe, so as to inject date/other infos into it. 34 | *.manifest 35 | *.spec 36 | 37 | # Installer logs 38 | pip-log.txt 39 | pip-delete-this-directory.txt 40 | 41 | # Unit test / coverage reports 42 | htmlcov/ 43 | .tox/ 44 | .coverage 45 | .coverage.* 46 | .cache 47 | nosetests.xml 48 | coverage.xml 49 | *.cover 50 | .hypothesis/ 51 | 52 | # Translations 53 | *.mo 54 | *.pot 55 | 56 | # Django stuff: 57 | *.log 58 | .static_storage/ 59 | .media/ 60 | local_settings.py 61 | 62 | # Flask stuff: 63 | instance/ 64 | .webassets-cache 65 | 66 | # Scrapy stuff: 67 | .scrapy 68 | 69 | # Sphinx documentation 70 | docs/_build/ 71 | 72 | # PyBuilder 73 | target/ 74 | 75 | # Jupyter Notebook 76 | .ipynb_checkpoints 77 | 78 | # pyenv 79 | .python-version 80 | 81 | # celery beat schedule file 82 | celerybeat-schedule 83 | 84 | # SageMath parsed files 85 | *.sage.py 86 | 87 | # Environments 88 | .env 89 | .venv 90 | env/ 91 | venv/ 92 | ENV/ 93 | env.bak/ 94 | venv.bak/ 95 | 96 | # Spyder project settings 97 | .spyderproject 98 | .spyproject 99 | 100 | # Rope project settings 101 | .ropeproject 102 | 103 | # mkdocs documentation 104 | /site 105 | 106 | # mypy 107 | .mypy_cache/ 108 | 109 | --------------------------------------------------------------------------------