├── .gitbook └── assets │ ├── dbx_cred_passthrough.png │ ├── glow_ref_arch_genomics.png │ ├── mountblob_1.png │ ├── mountblob_2.png │ ├── sparkitecture.png │ └── sparkitecture_logo.png ├── .github ├── FUNDING.yml └── ISSUE_TEMPLATE │ ├── bug_report.md │ └── feature_request.md ├── .gitignore ├── LICENSE ├── README.md ├── SUMMARY.md ├── _config.yml ├── bioinformatics-and-genomics └── glow.md ├── cloud-service-integration ├── azure-data-factory.md ├── azure-sql-data-warehouse.md └── azure-storage.md ├── data-preparation ├── other-common-tasks.md ├── reading-and-writing-data.md └── shaping-data-with-pipelines.md ├── img ├── Sparkitecture.ai ├── Sparkitecture.pdf ├── Sparkitecture.svg ├── Sparkitecture_logo.png ├── Sparkitecture_star.png ├── logo.pptx ├── mountblob_1.png └── mountblob_2.png ├── machine-learning ├── about-spark-mllib.md ├── classification │ ├── README.md │ ├── decision-tree.md │ ├── gradient-boosted-trees.md │ ├── logistic-regression.md │ ├── naive-bayes.md │ └── random-forest.md ├── feature-importance.md ├── mlflow.md ├── model-evaluation.md ├── model-saving-and-loading.md └── regression │ ├── README.md │ ├── decision-tree.md │ ├── gradient-boosted-trees.md │ ├── linear-regression.md │ └── random-forest.md ├── natural-language-processing ├── data-preparation.md └── model-evaluation.md ├── operationalization ├── api-serving.md └── batch-scoring.md └── streaming-data └── structured-streaming.md /.gitbook/assets/dbx_cred_passthrough.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.gitbook/assets/dbx_cred_passthrough.png -------------------------------------------------------------------------------- /.gitbook/assets/glow_ref_arch_genomics.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.gitbook/assets/glow_ref_arch_genomics.png -------------------------------------------------------------------------------- /.gitbook/assets/mountblob_1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.gitbook/assets/mountblob_1.png -------------------------------------------------------------------------------- /.gitbook/assets/mountblob_2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.gitbook/assets/mountblob_2.png -------------------------------------------------------------------------------- /.gitbook/assets/sparkitecture.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.gitbook/assets/sparkitecture.png -------------------------------------------------------------------------------- /.gitbook/assets/sparkitecture_logo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.gitbook/assets/sparkitecture_logo.png -------------------------------------------------------------------------------- /.github/FUNDING.yml: -------------------------------------------------------------------------------- 1 | custom: https://www.buymeacoffee.com/colbyford 2 | -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/bug_report.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.github/ISSUE_TEMPLATE/bug_report.md -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/feature_request.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.github/ISSUE_TEMPLATE/feature_request.md -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/README.md -------------------------------------------------------------------------------- /SUMMARY.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/SUMMARY.md -------------------------------------------------------------------------------- /_config.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/_config.yml -------------------------------------------------------------------------------- /bioinformatics-and-genomics/glow.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/bioinformatics-and-genomics/glow.md -------------------------------------------------------------------------------- /cloud-service-integration/azure-data-factory.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/cloud-service-integration/azure-data-factory.md -------------------------------------------------------------------------------- /cloud-service-integration/azure-sql-data-warehouse.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/cloud-service-integration/azure-sql-data-warehouse.md -------------------------------------------------------------------------------- /cloud-service-integration/azure-storage.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/cloud-service-integration/azure-storage.md -------------------------------------------------------------------------------- /data-preparation/other-common-tasks.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/data-preparation/other-common-tasks.md -------------------------------------------------------------------------------- /data-preparation/reading-and-writing-data.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/data-preparation/reading-and-writing-data.md -------------------------------------------------------------------------------- /data-preparation/shaping-data-with-pipelines.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/data-preparation/shaping-data-with-pipelines.md -------------------------------------------------------------------------------- /img/Sparkitecture.ai: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/img/Sparkitecture.ai -------------------------------------------------------------------------------- /img/Sparkitecture.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/img/Sparkitecture.pdf -------------------------------------------------------------------------------- /img/Sparkitecture.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/img/Sparkitecture.svg -------------------------------------------------------------------------------- /img/Sparkitecture_logo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/img/Sparkitecture_logo.png -------------------------------------------------------------------------------- /img/Sparkitecture_star.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/img/Sparkitecture_star.png -------------------------------------------------------------------------------- /img/logo.pptx: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/img/logo.pptx -------------------------------------------------------------------------------- /img/mountblob_1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/img/mountblob_1.png -------------------------------------------------------------------------------- /img/mountblob_2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/img/mountblob_2.png -------------------------------------------------------------------------------- /machine-learning/about-spark-mllib.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/about-spark-mllib.md -------------------------------------------------------------------------------- /machine-learning/classification/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/classification/README.md -------------------------------------------------------------------------------- /machine-learning/classification/decision-tree.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/classification/decision-tree.md -------------------------------------------------------------------------------- /machine-learning/classification/gradient-boosted-trees.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/classification/gradient-boosted-trees.md -------------------------------------------------------------------------------- /machine-learning/classification/logistic-regression.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/classification/logistic-regression.md -------------------------------------------------------------------------------- /machine-learning/classification/naive-bayes.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/classification/naive-bayes.md -------------------------------------------------------------------------------- /machine-learning/classification/random-forest.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/classification/random-forest.md -------------------------------------------------------------------------------- /machine-learning/feature-importance.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/feature-importance.md -------------------------------------------------------------------------------- /machine-learning/mlflow.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/mlflow.md -------------------------------------------------------------------------------- /machine-learning/model-evaluation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/model-evaluation.md -------------------------------------------------------------------------------- /machine-learning/model-saving-and-loading.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/model-saving-and-loading.md -------------------------------------------------------------------------------- /machine-learning/regression/README.md: -------------------------------------------------------------------------------- 1 | # Regression 2 | 3 | -------------------------------------------------------------------------------- /machine-learning/regression/decision-tree.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/regression/decision-tree.md -------------------------------------------------------------------------------- /machine-learning/regression/gradient-boosted-trees.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/regression/gradient-boosted-trees.md -------------------------------------------------------------------------------- /machine-learning/regression/linear-regression.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/regression/linear-regression.md -------------------------------------------------------------------------------- /machine-learning/regression/random-forest.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/machine-learning/regression/random-forest.md -------------------------------------------------------------------------------- /natural-language-processing/data-preparation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/natural-language-processing/data-preparation.md -------------------------------------------------------------------------------- /natural-language-processing/model-evaluation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/natural-language-processing/model-evaluation.md -------------------------------------------------------------------------------- /operationalization/api-serving.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/operationalization/api-serving.md -------------------------------------------------------------------------------- /operationalization/batch-scoring.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/operationalization/batch-scoring.md -------------------------------------------------------------------------------- /streaming-data/structured-streaming.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/colbyford/sparkitecture/HEAD/streaming-data/structured-streaming.md --------------------------------------------------------------------------------