├── Classification Using Pyspark_Home_Quote - v3.ipynb ├── Classification_Using _Pyspark.py ├── Image ├── Decision_Tree.png ├── Decision_Tree_Gini_LogLoss.png ├── Decision_Tree_ROC.png ├── Decision_Tree_confusion_matrix.png ├── Decision_Tree_ev1.png ├── EDA1.jpg ├── EDA2.jpg ├── EDA3.jpg ├── EDA4.jpg ├── EDA5.png ├── EDA6.png ├── Random_Forest.png ├── Random_Forest_Gini_LogLoss.png ├── Random_Forest_ROC.png ├── Random_Forest_confusion_matrix.png ├── Random_Forest_ev1.png ├── call_function_feature_engineering.png ├── call_insignificant_categories_function.jpg ├── callfunction_compare_categorical_variables.jpg ├── check_data.png ├── check_missing_values.png ├── check_missing_values2.png ├── check_missing_values3.png ├── define_categorical_numerical_variables1.png ├── define_categorical_numerical_variables2.png ├── feature_engineering.png ├── feature_engineering2.png ├── function_compare_categorical_variables.jpg ├── gradient_boosting.png ├── gradient_boosting_Gini_LogLoss.png ├── gradient_boosting_ROC.png ├── gradient_boosting_ROC_confusion_matrix.png ├── gradient_boosting_confusion_matrix.png ├── gradient_boosting_ev1.png ├── handle_missing_values.jpg ├── handle_missing_values2.jpg ├── handle_outlier.png ├── handle_outlier2.png ├── handle_outlier3.png ├── hyper_parameter_Random_Forest.png ├── hyper_parameter_tuning_DecisionTree.png ├── hyper_parameter_tuning_GradientBoost.png ├── hyper_parameter_tuning_LogisticRegression.png ├── implement_to_data_test.png ├── implement_to_data_test2.png ├── insignificant_categories_function.jpg ├── insignificant_categories_function3.jpg ├── insignificant_categories_function4.jpg ├── load_dataset_function.png ├── load_libraries.png ├── logistic_regression.png ├── logistic_regression_Gini_LogLoss.png ├── logistic_regression_ROC.png ├── logistic_regression_ROC_confusion_matrix.png ├── logistic_regression_confusion_matrix.png ├── logistic_regression_ev1.png ├── split_data_train.png └── test.txt ├── README.md ├── my_submission.csv ├── my_submission2.csv └── sample_submission.csv /Classification Using Pyspark_Home_Quote - v3.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Classification Using Pyspark_Home_Quote - v3.ipynb -------------------------------------------------------------------------------- /Classification_Using _Pyspark.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Classification_Using _Pyspark.py -------------------------------------------------------------------------------- /Image/Decision_Tree.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Decision_Tree.png -------------------------------------------------------------------------------- /Image/Decision_Tree_Gini_LogLoss.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Decision_Tree_Gini_LogLoss.png -------------------------------------------------------------------------------- /Image/Decision_Tree_ROC.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Decision_Tree_ROC.png -------------------------------------------------------------------------------- /Image/Decision_Tree_confusion_matrix.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Decision_Tree_confusion_matrix.png -------------------------------------------------------------------------------- /Image/Decision_Tree_ev1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Decision_Tree_ev1.png -------------------------------------------------------------------------------- /Image/EDA1.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/EDA1.jpg -------------------------------------------------------------------------------- /Image/EDA2.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/EDA2.jpg -------------------------------------------------------------------------------- /Image/EDA3.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/EDA3.jpg -------------------------------------------------------------------------------- /Image/EDA4.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/EDA4.jpg -------------------------------------------------------------------------------- /Image/EDA5.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/EDA5.png -------------------------------------------------------------------------------- /Image/EDA6.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/EDA6.png -------------------------------------------------------------------------------- /Image/Random_Forest.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Random_Forest.png -------------------------------------------------------------------------------- /Image/Random_Forest_Gini_LogLoss.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Random_Forest_Gini_LogLoss.png -------------------------------------------------------------------------------- /Image/Random_Forest_ROC.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Random_Forest_ROC.png -------------------------------------------------------------------------------- /Image/Random_Forest_confusion_matrix.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Random_Forest_confusion_matrix.png -------------------------------------------------------------------------------- /Image/Random_Forest_ev1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/Random_Forest_ev1.png -------------------------------------------------------------------------------- /Image/call_function_feature_engineering.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/call_function_feature_engineering.png -------------------------------------------------------------------------------- /Image/call_insignificant_categories_function.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/call_insignificant_categories_function.jpg -------------------------------------------------------------------------------- /Image/callfunction_compare_categorical_variables.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/callfunction_compare_categorical_variables.jpg -------------------------------------------------------------------------------- /Image/check_data.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/check_data.png -------------------------------------------------------------------------------- /Image/check_missing_values.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/check_missing_values.png -------------------------------------------------------------------------------- /Image/check_missing_values2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/check_missing_values2.png -------------------------------------------------------------------------------- /Image/check_missing_values3.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/check_missing_values3.png -------------------------------------------------------------------------------- /Image/define_categorical_numerical_variables1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/define_categorical_numerical_variables1.png -------------------------------------------------------------------------------- /Image/define_categorical_numerical_variables2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/define_categorical_numerical_variables2.png -------------------------------------------------------------------------------- /Image/feature_engineering.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/feature_engineering.png -------------------------------------------------------------------------------- /Image/feature_engineering2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/feature_engineering2.png -------------------------------------------------------------------------------- /Image/function_compare_categorical_variables.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/function_compare_categorical_variables.jpg -------------------------------------------------------------------------------- /Image/gradient_boosting.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/gradient_boosting.png -------------------------------------------------------------------------------- /Image/gradient_boosting_Gini_LogLoss.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/gradient_boosting_Gini_LogLoss.png -------------------------------------------------------------------------------- /Image/gradient_boosting_ROC.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/gradient_boosting_ROC.png -------------------------------------------------------------------------------- /Image/gradient_boosting_ROC_confusion_matrix.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/gradient_boosting_ROC_confusion_matrix.png -------------------------------------------------------------------------------- /Image/gradient_boosting_confusion_matrix.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/gradient_boosting_confusion_matrix.png -------------------------------------------------------------------------------- /Image/gradient_boosting_ev1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/gradient_boosting_ev1.png -------------------------------------------------------------------------------- /Image/handle_missing_values.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/handle_missing_values.jpg -------------------------------------------------------------------------------- /Image/handle_missing_values2.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/handle_missing_values2.jpg -------------------------------------------------------------------------------- /Image/handle_outlier.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/handle_outlier.png -------------------------------------------------------------------------------- /Image/handle_outlier2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/handle_outlier2.png -------------------------------------------------------------------------------- /Image/handle_outlier3.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/handle_outlier3.png -------------------------------------------------------------------------------- /Image/hyper_parameter_Random_Forest.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/hyper_parameter_Random_Forest.png -------------------------------------------------------------------------------- /Image/hyper_parameter_tuning_DecisionTree.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/hyper_parameter_tuning_DecisionTree.png -------------------------------------------------------------------------------- /Image/hyper_parameter_tuning_GradientBoost.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/hyper_parameter_tuning_GradientBoost.png -------------------------------------------------------------------------------- /Image/hyper_parameter_tuning_LogisticRegression.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/hyper_parameter_tuning_LogisticRegression.png -------------------------------------------------------------------------------- /Image/implement_to_data_test.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/implement_to_data_test.png -------------------------------------------------------------------------------- /Image/implement_to_data_test2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/implement_to_data_test2.png -------------------------------------------------------------------------------- /Image/insignificant_categories_function.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/insignificant_categories_function.jpg -------------------------------------------------------------------------------- /Image/insignificant_categories_function3.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/insignificant_categories_function3.jpg -------------------------------------------------------------------------------- /Image/insignificant_categories_function4.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/insignificant_categories_function4.jpg -------------------------------------------------------------------------------- /Image/load_dataset_function.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/load_dataset_function.png -------------------------------------------------------------------------------- /Image/load_libraries.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/load_libraries.png -------------------------------------------------------------------------------- /Image/logistic_regression.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/logistic_regression.png -------------------------------------------------------------------------------- /Image/logistic_regression_Gini_LogLoss.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/logistic_regression_Gini_LogLoss.png -------------------------------------------------------------------------------- /Image/logistic_regression_ROC.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/logistic_regression_ROC.png -------------------------------------------------------------------------------- /Image/logistic_regression_ROC_confusion_matrix.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/logistic_regression_ROC_confusion_matrix.png -------------------------------------------------------------------------------- /Image/logistic_regression_confusion_matrix.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/logistic_regression_confusion_matrix.png -------------------------------------------------------------------------------- /Image/logistic_regression_ev1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/logistic_regression_ev1.png -------------------------------------------------------------------------------- /Image/split_data_train.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/Image/split_data_train.png -------------------------------------------------------------------------------- /Image/test.txt: -------------------------------------------------------------------------------- 1 | test 2 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/README.md -------------------------------------------------------------------------------- /my_submission.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/my_submission.csv -------------------------------------------------------------------------------- /my_submission2.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/my_submission2.csv -------------------------------------------------------------------------------- /sample_submission.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/elsyifa/Classification-Pyspark/HEAD/sample_submission.csv --------------------------------------------------------------------------------