├── README.md ├── assets ├── chapter1_preview.jpg ├── contents.jpg ├── db-logo.png ├── diagrams.jpg ├── experience.jpg ├── insightful_markdown.JPG ├── jobs_view.JPG ├── mode diagrams-client mode.png ├── mode diagrams-cluster mode.png ├── mode diagrams-no spark application.png ├── scenario_based_excercise.JPG ├── solutions.JPG ├── step1.png ├── step2.png └── step3.png └── dbc └── databricks-spark-certification.dbc /README.md: -------------------------------------------------------------------------------- 1 | # A Guide to Databricks Spark Certification 2 | 3 | [![Stars](https://img.shields.io/github/stars/Realsid/databricks-spark-certification?style=social)](https://github.com/Realsid/databricks-spark-certification) 4 |

5 | 6 |

7 | 8 | Welcome to guide to databricks spark certification ! This repository will help you: 9 | 1. Learn about [Apache Spark framework](https://spark.apache.org/) 10 | 2. Learn to use [pyspark](https://spark.apache.org/docs/latest/api/python/index.html) in databricks enviroment 11 | 3. Learn about the topics that are required study for the clear [CRT020](https://academy.databricks.com/category/certifications) examination 12 | 13 | Throughout the guide more emphesis will be given to a code first methodology with minimal theory when covering topics. Following are the pre-requisites to start using this guide: 14 | 15 | 1. Basic knowledge of python 16 | 2. Basic knowledge of SQL 17 | 2. [Databricks community edition account](https://community.cloud.databricks.com/) 18 | 19 | Once you setup the account download the .DBC files and upload it to your databricks account as shown below: 20 | 21 | 1.Log into databricks community edition and click on import![](https://github.com/Realsid/databricks-spark-certification/blob/master/assets/step1.png) 22 | 2.Click on file and browse![](https://github.com/Realsid/databricks-spark-certification/blob/master/assets/step2.png) 23 | 3. Upload the databricks-spark-certification.dbc file![](https://github.com/Realsid/databricks-spark-certification/blob/master/assets/step3.png) 24 | 25 | This guide is suplemented with a [google sheet](https://docs.google.com/spreadsheets/d/1QOymDzCGG0LgzoFbSdwMPMEfysVwQs2oyTiIagSBSKM/edit?usp=sharing) where you can find topic wise breakup of material provided in the guide. 26 | 27 | # Snippets 28 | 29 | ## Contents 30 | 31 | >

32 | > 33 | >

34 | 35 | ## Chapters 36 | 37 | >

38 | > 39 | >

40 | 41 | ## Diagrams 42 | 43 | >

44 | > 45 | >

46 | 47 | ## Insightful Markdown 48 | 49 | >

50 | > 51 | >

52 | 53 | ## Tricks and tips 54 | 55 | >

56 | > 57 | >

58 | 59 | ## Scenario based exercises 60 | 61 | >

62 | > 63 | >

64 | 65 | ## Solutions 66 | 67 | >

68 | > 69 | >

70 | -------------------------------------------------------------------------------- /assets/chapter1_preview.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/chapter1_preview.jpg -------------------------------------------------------------------------------- /assets/contents.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/contents.jpg -------------------------------------------------------------------------------- /assets/db-logo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/db-logo.png -------------------------------------------------------------------------------- /assets/diagrams.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/diagrams.jpg -------------------------------------------------------------------------------- /assets/experience.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/experience.jpg -------------------------------------------------------------------------------- /assets/insightful_markdown.JPG: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/insightful_markdown.JPG -------------------------------------------------------------------------------- /assets/jobs_view.JPG: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/jobs_view.JPG -------------------------------------------------------------------------------- /assets/mode diagrams-client mode.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/mode diagrams-client mode.png -------------------------------------------------------------------------------- /assets/mode diagrams-cluster mode.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/mode diagrams-cluster mode.png -------------------------------------------------------------------------------- /assets/mode diagrams-no spark application.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/mode diagrams-no spark application.png -------------------------------------------------------------------------------- /assets/scenario_based_excercise.JPG: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/scenario_based_excercise.JPG -------------------------------------------------------------------------------- /assets/solutions.JPG: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/solutions.JPG -------------------------------------------------------------------------------- /assets/step1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/step1.png -------------------------------------------------------------------------------- /assets/step2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/step2.png -------------------------------------------------------------------------------- /assets/step3.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/assets/step3.png -------------------------------------------------------------------------------- /dbc/databricks-spark-certification.dbc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Realsid/databricks-spark-certification/d8509b2162029f480e8daf480809a161925c3afc/dbc/databricks-spark-certification.dbc --------------------------------------------------------------------------------