├── .gitignore ├── CCA175 Scala ├── 02 Data Processing - Overview.ipynb ├── 04 Processing Column Data.ipynb └── 05 Basic Transformations.ipynb ├── Data Engineering Bootcamp ├── 30 Basics of Programming using Python │ ├── .idea │ │ └── workspace.xml │ ├── 01 Getting Started.ipynb │ ├── 02 Basic Programming Constructs.ipynb │ ├── 03 All about Functions.ipynb │ ├── 04 Collections and Tuples.ipynb │ ├── 05 Processing Collections using Loops.ipynb │ ├── 06 Development of Map Reduce APIs.ipynb │ ├── 07 Data Processing using Pandas.ipynb │ ├── 08 Data Visualization.ipynb │ ├── 09 Database Programming.ipynb │ ├── 09 Overview of Docker for Development.md │ ├── 10 Exception Handling and Logging.ipynb │ ├── 11 01 Web Scraping using BeautifulSoup.ipynb │ ├── 11 02 Solution - Web Scraping using BeautifulSoup.ipynb │ ├── 11 Exercise - Web Scraping - Airports Data.html │ ├── 21 Downloading Stock Data.ipynb │ └── 99 Dates.ipynb ├── 40 Big Data ecosystem - Overview │ ├── 01 Big Data Cluster Management Tools – Overview.ipynb │ ├── 02 Understanding Big Data File Systems - HDFS and DBFS.ipynb │ ├── 03 Yet Another Resource Negotiator (YARN).ipynb │ └── 04 Setup Development Environment.md └── 46 Apache Spark using Python │ ├── 00 Quick Overview of Pyspark.ipynb │ ├── 00 Setup Development Environment.md │ ├── 01 Getting Started.ipynb │ ├── 02 Quick Recap of Python.ipynb │ ├── 03 Data Processing - Overview.ipynb │ ├── 04 Processing Column Data.ipynb │ ├── 05 Basic Transformations.ipynb │ ├── 06 Joining Data Sets.ipynb │ ├── 07 Windowing Functions.ipynb │ ├── 08 Spark Metastore.ipynb │ ├── 09 Spark Metastore (Contd).ipynb │ └── requirements.txt ├── README.md └── starterkits └── spark ├── python ├── Python - 01 Quick Recap.ipynb └── Python - 02 Getting Started with Spark.ipynb └── scala ├── Scala - 01 Quick Recap.ipynb └── Scala - 02 Getting Started with Spark.ipynb /.gitignore: -------------------------------------------------------------------------------- 1 | .ipynb_checkpoints/ 2 | itversity-books-env 3 | -------------------------------------------------------------------------------- /CCA175 Scala/02 Data Processing - Overview.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/CCA175 Scala/02 Data Processing - Overview.ipynb -------------------------------------------------------------------------------- /CCA175 Scala/04 Processing Column Data.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/CCA175 Scala/04 Processing Column Data.ipynb -------------------------------------------------------------------------------- /CCA175 Scala/05 Basic Transformations.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/CCA175 Scala/05 Basic Transformations.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/.idea/workspace.xml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/.idea/workspace.xml -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/01 Getting Started.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/01 Getting Started.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/02 Basic Programming Constructs.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/02 Basic Programming Constructs.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/03 All about Functions.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/03 All about Functions.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/04 Collections and Tuples.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/04 Collections and Tuples.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/05 Processing Collections using Loops.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/05 Processing Collections using Loops.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/06 Development of Map Reduce APIs.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/06 Development of Map Reduce APIs.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/07 Data Processing using Pandas.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/07 Data Processing using Pandas.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/08 Data Visualization.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/08 Data Visualization.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/09 Database Programming.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/09 Database Programming.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/09 Overview of Docker for Development.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/09 Overview of Docker for Development.md -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/10 Exception Handling and Logging.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/10 Exception Handling and Logging.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/11 01 Web Scraping using BeautifulSoup.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/11 01 Web Scraping using BeautifulSoup.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/11 02 Solution - Web Scraping using BeautifulSoup.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/11 02 Solution - Web Scraping using BeautifulSoup.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/11 Exercise - Web Scraping - Airports Data.html: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/11 Exercise - Web Scraping - Airports Data.html -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/21 Downloading Stock Data.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/21 Downloading Stock Data.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/30 Basics of Programming using Python/99 Dates.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/30 Basics of Programming using Python/99 Dates.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/40 Big Data ecosystem - Overview/01 Big Data Cluster Management Tools – Overview.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/40 Big Data ecosystem - Overview/01 Big Data Cluster Management Tools – Overview.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/40 Big Data ecosystem - Overview/02 Understanding Big Data File Systems - HDFS and DBFS.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/40 Big Data ecosystem - Overview/02 Understanding Big Data File Systems - HDFS and DBFS.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/40 Big Data ecosystem - Overview/03 Yet Another Resource Negotiator (YARN).ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/40 Big Data ecosystem - Overview/03 Yet Another Resource Negotiator (YARN).ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/40 Big Data ecosystem - Overview/04 Setup Development Environment.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/40 Big Data ecosystem - Overview/04 Setup Development Environment.md -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/00 Quick Overview of Pyspark.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/00 Quick Overview of Pyspark.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/00 Setup Development Environment.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/00 Setup Development Environment.md -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/01 Getting Started.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/01 Getting Started.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/02 Quick Recap of Python.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/02 Quick Recap of Python.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/03 Data Processing - Overview.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/03 Data Processing - Overview.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/04 Processing Column Data.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/04 Processing Column Data.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/05 Basic Transformations.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/05 Basic Transformations.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/06 Joining Data Sets.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/06 Joining Data Sets.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/07 Windowing Functions.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/07 Windowing Functions.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/08 Spark Metastore.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/08 Spark Metastore.ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/09 Spark Metastore (Contd).ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/Data Engineering Bootcamp/46 Apache Spark using Python/09 Spark Metastore (Contd).ipynb -------------------------------------------------------------------------------- /Data Engineering Bootcamp/46 Apache Spark using Python/requirements.txt: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/README.md -------------------------------------------------------------------------------- /starterkits/spark/python/Python - 01 Quick Recap.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/starterkits/spark/python/Python - 01 Quick Recap.ipynb -------------------------------------------------------------------------------- /starterkits/spark/python/Python - 02 Getting Started with Spark.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/starterkits/spark/python/Python - 02 Getting Started with Spark.ipynb -------------------------------------------------------------------------------- /starterkits/spark/scala/Scala - 01 Quick Recap.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/starterkits/spark/scala/Scala - 01 Quick Recap.ipynb -------------------------------------------------------------------------------- /starterkits/spark/scala/Scala - 02 Getting Started with Spark.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dgadiraju/itversity-books/HEAD/starterkits/spark/scala/Scala - 02 Getting Started with Spark.ipynb --------------------------------------------------------------------------------