├── Chapter 1 ├── Ch 1 Exercise.ipynb └── Chapter 1.ipynb ├── Chapter 2 ├── Ch 2 Probability.ipynb └── Ch_2_Exercise.ipynb ├── Chapter 3 ├── Ch 3 Descriptive and Inferential Statistics.ipynb └── Ch 3 Exercise.ipynb ├── Chapter 4 ├── Ch 4 Exercise.ipynb └── Ch 4 Linear Algebra.ipynb ├── Chapter 5 ├── Chapter 5 Exercise.ipynb └── Chapter 5.ipynb ├── Chapter 6 ├── Ch 6.ipynb └── Chapter 6 Exercise.ipynb └── README.md /Chapter 1/Ch 1 Exercise.ipynb: -------------------------------------------------------------------------------- 1 | {"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"name":"Ch 1: Exercise.ipynb","provenance":[],"authorship_tag":"ABX9TyOM76TTaSZgQH7orcGacZS8"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"}},"cells":[{"cell_type":"markdown","source":["$\\text {1. Is the value 62.6738 rational or irrational? Why or why not?}$\n","\n","Ans: It is a rational number. For a number to be rational below conditionals should be met:\n","* It should be represented by $p/q$; where p $\\neq$ q\n","* The ratio p/q can be further simplified and represented in decimal form.\n","\n","\\begin{equation}\n"," \\frac{p}{q}=62.6738 = 62.674\n","\\end{equation}"],"metadata":{"id":"QCOu6O6BzBgl"}},{"cell_type":"markdown","source":["$\\text {2. Evaluate expression} \\, {10^710^{-5}}$"],"metadata":{"id":"Lemu7y1V2F3c"}},{"cell_type":"code","execution_count":1,"metadata":{"id":"ZhXoW4tgysnE","executionInfo":{"status":"ok","timestamp":1655892106967,"user_tz":-360,"elapsed":10,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}}},"outputs":[],"source":["from sympy import *"]},{"cell_type":"code","source":["x = symbols('x')\n","expr2=(x**7)*(x**-5)\n","expr2.subs(x,10)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":37},"id":"DB2kxlKV3CEr","executionInfo":{"status":"ok","timestamp":1655892168083,"user_tz":-360,"elapsed":3,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"45cb6d03-fe13-44a3-c0d5-574903569e28"},"execution_count":3,"outputs":[{"output_type":"execute_result","data":{"text/plain":["100"],"text/latex":"$\\displaystyle 100$"},"metadata":{},"execution_count":3}]},{"cell_type":"markdown","source":["$\\text {3. Evaluate expression} \\, {81^{1/2}}$"],"metadata":{"id":"jjk07Tjd3cgy"}},{"cell_type":"code","source":["expr3 = x**(1/2)\n","expr3.subs(x,81)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":37},"id":"mfVgk6UD3Rd3","executionInfo":{"status":"ok","timestamp":1655892420322,"user_tz":-360,"elapsed":6,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"426f439d-a8e4-45e4-f386-e32636e3d275"},"execution_count":7,"outputs":[{"output_type":"execute_result","data":{"text/plain":["9.00000000000000"],"text/latex":"$\\displaystyle 9.0$"},"metadata":{},"execution_count":7}]},{"cell_type":"markdown","source":["$\\text {4. Evaluate expression} \\, {25^{\\frac 32}}$"],"metadata":{"id":"u_K6ubUU39d3"}},{"cell_type":"code","source":["expr4=x**(3/2)\n","expr4.subs(x,25)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":37},"id":"Y1Gwrxss305K","executionInfo":{"status":"ok","timestamp":1655892438954,"user_tz":-360,"elapsed":378,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"f4ab348d-8c73-47f6-f901-5051b50c1231"},"execution_count":9,"outputs":[{"output_type":"execute_result","data":{"text/plain":["125.000000000000"],"text/latex":"$\\displaystyle 125.0$"},"metadata":{},"execution_count":9}]},{"cell_type":"markdown","source":["$\\text {5. Assuming no payments are made, how much would a \\$1,000 be worth at 5% interest compounded monthly after 3 years?}$\n","Ans:\n"," Say- \n"," * A = balance\n"," * P = Starting investment($1000)\n"," * r = Interest rate (5%)\n"," * t = Number of years (3)\n"," * n = Periods (12)\n"," \\begin{equation} A = P × (1+ \\frac {r}{n})^{nt} \\end{equation}\n"," \\begin{equation} A = 1000 × (1+ \\frac {0.05}{12})^{12×3} = ? \\end{equation}\n"],"metadata":{"id":"bhORmnCV4qUP"}},{"cell_type":"code","source":["p, r , n, t = symbols('p r n t')\n","a = p * (1+(r/n))**(n*t)\n","a.evalf(subs={p:1000,r:5/100, n:12, t:3})"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":37},"id":"eDihiLUC4VN7","executionInfo":{"status":"ok","timestamp":1655893207611,"user_tz":-360,"elapsed":369,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"afc8f09d-2c88-40ce-92d5-a04f45ddde4e"},"execution_count":16,"outputs":[{"output_type":"execute_result","data":{"text/plain":["1161.47223133347"],"text/latex":"$\\displaystyle 1161.47223133347$"},"metadata":{},"execution_count":16}]},{"cell_type":"markdown","source":["$\\text {6. Assuming no payments are made, how much would a \\$1,000 be worth at 5% interest compounded continuously after 3 years?}$\n","Ans:\n"," Say- \n"," * A = balance\n"," * P = Starting investment($1000)\n"," * r = Interest rate (5%)\n"," * t = Number of years (3)\n"," * n = Periods (12)\n","\\begin{equation} A = P × e^{rt} \\\\ A = 1000 × e^{0.05×3} = ? \\end{equation}"],"metadata":{"id":"wcIqnwXL_D91"}},{"cell_type":"code","source":["a = p*exp(r*t)\n","a.evalf(subs={p:1000,r:5/100,t:3})"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":37},"id":"y4cdg5J0_R4L","executionInfo":{"status":"ok","timestamp":1655894330651,"user_tz":-360,"elapsed":354,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"53668741-157d-45f7-cda1-1782e0f374ef"},"execution_count":29,"outputs":[{"output_type":"execute_result","data":{"text/plain":["1161.83424272828"],"text/latex":"$\\displaystyle 1161.83424272828$"},"metadata":{},"execution_count":29}]},{"cell_type":"markdown","source":["$\\text{7. For the function }f(x) = 3x^2+1, \\text{what is the slope at x=3?}$"],"metadata":{"id":"kH-J7tt97vBe"}},{"cell_type":"code","source":["f, x = symbols('f x')\n","f = 3*(x**2)+1\n","dx_f=diff(f)\n","dx_f.subs(x,3)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":37},"id":"4ZC4YgOL6RRz","executionInfo":{"status":"ok","timestamp":1655894487376,"user_tz":-360,"elapsed":358,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"5b31865f-b60f-41e0-f490-6262898928d9"},"execution_count":31,"outputs":[{"output_type":"execute_result","data":{"text/plain":["18"],"text/latex":"$\\displaystyle 18$"},"metadata":{},"execution_count":31}]},{"cell_type":"markdown","source":["$\\text{8. For the function }f(x) = 3x^2+1, \\text{what is the area under the curve for } x \\text{ between 0 and 2?}$"],"metadata":{"id":"uAIyK-ZJ9Wv_"}},{"cell_type":"code","source":["x = symbols('x')\n","area = integrate(f,(x,0,2)) # f is the same as problem 7\n","area"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":37},"id":"2VW8dBuU8TgM","executionInfo":{"status":"ok","timestamp":1655894063393,"user_tz":-360,"elapsed":8,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"87532923-808a-411e-a394-ecefef5d7a4c"},"execution_count":28,"outputs":[{"output_type":"execute_result","data":{"text/plain":["10"],"text/latex":"$\\displaystyle 10$"},"metadata":{},"execution_count":28}]}]} -------------------------------------------------------------------------------- /Chapter 2/Ch 2 Probability.ipynb: -------------------------------------------------------------------------------- 1 | {"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"name":"Ch 2: Probability.ipynb","provenance":[],"authorship_tag":"ABX9TyPGaQXbcNbQYVbBF93Io+bi"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"}},"cells":[{"cell_type":"markdown","source":["#Most people take probability for granted, but in reality it is more naunced and complicated."],"metadata":{"id":"D76YOvAeChza"}},{"cell_type":"markdown","source":["A report says, 85% of Cancer patients have said to have been Coffee drinkers. \n","\n","Say, in the US the population of Cancer patients is 0.5%\n","and about 65% of the population drink Coffee.\n","\n","Given that condition in the report, we should have more than 5% cancer patients, if it were to be True.\n","\n","Let's say, \n","\n","\\begin{align} P(Coffee|Cancer) = 0.85 \\\\ P(Coffee) = 0.65 \\\\ P(Cancer) = 0.05 \\\\ P(Cancer|Coffee) = ?\\end{align}\n","\n","Applying Bayes' Theorem we get:\n","\n","\\begin{equation}\n","P(Cancer|Coffee)= \\frac {P(Coffee|Cancer) × P(Cancer)}{P(Coffee)}\n","\\end{equation}"],"metadata":{"id":"doZJN6VhH6j-"}},{"cell_type":"code","execution_count":1,"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"tfEmVwJvCd-H","executionInfo":{"status":"ok","timestamp":1656061107387,"user_tz":-360,"elapsed":24,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"2617cd73-a951-4a19-de4e-3142f227bb1f"},"outputs":[{"output_type":"execute_result","data":{"text/plain":["0.006538461538461539"]},"metadata":{},"execution_count":1}],"source":["p_coffee_drinker=0.65\n","p_cancer=0.005\n","p_coffee_given_cancer=0.85\n","p_cancer_given_coffee_drinker=(p_coffee_given_cancer*p_cancer)/p_coffee_drinker\n","p_cancer_given_coffee_drinker"]},{"cell_type":"markdown","source":[""],"metadata":{"id":"U2U23akaOluG"}},{"cell_type":"markdown","source":["># **Binomial Distributions**\n","___\n","\n","Binomial Distributions measures how likely $k$ successes can happen out of $n$ trials, given $p$ probability.\n","\n","\\begin{equation}\n","{n \\choose k}p^k (1-p)^{n-k}\n","\\end{equation}\n","\n"],"metadata":{"id":"nJ-y91fAyLlI"}},{"cell_type":"code","source":["import matplotlib.pyplot as plt\n","from scipy.stats import binom\n","import seaborn as sns\n","import numpy as np"],"metadata":{"id":"HSfBDInFK2TZ","executionInfo":{"status":"ok","timestamp":1656061107390,"user_tz":-360,"elapsed":17,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}}},"execution_count":2,"outputs":[]},{"cell_type":"code","source":["n=10\n","p=0.9\n","ks=np.arange(n+1)\n","total=0\n","for k in ks:\n"," probability=binom.pmf(k,n,p)\n"," #print(f\"{k} - {probability}\")\n"," if k<=8:\n"," total+=probability\n"," else:\n"," break\n","print(total) # total probability of 8 or fewer successes"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"wj7MOKOE0A3P","executionInfo":{"status":"ok","timestamp":1656061107391,"user_tz":-360,"elapsed":16,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"64135e0e-cd4c-4200-bfd2-c285b0207657"},"execution_count":3,"outputs":[{"output_type":"stream","name":"stdout","text":["0.2639010708999999\n"]}]},{"cell_type":"markdown","source":["So, there is a 26% chance we would see eight or fewer successes even if the underlying success rate is 90%"],"metadata":{"id":"MOBJhrvI4Xds"}},{"cell_type":"markdown","source":[">## ***Building Binomial Distribution from scratch.***\n","___\n","\n","From equation: \n","\n","\\begin{equation}\n","{n \\choose k}p^k (1-p)^{n-k}\\\\ \\text{Binomial Coefficient}{n \\choose k} = \\frac{n!}{k!×(n-k)!}\n","\\end{equation}\n","```python:\n","def factorial(n: int):\n"," f = 1\n"," for i in range(n):\n"," f*=(i+1)\n"," return f\n","def binomial_coefficient(n: int, k: int):\n"," return factorial(n) /(factorial(k)*factorial(n-k))\n","def binomial_distribution(k: int, n: int, p: float):\n"," return binomial_coefficient(n,k) *(p**k) * (1.0 - p)**n-k # refer to equation 1\n","```"],"metadata":{"id":"7tucdr4Q48i6"}},{"cell_type":"markdown","source":["># **Beta Distribution**\n","\n","\n","---\n","\n","So far, we've been creating myriad of distributions to answer the question whether or not we are going to see 8 successes out of 10 tests in the engine testing model.\n","* What if there are more underlying rates of success that yield 8/10 successes besides 90%?\n","* What if 70% or 30% or 80% underlying success rate yields 8/10 success result?\n","* When we fix 8/10 successes, can we explore the probabilities of those probabilities?\n","\n","Simple approach to that would be new type of distribution, _the beta distribution_. It allows us to see the likelihood of didfferent underlying probabilities for an event to occur, given $alpha$ successes and $beta$ failures.\n","\n"],"metadata":{"id":"s0ut8j2E-XRv"}},{"cell_type":"code","source":["from scipy.stats import beta\n","\n","a=8 #success\n","b=2 #failures\n","p=beta.cdf(.90, a, b) # probability the underlying success of 90%\n","print(p)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"wCjZCB779-vl","executionInfo":{"status":"ok","timestamp":1656061806409,"user_tz":-360,"elapsed":363,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"6130799d-26f3-4682-8634-90730b36dfab"},"execution_count":4,"outputs":[{"output_type":"stream","name":"stdout","text":["0.7748409780000001\n"]}]},{"cell_type":"markdown","source":["So, according to our calculation, there is a 77.48 % chance the underlying probability of success is 90% or less\n","\n","How do we calculate the probability of success being 90% or more? Pretty simple just subtract it from `1.0`"],"metadata":{"id":"HcvqaEoe-nO1"}},{"cell_type":"code","source":["1.0-p"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"_upmrr1c-cQ9","executionInfo":{"status":"ok","timestamp":1656061967163,"user_tz":-360,"elapsed":333,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"502b8241-8c54-482a-c9ca-e4be8573005b"},"execution_count":7,"outputs":[{"output_type":"execute_result","data":{"text/plain":["0.22515902199999993"]},"metadata":{},"execution_count":7}]},{"cell_type":"markdown","source":["Which only means that there is only a 22.5% chance that the underlying success rate is 90% or higher? But there is 77.5% chance that it is less than 90%. Could we gamble on that 22.5% chance of 90% or higher underlying success rate? I don't think so. If we run more tests and after 30 tests we get 6 failures then - "],"metadata":{"id":"4c1qfKtP_Msg"}},{"cell_type":"code","source":["a = 30\n","b= 6\n","p= 1.0 - beta.cdf(.90, a, b)\n","print(p)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"kA7YhvQV_MRV","executionInfo":{"status":"ok","timestamp":1656062201502,"user_tz":-360,"elapsed":16,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"b62c4b66-ac6f-4ffe-e3f1-2225d481f603"},"execution_count":8,"outputs":[{"output_type":"stream","name":"stdout","text":["0.13163577484183708\n"]}]},{"cell_type":"markdown","source":["At this point, it might be a good idea to walk away from the tests, unless you want to keep gambling away against the 13.16% chance and hope the peak moves to the right. \n","\n","* Question arises how can we calculate an area in the middle? say - the chances of succeeding is between 80% to 90% ?\n","\n","- That would be to subtract the area behind 80% peak from the area behind 90% "],"metadata":{"id":"cELaYppiABzP"}},{"cell_type":"code","source":["p = beta.cdf(.90, a, b) - beta.cdf(.80, a , b)\n","print(p)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"9lKaC0Zi-eq5","executionInfo":{"status":"ok","timestamp":1656062751094,"user_tz":-360,"elapsed":378,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"317e3e7b-0b59-4ad4-898b-1b051d394606"},"execution_count":9,"outputs":[{"output_type":"stream","name":"stdout","text":["0.5962725311986745\n"]}]},{"cell_type":"markdown","source":["So, the probability that our underlying success rate is between 80% and 90% is 59.6%\n"],"metadata":{"id":"UP6NtUZmCZ9s"}}]} -------------------------------------------------------------------------------- /Chapter 2/Ch_2_Exercise.ipynb: -------------------------------------------------------------------------------- 1 | {"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"name":"Ch_2_Exercise.ipynb","provenance":[],"authorship_tag":"ABX9TyNyEOuVsLoW2KrcM7bTqOyI"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"}},"cells":[{"cell_type":"markdown","source":["#**Exercise 1:**\n","\n","There is a 30% chance of rain today, and a 40% chance your ubrella order will arive on time. You are eager to walk in the rain today and cannot do so without either!\n","\n","What is the probability it will rain AND your umbrella will arrive?\n","\n","\\begin{equation}\n","P(Rain\\space AND\\space Umbrella) = {P(Umbrella)\\times P(Rain)}\n","= 0.4 × 0.3 = 0.12\n","\\end{equation}"],"metadata":{"id":"KIRW5WjyD-Uy"}},{"cell_type":"markdown","source":["# **Exercise 2:**\n","There is a 30% chance of rain today, and a 40% chance your ubrella order will arive on time. You will be able to run errands only if it doesn't rain or your umbrella arrives.\n","What is probability it will not rain OR your umbrella arrives?\n","\n","\\begin{align}\n","P(NR \\space \\text{OR} \\space U) = P(NR) + P(U) - P(NR \\space AND \\space U) \\\\\n","P(NR \\space \\text{OR} \\space U) = (1.0 - 0.3) + 0.4 - 0.12 = 0.7 + 0.4 - 0.12 = 0.98\n","\\end{align}\n","\n"],"metadata":{"id":"VFiYo2zFGQ7h"}},{"cell_type":"markdown","source":["# **Exercise 3:**\n","\n","There is a 30% chance of rain today, and a 40% chance your ubrella order will arive on time.\n","\n","However, you found out that if it rains thereis only a 20% chance your ubrella will arive on time. \n","\n","What is the probability that it will rain AND your umbrella will arrive on time?\n","\n","\\begin{equation}\n","P(rain \\space AND \\space umbrella) = P(Rain) × P(U|Rain) = 0.3 × 0.2 = 0.06\n","\\end{equation}"],"metadata":{"id":"Nkq21jn-KhZA"}},{"cell_type":"markdown","source":["# **Exercise 4:**\n","You have 137 passengers booked on a flight from Las Vegas to Dallas. However, it is Las Vegas on a Sunday morning and you estimate each passenger is 40% likely to not show up.\n","\n","You trying to figure out how many seats to over book, so the plane doesn't fly empty.\n","\n","How likely is it atleast 50 passengers will not show up?"],"metadata":{"id":"L9NbM5UILr-n"}},{"cell_type":"code","source":["from scipy.stats import binom\n","n= 137\n","p=0.4\n","p_50_or_more_no_shows=0.0\n","for x in range(50,138):\n"," p_50_or_more_no_shows+=binom.pmf(x,n,p)\n","print(p_50_or_more_no_shows)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"2xEQz_UNGHxo","executionInfo":{"status":"ok","timestamp":1656065692341,"user_tz":-360,"elapsed":15,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"032c56c1-72c6-4605-d59e-77bc57c8e2d3"},"execution_count":2,"outputs":[{"output_type":"stream","name":"stdout","text":["0.8220955881474781\n"]}]},{"cell_type":"markdown","source":["# Exercise 5:\n","You flipped a coin 19 times and got heads 15 times and tails 4 times.\n","Do you think this coin has any good probability of being fair? Why and why not?"],"metadata":{"id":"bmEQ131fNlHR"}},{"cell_type":"code","execution_count":4,"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"boGUMy3qD3VC","executionInfo":{"status":"ok","timestamp":1656065993568,"user_tz":-360,"elapsed":361,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"a819d7d1-e9d1-43d6-8f6e-6145e9f3c6f4"},"outputs":[{"output_type":"stream","name":"stdout","text":["0.9962310791015625\n"]}],"source":["from scipy.stats import beta\n","\n","a = 15\n","b = 4\n","\n","dist= 1.0 - beta.cdf(0.5,a,b)\n","print(dist)"]},{"cell_type":"markdown","source":["There is over 99.6% chance that the coin is unfair. Because the with the fair underlying probability rate of 0.5 there is less than 1% chance that this type of outcome is likely to have taken place. "],"metadata":{"id":"DdGuKSHnOt34"}}]} -------------------------------------------------------------------------------- /Chapter 3/Ch 3 Exercise.ipynb: -------------------------------------------------------------------------------- 1 | {"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"name":"Ch:3 Exercise.ipynb","provenance":[],"authorship_tag":"ABX9TyOOr4ribZDxXsB6XPHZv1xi"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"}},"cells":[{"cell_type":"markdown","source":["Chapter-3\n","#**Exercises**"],"metadata":{"id":"2Q4x74NK7a2a"}},{"cell_type":"markdown","source":["#### **Ex-1:**\n","---\n","You bought a spool of 1.75 mm filament for your 3D printer. You want to measure how close the filament diameter really is to 1.75 mm. You use a caliper tool and sample the diameter five times on the spool:\n","\n","1.78, 1.75, 1.72, 1.74, 1.77\n","\n","`Calculate the mean and standard deviation for this set of values.`"],"metadata":{"id":"tT7f73En7ilf"}},{"cell_type":"code","execution_count":2,"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"RMkZ9Bf-7H0F","executionInfo":{"status":"ok","timestamp":1657102475481,"user_tz":-360,"elapsed":3,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"abb2f76c-5c0c-4ccb-cc1b-377529ce1625"},"outputs":[{"output_type":"stream","name":"stdout","text":["Mean: 1.752\n","Standard Deviation: 0.02135415650406264\n"]}],"source":["from math import sqrt\n","sample=[1.78, 1.75, 1.72, 1.74, 1.77]\n","\n","def mean(values):\n"," return sum(values)/len(values)\n","def variance_sample(values):\n"," mean = sum(values)/len(values)\n"," var = sum((v-mean)**2 for v in values)/len(values)\n"," return var\n","def std_dev_sample(values):\n"," return sqrt(variance_sample(values))\n","mean = mean(sample)\n","std_dev = std_dev_sample(sample)\n","print(\"Mean:\",mean)\n","print(\"Standard Deviation:\", std_dev)"]},{"cell_type":"markdown","source":["#### **Ex-2:**\n","---\n","A manufacturer says Z-Phone smart phone has a mean consumer life of 42 months with a standard deviation of 8 months, Assuming a normal distribution, what is the probability a given random Z-Phone will last between 20 and 30 months?"],"metadata":{"id":"PFG8Fz6T7-gY"}},{"cell_type":"code","source":["from scipy.stats import norm\n","mean = 42\n","std_dev = 8\n","x = norm.cdf(30, mean, std_dev) - norm.cdf(20, mean, std_dev)\n","print(x)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"yehI2Mkf8dDF","executionInfo":{"status":"ok","timestamp":1657102752158,"user_tz":-360,"elapsed":400,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"51083c6f-1cbd-468c-f1ef-8d6d29db14e2"},"execution_count":3,"outputs":[{"output_type":"stream","name":"stdout","text":["0.06382743803380352\n"]}]},{"cell_type":"markdown","source":["#### **Ex-3:**\n","---\n","I am skeptical that my 3D printer filament is not 1.75 mm in average diameter as advertised. I sampled 34 measurements with my tool. The sample mean is 1.715588 and the sample standard deviation is 0.029252. \n","\n","What is the 99% confidence interval for the mean of my entire spool of filament?"],"metadata":{"id":"LD7ZLiAc8eTi"}},{"cell_type":"markdown","source":["\\begin{equation}\n","E = ±z_c \\frac{s}{\\sqrt n}\n","\\end{equation}\n","$\\text{Confidence Interval=(sample mean ± }E)$"],"metadata":{"id":"C2CyyuCFHJHG"}},{"cell_type":"code","source":["from math import sqrt\n","from scipy.stats import norm\n","\n","def critical_z_value(p, mean=0.0, std=1.0):\n"," norm_dist=norm(loc=mean, scale=std)\n"," left_area= (1.0-p)/2.0\n"," right_area= 1.0 - ((1.0-p)/2.0)\n"," return norm_dist.ppf(left_area),norm_dist.ppf(right_area)\n","\n","def ci_large_sample(p, sample_mean, sample_std, n):\n"," lower,upper = critical_z_value(p)\n"," lower_ci = lower*(sample_std/sqrt(n)) # lower margin of error\n"," upper_ci = upper*(sample_std/sqrt(n)) # upper margin of error\n"," return sample_mean + lower_ci, sample_mean + upper_ci # mean+margin of error\n","\n","print(ci_large_sample(p=.99, sample_mean=1.715588, sample_std=0.029252, n=34))"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"PuzFhEbp84JU","executionInfo":{"status":"ok","timestamp":1657103547672,"user_tz":-360,"elapsed":353,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"c37ab0d3-3235-4b92-dcd4-0c02e20b10bc"},"execution_count":4,"outputs":[{"output_type":"stream","name":"stdout","text":["(1.7026658973748656, 1.7285101026251342)\n"]}]},{"cell_type":"markdown","source":["#### **Ex-4:**\n","---\n","\n","Your marketing department has started a new advertising campaign and wants to know if it affected sales, which in the past averaged \\$10,345 a day with a standard deviation of \\$552. The new advertising campaign ran for 45 days and averaged \\$11,641 in sales.\n","Did the campaign affect sales? Why or why not? (Use a two-tailed test for more reliable significance.)"],"metadata":{"id":"i2TREy-89DKW"}},{"cell_type":"code","source":["from scipy.stats import norm \n","\n","mean = 10345\n","std_dev = 552\n","\n","p1 = 1.0 - norm.cdf(11641,mean, std_dev)\n","\n","p2 = p1\n","\n","p_value = p1+p2\n","print(\"Two tailed P-value\",p_value)\n","if p_value<=0.05:\n"," print(\"Passed!\")\n","else:\n"," print(\"Failed\")"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"lG64Na3w-IDQ","executionInfo":{"status":"ok","timestamp":1657103773671,"user_tz":-360,"elapsed":15,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"f0b10ed0-08ba-4bdc-8425-695418f2192f"},"execution_count":5,"outputs":[{"output_type":"stream","name":"stdout","text":["Two tailed P-value 0.01888333596496139\n","Passed!\n"]}]},{"cell_type":"markdown","source":["Since it passes the two tailed test, we can say that the campaign worked and affected the sales."],"metadata":{"id":"0I-KfpjIFP4p"}}]} 2 | -------------------------------------------------------------------------------- /Chapter 4/Ch 4 Exercise.ipynb: -------------------------------------------------------------------------------- 1 | {"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"name":"Ch 4: Exercise.ipynb","provenance":[],"collapsed_sections":[],"authorship_tag":"ABX9TyNcKCl0Ai5vJ7tq+j2nV+r/"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"}},"cells":[{"cell_type":"markdown","source":["####**Imports**"],"metadata":{"id":"8I_ptMAEK3Ep"}},{"cell_type":"code","source":["import numpy as np\n","from numpy import array\n","from numpy.linalg import det,inv"],"metadata":{"id":"j2IByj0WK76r","executionInfo":{"status":"ok","timestamp":1657625603278,"user_tz":-360,"elapsed":408,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}}},"execution_count":5,"outputs":[]},{"cell_type":"markdown","source":["### **Exercise 1**\n","---\n","Vector $\\vec{v}$ has a value of $[1,2]$ but then a transformation happens. $\\hat{i}$ lands at $[2,0]$ and $\\hat{j}$ lands at $[0,1.5]$. Where does $\\vec{v}$ land?"],"metadata":{"id":"kxHlXu_gIBVP"}},{"cell_type":"code","execution_count":3,"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"aXFE5aGPH6xW","executionInfo":{"status":"ok","timestamp":1657625481208,"user_tz":-360,"elapsed":11,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"4c759dda-4776-4974-c12c-89c41224e5d2"},"outputs":[{"output_type":"stream","name":"stdout","text":["[2. 3.]\n"]}],"source":["v=array([1,2])\n","i_hat = array([2,0])\n","j_hat = array([0, 1.5])\n","#fix this line \n","basis = array([i_hat, j_hat])\n","\n","# transform vector v into w \n","w = basis.dot(v)\n","print(w)"]},{"cell_type":"markdown","source":["###**Exercise 2**\n","---\n","Vector $\\vec{v}$ has a value of $[1,2]$ but then a transformation happens. $\\hat{i}$ lands at $[-2,1]$ and $\\hat{j}$ lands at $[1,-2]$. Where does $\\vec{v}$ land?"],"metadata":{"id":"lVzNNLIbIr7o"}},{"cell_type":"code","source":["v=array([1,2])\n","i_hat = array([-2,1])\n","j_hat = array([1, -2])\n","#fix this line \n","basis = array([i_hat, j_hat])\n","\n","# transform vector v into w \n","w = basis.dot(v)\n","print(w)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"ChiFTrtxJbqx","executionInfo":{"status":"ok","timestamp":1657625489758,"user_tz":-360,"elapsed":1039,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"5c83d470-3e07-42dd-9dd1-a33844e57bf5"},"execution_count":4,"outputs":[{"output_type":"stream","name":"stdout","text":["[ 0 -3]\n"]}]},{"cell_type":"markdown","source":["#### **Exercise 3**\n","---\n","A transformation $\\hat{i}$ lands at $[1,0]$ and $\\hat{j}$ lands at $[2,2]$. What is the determinant of this transformation?"],"metadata":{"id":"X9LSR-3eIvtD"}},{"cell_type":"code","source":["i_hat = array([1,0])\n","j_hat = array([2,2])\n","basis = array([i_hat, j_hat]).transpose()\n","determinant = det(basis)\n","print(determinant)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"WePh_-XqJcKx","executionInfo":{"status":"ok","timestamp":1657625617696,"user_tz":-360,"elapsed":3,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"ee743588-a619-4ffe-fe6f-4dc3b4502ac3"},"execution_count":6,"outputs":[{"output_type":"stream","name":"stdout","text":["2.0\n"]}]},{"cell_type":"markdown","source":["#### **Exercise 4**\n","---\n","Can two or more linear transformations be done in single linear transformation? Why or why not? "],"metadata":{"id":"qI4MkX5mJivH"}},{"cell_type":"markdown","source":["Yes, because matrix multiplication allows us to combine several matrices into a single matrix representing one consolidated transformation."],"metadata":{"id":"QBs2vap4MCFy"}},{"cell_type":"markdown","source":["#### **Exercise 5**\n","---\n","Solve the system equations for x,y and z:\n","\\begin{equation}\n","3x + 1y + 0z = 54\\\\\n","2x + 4y + 1z = 12\\\\\n","3x + 1y + 8z = 6\n","\\end{equation}"],"metadata":{"id":"3OSgftBoJwNb"}},{"cell_type":"code","source":["A = array([\n"," [3,1,0],\n"," [2,4,1],\n"," [3,1,8]\n","])\n","B = array([\n"," 54,\n"," 12,\n"," 6\n","]) \n","X = inv(A).dot(B)\n","print(X)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"3mlE2tO1KLze","executionInfo":{"status":"ok","timestamp":1657625807435,"user_tz":-360,"elapsed":433,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"226ac1de-9865-44e4-b46c-7296c31d54f8"},"execution_count":7,"outputs":[{"output_type":"stream","name":"stdout","text":["[19.8 -5.4 -6. ]\n"]}]},{"cell_type":"markdown","source":["#### **Exercise 6**\n","---\n","Is the following matrix linearly dependent? Why or why not?\n","\\begin{bmatrix}2\\,\\,\\,\\,1\\\\6\\,\\,\\,\\,3\\end{bmatrix}"],"metadata":{"id":"ihUY7C1lKYDF"}},{"cell_type":"code","source":["i_hat = array([2,6])\n","j_hat = array([1,3])\n","basis = array([i_hat, j_hat]).transpose()\n","determinant = det(basis)\n","print(determinant)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"aK5rKOGlKs_c","executionInfo":{"status":"ok","timestamp":1657625864241,"user_tz":-360,"elapsed":4,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"8381518d-a0c5-4e40-ad52-b343878f215d"},"execution_count":8,"outputs":[{"output_type":"stream","name":"stdout","text":["0.0\n"]}]}]} -------------------------------------------------------------------------------- /Chapter 4/Ch 4 Linear Algebra.ipynb: -------------------------------------------------------------------------------- 1 | {"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"name":"Ch:4 Linear Algebra.ipynb","provenance":[],"authorship_tag":"ABX9TyNM3eCYkPgNoq64+ghaEWOd"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"}},"cells":[{"cell_type":"markdown","source":["Chapter 4\n","# **Linear Algebra**\n"],"metadata":{"id":"nY0K7Hartvpp"}},{"cell_type":"markdown","source":["#### What is Vector?\n","\n","Simply put vector is an arrow in a space with a specific direction and length, often representing a piece of data. It has no concept of location, so always imagine it's tail starts at the origin of a Cartesian plane(0,0). A vector $v$ is denoted like this: $v⃗$\n","\n","

\n"],"metadata":{"id":"S2oOq9z7t5tZ"}},{"cell_type":"markdown","source":["To declare a vector, you can use NumPy's array() function then can pass a collection of numebers to it as below: \n","\n"],"metadata":{"id":"PMzSH03k2-Iz"}},{"cell_type":"code","execution_count":null,"metadata":{"id":"rUTLlPH5tRxt","colab":{"base_uri":"https://localhost:8080/"},"executionInfo":{"status":"ok","timestamp":1657183952765,"user_tz":-360,"elapsed":3,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"4c1d3b29-2bf7-419d-cf01-ff4285be4077"},"outputs":[{"output_type":"stream","name":"stdout","text":["[3 2]\n"]}],"source":["import numpy as np\n","v= np.array([3,2])\n","print(v)"]},{"cell_type":"markdown","source":["A three dimensional vector:\n","$\\vec{v} = \\begin{bmatrix} x\\\\y\\\\z \\end{bmatrix} = \\begin{bmatrix} 4\\\\1\\\\2 \\end{bmatrix} $\n","\n","can be expressed in python like this:"],"metadata":{"id":"OW9sH90P3C-d"}},{"cell_type":"code","source":["v=np.array([4,1,2])\n","print(v)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"ZUh8B0lW3Bbb","executionInfo":{"status":"ok","timestamp":1657184325041,"user_tz":-360,"elapsed":1857,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"bf507643-876e-49b3-e30e-35b2ead87b54"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[4 1 2]\n"]}]},{"cell_type":"markdown","source":["#### **Adding and Combining vectors**\n","\n","If we have two vectors $\\vec{v}$ and $\\vec{w}$ how do we add these vectors?\n","\n","$\\vec{v}=\\begin{bmatrix}3\\\\2\\end{bmatrix}$\n","\n","$\\vec{w}=\\begin{bmatrix}2\\\\-1\\end{bmatrix}$\n","\n","$\\vec{v}+\\vec{w}=\\begin{bmatrix}3+2\\\\2+-1\\end{bmatrix}=\\begin{bmatrix}5\\\\1\\end{bmatrix}$\n","\n","In python:"],"metadata":{"id":"z2ki13KQ4jjb"}},{"cell_type":"code","source":["from numpy import array\n","v = array([3, 2])\n","w = array([2,-1])\n","v_plus_w = v + w\n","print(v_plus_w)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"qpoh8f5x4co7","executionInfo":{"status":"ok","timestamp":1657184704022,"user_tz":-360,"elapsed":540,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"b0870ee7-2a56-4167-add2-c7c310d19c80"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[5 1]\n"]}]},{"cell_type":"markdown","source":["$\\vec{v}+\\vec{w} = \\vec{w}+\\vec{v}$"],"metadata":{"id":"dTRiLW5S6iVf"}},{"cell_type":"markdown","source":["#### **Scaling Vectors**\n","Scaling is growing or shrinking a vector's length. You can grow/shrink a vector by multiplying or scaling it with a single value, known as $scalar$\n","\n","$\\vec{v}=\\begin{bmatrix}3\\\\2\\end{bmatrix}$\n","\n","$2\\vec{v}=\\begin{bmatrix}3×2\\\\2×2\\end{bmatrix} = \\begin{bmatrix}6\\\\4\\end{bmatrix}$"],"metadata":{"id":"0jS4Dsne7KD-"}},{"cell_type":"code","source":["from numpy import array\n","v=array([3,2])\n","scaled_v= 2*v\n","print(scaled_v)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"RQTHgNXR58uo","executionInfo":{"status":"ok","timestamp":1657185217709,"user_tz":-360,"elapsed":596,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"eab21ccf-6d93-4386-ec46-09ef02b323c0"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[6 4]\n"]}]},{"cell_type":"markdown","source":["Scaling a vector doesn't change it's direction. But when you multiply a vector by negative number it flips the direction of the vector as shown in the image.\n","\n","

\n","\n","Although it stays on the same line. This segues to a key concept called linear dependence."],"metadata":{"id":"-0xwZlA78LQW"}},{"cell_type":"markdown","source":["#### **Span and Linear Dependence**\n","\n","These two operaions adding and scaling vectors, bring about the idea that we can combine two vectors and scale them to create any resulting vector we want.\n","$\\vec{v}+\\vec{w}=\\overrightarrow{v+w}$\n","\n","Again, $\\vec{v}$ and $\\vec{w}$ are fixed in direction, except for flipping with negative scalars, but we can use scaling to freely create any vector composed of $\\overrightarrow{v+w}$. This whole space of possible vectors is called $span$. When we have two vectors in two different directions they are $\\textit{linearly independent}$ and have this unlimited span.\n","\n","We are only limited in span when we have two vectors on the same line and in the same direction. No matter how we may scale or combine them the resulting vector is always going to be on that same line. This makes them $\\textit{linearly dependent}.$"],"metadata":{"id":"ZTpcWHhQ9Zyf"}},{"cell_type":"markdown","source":["###**Linear Transformations**\n","\n","#### **Basis Vectors**\n","\n","Imagine we have two simple vectors $\\hat{i}\\,\\text{and}\\,\\hat{j}$. These are known as basis vectors, which are used to describe transformations on other vectors. They typically have a length of 1 and point in perpendicular positive directions.\n","Think of the basis vectors as building blocks to build or transform any vector. Our basis vector is expressed in a 2 × 2 matrix, where the first column is $\\hat{i}$ and the second is column is $\\hat{j}$ :\n","\n","$\\hat{i}$ = $\\begin{bmatrix}1\\\\0\\end{bmatrix}$\n","\n","$\\hat{j}$ = $\\begin{bmatrix}0\\\\1\\end{bmatrix}$\n","\n","$\\text{basis}$ = $\\begin{bmatrix}1\\,\\,\\,\\,\\,\\,0 \\\\0\\,\\,\\,\\,\\,\\,1\\end{bmatrix}$\n","\n","$\\vec{v}=\\hat{i}+\\hat{j}$\n","\n"],"metadata":{"id":"ktUixtNU_2xs"}},{"cell_type":"markdown","source":["#### **Matrix Vector Multiplication**\n","\n","The formula to transform a vector $\\vec{v}$ given basis vectors $\\hat{i}$ and $\\hat{j}$ packaged as a matrix is:\n","\n","$\\begin{bmatrix}x_{new}\\\\y_{new}\\end{bmatrix} = \\begin{bmatrix}a\\,\\,\\,b\\\\c\\,\\,\\,d \\end{bmatrix}\\begin{bmatrix}x\\\\y\\end{bmatrix}=\\begin{bmatrix}ax+by\\\\cx+dy\\end{bmatrix}$"],"metadata":{"id":"AkrxVIHOD-0m"}},{"cell_type":"code","source":["from numpy import array \n","basis = array([[3,0],\n"," [0,2]])\n","v=array([1,1])\n","new_v=basis.dot(v)\n","print(new_v)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"GQcE9Nfb76VG","executionInfo":{"status":"ok","timestamp":1657187705408,"user_tz":-360,"elapsed":529,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"17d610f1-8909-4e08-d690-22cea3f3992f"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[3 2]\n"]}]},{"cell_type":"code","source":["i_hat = array([2,0])\n","j_hat = array([0,3])\n","#convert rows into columns\n","basis = array([i_hat,j_hat]).transpose()\n","v = array([2,1])\n","\n","new_v=basis.dot(v)\n","print(new_v)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"TQo0GqqtFZFv","executionInfo":{"status":"ok","timestamp":1657187872720,"user_tz":-360,"elapsed":649,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"a7db9424-5111-4b6a-85b2-8502f72af162"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[4 3]\n"]}]},{"cell_type":"markdown","source":["#### **Matrix Multiplication**\n","\n","Here is how we apply a rotation and then shear to any vector $\\vec{v}$ with value $[x,y]$:\n","\n","$\\begin{bmatrix}1\\,\\,\\,\\,1\\\\0\\,\\,\\,\\,1\\end{bmatrix}$$\\begin{bmatrix}0\\,\\,\\,\\,-1\\\\1\\,\\,\\,\\,\\,\\,\\,\\,\\,\\,0\\end{bmatrix}$$\\begin{bmatrix}x\\\\y\\end{bmatrix}$ - - - - - - - - - - - - -(1)\n","\n","$\\begin{bmatrix}a\\,\\,\\,\\,b\\\\c\\,\\,\\,\\,d\\end{bmatrix}$$\\begin{bmatrix}e\\,\\,\\,\\,f\\\\g\\,\\,\\,\\,h\\end{bmatrix}$= $\\begin{bmatrix}ae+bg\\,\\,\\,\\,\\,\\,af+bh\\\\ce+dg\\,\\,\\,\\,\\,\\,cf+dh\\end{bmatrix}$ - - - - - - - - - -(2)\n","\n","\n","Applying (2) on (1) we get:\n","\n","$\\begin{bmatrix}1\\,\\,\\,-1\\\\1\\,\\,\\,\\,\\,\\,\\,\\,\\,0\\end{bmatrix}$$\\begin{bmatrix}x\\\\y\\end{bmatrix}$\n"],"metadata":{"id":"jkxrQMiRq1bu"}},{"cell_type":"code","source":["from numpy import array\n","i_hat1 = array([0,1])\n","j_hat1=array([-1,0])\n","transform1=array([i_hat1, j_hat1]).transpose()\n","i_hat2 = array([1,0])\n","j_hat2 = array([1,1])\n","transform2=array([i_hat2, j_hat2]).transpose()\n","\n","#combine transformations\n","combined = transform2 @ transform1 # @ is the shorthand for matmul()\n","\n","#test\n","print(f\"Combined Matrix:\\n{combined}\")"],"metadata":{"id":"EwQElYqPGCdd","colab":{"base_uri":"https://localhost:8080/"},"executionInfo":{"status":"ok","timestamp":1657282543245,"user_tz":-360,"elapsed":8,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"77e6b3e3-4736-46d7-b75a-12bd049ef8a4"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["Combined Matrix:\n","[[ 1 -1]\n"," [ 1 0]]\n"]}]},{"cell_type":"markdown","source":["##### **Using dot() vs matmul() vs @**\n","Usually you want to prefer matmul() and it's shorhand @ to combine matrices rather than the dot() operator in NumPy. The former generally has a preferable policy for higher dimensional matrices and how the elements are broadcasted."],"metadata":{"id":"MgRfnd1BwKsC"}},{"cell_type":"code","source":["from numpy import matmul\n","v = array([1,2])\n","print(combined.dot(v))\n","print(combined @ v)\n","print(matmul(combined,v))"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"S2gNCY_cutse","executionInfo":{"status":"ok","timestamp":1657282788178,"user_tz":-360,"elapsed":378,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"0e25e7b6-1f8a-45c3-83f6-17736d263e47"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[-1 1]\n","[-1 1]\n","[-1 1]\n"]}]},{"cell_type":"markdown","source":["Note that we could also have applied each transformation individually to vector $\\vec{v}$"],"metadata":{"id":"JbSL7RHJwvSp"}},{"cell_type":"code","source":["rotated = transform1.dot(v)\n","sheered = transform2.dot(rotated)\n","print(sheered)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"tr26uRnyvQ-B","executionInfo":{"status":"ok","timestamp":1657283241229,"user_tz":-360,"elapsed":388,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"d1528799-3faf-4a1c-c8c4-10c1c56205c8"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[-1 1]\n"]}]},{"cell_type":"markdown","source":["Remember the order of transformation matters!\n","If we flip the order as below, see how the result gets changed:"],"metadata":{"id":"cADMVBfzypUL"}},{"cell_type":"code","source":["combined = transform1 @ transform2 # sheer and then rotate\n","print(combined.dot(v))"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"U6X2S80yx1_T","executionInfo":{"status":"ok","timestamp":1657283512891,"user_tz":-360,"elapsed":9,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"cb30018b-3cda-44b2-928e-55bff1c9f58b"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[-2 3]\n"]}]},{"cell_type":"code","source":["transform2 #sheer"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"wTDWGJ6vy4ZB","executionInfo":{"status":"ok","timestamp":1657283586347,"user_tz":-360,"elapsed":12,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"40573178-0922-44a3-ff8d-75b4dfee307c"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":["array([[1, 1],\n"," [0, 1]])"]},"metadata":{},"execution_count":14}]},{"cell_type":"code","source":["transform1 #rotation "],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"pmdMyENuzJ0Z","executionInfo":{"status":"ok","timestamp":1657283632598,"user_tz":-360,"elapsed":7,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"707f7e99-dbd7-492e-cba8-54ad036df3a8"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":["array([[ 0, -1],\n"," [ 1, 0]])"]},"metadata":{},"execution_count":15}]},{"cell_type":"markdown","source":["#### **Determinants**\n","\n","Determinant are the factor by which a sampled area from vector space is expanded and Squished.\n","\n","If $\\hat{i},\\hat{j}$ is scaled and becomes $2\\hat{i},3\\hat{j}$ the determinant is therefore $2×3=6$\n","\n","Determinants describe how much a sampled area in a vector space changes in scale with linear transformations and this can provide helpful information about the transformation."],"metadata":{"id":"w8OPXUFAzt6S"}},{"cell_type":"code","source":["from numpy.linalg import det\n","from numpy import array \n","i_hat = array([3,0])\n","j_hat = array([0,2])\n","basis = array([i_hat,j_hat]).transpose()\n","determinant=det(basis)\n","print(determinant) "],"metadata":{"id":"A0fJw1Y1zVnv","colab":{"base_uri":"https://localhost:8080/"},"executionInfo":{"status":"ok","timestamp":1657287376521,"user_tz":-360,"elapsed":15,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"476be603-20e8-4d44-f003-caa590d4db52"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["6.0\n"]}]},{"cell_type":"markdown","source":["Sample shears and rotations should not affect the determinant, as the area will not change. "],"metadata":{"id":"yUGOmzMMCAr4"}},{"cell_type":"code","source":["from numpy.linalg import det\n","from numpy import array \n","i_hat = array([1,0])\n","j_hat = array([1,1])\n","basis = array([i_hat,j_hat]).transpose()\n","determinant=det(basis)\n","print(determinant) "],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"YAEZ7lDFBnaF","executionInfo":{"status":"ok","timestamp":1657287543208,"user_tz":-360,"elapsed":389,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"4d0d3604-86c3-4e1d-cfef-520ca81273fd"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["1.0\n"]}]},{"cell_type":"markdown","source":["But scaling however, will increase or decrease determinant"],"metadata":{"id":"Coet7IRwCThN"}},{"cell_type":"code","source":["from numpy.linalg import det\n","from numpy import array \n","i_hat = array([-2,1])\n","j_hat = array([1,2])\n","basis = array([i_hat,j_hat]).transpose()\n","determinant=det(basis)\n","print(determinant) "],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"N5QlJAgPBxwi","executionInfo":{"status":"ok","timestamp":1657287607338,"user_tz":-360,"elapsed":7,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"1a6352bb-0252-40a5-a93a-927efc90fd3b"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["-5.000000000000001\n"]}]},{"cell_type":"markdown","source":["The most critical piece of information that a determinant gives us is whether the transformation is linearly dependent. If you have a determinant of 0, that means all of the space has been squished into a lesser dimension."],"metadata":{"id":"_EpAenOvCvIL"}},{"cell_type":"code","source":["from numpy.linalg import det\n","from numpy import array \n","i_hat = array([-2,1])\n","j_hat = array([3,-1.5])\n","basis = array([i_hat,j_hat]).transpose()\n","determinant=det(basis)\n","print(determinant) "],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"rBiAZmotCf_m","executionInfo":{"status":"ok","timestamp":1657287909675,"user_tz":-360,"elapsed":7,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"57c9aa62-13d6-426f-a4e8-0ed91a9eca84"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["0.0\n"]}]},{"cell_type":"markdown","source":["#### **Systems of Equations and Inverse Matrices**"],"metadata":{"id":"33Gb-fWyEAKA"}},{"cell_type":"markdown","source":["Below are the operations for these equations:\n","\\begin{equation}\n","4x + 2y + 4z = 44\\\\\n","5x + 3y + 7z = 56\\\\ \n","9x + 3y + 6z = 72\\\\\n","\\end{equation}"],"metadata":{"id":"jcmA2CHCEQth"}},{"cell_type":"code","source":["from sympy import *\n","A = Matrix([\n"," [4,2,4],\n"," [5,3,7],\n"," [9,3,6]\n"," ])\n","\n","inverse = A.inv() #inverse matrix calculations\n","identity = inverse*A #identity matrix calculations \n","print(f\"Inverse:{inverse}\")\n","print(f\"Identity:{identity}\")"],"metadata":{"id":"FJIzyGz3DshB","colab":{"base_uri":"https://localhost:8080/"},"executionInfo":{"status":"ok","timestamp":1657623347455,"user_tz":-360,"elapsed":1066,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"59708188-0cf8-42af-cb39-1570506ad995"},"execution_count":1,"outputs":[{"output_type":"stream","name":"stdout","text":["Inverse:Matrix([[-1/2, 0, 1/3], [11/2, -2, -4/3], [-2, 1, 1/3]])\n","Identity:Matrix([[1, 0, 0], [0, 1, 0], [0, 0, 1]])\n"]}]},{"cell_type":"code","source":["from numpy import array \n","from numpy.linalg import inv \n","A = array([[4,2,4],\n"," [5,3,7],\n"," [9,3,6]])\n","B= array([\n"," 44,\n"," 56,\n"," 72\n","])\n","X = inv(A).dot(B)\n","print(X)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"yU9iewsmDMGI","executionInfo":{"status":"ok","timestamp":1657623509489,"user_tz":-360,"elapsed":371,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"a773505f-83a4-43ba-ac7a-e9877b0d6240"},"execution_count":3,"outputs":[{"output_type":"stream","name":"stdout","text":["[ 2. 34. -8.]\n"]}]},{"cell_type":"code","source":["from sympy import * \n","A = Matrix([[4,2,4],\n"," [5,3,7],\n"," [9,3,6]])\n","B= Matrix([\n"," 44,\n"," 56,\n"," 72\n","])\n","X = A.inv() * B \n","print(X)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"OYxz-WdqDzac","executionInfo":{"status":"ok","timestamp":1657623579882,"user_tz":-360,"elapsed":404,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"64faea3d-439e-4c65-d224-63c5bce14445"},"execution_count":4,"outputs":[{"output_type":"stream","name":"stdout","text":["Matrix([[2], [34], [-8]])\n"]}]},{"cell_type":"markdown","source":["#### **Eigenvectors and Eigenvalues**\n","Matrix decomposition is breaking up a matrix into it's basic components.\n","If we have a sqare matrix $A$, it has the following eigenvalue equation:\n","\n","$Av=λv$\n","\n","If $A$ is the original matrix, it is composed of eigenvector $v$ and eigenvalue $λ$. There is one eigenvector and eigenvalue for each dimension of the parent matrix and not all matrices can be decomposed into eigenvectors and eigenvalues."],"metadata":{"id":"PmHYrRcZFTDb"}},{"cell_type":"code","source":["from numpy import array,diag \n","from numpy.linalg import eig,inv \n","\n","A = array([\n"," [1,2],\n"," [4,5]\n"," ])\n","eigenvals, eigenvecs = eig(A)\n","print(\"Eigenvalues:\")\n","print(eigenvals)\n","print(\"Eigenvectors:\")\n","print(eigenvecs)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"O0P6ggb2EIWU","executionInfo":{"status":"ok","timestamp":1657624274296,"user_tz":-360,"elapsed":373,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"a58bfd34-3162-42d9-e190-8508ea793b3d"},"execution_count":7,"outputs":[{"output_type":"stream","name":"stdout","text":["Eigenvalues:\n","[-0.46410162 6.46410162]\n","Eigenvectors:\n","[[-0.80689822 -0.34372377]\n"," [ 0.59069049 -0.9390708 ]]\n"]}]},{"cell_type":"markdown","source":["Now we can rebuild the matrix from the eigenvalues and eigenvectors:"],"metadata":{"id":"w8ORN5oBG-Dt"}},{"cell_type":"code","source":["Q = eigenvecs \n","R = inv(Q)\n","\n","L = diag(eigenvals)\n","B = Q @ L @ R \n","print(B)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"sw-iF1sNGuVL","executionInfo":{"status":"ok","timestamp":1657624427395,"user_tz":-360,"elapsed":5,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"584c1d51-e9f6-4b89-bf45-478c09071470"},"execution_count":8,"outputs":[{"output_type":"stream","name":"stdout","text":["[[1. 2.]\n"," [4. 5.]]\n"]}]}]} -------------------------------------------------------------------------------- /Chapter 5/Chapter 5 Exercise.ipynb: -------------------------------------------------------------------------------- 1 | {"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"name":"Chapter 5: Exercise.ipynb","provenance":[],"collapsed_sections":[],"authorship_tag":"ABX9TyMh6ScZ1Ir56R6dZM2JrZMI"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"}},"cells":[{"cell_type":"code","execution_count":19,"metadata":{"id":"phhXQI_nj6IS","executionInfo":{"status":"ok","timestamp":1659076030918,"user_tz":-360,"elapsed":516,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}}},"outputs":[],"source":["import pandas as pd\n","import matplotlib.pyplot as plt\n","from sklearn.linear_model import LinearRegression \n","from sklearn.model_selection import KFold, cross_val_score\n","from scipy.stats import t\n","from math import sqrt \n","import seaborn as sns"]},{"cell_type":"markdown","source":["#### **Read the dataset**"],"metadata":{"id":"Y5HFeui1kS6X"}},{"cell_type":"code","source":["link='http://bit.ly/3C8JzrM'\n","dataset=pd.read_csv(link)"],"metadata":{"id":"w5p40Wy1kRSV","executionInfo":{"status":"ok","timestamp":1659075498582,"user_tz":-360,"elapsed":1264,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}}},"execution_count":6,"outputs":[]},{"cell_type":"code","source":["dataset.head()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":206},"id":"4y0ra5dmkhZ8","executionInfo":{"status":"ok","timestamp":1659075500326,"user_tz":-360,"elapsed":5,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"a26a3fb4-a993-4667-d15e-ec52044d8264"},"execution_count":7,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" x y\n","0 1 -13.115843\n","1 2 25.806547\n","2 3 -5.017285\n","3 4 20.256415\n","4 5 4.075003"],"text/html":["\n","
\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
xy
01-13.115843
1225.806547
23-5.017285
3420.256415
454.075003
\n","
\n"," \n"," \n"," \n","\n"," \n","
\n","
\n"," "]},"metadata":{},"execution_count":7}]},{"cell_type":"markdown","source":["#### **Exercise 1**:\n","Perform a simple linear regression to find the $m$ and $b$ values that minimizes the loss (sum of squares)."],"metadata":{"id":"9Yr5_jPCkrBR"}},{"cell_type":"code","source":["df = pd.read_csv(link,delimiter=',')\n","\n","X = df.values[:,:-1]\n","Y = df.values[:,:-1]\n","fit = LinearRegression().fit(X,Y)\n","\n","m = fit.coef_.flatten()\n","b = fit.intercept_.flatten()\n","\n","print(f\"m: {m}\")\n","print(f\"b: {b}\")\n","\n","plt.plot(X,Y,'.')\n","plt.plot(X,m*X+b)\n","plt.legend(['points','line'])\n","plt.show()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":300},"id":"OSkWblhjkmPY","executionInfo":{"status":"ok","timestamp":1659075924564,"user_tz":-360,"elapsed":1356,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"b4995db4-8602-4c93-ce9b-3fee8f32d874"},"execution_count":18,"outputs":[{"output_type":"stream","name":"stdout","text":["m: [1.]\n","b: [-1.42108547e-14]\n"]},{"output_type":"display_data","data":{"text/plain":["
"],"image/png":"\n"},"metadata":{"needs_background":"light"}}]},{"cell_type":"markdown","source":["#### **Exercise 2**:\n","Calculate the correlation coefficient and statistical significance of this data (at 95% confidence). Is the correlation useful?"],"metadata":{"id":"_oMjMwXYk676"}},{"cell_type":"code","source":["data = pd.read_csv(link,delimiter=',')\n","correlations=data.corr(method='pearson')\n","sns.heatmap(data=correlations,annot=True,cmap='YlGn_r')"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":286},"id":"F-AbC3oklF3x","executionInfo":{"status":"ok","timestamp":1659076037781,"user_tz":-360,"elapsed":1725,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"90d36961-c455-473b-8f3a-2100e7151f18"},"execution_count":20,"outputs":[{"output_type":"execute_result","data":{"text/plain":[""]},"metadata":{},"execution_count":20},{"output_type":"display_data","data":{"text/plain":["
"],"image/png":"\n"},"metadata":{"needs_background":"light"}}]},{"cell_type":"code","source":["n = df.shape[0]\n","lower_cv=t(n-1).ppf((1-0.95)/2)\n","upper_cv=t(n-1).ppf((1+0.95)/2)\n","r = correlations[\"y\"][\"x\"]\n","test_value = r/sqrt((1 - r**2)/(n-2))\n","test_value=r/sqrt((1-r**2)/(n-2))\n","print(f\"TEST VALUE:{test_value}\")\n","print(f\"CRITICAL RANGE:{lower_cv,upper_cv}\")\n","\n","if test_valueupper_cv:\n"," print(\"CORRELATION PROVEN, REJECT H0\")\n","else:\n"," print(\"CORRELATION NOT PROVEN, FAILED TO REJECT H0\")\n","\n","if test_value > 0:\n"," p_value = 1.0 - t(n-1).cdf(test_value)\n","else:\n"," p_value = t(n-1).cdf(test_value)\n","#two tailed test so multiply by 2\n","p_value=p_value*2 \n","print(f\"P-VALUE:{p_value}\")"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"5WA9y6Luo9-4","executionInfo":{"status":"ok","timestamp":1659076388595,"user_tz":-360,"elapsed":518,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"dc6e6876-2941-4674-f907-435ea2121607"},"execution_count":25,"outputs":[{"output_type":"stream","name":"stdout","text":["TEST VALUE:23.835515323677328\n","CRITICAL RANGE:(-1.984467454426692, 1.984467454426692)\n","CORRELATION PROVEN, REJECT H0\n","P-VALUE:0.0\n"]}]},{"cell_type":"markdown","source":["#### **Exercise 3**:\n","\n","If I predict where x = 50, what is the 95% prediction interval for the predicted value of y?"],"metadata":{"id":"W4YFxMlolS8T"}},{"cell_type":"code","source":["import pandas as pd\n","from scipy.stats import t\n","from math import sqrt\n","\n","points = list(pd.read_csv(link,delimiter=',').itertuples())\n","\n","n = len(points)\n","m = 1.75919315\n","b = 4.69359655\n","\n","x_0 = 50\n","x_mean = sum(p.x for p in points)/len(points) \n","\n","t_value = t(n-2).ppf(.975)\n","\n","standard_error = sqrt(sum((p.y - (m * p.x + b))**2 for p in points)/(n-2))\n","margin_of_error = t_value * standard_error * sqrt(1+(1/n)+(n*((x_0+x_mean)**2))/(n*(sum(p.x**2 for p in points)-(sum(p.x for p in points))**2)))\n","\n","predicted_y = m*x_0 + b\n","\n","print(f'The prediction interval is between:{predicted_y-margin_of_error} and {predicted_y+margin_of_error}')"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"h-FCRtE_liCZ","executionInfo":{"status":"ok","timestamp":1659076634186,"user_tz":-360,"elapsed":557,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"46125bfc-76d7-4583-ab9c-863ad3f211a7"},"execution_count":28,"outputs":[{"output_type":"stream","name":"stdout","text":["The prediction interval is between:50.80065904646458 and 134.50584905353543\n"]}]},{"cell_type":"markdown","source":["#### **Exercise 4**:\n","\n","Start your regression over and do a train/test split. Feel free to experiment with cross-validation and random-fold validation. Does the linear regression perform well and consistently on the testing data?"],"metadata":{"id":"FCl0jve2lij2"}},{"cell_type":"code","source":["df = pd.read_csv(link,delimiter=',')\n","X = df.values[:,:-1]\n","Y = df.values[:,:-1]\n","kfold = KFold(n_splits=3,random_state=7, shuffle=True)\n","model = LinearRegression()\n","results = cross_val_score(model,X,Y, cv = kfold)\n","print(results) \n","print(f\"MSE: {results.mean()}, std_dev: {results.std()}\")"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"gcd_IUvQl1rG","executionInfo":{"status":"ok","timestamp":1659077018526,"user_tz":-360,"elapsed":1126,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"4bb73082-45fa-4ce9-d943-cc09136fc351"},"execution_count":33,"outputs":[{"output_type":"stream","name":"stdout","text":["[1. 1. 1.]\n","MSE: 1.0, std_dev: 0.0\n"]}]}]} -------------------------------------------------------------------------------- /Chapter 6/Chapter 6 Exercise.ipynb: -------------------------------------------------------------------------------- 1 | {"nbformat":4,"nbformat_minor":0,"metadata":{"colab":{"name":"Chapter 6:Exercise.ipynb","provenance":[],"authorship_tag":"ABX9TyNfhRPE8QnakNKDKIpF1Xgm"},"kernelspec":{"name":"python3","display_name":"Python 3"},"language_info":{"name":"python"},"widgets":{"application/vnd.jupyter.widget-state+json":{"f1de93fd8e2c43c299c4df9200ccada1":{"model_module":"@jupyter-widgets/controls","model_name":"VBoxModel","model_module_version":"1.5.0","state":{"_dom_classes":["widget-interact"],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"VBoxModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"VBoxView","box_style":"","children":["IPY_MODEL_3f705e7820d64407ad62ad1c2418dc3e","IPY_MODEL_9b7bf8e85e7742c596bcdefc5c1802a2","IPY_MODEL_497cf8c9cbcb4d8dade0e991b3f7ae99","IPY_MODEL_f699b2ce7068468bbd310dd3298c0361"],"layout":"IPY_MODEL_2dc5ed70af2046348efb4f71ff2db3e3"}},"3f705e7820d64407ad62ad1c2418dc3e":{"model_module":"@jupyter-widgets/controls","model_name":"IntSliderModel","model_module_version":"1.5.0","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"IntSliderModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"IntSliderView","continuous_update":false,"description":"Red","description_tooltip":null,"disabled":false,"layout":"IPY_MODEL_51960180efe548b591b39cf217127006","max":255,"min":0,"orientation":"horizontal","readout":true,"readout_format":"d","step":1,"style":"IPY_MODEL_609f4c52d30149f084330d18bc235551","value":149}},"9b7bf8e85e7742c596bcdefc5c1802a2":{"model_module":"@jupyter-widgets/controls","model_name":"IntSliderModel","model_module_version":"1.5.0","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"IntSliderModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"IntSliderView","continuous_update":false,"description":"Green","description_tooltip":null,"disabled":false,"layout":"IPY_MODEL_a172f1f1daf2431d942f6e3ffaff4fac","max":255,"min":0,"orientation":"horizontal","readout":true,"readout_format":"d","step":1,"style":"IPY_MODEL_22731951a628416182ac74b65405a498","value":188}},"497cf8c9cbcb4d8dade0e991b3f7ae99":{"model_module":"@jupyter-widgets/controls","model_name":"IntSliderModel","model_module_version":"1.5.0","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"IntSliderModel","_view_count":null,"_view_module":"@jupyter-widgets/controls","_view_module_version":"1.5.0","_view_name":"IntSliderView","continuous_update":false,"description":"Blue","description_tooltip":null,"disabled":false,"layout":"IPY_MODEL_86efe8a05549487da8fc1ce4d32417ea","max":255,"min":0,"orientation":"horizontal","readout":true,"readout_format":"d","step":1,"style":"IPY_MODEL_36068737de804c4abc83edd845414f60","value":104}},"f699b2ce7068468bbd310dd3298c0361":{"model_module":"@jupyter-widgets/output","model_name":"OutputModel","model_module_version":"1.0.0","state":{"_dom_classes":[],"_model_module":"@jupyter-widgets/output","_model_module_version":"1.0.0","_model_name":"OutputModel","_view_count":null,"_view_module":"@jupyter-widgets/output","_view_module_version":"1.0.0","_view_name":"OutputView","layout":"IPY_MODEL_187640de1883495581140fa3c604e8fb","msg_id":"","outputs":[{"output_type":"display_data","data":{"text/plain":"'Predicted Font Color: DARK'","application/vnd.google.colaboratory.intrinsic+json":{"type":"string"}},"metadata":{}}]}},"2dc5ed70af2046348efb4f71ff2db3e3":{"model_module":"@jupyter-widgets/base","model_name":"LayoutModel","model_module_version":"1.2.0","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"51960180efe548b591b39cf217127006":{"model_module":"@jupyter-widgets/base","model_name":"LayoutModel","model_module_version":"1.2.0","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"609f4c52d30149f084330d18bc235551":{"model_module":"@jupyter-widgets/controls","model_name":"SliderStyleModel","model_module_version":"1.5.0","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"SliderStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":"","handle_color":null}},"a172f1f1daf2431d942f6e3ffaff4fac":{"model_module":"@jupyter-widgets/base","model_name":"LayoutModel","model_module_version":"1.2.0","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"22731951a628416182ac74b65405a498":{"model_module":"@jupyter-widgets/controls","model_name":"SliderStyleModel","model_module_version":"1.5.0","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"SliderStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":"","handle_color":null}},"86efe8a05549487da8fc1ce4d32417ea":{"model_module":"@jupyter-widgets/base","model_name":"LayoutModel","model_module_version":"1.2.0","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}},"36068737de804c4abc83edd845414f60":{"model_module":"@jupyter-widgets/controls","model_name":"SliderStyleModel","model_module_version":"1.5.0","state":{"_model_module":"@jupyter-widgets/controls","_model_module_version":"1.5.0","_model_name":"SliderStyleModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"StyleView","description_width":"","handle_color":null}},"187640de1883495581140fa3c604e8fb":{"model_module":"@jupyter-widgets/base","model_name":"LayoutModel","model_module_version":"1.2.0","state":{"_model_module":"@jupyter-widgets/base","_model_module_version":"1.2.0","_model_name":"LayoutModel","_view_count":null,"_view_module":"@jupyter-widgets/base","_view_module_version":"1.2.0","_view_name":"LayoutView","align_content":null,"align_items":null,"align_self":null,"border":null,"bottom":null,"display":null,"flex":null,"flex_flow":null,"grid_area":null,"grid_auto_columns":null,"grid_auto_flow":null,"grid_auto_rows":null,"grid_column":null,"grid_gap":null,"grid_row":null,"grid_template_areas":null,"grid_template_columns":null,"grid_template_rows":null,"height":null,"justify_content":null,"justify_items":null,"left":null,"margin":null,"max_height":null,"max_width":null,"min_height":null,"min_width":null,"object_fit":null,"object_position":null,"order":null,"overflow":null,"overflow_x":null,"overflow_y":null,"padding":null,"right":null,"top":null,"visibility":null,"width":null}}}}},"cells":[{"cell_type":"markdown","source":["# **Exercises**\n","A dataset of three input variables RED, GREEN, and BLUE as well as an output variable\n","LIGHT_OR_DARK_FONT_IND is provided [here](https://bit.ly/3imidqa). It will be used to predict whether a\n","light/dark font (0/1 respectively) will work for a given background color (specified by RGB\n","values).\n","1. Perform a logistic regression on the preceding data, using three-fold cross-validation\n","and accuracy as your metric.\n","2. Produce a confusion matrix comparing the predictions and actual data.\n","3. Pick a few different background colors (you can use an RGB tool like this one) and see\n","if the logistic regression sensibly chooses a light (0) or dark (1) font for each one.\n","4. Based on the preceding exercises, do you think logistic regression is effective for\n","predicting a light or dark font for a given background color?"],"metadata":{"id":"M21-dwPk-4dX"}},{"cell_type":"code","source":["dataset=pd.read_csv(\"https://bit.ly/3imidqa\", delimiter=\",\")\n","dataset.head()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":206},"id":"EIKcC9KoD3oY","executionInfo":{"status":"ok","timestamp":1659787790412,"user_tz":-360,"elapsed":21,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"ee10df23-0775-44ac-ae1b-b24b99d31a7c"},"execution_count":null,"outputs":[{"output_type":"execute_result","data":{"text/plain":[" RED GREEN BLUE LIGHT_OR_DARK_FONT_IND\n","0 0 0 0 0\n","1 0 0 128 0\n","2 0 0 139 0\n","3 0 0 205 0\n","4 0 0 238 0"],"text/html":["\n","
\n","
\n","
\n","\n","\n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n"," \n","
REDGREENBLUELIGHT_OR_DARK_FONT_IND
00000
1001280
2001390
3002050
4002380
\n","
\n"," \n"," \n"," \n","\n"," \n","
\n","
\n"," "]},"metadata":{},"execution_count":19}]},{"cell_type":"markdown","source":["#### **Exercise 1:**\n","___"],"metadata":{"id":"nFOSH_n1-7ab"}},{"cell_type":"code","execution_count":null,"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"w52VNJ85-xD-","executionInfo":{"status":"ok","timestamp":1659786753336,"user_tz":-360,"elapsed":753,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"dee57aea-7242-4e6f-e023-6c748310ad58"},"outputs":[{"output_type":"stream","name":"stdout","text":["Accuracy Mean: 1.000 (stdev=0.000)\n"]}],"source":["import pandas as pd\n","from sklearn.linear_model import LogisticRegression\n","from sklearn.metrics import confusion_matrix\n","from sklearn.model_selection import KFold, cross_val_score\n","# Load the data\n","df = pd.read_csv(\"https://bit.ly/3imidqa\", delimiter=\",\")\n","X = df.values[:, :-1]\n","Y = df.values[:, -1]\n","kfold = KFold(n_splits=3, shuffle=True) # n_splits means how many folds, we are using 3 fold.\n","model = LogisticRegression(penalty='none') # you could use 'elasticnet','l1', or 'l2' as penalty.\n","results = cross_val_score(model, X, Y, cv=kfold)\n","print(\"Accuracy Mean: %.3f (stdev=%.3f)\" % (results.mean(),\\\n","results.std()))"]},{"cell_type":"markdown","source":["#### **Exercise 2**\n","___"],"metadata":{"id":"N51GDUsOAR-E"}},{"cell_type":"code","source":["import pandas as pd\n","from sklearn.linear_model import LogisticRegression\n","from sklearn.metrics import confusion_matrix\n","from sklearn.model_selection import train_test_split\n","# Load the data\n","df = pd.read_csv(\"https://bit.ly/3imidqa\", delimiter=\",\")\n","# Extract input variables (all rows, all columns but last column)\n","X = df.values[:, :-1]\n","# Extract output column (all rows, last column)\\\n","Y = df.values[:, -1]\n","model = LogisticRegression(solver='liblinear')\n","X_train, X_test, Y_train, Y_test = train_test_split(X, Y,\n","test_size=.33)\n","model.fit(X_train, Y_train)\n","prediction = model.predict(X_test)\n","matrix = confusion_matrix(y_true=Y_test, y_pred=prediction)\n","print(matrix)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/"},"id":"H9nxpHzt_uht","executionInfo":{"status":"ok","timestamp":1659786862038,"user_tz":-360,"elapsed":605,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"15d17782-c66e-4eac-9a03-9f01d3cde81d"},"execution_count":null,"outputs":[{"output_type":"stream","name":"stdout","text":["[[168 8]\n"," [ 1 267]]\n"]}]},{"cell_type":"code","source":["import numpy as np\n","import seaborn as sns\n","import matplotlib.pyplot as plt \n","labels = ['True Negative','False Positive','False Negative','True Positive']\n","labels = np.asarray(labels).reshape(2,2)\n","ax = sns.heatmap(matrix, annot=labels, fmt='', cmap='Blues')\n","ax.set_title('Exercise 2 Confusion Matrix Plot \\n\\n',fontsize=18)\n","ax.set_xlabel('\\nPredicted Values')\n","ax.set_ylabel('Actual Values ')\n","ax.xaxis.set_ticklabels(['False','True'])\n","ax.yaxis.set_ticklabels(['False','True'])\n","plt.show()"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":353},"id":"SEYuiJe3ACw_","executionInfo":{"status":"ok","timestamp":1659787323078,"user_tz":-360,"elapsed":1649,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"bcfe0654-38fe-4c3d-9092-42c905e3ae1e"},"execution_count":null,"outputs":[{"output_type":"display_data","data":{"text/plain":["
"],"image/png":"\n"},"metadata":{"needs_background":"light"}}]},{"cell_type":"markdown","source":["#### **Exercise 3**\n","___"],"metadata":{"id":"GVgQoivZCi3x"}},{"cell_type":"code","source":["import numpy as np\n","import pandas as pd\n","import ipywidgets as widgets\n","from IPython.display import display,Latex\n","from sklearn.linear_model import LogisticRegression\n","from sklearn.model_selection import train_test_split\n","from ipywidgets import interact, interactive, fixed, interact_manual\n","\n","# Load the data\n","df = pd.read_csv(\"https://bit.ly/3imidqa\", delimiter=\",\")\n","# Extract input variables (all rows, all columns but last column)\n","X = df.values[:, :-1]\n","# Extract output column (all rows, last column)\n","Y = df.values[:, -1]\n","model = LogisticRegression(solver='liblinear')\n","X_train, X_test, Y_train, Y_test = train_test_split(X, Y,\\\n"," test_size=.33)\n","model.fit(X_train, Y_train)\n","prediction = model.predict(X_test)\n","\n","def colrs(r,g,b):\n"," x = model.predict(np.array([[int(r), int(g), int(b)]]))\n"," if model.predict(np.array([[int(r), int(g), int(b)]]))[0] == 0.0:\n"," return display(\"Predicted Font Color: LIGHT\")\n"," else:\n"," return display(\"Predicted Font Color: DARK\")\n","\n","def slider_maker(description):\n"," slider = widgets.IntSlider(\n"," value=0,\n"," min=0,\n"," max=255,\n"," step=1,\n"," description=description,\n"," disabled=False,\n"," continuous_update=False,\n"," orientation='horizontal',\n"," readout=True,\n"," readout_format='d'\n"," )\n"," return slider\n","w = interactive(colrs,r=slider_maker('Red'),g=slider_maker('Green'),b=slider_maker('Blue'))\n","display(\"Choose RGB combinations from the sliders below: \")\n","display(w)"],"metadata":{"colab":{"base_uri":"https://localhost:8080/","height":150,"referenced_widgets":["f1de93fd8e2c43c299c4df9200ccada1","3f705e7820d64407ad62ad1c2418dc3e","9b7bf8e85e7742c596bcdefc5c1802a2","497cf8c9cbcb4d8dade0e991b3f7ae99","f699b2ce7068468bbd310dd3298c0361","2dc5ed70af2046348efb4f71ff2db3e3","51960180efe548b591b39cf217127006","609f4c52d30149f084330d18bc235551","a172f1f1daf2431d942f6e3ffaff4fac","22731951a628416182ac74b65405a498","86efe8a05549487da8fc1ce4d32417ea","36068737de804c4abc83edd845414f60","187640de1883495581140fa3c604e8fb"]},"id":"-oj8LhmXAlQo","executionInfo":{"status":"ok","timestamp":1659795697653,"user_tz":-360,"elapsed":1360,"user":{"displayName":"Ziaul Karim","userId":"08392030995732291798"}},"outputId":"fb905cd0-4258-4126-a29b-270d8681b674"},"execution_count":120,"outputs":[{"output_type":"display_data","data":{"text/plain":["'Choose RGB combinations from the sliders below: '"],"application/vnd.google.colaboratory.intrinsic+json":{"type":"string"}},"metadata":{}},{"output_type":"display_data","data":{"text/plain":["interactive(children=(IntSlider(value=0, continuous_update=False, description='Red', max=255), IntSlider(value…"],"application/vnd.jupyter.widget-view+json":{"version_major":2,"version_minor":0,"model_id":"f1de93fd8e2c43c299c4df9200ccada1"}},"metadata":{}}]},{"cell_type":"markdown","source":["#### **Exercise 4**\n","___\n","Yes, the logistic regression is very effective at predicting light or dark\n","fonts for a given background color. Not only is the accuracy extremely\n","high, but the confusion matrix has high numbers in the top-left to\n","bottom-right diagonal with lower numbers in the other cells."],"metadata":{"id":"qBuG_Bg-DHov"}}]} 2 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # Math For Data Science. 2 | 3 | Remarks here are personal and this is just for practice purposes only. Equations may seem erratic at places, as LATEX syntaxes are used to separate the lines. Works better on [colab](https://colab.research.google.com/) 4 | 5 | Lessons here are from the book titled: 6 | 7 |

Essential Math For Data Science.

8 | 9 | ___ 10 |

11 | Written by Thomas Nield 12 |

13 |

14 | © Published by O'REILLY Media® 15 |

16 |

17 | 18 |

19 |
20 | 21 | ### Important notes: 22 | ___ 23 | 24 | 1. **Chapter Length Consideration:** Kindly note that the chapters within this book are deliberately extensive, allowing you ample time to absorb the content at your own pace. 25 | 26 | 2. **Utilization of Python's [Sympy](https://docs.sympy.org/latest/index.html) Library:** It is worth highlighting that this book heavily relies on the Sympy library within the Python programming language. 27 | 28 | 3. **Emphasis on Comprehensive Understanding:** While the comprehensive nature of this book may lead to a temptation to skim through certain sections, particularly those involving intricate algorithms and lengthy lines of code, I strongly encourage you not to succumb to this temptation. Engaging directly with these aspects not only reinforces your cognitive assimilation of the material but also facilitates a deeper understanding of the subject matter. 29 | 30 | 4. **Value of In-Depth Exploration:** Even seemingly straightforward algorithms such as Linear Regression demand a meticulous approach. It's essential to invest time in critically evaluating results and formulating your own hypotheses. 31 | 32 | 5. **Importance of Practical Implementation:** While there's no shortcut to mastering the content, consider the option of copying and executing the provided codes within the chapters. This approach can enhance your familiarity with coding practices. However, I advise against adopting the same approach for the exercises, as they hold significant learning value when approached through manual coding. 33 | 34 | Your commitment to engaging with the material in a comprehensive manner will undoubtedly contribute to a more profound grasp of the subject matter. Thank you for your dedication to this learning journey. 35 | 36 | *⚠ This is a companion repository for the book.* 37 | 38 | 39 | --------------------------------------------------------------------------------