├── README.md
├── environment.yml
├── intro.ipynb
├── zreferences.ipynb
├── memoryless.ipynb
├── poisson.ipynb
├── generators.ipynb
├── kolmogorov_fwd.ipynb
├── kolmogorov_bwd.ipynb
├── uc_mc_semigroups.ipynb
└── ergodicity.ipynb


/README.md:
--------------------------------------------------------------------------------
1 | # continuous_time_mcs.notebooks
2 | 
3 | Notebooks for https://quantecon.github.io/continuous_time_mcs/intro.html
4 | 
5 | **Note:** This README should be edited [here](https://github.com/quantecon/continuous_time_mcs/_notebook_repo)
6 | 


--------------------------------------------------------------------------------
/environment.yml:
--------------------------------------------------------------------------------
 1 | name: quantecon
 2 | channels:
 3 |   - default
 4 | dependencies:
 5 |   - python=3.12
 6 |   - anaconda=2024.10
 7 |   - pip
 8 |   - pip:
 9 |     - jupyter-book==0.15.1
10 |     - quantecon-book-theme==0.7.2
11 |     - sphinx-tojupyter==0.6.0
12 |     - sphinxext-rediraffe==0.2.7
13 |     - sphinx-exercise==0.4.1
14 |     - ghp-import==2.1.0
15 |     - sphinxcontrib-youtube
16 |     - sphinx-proof
17 |     - quantecon
18 | 
19 | 


--------------------------------------------------------------------------------
/intro.ipynb:
--------------------------------------------------------------------------------
 1 | {
 2 |  "cells": [
 3 |   {
 4 |    "cell_type": "markdown",
 5 |    "id": "2b964618",
 6 |    "metadata": {},
 7 |    "source": [
 8 |     "$$\n",
 9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
23 |     "$$"
24 |    ]
25 |   },
26 |   {
27 |    "cell_type": "markdown",
28 |    "id": "50331f05",
29 |    "metadata": {},
30 |    "source": [
31 |     "# Continuous Time Markov Chains\n",
32 |     "\n",
33 |     "**Authors**: [Thomas J. Sargent](http://www.tomsargent.com/) and [John\n",
34 |     "Stachurski](https://johnstachurski.net/)\n",
35 |     "\n",
36 |     "These lectures provides a short introduction to continuous time Markov chains.\n",
37 |     "Mathematical ideas are combined with computer code to build intuition and\n",
38 |     "bridge the gap between theory and applications.  There are many solved\n",
39 |     "exercises.\n",
40 |     "\n",
41 |     "The presentation is rigorous but aims toward applications rather than\n",
42 |     "mathematical curiosities (which are plentiful, if one starts to look).\n",
43 |     "Applications are drawn from economics, finance and operations research.  I\n",
44 |     "assume readers have some knowledge of [discrete time Markov\n",
45 |     "chains](https://python.quantecon.org/finite_markov.html).  Later lectures, use\n",
46 |     "a small amount of analysis in Banach space.\n",
47 |     "\n",
48 |     "Code is written in Python and accelerated using\n",
49 |     "JIT compilation via [Numba](http://numba.pydata.org/).  QuantEcon provides an\n",
50 |     "[introduction to these topics](https://python-programming.quantecon.org/).\n",
51 |     "\n",
52 |     "- [Memoryless Distributions](https://continuous-time-mcs.quantecon.org/memoryless.html)\n",
53 |     "- [Poisson Processes](https://continuous-time-mcs.quantecon.org/poisson.html)\n",
54 |     "- [The Markov Property](https://continuous-time-mcs.quantecon.org/markov_prop.html)\n",
55 |     "- [The Kolmogorov Backward Equation](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html)\n",
56 |     "- [The Kolmogorov Forward Equation](https://continuous-time-mcs.quantecon.org/kolmogorov_fwd.html)\n",
57 |     "- [Semigroups and Generators](https://continuous-time-mcs.quantecon.org/generators.html)\n",
58 |     "- [UC Markov Semigroups](https://continuous-time-mcs.quantecon.org/uc_mc_semigroups.html)\n",
59 |     "- [Stationarity and Ergodicity](https://continuous-time-mcs.quantecon.org/ergodicity.html)\n",
60 |     "- [Bibliography](https://continuous-time-mcs.quantecon.org/zreferences.html)"
61 |    ]
62 |   }
63 |  ],
64 |  "metadata": {
65 |   "date": 1763463649.7495725,
66 |   "filename": "intro.md",
67 |   "kernelspec": {
68 |    "display_name": "Python",
69 |    "language": "python3",
70 |    "name": "python3"
71 |   },
72 |   "title": "Continuous Time Markov Chains"
73 |  },
74 |  "nbformat": 4,
75 |  "nbformat_minor": 5
76 | }


--------------------------------------------------------------------------------
/zreferences.ipynb:
--------------------------------------------------------------------------------
 1 | {
 2 |  "cells": [
 3 |   {
 4 |    "cell_type": "markdown",
 5 |    "id": "d07ed557",
 6 |    "metadata": {},
 7 |    "source": [
 8 |     "$$\n",
 9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
23 |     "$$"
24 |    ]
25 |   },
26 |   {
27 |    "cell_type": "markdown",
28 |    "id": "13978fe9",
29 |    "metadata": {},
30 |    "source": [
31 |     "# Bibliography"
32 |    ]
33 |   },
34 |   {
35 |    "cell_type": "markdown",
36 |    "id": "b142915a",
37 |    "metadata": {},
38 |    "source": [
39 |     "## References\n",
40 |     "\n",
41 |     "<a id='id5'></a>\n",
42 |     "\\[App19\\] David Applebaum. *Semigroups of Linear Operators*. Volume 93. Cambridge University Press, 2019.\n",
43 |     "\n",
44 |     "<a id='id7'></a>\n",
45 |     "\\[Bob05\\] Adam Bobrowski. *Functional analysis for probability and stochastic processes: an introduction*. Cambridge University Press, 2005.\n",
46 |     "\n",
47 |     "<a id='id13'></a>\n",
48 |     "\\[How17\\] Douglas C Howard. *Elements of Stochastic Processes: A Computational Approach*. FE Press, 2017.\n",
49 |     "\n",
50 |     "<a id='id3'></a>\n",
51 |     "\\[LM94\\] Andrzej Lasota and Michael C Mackey. *Chaos, fractals, and noise: stochastic aspects of dynamics*. Volume 97. Springer Science & Business Media, 1994.\n",
52 |     "\n",
53 |     "<a id='id9'></a>\n",
54 |     "\\[LG16\\] Jean-François Le Gall. *Brownian motion, martingales, and stochastic calculus*. Volume 274. Springer, 2016.\n",
55 |     "\n",
56 |     "<a id='id8'></a>\n",
57 |     "\\[Lig10\\] Thomas Milton Liggett. *Continuous time Markov processes: an introduction*. Volume 113. American Mathematical Soc., 2010.\n",
58 |     "\n",
59 |     "<a id='id12'></a>\n",
60 |     "\\[Nor98\\] James R Norris. *Markov chains*. Number 2. Cambridge University Press, 1998.\n",
61 |     "\n",
62 |     "<a id='id11'></a>\n",
63 |     "\\[Par08\\] Etienne Pardoux. *Markov processes and applications: algorithms, networks, genome and finance*. Volume 796. John Wiley & Sons, 2008.\n",
64 |     "\n",
65 |     "<a id='id2'></a>\n",
66 |     "\\[PichorRTKaminska12\\] Katarzyna Pichór, Ryszard Rudnicki, and Marta Tyran-Kamińska. Stochastic semigroups and their applications to biological models. *Demonstratio Mathematica*, 45(2):463–494, 2012.\n",
67 |     "\n",
68 |     "<a id='id6'></a>\n",
69 |     "\\[SK11\\] Prasanna K Sahoo and Palaniappan Kannappan. *Introduction to functional equations*. CRC Press, 2011.\n",
70 |     "\n",
71 |     "<a id='id4'></a>\n",
72 |     "\\[Sta09\\] John Stachurski. *Economic dynamics: theory and computation*. MIT Press, 2009.\n",
73 |     "\n",
74 |     "<a id='id14'></a>\n",
75 |     "\\[Str13\\] Daniel W Stroock. *An introduction to Markov processes*. Volume 230. Springer Science & Business Media, 2013.\n",
76 |     "\n",
77 |     "<a id='id10'></a>\n",
78 |     "\\[Wal12\\] John B Walsh. *Knowing the odds: an introduction to probability*. Volume 139. American Mathematical Soc., 2012."
79 |    ]
80 |   }
81 |  ],
82 |  "metadata": {
83 |   "date": 1763463650.0775473,
84 |   "filename": "zreferences.md",
85 |   "kernelspec": {
86 |    "display_name": "Python",
87 |    "language": "python3",
88 |    "name": "python3"
89 |   },
90 |   "title": "Bibliography"
91 |  },
92 |  "nbformat": 4,
93 |  "nbformat_minor": 5
94 | }


--------------------------------------------------------------------------------
/memoryless.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "id": "97783119",
  6 |    "metadata": {},
  7 |    "source": [
  8 |     "$$\n",
  9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
 10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
 11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
 12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
 13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
 14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
 15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
 16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
 17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
 18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
 19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
 20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
 21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
 22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
 23 |     "$$"
 24 |    ]
 25 |   },
 26 |   {
 27 |    "cell_type": "markdown",
 28 |    "id": "9dab359c",
 29 |    "metadata": {},
 30 |    "source": [
 31 |     "# Memoryless Distributions\n",
 32 |     "\n",
 33 |     "In addition to what’s in Anaconda, this lecture will need the following libraries:"
 34 |    ]
 35 |   },
 36 |   {
 37 |    "cell_type": "code",
 38 |    "execution_count": null,
 39 |    "id": "8a60c7ce",
 40 |    "metadata": {
 41 |     "hide-output": false
 42 |    },
 43 |    "outputs": [],
 44 |    "source": [
 45 |     "!pip install quantecon"
 46 |    ]
 47 |   },
 48 |   {
 49 |    "cell_type": "markdown",
 50 |    "id": "7e5389ce",
 51 |    "metadata": {},
 52 |    "source": [
 53 |     "## Overview\n",
 54 |     "\n",
 55 |     "Markov processes are, by definition, forgetful.\n",
 56 |     "\n",
 57 |     "In particular, for any Markov processes, the distribution over future outcomes\n",
 58 |     "depends only on the current state, rather than the entire history.\n",
 59 |     "\n",
 60 |     "In the case of continuous time Markov chains, which jump between discrete\n",
 61 |     "states, this requires that the amount of elapsed time since the last jump is\n",
 62 |     "not helpful in predicting the timing of the next jump.\n",
 63 |     "\n",
 64 |     "In other words, the jump times are “memoryless”.\n",
 65 |     "\n",
 66 |     "It is remarkable that the only distribution on $ \\RR_+ $ with this\n",
 67 |     "property is the exponential distribution.\n",
 68 |     "\n",
 69 |     "Similarly, the only memoryless distribution on $ \\ZZ_+ $ is the geometric\n",
 70 |     "distribution.\n",
 71 |     "\n",
 72 |     "This lecture tries to clarify these ideas.\n",
 73 |     "\n",
 74 |     "We will use the following imports:"
 75 |    ]
 76 |   },
 77 |   {
 78 |    "cell_type": "code",
 79 |    "execution_count": null,
 80 |    "id": "1c634e08",
 81 |    "metadata": {
 82 |     "hide-output": false
 83 |    },
 84 |    "outputs": [],
 85 |    "source": [
 86 |     "import numpy as np\n",
 87 |     "import matplotlib.pyplot as plt\n",
 88 |     "import quantecon as qe\n",
 89 |     "from numba import njit\n",
 90 |     "from scipy.special import factorial, binom"
 91 |    ]
 92 |   },
 93 |   {
 94 |    "cell_type": "markdown",
 95 |    "id": "4003e116",
 96 |    "metadata": {},
 97 |    "source": [
 98 |     "## The Geometric Distribution\n",
 99 |     "\n",
100 |     "Consider betting on a roulette wheel and suppose that red has come up four times in a row.\n",
101 |     "\n",
102 |     "Since five reds in a row is an unlikely event, many people instinctively feel\n",
103 |     "that black is more likely on the fifth spin — “Surely black will come up this time!”\n",
104 |     "\n",
105 |     "But rational thought tells us such instincts are wrong: the four previous reds make no\n",
106 |     "difference to the outcome of the next spin.\n",
107 |     "\n",
108 |     "(Many casinos offer an unlimited supply of free alcoholic beverages in order to discourage this kind of rational analysis.)\n",
109 |     "\n",
110 |     "A mathematical restatement of this phenomenon is: the geometric distribution is memoryless."
111 |    ]
112 |   },
113 |   {
114 |    "cell_type": "markdown",
115 |    "id": "f26b858d",
116 |    "metadata": {},
117 |    "source": [
118 |     "### Memorylessness\n",
119 |     "\n",
120 |     "Let $ X $ be a random variable supported on the nonnegative integers $ \\ZZ_+ $.\n",
121 |     "\n",
122 |     "We say that $ X $ is [geometrically distributed](https://en.wikipedia.org/wiki/Geometric_distribution) if, for some $ \\theta $ satisfying $ 0 \\leq \\theta \\leq 1 $,\n",
123 |     "\n",
124 |     "\n",
125 |     "<a id='equation-geodist'></a>\n",
126 |     "$$\n",
127 |     "\\PP\\{X = k\\} = (1-\\theta)^k \\theta \n",
128 |     "    \\qquad (k = 0, 1, \\ldots) \\tag{1.1}\n",
129 |     "$$\n",
130 |     "\n",
131 |     "An example can be constructed from the discussion of the roulette wheel above.\n",
132 |     "\n",
133 |     "Suppose that,\n",
134 |     "\n",
135 |     "- the outcome of each spin is either red or black,  \n",
136 |     "- spins are labeled by $ 0, 1, 2, \\ldots $,  \n",
137 |     "- on each spin, black occurs with probability $ \\theta $ and  \n",
138 |     "- outcomes across spins are independent.  \n",
139 |     "\n",
140 |     "\n",
141 |     "Then [(1.1)](#equation-geodist) is the probability that the first occurrence of black is at spin $ k $.\n",
142 |     "\n",
143 |     "(The outcome “black” fails $ k $ times and then succeeds.)\n",
144 |     "\n",
145 |     "Consistent with our discussion in the introduction, the geometric distribution\n",
146 |     "is **memoryless**.\n",
147 |     "\n",
148 |     "In particular, given any nonnegative integer $ m $, we have\n",
149 |     "\n",
150 |     "\n",
151 |     "<a id='equation-memgeo'></a>\n",
152 |     "$$\n",
153 |     "\\PP \\{X = m + 1 \\,|\\, X > m \\} = \\theta \\tag{1.2}\n",
154 |     "$$\n",
155 |     "\n",
156 |     "In other words, regardless of how long we have seen only red outcomes, the\n",
157 |     "probability of black on the next spin is the same as the unconditional\n",
158 |     "probability of getting black on the very first spin.\n",
159 |     "\n",
160 |     "To establish [(1.2)](#equation-memgeo), we use basic properties of the geometric\n",
161 |     "distribution to obtain.\n",
162 |     "\n",
163 |     "$$\n",
164 |     "\\frac{ \\PP \\{X = m + 1 \\text{ and } X > m \\} }\n",
165 |     "    {\\PP \\{X \\geq m\\}}\n",
166 |     "    =\n",
167 |     "    \\frac{ \\PP \\{X = m + 1 \\} }\n",
168 |     "    {\\PP \\{X > m\\}}\n",
169 |     "    = \\frac{ (1-\\theta)^{m+1} \\theta }\n",
170 |     "        {(1-\\theta)^{m+1} }\n",
171 |     "    = \\theta\n",
172 |     "$$"
173 |    ]
174 |   },
175 |   {
176 |    "cell_type": "markdown",
177 |    "id": "a6426c9d",
178 |    "metadata": {},
179 |    "source": [
180 |     "## The Exponential Distribution\n",
181 |     "\n",
182 |     "Later, when we construct continuous time Markov chains, we will need to\n",
183 |     "specify the distribution of the holding times, which are the time intervals\n",
184 |     "between jumps.\n",
185 |     "\n",
186 |     "As discussed above (and again below), the holding time distribution must be\n",
187 |     "memoryless, so that the chain satisfies the Markov property.\n",
188 |     "\n",
189 |     "While the geometric distribution is memoryless, its discrete support makes it\n",
190 |     "a poor fit for the continuous time case.\n",
191 |     "\n",
192 |     "Hence we turn to the [exponential distribution](https://en.wikipedia.org/wiki/Exponential_distribution), which is supported on $ \\RR_+ $.\n",
193 |     "\n",
194 |     "A random variable $ Y $ on $ \\RR_+ $ is called **exponential with rate $ \\lambda $**, denoted by $ Y \\sim \\Exp(\\lambda) $, if\n",
195 |     "\n",
196 |     "$$\n",
197 |     "\\PP\\{Y > y\\} = e^{-\\lambda y}\n",
198 |     "    \\qquad (y \\geq 0)\n",
199 |     "$$\n",
200 |     "\n",
201 |     "\n",
202 |     "<a id='geomtoexp'></a>"
203 |    ]
204 |   },
205 |   {
206 |    "cell_type": "markdown",
207 |    "id": "854d9a15",
208 |    "metadata": {},
209 |    "source": [
210 |     "### From Geometric to Exponential\n",
211 |     "\n",
212 |     "The exponential distribution can be regarded as the “limit” of the geometric\n",
213 |     "distribution.\n",
214 |     "\n",
215 |     "To illustrate, let us suppose that\n",
216 |     "\n",
217 |     "- customers enter a shop at discrete times $ t_0, t_1, \\ldots $  \n",
218 |     "- these times are evenly spaced, so that $ h = t_{i+1} - t_i $ for some $ h > 0 $ and all $ i \\in \\ZZ_+ $  \n",
219 |     "- at each $ t_i $, either zero or one customers enter (no more because $ h $ is small)  \n",
220 |     "- entry at each $ t_i $ occurs with probability $ \\lambda h $ and is independent over $ i $.  \n",
221 |     "\n",
222 |     "\n",
223 |     "The fact that the entry probability is proportional to $ h $ is important in\n",
224 |     "what follows.\n",
225 |     "\n",
226 |     "You can imagine many customers passing by the shop, each entering\n",
227 |     "independently.\n",
228 |     "\n",
229 |     "If we halve the time interval, then we also halve the probability that a\n",
230 |     "customer enters.\n",
231 |     "\n",
232 |     "Let\n",
233 |     "\n",
234 |     "- $ Y $ be the time of the first arrival at the shop,  \n",
235 |     "- $ t $ be a given positive number and  \n",
236 |     "- $ i(h) $ be the largest integer such that $ t_{i(h)} \\leq t $.  \n",
237 |     "\n",
238 |     "\n",
239 |     "Note that, as $ h \\to 0 $, the grid becomes finer and $ t_{i(h)} = i(h) h  \\to t $.\n",
240 |     "\n",
241 |     "Writing $ i(h) $ as $ i $ and using the geometric distribution, the probability that\n",
242 |     "the first arrival occurs after $ t_{i} $ is $ (1-\\lambda h)^{i} $.\n",
243 |     "\n",
244 |     "Hence\n",
245 |     "\n",
246 |     "$$\n",
247 |     "\\PP\\{Y > t_{i} \\}\n",
248 |     "    = (1-\\lambda h)^i\n",
249 |     "    = \\left( 1- \\frac{\\lambda i h}{i} \\right)^i\n",
250 |     "$$\n",
251 |     "\n",
252 |     "Using the fact that $ e^x = \\lim_{i \\to \\infty}(1 + x/i)^i $ for all $ x $ and $ i\n",
253 |     "h = t_i \\to t $, we obtain, for large $ i $,\n",
254 |     "\n",
255 |     "$$\n",
256 |     "\\PP\\{Y > t\\}\n",
257 |     "    \\approx\n",
258 |     "    e^{- \\lambda t}\n",
259 |     "$$\n",
260 |     "\n",
261 |     "In this sense, the exponential is the limit of the geometric distribution."
262 |    ]
263 |   },
264 |   {
265 |    "cell_type": "markdown",
266 |    "id": "f651d91a",
267 |    "metadata": {},
268 |    "source": [
269 |     "### Memoryless Property of the Exponential Distribution\n",
270 |     "\n",
271 |     "The exponential distribution is the only memoryless distribution supported on $ \\RR_+ $, as the next theorem attests."
272 |    ]
273 |   },
274 |   {
275 |    "cell_type": "markdown",
276 |    "id": "498fa320",
277 |    "metadata": {},
278 |    "source": [
279 |     "**Theorem 1.1** (Characterization of the Exponential Distribution)\n",
280 |     "\n",
281 |     "If $ X $ is a random variable supported on $ \\RR_+ $, then there exists a\n",
282 |     "$ \\lambda > 0 $ such that $ X \\sim \\Exp(\\lambda) $ if and only if, for all\n",
283 |     "positive $ s, t $,\n",
284 |     "\n",
285 |     "\n",
286 |     "<a id='equation-memexpo'></a>\n",
287 |     "$$\n",
288 |     "\\PP \\{X > s + t \\,|\\, X > s \\} = \\PP \\{X > t\\} \\tag{1.3}\n",
289 |     "$$"
290 |    ]
291 |   },
292 |   {
293 |    "cell_type": "markdown",
294 |    "id": "76cc26f8",
295 |    "metadata": {},
296 |    "source": [
297 |     "**Proof.**\n",
298 |     "\n",
299 |     "Proof. To see that [(1.3)](#equation-memexpo) holds when $ X $ is exponential with rate $ \\lambda $,\n",
300 |     "fix $ s, t > 0 $ and observe that\n",
301 |     "\n",
302 |     "$$\n",
303 |     "\\frac{ \\PP \\{X > s + t \\text{ and } X > s \\} }\n",
304 |     "    {\\PP \\{X > s\\}}\n",
305 |     "    =\n",
306 |     "    \\frac{ \\PP \\{X > s + t \\} }\n",
307 |     "    {\\PP \\{X > s\\}}\n",
308 |     "    = \\frac{e^{-\\lambda s - \\lambda t}}{e^{-\\lambda s}}\n",
309 |     "    = e^{-\\lambda t}\n",
310 |     "$$\n",
311 |     "\n",
312 |     "To see that the converse holds, let $ X $ be a random variable supported on $ \\RR_+ $\n",
313 |     "such that [(1.3)](#equation-memexpo) holds.\n",
314 |     "\n",
315 |     "The “exceedance” function $ f(s) := \\PP\\{X > s\\} $ then has three properties:\n",
316 |     "\n",
317 |     "1. $ f $ is decreasing on $ \\RR_+ $,  \n",
318 |     "1. $ 0 < f(t) < 1 $ for all $ t > 0 $,  \n",
319 |     "1. $ f(s + t) = f(s) f(t) $ for all $ s, t > 0 $.  \n",
320 |     "\n",
321 |     "\n",
322 |     "The first property is common to all exceedance functions, the second is due to\n",
323 |     "the fact that $ X $ is supported on all of $ \\RR_+ $, and the\n",
324 |     "third is [(1.3)](#equation-memexpo).\n",
325 |     "\n",
326 |     "From these three properties we will show that\n",
327 |     "\n",
328 |     "\n",
329 |     "<a id='equation-implex'></a>\n",
330 |     "$$\n",
331 |     "f(t) = f(1)^t  \\;\\; \\forall \\, t \\geq 0 \\tag{1.4}\n",
332 |     "$$\n",
333 |     "\n",
334 |     "This is sufficient to prove the claim because then $ \\lambda := - \\ln f(1) $ is a positive real number (by property 2) and, moreover,\n",
335 |     "\n",
336 |     "$$\n",
337 |     "f(t) \n",
338 |     "    = \\exp( \\ln ( f(1) ) t) \n",
339 |     "    = \\exp( - \\lambda t)\n",
340 |     "$$\n",
341 |     "\n",
342 |     "To see that [(1.4)](#equation-implex) holds, fix positive integers $ m,n $.\n",
343 |     "\n",
344 |     "We can use property 3 to obtain both\n",
345 |     "\n",
346 |     "$$\n",
347 |     "f(m/n) = f(1/n)^m\n",
348 |     "    \\quad \\text{and} \\quad\n",
349 |     "    f(1) = f(1/n)^n\n",
350 |     "$$\n",
351 |     "\n",
352 |     "It follows that $ f(m/n)^n = f(1/n)^{m n} = f(1)^m $ and, raising to the power\n",
353 |     "of $ 1/n $, we get [(1.4)](#equation-implex) when $ t=m/n $.\n",
354 |     "\n",
355 |     "The discussion so far confirms that [(1.4)](#equation-implex) holds when $ t $ is rational.\n",
356 |     "\n",
357 |     "So now take any $ t \\geq 0 $ and rational sequences $ (a_n) $ and $ (b_n) $\n",
358 |     "converging to $ t $ with $ a_n \\leq t \\leq b_n $ for all $ n $.\n",
359 |     "\n",
360 |     "By property 1 we have $ f(b_n) \\leq f(t) \\leq f(a_n) $ for all $ n $, so\n",
361 |     "\n",
362 |     "$$\n",
363 |     "f(1)^{b_n} \\leq f(t) \\leq f(1)^{a_n}\n",
364 |     "    \\quad \\forall \\, n \\in \\NN\n",
365 |     "$$\n",
366 |     "\n",
367 |     "Taking the limit in $ n $ completes the proof."
368 |    ]
369 |   },
370 |   {
371 |    "cell_type": "markdown",
372 |    "id": "ac4813c2",
373 |    "metadata": {},
374 |    "source": [
375 |     "\n",
376 |     "<a id='fail-mem'></a>"
377 |    ]
378 |   },
379 |   {
380 |    "cell_type": "markdown",
381 |    "id": "10e03b18",
382 |    "metadata": {},
383 |    "source": [
384 |     "### Failure of Memorylessness\n",
385 |     "\n",
386 |     "We know from the proceeding section that any distribution on $ \\RR_+ $ other\n",
387 |     "than the exponential distribution fails to be memoryless.\n",
388 |     "\n",
389 |     "Here’s an example that helps to clarify (although the support of the distribution is a proper subset of $ \\RR_+ $).\n",
390 |     "\n",
391 |     "A random variable $ Y $ has the Pareto distribution with positive parameters $ t_0, \\alpha $ if\n",
392 |     "\n",
393 |     "$$\n",
394 |     "f(t) \n",
395 |     "    := \\PP\\{Y > t\\} \n",
396 |     "    = \n",
397 |     "    \\begin{cases}\n",
398 |     "    1 & \\text{ if } t \\leq t_0\n",
399 |     "    \\\\\n",
400 |     "    (t_0 / t)^\\alpha & \\text{ if } t > t_0\n",
401 |     "    \\end{cases}\n",
402 |     "$$\n",
403 |     "\n",
404 |     "As a result,  with $ s > t_0 $,\n",
405 |     "\n",
406 |     "$$\n",
407 |     "\\PP \\{Y > s + t \\,|\\, Y > s \\}\n",
408 |     "    =\n",
409 |     "    \\frac{ \\PP \\{Y > s + t \\} }\n",
410 |     "    {\\PP \\{Y > s\\}}\n",
411 |     "    = \\left( \\frac{t}{t + s} \\right)^\\alpha\n",
412 |     "$$\n",
413 |     "\n",
414 |     "Since this probability falls with $ s $, the distribution is not memoryless.\n",
415 |     "\n",
416 |     "If we have waited many hours for an event (i.e., $ s $ is large), then the\n",
417 |     "probability of waiting another hour is relatively small."
418 |    ]
419 |   },
420 |   {
421 |    "cell_type": "markdown",
422 |    "id": "f839917e",
423 |    "metadata": {},
424 |    "source": [
425 |     "## Sums of Exponentials\n",
426 |     "\n",
427 |     "A random variable $ W $ on $ \\RR_+ $ is said to have the [Erlang\n",
428 |     "distribution](https://en.wikipedia.org/wiki/Erlang_distribution) if its\n",
429 |     "density has the form\n",
430 |     "\n",
431 |     "$$\n",
432 |     "f(t) = \\frac{\\lambda^n  t^{n-1}}{(n-1)!} e^{-\\lambda t}\n",
433 |     "    \\qquad (t \\geq 0)\n",
434 |     "$$\n",
435 |     "\n",
436 |     "for some $ n \\in \\NN $ and $ \\lambda > 0 $.\n",
437 |     "\n",
438 |     "The parameters $ n $ and $ \\lambda $ are called the **shape** and **rate**\n",
439 |     "parameters respectively.\n",
440 |     "\n",
441 |     "The next figure shows the shape for two parameterizations."
442 |    ]
443 |   },
444 |   {
445 |    "cell_type": "code",
446 |    "execution_count": null,
447 |    "id": "84b752a0",
448 |    "metadata": {
449 |     "hide-output": false
450 |    },
451 |    "outputs": [],
452 |    "source": [
453 |     "t_grid = np.linspace(0, 50, 100)\n",
454 |     "\n",
455 |     "class Erlang:\n",
456 |     "\n",
457 |     "    def __init__(self, λ=0.5, n=10):\n",
458 |     "        self.λ, self.n = λ, n\n",
459 |     "\n",
460 |     "    def __call__(self, t):\n",
461 |     "        n, λ = self.n, self.λ\n",
462 |     "        return (λ**n * t**(n-1) * np.exp(-λ * t)) / factorial(n-1)\n",
463 |     "\n",
464 |     "e1 = Erlang(n=10, λ=0.5)\n",
465 |     "e2 = Erlang(n=10, λ=0.75)\n",
466 |     "\n",
467 |     "fig, ax = plt.subplots()\n",
468 |     "for e in e1, e2:\n",
469 |     "    ax.plot(t_grid, e(t_grid), label=f'$n={e.n}, \\lambda={e.λ}$')\n",
470 |     "\n",
471 |     "ax.legend()\n",
472 |     "plt.show()"
473 |    ]
474 |   },
475 |   {
476 |    "cell_type": "markdown",
477 |    "id": "cec63169",
478 |    "metadata": {},
479 |    "source": [
480 |     "The CDF of the Erlang distribution is\n",
481 |     "\n",
482 |     "\n",
483 |     "<a id='equation-erlcdf'></a>\n",
484 |     "$$\n",
485 |     "F(t) \n",
486 |     "    = \\PP\\{W \\leq t\\}\n",
487 |     "    = 1 - \\sum_{k=0}^{n-1} \\frac{(\\lambda t)^k}{k!} e^{-\\lambda t} \\tag{1.5}\n",
488 |     "$$\n",
489 |     "\n",
490 |     "The Erlang distribution is of interest to us because of the following fact."
491 |    ]
492 |   },
493 |   {
494 |    "cell_type": "markdown",
495 |    "id": "d968681b",
496 |    "metadata": {},
497 |    "source": [
498 |     "**Lemma 1.1** (Distribution of Sum of Exponentials)\n",
499 |     "\n",
500 |     "If, for some $ \\lambda > 0 $, the sequence $ (W_i) $ is IID and exponentially\n",
501 |     "distributed with rate $ \\lambda $, then $ J_n := \\sum_{i=1}^n W_i $ has the Erlang\n",
502 |     "distribution with shape $ n $ and rate $ \\lambda $."
503 |    ]
504 |   },
505 |   {
506 |    "cell_type": "markdown",
507 |    "id": "9e576e01",
508 |    "metadata": {},
509 |    "source": [
510 |     "This connects to Poisson process theory, as we shall soon see."
511 |    ]
512 |   },
513 |   {
514 |    "cell_type": "markdown",
515 |    "id": "e431349a",
516 |    "metadata": {},
517 |    "source": [
518 |     "## Exercises"
519 |    ]
520 |   },
521 |   {
522 |    "cell_type": "markdown",
523 |    "id": "f2fd4c76",
524 |    "metadata": {},
525 |    "source": [
526 |     "## Exercise 1.1\n",
527 |     "\n",
528 |     "Due to its memoryless property, we can “stop” and “restart” an exponential\n",
529 |     "draw without changing its distribution.\n",
530 |     "\n",
531 |     "To illustrate this, we can think of fixing $ \\lambda > 0 $, drawing from\n",
532 |     "$ \\Exp(\\lambda) $, and stopping and restarting whenever a threshold $ s $ is crossed.\n",
533 |     "\n",
534 |     "In particular, consider the random variable $ X $ defined as follows:\n",
535 |     "\n",
536 |     "- Draw $ Y $ from $ \\Exp(\\lambda) $.  \n",
537 |     "- If $ Y \\leq s $, set $ X = Y $.  \n",
538 |     "- If not, draw $ Z $ independently from $ \\Exp(\\lambda) $ and set $ X = s + Z $.  \n",
539 |     "\n",
540 |     "\n",
541 |     "Show that $ X \\sim \\Exp(\\lambda) $."
542 |    ]
543 |   },
544 |   {
545 |    "cell_type": "markdown",
546 |    "id": "a05d82a1",
547 |    "metadata": {},
548 |    "source": [
549 |     "## Solution to[ Exercise 1.1](https://continuous-time-mcs.quantecon.org/#memoryless-ex-1)\n",
550 |     "\n",
551 |     "Let $ X $ be constructed as in the statement of the exercise and fix $ t > 0 $.\n",
552 |     "\n",
553 |     "Notice that $ X > s + t $ if and only if $ Y > s $ and $ Z > t $.\n",
554 |     "\n",
555 |     "As a result of this fact and independence,\n",
556 |     "\n",
557 |     "$$\n",
558 |     "\\PP\\{X > s + t\\}\n",
559 |     "= \\PP\\{Y > s \\} \\PP\\{Z > t\\}\n",
560 |     "= e^{-\\lambda(s + t)}\n",
561 |     "$$\n",
562 |     "\n",
563 |     "At the same time, $ X > s-t $ if and only if $ Y > s-t $, so\n",
564 |     "\n",
565 |     "$$\n",
566 |     "\\PP\\{X > s - t\\}\n",
567 |     "= \\PP\\{Y > s - t \\} \n",
568 |     "= e^{-\\lambda(s - t)}\n",
569 |     "$$\n",
570 |     "\n",
571 |     "Either way, we have $ X \\sim \\Exp(\\lambda) $, as was to be shown."
572 |    ]
573 |   },
574 |   {
575 |    "cell_type": "markdown",
576 |    "id": "c265061f",
577 |    "metadata": {},
578 |    "source": [
579 |     "## Exercise 1.2\n",
580 |     "\n",
581 |     "Fix $ \\lambda = 0.5 $ and $ s=1.0 $.\n",
582 |     "\n",
583 |     "Simulate 1,000 draws of $ X $ using the algorithm above.\n",
584 |     "\n",
585 |     "Plot the fraction of the sample exceeding $ t $ for each $ t \\geq 0 $ (on a grid)\n",
586 |     "and compare to $ t \\mapsto e^{-\\lambda t} $.\n",
587 |     "\n",
588 |     "Is the fit good?  How about if the number of draws is increased?\n",
589 |     "\n",
590 |     "Are the results in line with those of the previous exercise?"
591 |    ]
592 |   },
593 |   {
594 |    "cell_type": "markdown",
595 |    "id": "70948eef",
596 |    "metadata": {},
597 |    "source": [
598 |     "## Solution to[ Exercise 1.2](https://continuous-time-mcs.quantecon.org/#memoryless-ex-2)\n",
599 |     "\n",
600 |     "Here’s one solution, starting with 1,000 draws."
601 |    ]
602 |   },
603 |   {
604 |    "cell_type": "code",
605 |    "execution_count": null,
606 |    "id": "4083c52d",
607 |    "metadata": {
608 |     "hide-output": false
609 |    },
610 |    "outputs": [],
611 |    "source": [
612 |     "λ = 0.5 \n",
613 |     "np.random.seed(1234)\n",
614 |     "t_grid = np.linspace(0, 10, 200)\n",
615 |     "\n",
616 |     "@njit\n",
617 |     "def draw_X(s=1.0, n=1_000):\n",
618 |     "    draws = np.empty(n)\n",
619 |     "    for i in range(n):\n",
620 |     "        Y = np.random.exponential(scale=1/λ)\n",
621 |     "        if Y <= s:\n",
622 |     "            X = Y\n",
623 |     "        else:\n",
624 |     "            Z = np.random.exponential(scale=1/λ)\n",
625 |     "            X = s + Z\n",
626 |     "        draws[i] = X\n",
627 |     "    return draws\n",
628 |     "\n",
629 |     "fig, ax = plt.subplots()\n",
630 |     "draws = draw_X()\n",
631 |     "empirical_exceedance = [np.mean(draws > t) for t in t_grid]\n",
632 |     "ax.plot(t_grid, np.exp(- λ * t_grid), label='exponential exceedance')\n",
633 |     "ax.plot(t_grid, empirical_exceedance, label='empirical exceedance')\n",
634 |     "ax.legend()\n",
635 |     "\n",
636 |     "plt.show()"
637 |    ]
638 |   },
639 |   {
640 |    "cell_type": "markdown",
641 |    "id": "7d50ea0a",
642 |    "metadata": {},
643 |    "source": [
644 |     "The fit is already very close, which matches with the theory in ``.\n",
645 |     "\n",
646 |     "The two lines become indistinguishable as $ n $ is increased further."
647 |    ]
648 |   },
649 |   {
650 |    "cell_type": "code",
651 |    "execution_count": null,
652 |    "id": "a68a808b",
653 |    "metadata": {
654 |     "hide-output": false
655 |    },
656 |    "outputs": [],
657 |    "source": [
658 |     "fig, ax = plt.subplots()\n",
659 |     "draws = draw_X(n=10_000)\n",
660 |     "empirical_exceedance = [np.mean(draws > t) for t in t_grid]\n",
661 |     "ax.plot(t_grid, np.exp(- λ * t_grid), label='exponential exceedance')\n",
662 |     "ax.plot(t_grid, empirical_exceedance, label='empirical exceedance')\n",
663 |     "ax.legend()\n",
664 |     "plt.show()"
665 |    ]
666 |   }
667 |  ],
668 |  "metadata": {
669 |   "date": 1763463649.9419599,
670 |   "filename": "memoryless.md",
671 |   "kernelspec": {
672 |    "display_name": "Python",
673 |    "language": "python3",
674 |    "name": "python3"
675 |   },
676 |   "title": "Memoryless Distributions"
677 |  },
678 |  "nbformat": 4,
679 |  "nbformat_minor": 5
680 | }


--------------------------------------------------------------------------------
/poisson.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "id": "37f995dd",
  6 |    "metadata": {},
  7 |    "source": [
  8 |     "$$\n",
  9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
 10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
 11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
 12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
 13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
 14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
 15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
 16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
 17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
 18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
 19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
 20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
 21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
 22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
 23 |     "$$"
 24 |    ]
 25 |   },
 26 |   {
 27 |    "cell_type": "markdown",
 28 |    "id": "e927b300",
 29 |    "metadata": {},
 30 |    "source": [
 31 |     "# Poisson Processes\n",
 32 |     "\n",
 33 |     "In addition to what’s in Anaconda, this lecture will need the following libraries:"
 34 |    ]
 35 |   },
 36 |   {
 37 |    "cell_type": "code",
 38 |    "execution_count": null,
 39 |    "id": "70414813",
 40 |    "metadata": {
 41 |     "hide-output": false
 42 |    },
 43 |    "outputs": [],
 44 |    "source": [
 45 |     "!pip install quantecon"
 46 |    ]
 47 |   },
 48 |   {
 49 |    "cell_type": "markdown",
 50 |    "id": "60e1c8a2",
 51 |    "metadata": {},
 52 |    "source": [
 53 |     "## Overview\n",
 54 |     "\n",
 55 |     "Counting processes count the number of “arrivals” occurring by a given time\n",
 56 |     "(e.g., the number of visitors to a website, the number of customers arriving at a restaurant, etc.)\n",
 57 |     "\n",
 58 |     "Counting processes become Poisson processes when the time interval between\n",
 59 |     "arrivals is IID and exponentially distributed.\n",
 60 |     "\n",
 61 |     "Exponential distributions and Poisson processes have deep connections to\n",
 62 |     "continuous time Markov chains.\n",
 63 |     "\n",
 64 |     "For example, Poisson processes are one of the simplest nontrivial examples of\n",
 65 |     "a continuous time Markov chain.\n",
 66 |     "\n",
 67 |     "In addition, when continuous time Markov chains jump between states, the time\n",
 68 |     "between jumps is *necessarily* exponentially distributed.\n",
 69 |     "\n",
 70 |     "In discussing Poisson processes, we will use the following imports:"
 71 |    ]
 72 |   },
 73 |   {
 74 |    "cell_type": "code",
 75 |    "execution_count": null,
 76 |    "id": "2209b0ca",
 77 |    "metadata": {
 78 |     "hide-output": false
 79 |    },
 80 |    "outputs": [],
 81 |    "source": [
 82 |     "import numpy as np\n",
 83 |     "import matplotlib.pyplot as plt\n",
 84 |     "import quantecon as qe\n",
 85 |     "from numba import njit\n",
 86 |     "from scipy.special import factorial, binom"
 87 |    ]
 88 |   },
 89 |   {
 90 |    "cell_type": "markdown",
 91 |    "id": "5629090a",
 92 |    "metadata": {},
 93 |    "source": [
 94 |     "## Counting Processes\n",
 95 |     "\n",
 96 |     "Let’s start with the general case of an arbitrary counting process."
 97 |    ]
 98 |   },
 99 |   {
100 |    "cell_type": "markdown",
101 |    "id": "312b1e54",
102 |    "metadata": {},
103 |    "source": [
104 |     "### Jumps and Counts\n",
105 |     "\n",
106 |     "Let $ (J_k) $ be an increasing sequence of nonnegative random variables\n",
107 |     "satisfying  $ J_k \\to \\infty $ with probability one.\n",
108 |     "\n",
109 |     "For example, $ J_k $ might be the time the $ k $-th customer arrives at a shop.\n",
110 |     "\n",
111 |     "Then\n",
112 |     "\n",
113 |     "\n",
114 |     "<a id='equation-defcount'></a>\n",
115 |     "$$\n",
116 |     "N_t := \\sum_{k \\geq 0} k \\mathbb{1} \\{ J_k \\leq t < J_{k+1} \\} \\tag{2.1}\n",
117 |     "$$\n",
118 |     "\n",
119 |     "is the number of customers that have visited by time $ t $.\n",
120 |     "\n",
121 |     "The next figure illustrate the definition of $ N_t $ for a given jump sequence $ \\{J_k\\} $."
122 |    ]
123 |   },
124 |   {
125 |    "cell_type": "code",
126 |    "execution_count": null,
127 |    "id": "3b4f9b85",
128 |    "metadata": {
129 |     "hide-output": false
130 |    },
131 |    "outputs": [],
132 |    "source": [
133 |     "Ks = 0, 1, 2, 3\n",
134 |     "Js = 0, 0.8, 1.8, 2.1, 3\n",
135 |     "n = len(Ks)\n",
136 |     "\n",
137 |     "fig, ax = plt.subplots()\n",
138 |     "\n",
139 |     "ax.plot(Js[:-1], Ks, 'o')\n",
140 |     "ax.hlines(Ks, Js[:-1], Js[1:], label='$N_t$')\n",
141 |     "ax.vlines(Js[:-1], (0, Ks[0], Ks[1], Ks[2]), Ks, alpha=0.25)\n",
142 |     "\n",
143 |     "ax.set(xticks=Js[:-1],\n",
144 |     "       xticklabels=[f'$J_{k}$' for k in range(n)],\n",
145 |     "       yticks=(0, 1, 2, 3),\n",
146 |     "       xlabel='$t$')\n",
147 |     "\n",
148 |     "ax.legend(loc='lower right')\n",
149 |     "plt.show()"
150 |    ]
151 |   },
152 |   {
153 |    "cell_type": "markdown",
154 |    "id": "794866e0",
155 |    "metadata": {},
156 |    "source": [
157 |     "An alternative but equivalent definition is\n",
158 |     "\n",
159 |     "$$\n",
160 |     "N_t := \\max \\{k \\geq 0 \\,|\\, J_k \\leq t \\}\n",
161 |     "$$\n",
162 |     "\n",
163 |     "As a function of $ t $, the process $ N_t $ is called a **counting process**.\n",
164 |     "\n",
165 |     "The jump times $ (J_k) $ are sometimes called **arrival times** and the\n",
166 |     "intervals $ J_k - J_{k-1} $ are called **wait times** or **holding times**."
167 |    ]
168 |   },
169 |   {
170 |    "cell_type": "markdown",
171 |    "id": "7de2365a",
172 |    "metadata": {},
173 |    "source": [
174 |     "### Exponential Holding Times\n",
175 |     "\n",
176 |     "A Poisson process is a counting process with independent exponential holding times.\n",
177 |     "\n",
178 |     "In particular, suppose that the arrival times are given by $ J_0 = 0 $ and\n",
179 |     "\n",
180 |     "$$\n",
181 |     "J_k := W_1 + \\cdots W_k\n",
182 |     "$$\n",
183 |     "\n",
184 |     "where $ (W_i) $ are IID exponential with some fixed rate $ \\lambda $.\n",
185 |     "\n",
186 |     "Then the associated counting process $ (N_t) $ is called a **Poisson process** with rate $ \\lambda $.\n",
187 |     "\n",
188 |     "The rationale behind the name is that, for each $ t > 0 $, the random variable\n",
189 |     "$ N_t $ has the Poisson distribution with parameter $ t \\lambda $.\n",
190 |     "\n",
191 |     "In other words,\n",
192 |     "\n",
193 |     "\n",
194 |     "<a id='equation-poissondist'></a>\n",
195 |     "$$\n",
196 |     "\\PP\\{N_t = k\\} \n",
197 |     "    = e^{-t \\lambda} \\frac{(t \\lambda)^k }{k!}\n",
198 |     "    \\qquad (k = 0, 1, \\ldots) \\tag{2.2}\n",
199 |     "$$\n",
200 |     "\n",
201 |     "For example, since $ N_t = 0 $ if and only if $ W_1 > t $, we have\n",
202 |     "\n",
203 |     "$$\n",
204 |     "\\PP\\{N_t =0\\} \n",
205 |     "    = \\PP\\{W_1 > t\\}\n",
206 |     "    = e^{-t \\lambda}\n",
207 |     "$$\n",
208 |     "\n",
209 |     "and the right hand side agrees with [(2.2)](#equation-poissondist) when $ k=0 $.\n",
210 |     "\n",
211 |     "This sets up a proof by induction, which is time consuming but not difficult\n",
212 |     "— the details can be found in $ \\S29 $ of [[Howard, 2017](https://continuous-time-mcs.quantecon.org/zreferences.html#id13)].\n",
213 |     "\n",
214 |     "Another way to show that $ N_t $ is Poisson with rate $ \\lambda $ is to appeal to\n",
215 |     "[Lemma 1.1](https://continuous-time-mcs.quantecon.org/memoryless.html#erlexp).\n",
216 |     "\n",
217 |     "We observe that\n",
218 |     "\n",
219 |     "$$\n",
220 |     "\\PP\\{N_t \\leq n\\} \n",
221 |     "    = \\PP\\{J_{n+1} > t\\} \n",
222 |     "    = 1 - \\PP\\{J_{n+1} \\leq t\\}\n",
223 |     "$$\n",
224 |     "\n",
225 |     "Inserting the expression for the Erlang CDF in [(1.5)](https://continuous-time-mcs.quantecon.org/memoryless.html#equation-erlcdf) with shape $ n+1 $ and\n",
226 |     "rate $ \\lambda $, we obtain\n",
227 |     "\n",
228 |     "$$\n",
229 |     "\\PP\\{N_t \\leq n\\} \n",
230 |     "    = \\sum_{k=0}^{n} \\frac{(t \\lambda )^k}{k!} e^{-t \\lambda}\n",
231 |     "$$\n",
232 |     "\n",
233 |     "This is the (integer valued) CDF for the Poisson distribution with parameter\n",
234 |     "$ t \\lambda $.\n",
235 |     "\n",
236 |     "An exercise at the end of the lecture asks you to verify that $ N_t $ is Poisson-$ (t \\lambda ) $ informally via simulation.\n",
237 |     "\n",
238 |     "The next figure shows one realization of a Poisson process $ (N_t) $, with jumps\n",
239 |     "at each new arrival."
240 |    ]
241 |   },
242 |   {
243 |    "cell_type": "code",
244 |    "execution_count": null,
245 |    "id": "acde4c6a",
246 |    "metadata": {
247 |     "hide-output": false
248 |    },
249 |    "outputs": [],
250 |    "source": [
251 |     "np.random.seed(1234)\n",
252 |     "T = 5\n",
253 |     "Ws = np.random.exponential(size=T)\n",
254 |     "Js = np.cumsum(Ws)\n",
255 |     "Ys = np.arange(T)\n",
256 |     "\n",
257 |     "fig, ax = plt.subplots()\n",
258 |     "\n",
259 |     "ax.plot(np.insert(Js, 0, 0)[:-1], Ys, 'o')\n",
260 |     "ax.hlines(Ys, np.insert(Js, 0, 0)[:-1], Js, label='$N_t$')\n",
261 |     "ax.vlines(Js[:-1], Ys[:-1], Ys[1:], alpha=0.25)\n",
262 |     "\n",
263 |     "ax.set(xticks=[],\n",
264 |     "       yticks=range(Ys.max()+1),\n",
265 |     "       xlabel='time')\n",
266 |     "\n",
267 |     "ax.grid(lw=0.2)\n",
268 |     "ax.legend(loc='lower right')\n",
269 |     "plt.show()"
270 |    ]
271 |   },
272 |   {
273 |    "cell_type": "markdown",
274 |    "id": "5295a7a6",
275 |    "metadata": {},
276 |    "source": [
277 |     "## Stationary Independent Increments\n",
278 |     "\n",
279 |     "One of the defining features of a Poisson process is that it has stationary\n",
280 |     "and independent increments.\n",
281 |     "\n",
282 |     "This is due to the memoryless property of exponentials.\n",
283 |     "\n",
284 |     "It means that\n",
285 |     "\n",
286 |     "1. the variables $ \\{N_{t_{i+1}} - N_{t_i}\\}_{i \\in I} $ are independent for any\n",
287 |     "  strictly increasing finite sequence $ (t_i)_{i \\in I} $ and  \n",
288 |     "1. the distribution of $ N_{t+h} - N_t $ depends on $ h $ but not $ t $.  \n",
289 |     "\n",
290 |     "\n",
291 |     "A detailed proof can be found in Theorem 2.4.3 of [[Norris, 1998](https://continuous-time-mcs.quantecon.org/zreferences.html#id12)].\n",
292 |     "\n",
293 |     "Instead of repeating this, we provide some intuition from a discrete\n",
294 |     "approximation.\n",
295 |     "\n",
296 |     "In the discussion below, we use the following well known fact:  If\n",
297 |     "$ (\\theta_n) $ is a sequence such that $ n \\theta_n $ converges, then\n",
298 |     "\n",
299 |     "\n",
300 |     "<a id='equation-binpois'></a>\n",
301 |     "$$\n",
302 |     "\\text{Binomial}(n, \\theta_n) \n",
303 |     "    \\approx\n",
304 |     "    \\text{Poisson}(n \\theta_n)\n",
305 |     "    \\quad \\text{for large } n \\tag{2.3}\n",
306 |     "$$\n",
307 |     "\n",
308 |     "(The exercises ask you to examine this claim visually.)\n",
309 |     "\n",
310 |     "We now return to [the environment](https://continuous-time-mcs.quantecon.org/memoryless.html#geomtoexp) where we linked the\n",
311 |     "geometric distribution to the exponential.\n",
312 |     "\n",
313 |     "That is, we fix small $ h > 0 $ and let $ t_i := ih $ for all $ i \\in \\ZZ_+ $.\n",
314 |     "\n",
315 |     "Let $ (V_i) $ be IID binary random variables with $ \\PP\\{V_i = 1\\} = h \\lambda $ for some $ \\lambda > 0 $.\n",
316 |     "\n",
317 |     "Linking to our previous discussion,\n",
318 |     "\n",
319 |     "- either one or zero customers visits a shop at each $ t_i $.  \n",
320 |     "- $ V_i = 1 $ means that a customer visits at time $ t_i $.  \n",
321 |     "- Visits occur with probability $ h \\lambda $, which is proportional to the\n",
322 |     "  length of the interval between grid points.  \n",
323 |     "\n",
324 |     "\n",
325 |     "We learned that the wait time until the first visit is\n",
326 |     "approximately exponential with rate $ t \\lambda $.\n",
327 |     "\n",
328 |     "Since $ (V_i) $ is IID, the same is true for the second wait time and so on.\n",
329 |     "\n",
330 |     "Moreover, these wait times are independent, since they depend on separate\n",
331 |     "subsets of $ (V_i) $.\n",
332 |     "\n",
333 |     "Let $ \\hat N_t $ count the number of visits by time $ t $, as shown in the next figure.\n",
334 |     "\n",
335 |     "($ V_i = 1 $ is indicated by a vertical line at $ t_i = i h $.)"
336 |    ]
337 |   },
338 |   {
339 |    "cell_type": "code",
340 |    "execution_count": null,
341 |    "id": "7bef1565",
342 |    "metadata": {
343 |     "hide-output": false
344 |    },
345 |    "outputs": [],
346 |    "source": [
347 |     "fig, ax = plt.subplots()\n",
348 |     "np.random.seed(1)\n",
349 |     "T = 10\n",
350 |     "p = 0.25\n",
351 |     "B = np.random.uniform(size=T) < p\n",
352 |     "N = np.cumsum(B)\n",
353 |     "m = N[-1]  # max of N\n",
354 |     "\n",
355 |     "t_grid = np.arange(T)\n",
356 |     "t_ticks = [f'$t_{i}$' for i in t_grid]\n",
357 |     "ax.set_yticks(range(m+1))\n",
358 |     "ax.set_xticks(t_grid)\n",
359 |     "ax.set_xticklabels(t_ticks, fontsize=12)\n",
360 |     "\n",
361 |     "ax.step(t_grid, np.insert(N, 0, 0)[:-1], label='$\\hat N_t$')\n",
362 |     "\n",
363 |     "for i in t_grid:\n",
364 |     "    if B[i]:\n",
365 |     "        ax.vlines((i,), (0,), (m,), ls='--', lw=0.5)\n",
366 |     "\n",
367 |     "ax.legend(loc='center right')\n",
368 |     "plt.show()"
369 |    ]
370 |   },
371 |   {
372 |    "cell_type": "markdown",
373 |    "id": "517de576",
374 |    "metadata": {},
375 |    "source": [
376 |     "We expect from the discussion above that $ (\\hat N_t) $ approximates a Poisson process.\n",
377 |     "\n",
378 |     "This intuition is correct because, fixing $ t $, letting $ k := \\max\\{i \\in\n",
379 |     "\\ZZ_+ \\,:\\, t_i \\leq t\\} $ and applying [(2.3)](#equation-binpois), we have\n",
380 |     "\n",
381 |     "$$\n",
382 |     "\\hat N_t \n",
383 |     "    = \\sum_{i=1}^k V_i\n",
384 |     "    \\sim \\text{Binomial}(k, h \\lambda)\n",
385 |     "    \\approx\n",
386 |     "    \\text{Poisson}(k h \\lambda )\n",
387 |     "$$\n",
388 |     "\n",
389 |     "Using the fact that $ kh = t_k \\approx t $ as $ h \\to 0 $, we see\n",
390 |     "that $ \\hat N_t $ is approximately Poisson with rate $ t \\lambda $, just as we\n",
391 |     "expected.\n",
392 |     "\n",
393 |     "This approximate construction of a Poisson process helps illustrate the\n",
394 |     "property of stationary independent increments.\n",
395 |     "\n",
396 |     "For example, if we fix $ s, t $, then $ \\hat N_{s + t} - \\hat N_s $ is the number of visits\n",
397 |     "between $ s $ and $ s+t $, so that\n",
398 |     "\n",
399 |     "$$\n",
400 |     "\\hat N_{s+t} - \\hat N_s\n",
401 |     "    = \\sum_i V_i \\mathbb 1\\{ s \\leq t_i < s + t \\}\n",
402 |     "$$\n",
403 |     "\n",
404 |     "Suppose there are $ k $ grid points between $ s $ and $ s+t $, so that $ t \\approx\n",
405 |     "kh $.\n",
406 |     "\n",
407 |     "Then\n",
408 |     "\n",
409 |     "$$\n",
410 |     "\\hat N_{s+t} - \\hat N_s\n",
411 |     "    \\sim \\text{Binomial}(k, h \\lambda )\n",
412 |     "    \\approx \n",
413 |     "    \\text{Poisson}(k h \\lambda )\n",
414 |     "    \\approx \n",
415 |     "    \\text{Poisson}(t\\lambda)\n",
416 |     "$$\n",
417 |     "\n",
418 |     "This illustrates the idea that, for a Poisson process $ (N_t) $, we have\n",
419 |     "\n",
420 |     "$$\n",
421 |     "N_{s+t} - N_s \n",
422 |     "   \\sim  \\text{Poisson}(t\\lambda)\n",
423 |     "$$\n",
424 |     "\n",
425 |     "In particular, increments are stationary (the distribution depends on $ t $ but not $ s $).\n",
426 |     "\n",
427 |     "The approximation also illustrates independence of increments, since, in the\n",
428 |     "approximation, increments depend on separate subsets of $ (V_i) $."
429 |    ]
430 |   },
431 |   {
432 |    "cell_type": "markdown",
433 |    "id": "00b8deff",
434 |    "metadata": {},
435 |    "source": [
436 |     "## Uniqueness\n",
437 |     "\n",
438 |     "What other counting processes have stationary independent increments?\n",
439 |     "\n",
440 |     "Remarkably, the answer is none:"
441 |    ]
442 |   },
443 |   {
444 |    "cell_type": "markdown",
445 |    "id": "79106fe3",
446 |    "metadata": {},
447 |    "source": [
448 |     "**Theorem 2.1** (Characterization of Poisson Processes)\n",
449 |     "\n",
450 |     "If $ (M_t) $ is a stochastic process supported on $ \\ZZ_+ $ and starting at 0 with\n",
451 |     "the property that its increments are stationary and independent, then $ (M_t) $ is a Poisson process."
452 |    ]
453 |   },
454 |   {
455 |    "cell_type": "markdown",
456 |    "id": "1a7363cc",
457 |    "metadata": {},
458 |    "source": [
459 |     "In particular, there exists a $ \\lambda > 0 $ such that\n",
460 |     "\n",
461 |     "$$\n",
462 |     "M_{s + t} - M_s\n",
463 |     "   \\sim  \\text{Poisson}(t\\lambda)\n",
464 |     "$$\n",
465 |     "\n",
466 |     "for any $ s, t $.\n",
467 |     "\n",
468 |     "The proof is similar to our earlier proof that the exponential distribution is\n",
469 |     "the only memoryless distribution.\n",
470 |     "\n",
471 |     "Details can be found in Section 6.2 of [[Pardoux, 2008](https://continuous-time-mcs.quantecon.org/zreferences.html#id11)] or\n",
472 |     "Theorem 2.4.3 of [[Norris, 1998](https://continuous-time-mcs.quantecon.org/zreferences.html#id12)].\n",
473 |     "\n",
474 |     "\n",
475 |     "<a id='restart-prop'></a>"
476 |    ]
477 |   },
478 |   {
479 |    "cell_type": "markdown",
480 |    "id": "9d31ebe8",
481 |    "metadata": {},
482 |    "source": [
483 |     "### The Restarting Property\n",
484 |     "\n",
485 |     "An important consequence of stationary independent increments is the\n",
486 |     "restarting property, which means that, when simulating, we can freely stop and\n",
487 |     "restart a Poisson process at any time:"
488 |    ]
489 |   },
490 |   {
491 |    "cell_type": "markdown",
492 |    "id": "13c0dc5a",
493 |    "metadata": {},
494 |    "source": [
495 |     "**Theorem 2.2** (Poisson Processes can be Paused and Restarted)\n",
496 |     "\n",
497 |     "If $ (N_t) $ is a Poisson process, $ s > 0 $ and\n",
498 |     "$ (M_t) $ is defined by $ M_t = N_{s+t} - N_s $ for $ t \\geq 0 $, then $ (M_t) $ is a\n",
499 |     "Poisson process independent of $ (N_r)_{r \\leq s} $."
500 |    ]
501 |   },
502 |   {
503 |    "cell_type": "markdown",
504 |    "id": "7be03c94",
505 |    "metadata": {},
506 |    "source": [
507 |     "**Proof.**\n",
508 |     "\n",
509 |     "Proof. Independence of $ (M_t) $ and $ (N_r)_{r \\leq s} $ follows from indepenence of the\n",
510 |     "increments of $ (N_t) $.\n",
511 |     "\n",
512 |     "In view of the uniqueness statement above, we can verify that $ (M_t) $ is a\n",
513 |     "Poisson process by showing that $ (M_t) $ starts at zero, takes values in\n",
514 |     "$ \\ZZ_+ $ and has stationary independent increments.\n",
515 |     "\n",
516 |     "It is clear that $ (M_t) $ starts at zero and takes values in $ \\ZZ_+ $.\n",
517 |     "\n",
518 |     "In addition, if we take any $ t < t' $, then\n",
519 |     "\n",
520 |     "$$\n",
521 |     "M_{t'} - M_t = N_{s+t'} - N_{s + t}\n",
522 |     "   \\sim  \\text{Poisson}((t' - t) \\lambda)\n",
523 |     "$$\n",
524 |     "\n",
525 |     "Hence $ (M_t) $ has stationary increments and,\n",
526 |     "using the relation $ M_{t'} - M_t = N_{s+t'} - N_{s + t} $ again,\n",
527 |     "the increments are independent as well.\n",
528 |     "\n",
529 |     "We conclude that $ (N_{s+t} - N_s)_{t \\geq 0} $ is indeed a\n",
530 |     "Poisson process independent of $ (N_r)_{r \\leq s} $."
531 |    ]
532 |   },
533 |   {
534 |    "cell_type": "markdown",
535 |    "id": "513c4977",
536 |    "metadata": {},
537 |    "source": [
538 |     "## Exercises"
539 |    ]
540 |   },
541 |   {
542 |    "cell_type": "markdown",
543 |    "id": "ea5d55b8",
544 |    "metadata": {},
545 |    "source": [
546 |     "## Exercise 2.1\n",
547 |     "\n",
548 |     "Fix $ \\lambda > 0 $ and draw $ \\{W_i\\} $ as IID exponentials with rate $ \\lambda $.\n",
549 |     "\n",
550 |     "Set $ J_n := W_1 + \\cdots W_n $ with $ J_0 = 0 $ and\n",
551 |     "$ N_t := \\sum_{n \\geq 0} n \\mathbb 1\\{ J_n \\leq t < J_{n+1} \\} $.\n",
552 |     "\n",
553 |     "Provide a visual test of the claim that $ N_t $ is Poisson with parameter $ t\n",
554 |     "\\lambda $.\n",
555 |     "\n",
556 |     "Do this by fixing $ t = T $, generating many independent draws of $ N_T $ and\n",
557 |     "comparing the empirical distribution of the sample with a Poisson\n",
558 |     "distribution with rate $ T \\lambda $.\n",
559 |     "\n",
560 |     "Try first with $ \\lambda = 0.5 $ and $ T=10 $."
561 |    ]
562 |   },
563 |   {
564 |    "cell_type": "markdown",
565 |    "id": "cf0f1c2b",
566 |    "metadata": {},
567 |    "source": [
568 |     "## Solution to[ Exercise 2.1](https://continuous-time-mcs.quantecon.org/#poisson-ex-1)\n",
569 |     "\n",
570 |     "Here is one solution.\n",
571 |     "\n",
572 |     "The figure shows that the fit is already good with a modest sample size.\n",
573 |     "\n",
574 |     "Increasing the sample size will further improve the fit."
575 |    ]
576 |   },
577 |   {
578 |    "cell_type": "code",
579 |    "execution_count": null,
580 |    "id": "5edbd274",
581 |    "metadata": {
582 |     "hide-output": false
583 |    },
584 |    "outputs": [],
585 |    "source": [
586 |     "λ = 0.5\n",
587 |     "T = 10\n",
588 |     "\n",
589 |     "def poisson(k, r):\n",
590 |     "    \"Poisson pmf with rate r.\"\n",
591 |     "    return np.exp(-r) * (r**k) / factorial(k)\n",
592 |     "\n",
593 |     "@njit\n",
594 |     "def draw_Nt(max_iter=1e5):\n",
595 |     "    J = 0\n",
596 |     "    n = 0\n",
597 |     "    while n < max_iter:\n",
598 |     "        W = np.random.exponential(scale=1/λ)\n",
599 |     "        J += W\n",
600 |     "        if J > T:\n",
601 |     "            return n\n",
602 |     "        n += 1\n",
603 |     "\n",
604 |     "@njit\n",
605 |     "def draw_Nt_sample(num_draws):\n",
606 |     "    draws = np.empty(num_draws)\n",
607 |     "    for i in range(num_draws):\n",
608 |     "        draws[i] = draw_Nt()\n",
609 |     "    return draws\n",
610 |     "\n",
611 |     "\n",
612 |     "sample_size = 10_000\n",
613 |     "sample = draw_Nt_sample(sample_size)\n",
614 |     "max_val = sample.max()\n",
615 |     "vals = np.arange(0, max_val+1)\n",
616 |     "\n",
617 |     "fig, ax = plt.subplots()\n",
618 |     "\n",
619 |     "ax.plot(vals, [poisson(v, T * λ) for v in vals],\n",
620 |     "    marker='o', label='poisson')\n",
621 |     "ax.plot(vals, [np.mean(sample==v) for v in vals],\n",
622 |     "    marker='o', label='empirical')\n",
623 |     "\n",
624 |     "ax.legend(fontsize=12)\n",
625 |     "plt.show()"
626 |    ]
627 |   },
628 |   {
629 |    "cell_type": "markdown",
630 |    "id": "fdb9095d",
631 |    "metadata": {},
632 |    "source": [
633 |     "## Exercise 2.2\n",
634 |     "\n",
635 |     "In the lecture we used the fact that $ \\Binomial(n, \\theta) \\approx \\Poisson(n \\theta) $ when $ n $ is large and $ \\theta $ is small.\n",
636 |     "\n",
637 |     "Investigate this relationship by plotting the distributions side by side.\n",
638 |     "\n",
639 |     "Experiment with different values of $ n $ and $ \\theta $."
640 |    ]
641 |   },
642 |   {
643 |    "cell_type": "markdown",
644 |    "id": "e54df243",
645 |    "metadata": {},
646 |    "source": [
647 |     "## Solution to[ Exercise 2.2](https://continuous-time-mcs.quantecon.org/#poisson-ex-2)\n",
648 |     "\n",
649 |     "Here is one solution.  It shows that the approximation is good when $ n $ is\n",
650 |     "large and $ \\theta $ is small."
651 |    ]
652 |   },
653 |   {
654 |    "cell_type": "code",
655 |    "execution_count": null,
656 |    "id": "903d9990",
657 |    "metadata": {
658 |     "hide-output": false
659 |    },
660 |    "outputs": [],
661 |    "source": [
662 |     "def binomial(k, n, p):\n",
663 |     "    # Binomial(n, p) pmf evaluated at k\n",
664 |     "    return binom(n, k) * p**k * (1-p)**(n-k)\n",
665 |     "\n",
666 |     "θ_vals = 0.5, 0.2, 0.1\n",
667 |     "\n",
668 |     "n_vals = 50, 75, 100\n",
669 |     "\n",
670 |     "fig, axes = plt.subplots(len(n_vals), 1, figsize=(6, 12))\n",
671 |     "\n",
672 |     "for n, θ, ax in zip(n_vals, θ_vals, axes.flatten()):\n",
673 |     "\n",
674 |     "    k_grid = np.arange(n)\n",
675 |     "    binom_vals = [binomial(k, n, θ) for k in k_grid]\n",
676 |     "    poisson_vals = [poisson(k, n * θ) for k in k_grid]\n",
677 |     "    ax.plot(k_grid, binom_vals, 'o-', alpha=0.5, label='binomial')\n",
678 |     "    ax.plot(k_grid, poisson_vals, 'o-', alpha=0.5, label='Poisson')\n",
679 |     "    ax.set_title(f'$n={n}$ and $\\\\theta = {θ}$')\n",
680 |     "    ax.legend(fontsize=12)\n",
681 |     "\n",
682 |     "fig.tight_layout()\n",
683 |     "plt.show()"
684 |    ]
685 |   }
686 |  ],
687 |  "metadata": {
688 |   "date": 1763463650.0208964,
689 |   "filename": "poisson.md",
690 |   "kernelspec": {
691 |    "display_name": "Python",
692 |    "language": "python3",
693 |    "name": "python3"
694 |   },
695 |   "title": "Poisson Processes"
696 |  },
697 |  "nbformat": 4,
698 |  "nbformat_minor": 5
699 | }


--------------------------------------------------------------------------------
/generators.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "id": "03e608a3",
  6 |    "metadata": {},
  7 |    "source": [
  8 |     "$$\n",
  9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
 10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
 11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
 12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
 13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
 14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
 15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
 16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
 17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
 18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
 19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
 20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
 21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
 22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
 23 |     "$$"
 24 |    ]
 25 |   },
 26 |   {
 27 |    "cell_type": "markdown",
 28 |    "id": "a6658ec9",
 29 |    "metadata": {},
 30 |    "source": [
 31 |     "# Semigroups and Generators"
 32 |    ]
 33 |   },
 34 |   {
 35 |    "cell_type": "markdown",
 36 |    "id": "6e9fbb20",
 37 |    "metadata": {},
 38 |    "source": [
 39 |     "## Overview\n",
 40 |     "\n",
 41 |     "We have seen in previous lectures that every intensity matrix generates a\n",
 42 |     "Markov semigroup.\n",
 43 |     "\n",
 44 |     "We have also hinted that the pairing is one-to-one, in a sense to be made precise.\n",
 45 |     "\n",
 46 |     "To clarify these ideas, we start in an\n",
 47 |     "abstract setting, with an arbitrary initial value problem.\n",
 48 |     "\n",
 49 |     "In this setting we introduce general operator semigroups and their generators.\n",
 50 |     "\n",
 51 |     "Once this is done, we will be able to return to the Markov case and fully\n",
 52 |     "clarify the connection between intensity matrices and Markov semigroups.\n",
 53 |     "\n",
 54 |     "The material below is relatively technical, with most of the\n",
 55 |     "complications driven by the fact that the state space can be infinite.\n",
 56 |     "\n",
 57 |     "Such technicalities are hard to avoid, since so many interesting Markov chains\n",
 58 |     "do have infinite state spaces.\n",
 59 |     "\n",
 60 |     "- Our very first example – the Poisson process – has an infinite state space.  \n",
 61 |     "- Another example is the study of queues, which often have no natural upper\n",
 62 |     "  bound.<sup><a href=#footnotepp id=footnotepp-link>[1]</a></sup>  \n",
 63 |     "\n",
 64 |     "\n",
 65 |     "Readers are assumed to have some basic familiarity with [Banach spaces](https://en.wikipedia.org/wiki/Banach_space)."
 66 |    ]
 67 |   },
 68 |   {
 69 |    "cell_type": "markdown",
 70 |    "id": "113b2f64",
 71 |    "metadata": {},
 72 |    "source": [
 73 |     "## Motivation\n",
 74 |     "\n",
 75 |     "The general theory of continuous semigroups of operators is motivated by\n",
 76 |     "the problem of solving linear ODEs in infinite dimensional spaces.<sup><a href=#fnpde id=fnpde-link>[2]</a></sup>\n",
 77 |     "\n",
 78 |     "More specifically, the challenge is to solve initial value problems such as\n",
 79 |     "\n",
 80 |     "\n",
 81 |     "<a id='equation-abscp'></a>\n",
 82 |     "$$\n",
 83 |     "x'_t = A x_t,\n",
 84 |     "    \\quad x_0 \\text{ given} \\tag{6.1}\n",
 85 |     "$$\n",
 86 |     "\n",
 87 |     "where\n",
 88 |     "\n",
 89 |     "- $ x_t $ takes value in a Banach space at each time $ t $,  \n",
 90 |     "- $ A $ is a linear operator and  \n",
 91 |     "- the time derivative $ x'_t $ uses a definition appropriate for a Banach\n",
 92 |     "  space.  \n",
 93 |     "\n",
 94 |     "\n",
 95 |     "This problem is also called the “abstract Cauchy problem”.\n",
 96 |     "\n",
 97 |     "Why do we need to solve such problems?\n",
 98 |     "\n",
 99 |     "One example comes from PDEs.\n",
100 |     "\n",
101 |     "PDEs tell us how functions change over time, starting from an infinitesimal\n",
102 |     "description.\n",
103 |     "\n",
104 |     "When $ x_t $ is a point in a function space, this fits into the framework of\n",
105 |     "[(6.1)](#equation-abscp).\n",
106 |     "\n",
107 |     "Another example comes from Markov processes, where, as we have seen, the flow\n",
108 |     "of distributions over time can be represented as a linear ODE in distribution\n",
109 |     "space.\n",
110 |     "\n",
111 |     "If the number of state is infinite, then the space of distributions is\n",
112 |     "infinite dimensional.\n",
113 |     "\n",
114 |     "This is another version of [(6.1)](#equation-abscp), and we return to it after a discussion\n",
115 |     "of the general theory.\n",
116 |     "\n",
117 |     "To give a high level view of the results below, the solution to the Cauchy\n",
118 |     "problem is represented as a trajectory $ t \\mapsto U_t x_0 $ from the initial\n",
119 |     "value $ x_0 $ under a semigroup of maps $ (U_t) $.\n",
120 |     "\n",
121 |     "The operator $ A $ in [(6.1)](#equation-abscp) is called the “generator” of $ (U_t) $ and is\n",
122 |     "its infinitesimal description."
123 |    ]
124 |   },
125 |   {
126 |    "cell_type": "markdown",
127 |    "id": "2fae30ac",
128 |    "metadata": {},
129 |    "source": [
130 |     "## Preliminaries\n",
131 |     "\n",
132 |     "Throughout this lecture, $ (\\BB, \\| \\cdot \\|) $ is a Banach space."
133 |    ]
134 |   },
135 |   {
136 |    "cell_type": "markdown",
137 |    "id": "94ee6ab1",
138 |    "metadata": {},
139 |    "source": [
140 |     "### The Space of Linear Operators\n",
141 |     "\n",
142 |     "You will recall that a **linear operator** on $ \\BB $ is a map $ A $ from $ \\BB $ to\n",
143 |     "itself satisfying\n",
144 |     "\n",
145 |     "$$\n",
146 |     "A(\\alpha g + \\beta h) = \\alpha A g + \\beta A h,\n",
147 |     "    \\quad\n",
148 |     "    \\forall \\, g, h \\in \\BB, \\;\\; \\alpha, \\beta \\in \\RR\n",
149 |     "$$\n",
150 |     "\n",
151 |     "The operator $ A $ is called **bounded** if\n",
152 |     "\n",
153 |     "\n",
154 |     "<a id='equation-norml'></a>\n",
155 |     "$$\n",
156 |     "\\| A \\| := \\sup_{g \\in \\BB, \\, \\| g \\| \\leq 1} \\| A g \\| < \\infty \\tag{6.2}\n",
157 |     "$$\n",
158 |     "\n",
159 |     "This is the usual definition of a [bounded linear operator](https://en.wikipedia.org/wiki/Bounded_operator) on a normed linear space.\n",
160 |     "\n",
161 |     "The set of all bounded linear operators on $ \\BB $ is denoted by $ \\linop $ and is\n",
162 |     "itself a Banach space.\n",
163 |     "\n",
164 |     "Sums and scalar products of elements of $ \\linop $ are defined in the usual way,\n",
165 |     "so that, for $ \\alpha \\in \\RR $, $ A, B \\in \\linop $ and $ g \\in \\BB $, we have\n",
166 |     "\n",
167 |     "$$\n",
168 |     "(A + B) g = Ag + Bg, \n",
169 |     "    \\quad (\\alpha A) g = \\alpha (A g)\n",
170 |     "$$\n",
171 |     "\n",
172 |     "and so on.\n",
173 |     "\n",
174 |     "We write $ A B $ to indicate composition of the operators $ A, B \\in \\linop $.\n",
175 |     "\n",
176 |     "The value defined in [(6.2)](#equation-norml) is called the [**operator norm**](https://en.wikipedia.org/wiki/Operator_norm) of $ A $ and,\n",
177 |     "as suggested by the notation, is a norm on $ \\linop $.\n",
178 |     "\n",
179 |     "In addition to being a norm, it enjoys the submultiplicative property $ \\| AB\n",
180 |     "\\| \\leq \\| A \\| \\| B\\| $ for all $ A, B \\in \\linop $.\n",
181 |     "\n",
182 |     "Let $ I $ be the identity in $ \\linop $, satisfying $ Ig = g $ for all $ g \\in \\BB $.\n",
183 |     "\n",
184 |     "(In fact $ \\linop $ is a [unital Banach algebra](https://en.wikipedia.org/wiki/Banach_algebra) when multiplication is identified with operator composition and $ I $ is adopted as the unit.)"
185 |    ]
186 |   },
187 |   {
188 |    "cell_type": "markdown",
189 |    "id": "25083cb7",
190 |    "metadata": {},
191 |    "source": [
192 |     "### The Exponential Function\n",
193 |     "\n",
194 |     "Given $ A \\in \\linop $, the exponential of $ A $ is the element of\n",
195 |     "$ \\linop $ defined as\n",
196 |     "\n",
197 |     "\n",
198 |     "<a id='equation-opexpo'></a>\n",
199 |     "$$\n",
200 |     "e^A  \n",
201 |     "    = \\sum_{k \\geq 0} \\frac{A^k}{k!} \n",
202 |     "    = I + A + \\frac{A^2}{2!} + \\cdots \\tag{6.3}\n",
203 |     "$$\n",
204 |     "\n",
205 |     "This is the same as the definition for the matrix exponential. The exponential function arises naturally as the solution to ODEs in Banach\n",
206 |     "space, one example of which (as we shall see) is distribution flows\n",
207 |     "associated with continuous time Markov chains.\n",
208 |     "\n",
209 |     "The exponential map has the following properties:\n",
210 |     "\n",
211 |     "- For each $ A \\in \\linop $, the operator $ e^A $ is a well defined element of $ \\linop $ with $ \\| e^A \\| \\leq e^{\\| A \\|} $.<sup><a href=#fncoex id=fncoex-link>[3]</a></sup>  \n",
212 |     "- $ e^0 = I $, where $ 0 $ is the zero element of $ \\linop $.  \n",
213 |     "- If $ A, B \\in \\linop $ and $ AB = BA $, then $ e^{A + B} = e^A e^B $  \n",
214 |     "- If $ A \\in \\linop $, then $ e^A $ is invertible and $ (e^A)^{-1} = e^{-A} $.  \n",
215 |     "\n",
216 |     "\n",
217 |     "The last fact is easily checked from the previous ones."
218 |    ]
219 |   },
220 |   {
221 |    "cell_type": "markdown",
222 |    "id": "af13eec8",
223 |    "metadata": {},
224 |    "source": [
225 |     "### Operator Calculus\n",
226 |     "\n",
227 |     "Consider a function\n",
228 |     "\n",
229 |     "$$\n",
230 |     "\\RR_+ \\ni t \\mapsto U_t \\in \\linop\n",
231 |     "$$\n",
232 |     "\n",
233 |     "which we can think of as a time path in $ \\linop $, such as a flow of Markov\n",
234 |     "operators.\n",
235 |     "\n",
236 |     "We say that this function is **differentiable at $ \\tau \\in \\RR_+ $** if there exists\n",
237 |     "an element $ T $ of $ \\linop $ such that\n",
238 |     "\n",
239 |     "\n",
240 |     "<a id='equation-devlim'></a>\n",
241 |     "$$\n",
242 |     "\\frac{U_{\\tau+h} - U_\\tau}{h} \\to T \n",
243 |     "    \\; \\text{ as } h \\to 0 \\tag{6.4}\n",
244 |     "$$\n",
245 |     "\n",
246 |     "In this case, $ T $ is called the **derivative** of the function $ t \\mapsto U_t $ at $ \\tau $ and we write\n",
247 |     "\n",
248 |     "$$\n",
249 |     "T = U'_\\tau \n",
250 |     "    \\; \\text{ or } \\;\n",
251 |     "    T = \\frac{d}{dt} U_t \\, \\Big|_{t=\\tau}\n",
252 |     "$$\n",
253 |     "\n",
254 |     "(Convergence of operators is in operator norm.  If $ \\tau = 0 $, then the limit\n",
255 |     "$ h \\to 0 $ in [(6.4)](#equation-devlim) is the right limit.)"
256 |    ]
257 |   },
258 |   {
259 |    "cell_type": "markdown",
260 |    "id": "4e869351",
261 |    "metadata": {},
262 |    "source": [
263 |     "**Example 6.1**\n",
264 |     "\n",
265 |     "If $ U_t = t V $ for some fixed $ V \\in \\linop $, then it is easy to\n",
266 |     "see that $ V $ is the derivative of $ t \\mapsto U_t $ at every $ t \\in \\RR_+ $."
267 |    ]
268 |   },
269 |   {
270 |    "cell_type": "markdown",
271 |    "id": "3009cfa3",
272 |    "metadata": {},
273 |    "source": [
274 |     "**Example 6.2**\n",
275 |     "\n",
276 |     "In [our discussion](https://continuous-time-mcs.quantecon.org/kolmogorov_fwd.html) of the Kolmogorov forward equation\n",
277 |     "when $ S $ is finite, we introduced the derivative of a map $ t\n",
278 |     "\\mapsto P_t $, where each $ P_t $ is a matrix on $ S $.\n",
279 |     "\n",
280 |     "The derivative was defined by differentiating $ P_t $ element-by-element.\n",
281 |     "\n",
282 |     "This coincides with the operator-theoretic definition in [(6.4)](#equation-devlim) when $ S $\n",
283 |     "is finite, because then the space $ \\lL(\\ell_1) $, which consists of all bounded linear operators on $ \\ell_1 $, is finite dimensional, and\n",
284 |     "hence pointwise and norm convergence coincide."
285 |    ]
286 |   },
287 |   {
288 |    "cell_type": "markdown",
289 |    "id": "7c8a9517",
290 |    "metadata": {},
291 |    "source": [
292 |     "Analogous to the matrix and scalar cases, we have the following result:"
293 |    ]
294 |   },
295 |   {
296 |    "cell_type": "markdown",
297 |    "id": "83bc7130",
298 |    "metadata": {},
299 |    "source": [
300 |     "**Lemma 6.1** (Differentiability of the Exponential Curve)\n",
301 |     "\n",
302 |     "For all $ A \\in \\linop $, the exponential curve $ t \\mapsto e^{tA} $ is everywhere differentiable and\n",
303 |     "\n",
304 |     "\n",
305 |     "<a id='equation-expdiffer'></a>\n",
306 |     "$$\n",
307 |     "\\frac{d}{dt} e^{tA} = e^{tA} A = A e^{tA} \\tag{6.5}\n",
308 |     "$$"
309 |    ]
310 |   },
311 |   {
312 |    "cell_type": "markdown",
313 |    "id": "75112975",
314 |    "metadata": {},
315 |    "source": [
316 |     "The proof is a (solved) exercise (see below)."
317 |    ]
318 |   },
319 |   {
320 |    "cell_type": "markdown",
321 |    "id": "32fa7415",
322 |    "metadata": {},
323 |    "source": [
324 |     "## Semigroups and Generators\n",
325 |     "\n",
326 |     "For continuous time Markov chains where the state space $ S $ is finite, we\n",
327 |     "saw that Markov semigroups often take the form $ P_t = e^{tQ} $ for some\n",
328 |     "intensity matrix $ Q $.\n",
329 |     "\n",
330 |     "This is ideal because the entire semigroup is characterized in a simple way by\n",
331 |     "its infinitesimal description $ Q $.\n",
332 |     "\n",
333 |     "It turns out that, when $ S $ is finite, this is always true:  if $ (P_t) $ is a\n",
334 |     "Markov semigroup, then there exists an intensity matrix $ Q $ satisfying $ P_t =\n",
335 |     "e^{tQ} $ for all $ t $.\n",
336 |     "\n",
337 |     "Moreover, this statement is again true when $ S $ is infinite, provided that\n",
338 |     "some restrictions are placed on the semigroup.\n",
339 |     "\n",
340 |     "Our aim is to make these statements precise, starting in an abstract setting\n",
341 |     "and then specializing."
342 |    ]
343 |   },
344 |   {
345 |    "cell_type": "markdown",
346 |    "id": "e2a74db9",
347 |    "metadata": {},
348 |    "source": [
349 |     "### Operator Semigroups\n",
350 |     "\n",
351 |     "Let $ U_t $ be an element of $ \\linop $ for all $ t \\in \\RR_+ $\n",
352 |     "\n",
353 |     "We say that $ (U_t) $ is an **evolution semigroup** on $ \\linop $ if $ U_0 = I $ and\n",
354 |     "$ U_{s + t} = U_s U_t $ for all $ s, t \\geq 0 $.\n",
355 |     "\n",
356 |     "The idea is that $ (U_t) $ generates a path in $ \\BB $ from any starting point $ g \\in \\BB $, so that $ U_t g $ is interpreted as the location of the state after $ t $ units of time.\n",
357 |     "\n",
358 |     "An evolution semigroup $ (U_t) $ is called\n",
359 |     "\n",
360 |     "- a $ C_0 $ **semigroup** on $ \\BB $ if, for each $ g \\in \\BB $, the map $ t \\mapsto U_t g $ from $ \\RR_+ $ to $ \\BB $ is continuous, and  \n",
361 |     "- a **uniformly continuous semigroup** on $ \\BB $ if the map $ t \\mapsto U_t $ from $ \\RR_+ $ to $ \\linop $ is continuous.  \n",
362 |     "\n",
363 |     "\n",
364 |     "In what follows we abbreviate “uniformly continuous” to UC.<sup><a href=#ucnote id=ucnote-link>[4]</a></sup>"
365 |    ]
366 |   },
367 |   {
368 |    "cell_type": "markdown",
369 |    "id": "645116e7",
370 |    "metadata": {},
371 |    "source": [
372 |     "**Example 6.3** (Exponential curves are UC semigroups)\n",
373 |     "\n",
374 |     "If $ U_t = e^{tA} $ for $ t \\in \\RR_+ $ and $ A \\in \\linop $, then $ (U_t) $\n",
375 |     "is a uniformly continuous semigroup on $ \\BB $."
376 |    ]
377 |   },
378 |   {
379 |    "cell_type": "markdown",
380 |    "id": "fb66eee3",
381 |    "metadata": {},
382 |    "source": [
383 |     "The claim that $ (U_t) $ is an evolution semigroup follows directly from the\n",
384 |     "properties of the exponential function given above.\n",
385 |     "\n",
386 |     "Uniform continuity can be established using arguments similar to those in the\n",
387 |     "proof of differentiability in [Lemma 6.1](#diffexpmap).\n",
388 |     "\n",
389 |     "Since norm convergence on $ \\linop $ implies pointwise convergence, every\n",
390 |     "uniformly continuous semigroup is a $ C_0 $ semigroup.\n",
391 |     "\n",
392 |     "The reverse is certainly not true — there are many important $ C_0 $ semigroups that fail to be uniformly continuous.\n",
393 |     "\n",
394 |     "In fact semigroups associated with PDEs, diffusions and other Markov processes\n",
395 |     "on continuous state spaces are typically $ C_0 $ but not uniformly continuous.\n",
396 |     "\n",
397 |     "There are also important examples of Markov semigroups on infinite\n",
398 |     "discrete state spaces that fail to be uniformly continuous.\n",
399 |     "\n",
400 |     "However, we will soon see that, for most continuous time Markov chains used in\n",
401 |     "applications, the semigroups are uniformly continuous."
402 |    ]
403 |   },
404 |   {
405 |    "cell_type": "markdown",
406 |    "id": "6f0b2c8c",
407 |    "metadata": {},
408 |    "source": [
409 |     "### Generators\n",
410 |     "\n",
411 |     "Consider a continuous time Markov chain on a finite state space with intensity\n",
412 |     "matrix $ Q $.\n",
413 |     "\n",
414 |     "The Markov semigroup $ (P_t) $ is fully specified by this infinitesimal\n",
415 |     "description $ Q $, in the sense that\n",
416 |     "\n",
417 |     "- $ P_t = e^{tQ} $ for all $ t \\geq 0 $ and (equivalently)  \n",
418 |     "- the forward and backward equations hold: $ P_t' = P_t Q = Q P_t $.  \n",
419 |     "\n",
420 |     "\n",
421 |     "Since $ P_0 = I $, the matrix $ Q $ can be recovered from the semigroup via\n",
422 |     "\n",
423 |     "$$\n",
424 |     "Q = P'_0 = \\lim_{h \\downarrow 0} \\frac{P_h - I}{h}\n",
425 |     "$$\n",
426 |     "\n",
427 |     "In the more abstract setting of $ C_0 $ semigroups, we say that $ Q $ is the\n",
428 |     "“generator” of the semigroup $ (P_t) $.\n",
429 |     "\n",
430 |     "More generally, given a $ C_0 $ semigroup $ (U_t) $, we say that a linear\n",
431 |     "operator $ A $ from $ \\BB $ to itself is the **generator** of $ (U_t) $ if\n",
432 |     "\n",
433 |     "\n",
434 |     "<a id='equation-defgenr'></a>\n",
435 |     "$$\n",
436 |     "A g = \\lim_{h \\downarrow 0} \\frac{U_h g - g}{h} \\tag{6.6}\n",
437 |     "$$\n",
438 |     "\n",
439 |     "for all $ g \\in \\BB $ such that the limit exists.\n",
440 |     "\n",
441 |     "The set of points where the limit exists (the domain of the generator) is\n",
442 |     "denoted by $ D(A) $.\n",
443 |     "\n",
444 |     "At this point we would like to write [(6.6)](#equation-defgenr) as $ A = U'_0 $, or express\n",
445 |     "$ U_t $ as $ e^{tA} $, analogous to the Markov case.\n",
446 |     "\n",
447 |     "There are problems, however.\n",
448 |     "\n",
449 |     "One problem is that the limit in [(6.6)](#equation-defgenr) can fail to exist for some $ g\n",
450 |     "\\in \\BB $.\n",
451 |     "\n",
452 |     "Indeed, why should the limit exist, given that $ C_0 $ semigroups are not\n",
453 |     "required to be differentiable?\n",
454 |     "\n",
455 |     "The other problem is that, even though the limit exists, the linear operator\n",
456 |     "$ A $ might be unbounded (i.e., not an element of $ \\linop $), in which case\n",
457 |     "a statement like $ U_t = e^{tA} $ is problematic.\n",
458 |     "\n",
459 |     "It turns out that, despite these issues, the theory of $ C_0 $ semigroups is\n",
460 |     "powerful, and, with some work, the technical issues can be\n",
461 |     "circumvented.<sup><a href=#fnhy id=fnhy-link>[5]</a></sup>\n",
462 |     "\n",
463 |     "Even better, for the applications we wish to consider, we can focus on UC\n",
464 |     "semigroups, where these problems do not arise.\n",
465 |     "\n",
466 |     "The next section gives details."
467 |    ]
468 |   },
469 |   {
470 |    "cell_type": "markdown",
471 |    "id": "b97d63f9",
472 |    "metadata": {},
473 |    "source": [
474 |     "### A Characterization of Uniformly Continuous Semigroups\n",
475 |     "\n",
476 |     "We saw in [Example 6.3](#ecuc) that exponential curves are an example\n",
477 |     "of a UC semigroup.\n",
478 |     "\n",
479 |     "The next theorem tells us that there are no other examples."
480 |    ]
481 |   },
482 |   {
483 |    "cell_type": "markdown",
484 |    "id": "361f1dfb",
485 |    "metadata": {},
486 |    "source": [
487 |     "**Theorem 6.1** (UC Semigroups are Exponential Curves)\n",
488 |     "\n",
489 |     "If $ (U_t) $ is a UC semigroup on $ \\BB $, then there exists an $ A \\in \\linop $\n",
490 |     "such that $ U_t = e^{tA} $ for all $ t \\geq 0 $.  Moreover,\n",
491 |     "\n",
492 |     "- $ U_t $ is differentiable at every $ t \\geq 0 $,  \n",
493 |     "- $ A $ is the generator of $ (U_t) $ and  \n",
494 |     "- $ U_t' = A U_t = U_t A $ for all $ t \\geq 0 $.  "
495 |    ]
496 |   },
497 |   {
498 |    "cell_type": "markdown",
499 |    "id": "35b01a09",
500 |    "metadata": {},
501 |    "source": [
502 |     "The last three claims in [Theorem 6.1](#ucsgec) follow directly from the\n",
503 |     "first claim.\n",
504 |     "\n",
505 |     "The statement $ U_t' = A U_t = U_t A $ is a\n",
506 |     "generalization of the Kolmogorov forward and backward equations.\n",
507 |     "\n",
508 |     "While slightly more complicated in the Banach setting, the proof of the first\n",
509 |     "claim (existence of an exponential representation) is a direct extension of\n",
510 |     "the fact that any continuous function $ f $ from $ \\RR $ to itself\n",
511 |     "satisfying\n",
512 |     "\n",
513 |     "- $ f(s)f(t) = f(s+t) $ for all $ s,t \\geq 0 $ and  \n",
514 |     "- $ f(0) = 1 $  \n",
515 |     "\n",
516 |     "\n",
517 |     "also satisfies $ f(t) = e^{ta} $ for some $ a \\in \\RR $.\n",
518 |     "\n",
519 |     "We proved something quite similar in [Theorem 1.1](https://continuous-time-mcs.quantecon.org/memoryless.html#exp_unique), on\n",
520 |     "the memoryless property of the exponential function.\n",
521 |     "\n",
522 |     "For more discussion of the scalar case, see, for example,\n",
523 |     "[[Sahoo and Kannappan, 2011](https://continuous-time-mcs.quantecon.org/zreferences.html#id6)].\n",
524 |     "\n",
525 |     "For a full proof of the first claim in [Theorem 6.1](#ucsgec), in the setting of\n",
526 |     "a Banach algebra, see, for\n",
527 |     "example, Chapter 7 of [[Bobrowski, 2005](https://continuous-time-mcs.quantecon.org/zreferences.html#id7)]."
528 |    ]
529 |   },
530 |   {
531 |    "cell_type": "markdown",
532 |    "id": "ac6b37c2",
533 |    "metadata": {},
534 |    "source": [
535 |     "## Exercises"
536 |    ]
537 |   },
538 |   {
539 |    "cell_type": "markdown",
540 |    "id": "a0869d9c",
541 |    "metadata": {},
542 |    "source": [
543 |     "## Exercise 6.1\n",
544 |     "\n",
545 |     "Prove that [(6.5)](#equation-expdiffer) holds for all $ A \\in \\linop $."
546 |    ]
547 |   },
548 |   {
549 |    "cell_type": "markdown",
550 |    "id": "99e39ea4",
551 |    "metadata": {},
552 |    "source": [
553 |     "## Solution to[ Exercise 8.1](https://continuous-time-mcs.quantecon.org/ergodicity.html#ergodicity-ex-1)\n",
554 |     "\n",
555 |     "To show the first equality, fix $ t \\in \\RR_+ $, take $ h > 0 $ and observe that\n",
556 |     "\n",
557 |     "$$\n",
558 |     "e^{(t+h)A} - e^{tA} - e^{tA} A\n",
559 |     "= e^{tA} (e^{hA} - I - A)\n",
560 |     "$$\n",
561 |     "\n",
562 |     "Since the norm on $ \\linop $ is submultiplicative, it suffices to show that\n",
563 |     "$ \\| e^{hA} - I - A \\| = o(h) $ as $ h \\to 0 $.\n",
564 |     "\n",
565 |     "Using the definition of the exponential, this is easily verified,\n",
566 |     "completing the proof of the first equality in [(6.5)](#equation-expdiffer).\n",
567 |     "\n",
568 |     "The proof of the second equality is similar."
569 |    ]
570 |   },
571 |   {
572 |    "cell_type": "markdown",
573 |    "id": "7876446f",
574 |    "metadata": {},
575 |    "source": [
576 |     "## Exercise 6.2\n",
577 |     "\n",
578 |     "In many texts, a $ C_0 $ semigroup is defined as an evolution semigroup $ (U_t) $\n",
579 |     "such that\n",
580 |     "\n",
581 |     "\n",
582 |     "<a id='equation-czsg2'></a>\n",
583 |     "$$\n",
584 |     "U_t g \\to g \\text{ as } t \\to 0 \\text{ for any } g \\in \\BB \\tag{6.7}\n",
585 |     "$$\n",
586 |     "\n",
587 |     "Our aim is to show that [(6.7)](#equation-czsg2) implies continuity at every point $ t $, as\n",
588 |     "in the definition we used above.\n",
589 |     "\n",
590 |     "The [Banach–Steinhaus Theorem](https://en.wikipedia.org/wiki/Uniform_boundedness_principle) can be used to show that, for an evolution semigroup $ (U_t) $ satisfying [(6.7)](#equation-czsg2), there exist finite constants $ \\omega $ and $ M $ such that\n",
591 |     "\n",
592 |     "\n",
593 |     "<a id='equation-sgbound'></a>\n",
594 |     "$$\n",
595 |     "\\| U_t \\| \\leq e^{t\\omega} M\n",
596 |     "\\quad \\text{for all } \\; t \\geq 0 \\tag{6.8}\n",
597 |     "$$\n",
598 |     "\n",
599 |     "Using this and [(6.7)](#equation-czsg2), show that, for any $ g \\in \\BB $, the map $ t \\mapsto\n",
600 |     "U_t g $ is continuous at all $ t $."
601 |    ]
602 |   },
603 |   {
604 |    "cell_type": "markdown",
605 |    "id": "459b8ae3",
606 |    "metadata": {},
607 |    "source": [
608 |     "## Solution to[ Exercise 8.2](https://continuous-time-mcs.quantecon.org/ergodicity.html#ergodicity-ex-2)\n",
609 |     "\n",
610 |     "Let $ (U_t) $ be an evolution semigroup satisfying [(6.7)](#equation-czsg2) and let\n",
611 |     "$ \\omega $ and $ M $ be as in [(6.8)](#equation-sgbound).\n",
612 |     "\n",
613 |     "Pick any $ g \\in \\BB $,  $ t > 0 $ and $ h_n \\downarrow 0 $ as $ n \\to \\infty $.\n",
614 |     "\n",
615 |     "On one hand, $ U_{t+ h_n} g = U_{h_n} U_t g \\to U_t g $ by [(6.7)](#equation-czsg2).\n",
616 |     "\n",
617 |     "On the other hand, from [(6.8)](#equation-sgbound) and the definition of the operator norm,\n",
618 |     "\n",
619 |     "$$\n",
620 |     "\\| U_{t-h_n} g - U_t g\\|\n",
621 |     "= \\|  U_{t-h_n} ( g - U_{h_n} g) \\|\n",
622 |     "\\leq e^{(t-h_n)\\omega} M \\| g - U_{h_n} g\\|\n",
623 |     "\\to 0\n",
624 |     "$$\n",
625 |     "\n",
626 |     "as $ n \\to \\infty $.  This completes the proof."
627 |    ]
628 |   },
629 |   {
630 |    "cell_type": "markdown",
631 |    "id": "28d9711b",
632 |    "metadata": {},
633 |    "source": [
634 |     "## Exercise 6.3\n",
635 |     "\n",
636 |     "Following on from the previous exercise,\n",
637 |     "a UC semigroup is often defined as an evolution semigroup $ (U_t) $\n",
638 |     "such that\n",
639 |     "\n",
640 |     "\n",
641 |     "<a id='equation-czsg3'></a>\n",
642 |     "$$\n",
643 |     "\\| U_t - I \\| \\to 0 \\text{ as } t \\to 0 \\tag{6.9}\n",
644 |     "$$\n",
645 |     "\n",
646 |     "Show that [(6.9)](#equation-czsg3) implies norm continuity at every point $ t $, as\n",
647 |     "in the definition we used above.\n",
648 |     "\n",
649 |     "In particular, show that, for any $ t_n \\to t $, we have\n",
650 |     "$ \\| U_{t_n} - U_t \\| \\to 0 $  as $ n \\to \\infty $."
651 |    ]
652 |   },
653 |   {
654 |    "cell_type": "markdown",
655 |    "id": "8fab772a",
656 |    "metadata": {},
657 |    "source": [
658 |     "## Solution to[ Exercise 8.3](https://continuous-time-mcs.quantecon.org/ergodicity.html#ergodicity-ex-3)\n",
659 |     "\n",
660 |     "The solution is similar to that of the previous exercise.\n",
661 |     "\n",
662 |     "Let $ (U_t) $ be an evolution semigroup satisfying [(6.9)](#equation-czsg3),\n",
663 |     "fix $ t > 0 $ and take $ (h_n) $ to be a scalar sequence satisfying $ h_n \\downarrow 0 $ as $ n \\to \\infty $.\n",
664 |     "\n",
665 |     "On one hand, $ U_{t+ h_n} = U_{h_n} U_t \\to U_t $ by [(6.9)](#equation-czsg3).\n",
666 |     "\n",
667 |     "On the other hand, from the submultiplicative property of the operator norm\n",
668 |     "and [(6.8)](#equation-sgbound),\n",
669 |     "\n",
670 |     "$$\n",
671 |     "\\| U_{t-h_n} - U_t \\|\n",
672 |     "= \\|  U_{t-h_n} ( I - U_{h_n}) \\|\n",
673 |     "\\leq e^{(t-h_n)\\omega} M  \\| I - U_{h_n} \\|\n",
674 |     "$$\n",
675 |     "\n",
676 |     "This converges to 0 as $ n \\to \\infty $, completing our proof.\n",
677 |     "\n",
678 |     "<p><a id=footnotepp href=#footnotepp-link><strong>[1]</strong></a> In fact a major concern with queues is that their length does not explode. This issue cannot be properly explored unless the state space is allowed to be infinite.\n",
679 |     "\n",
680 |     "<p><a id=fnpde href=#fnpde-link><strong>[2]</strong></a> An excellent introduction to operator semigroups, combined with\n",
681 |     "applications to PDEs and Markov processes, can be found in [[Applebaum, 2019](https://continuous-time-mcs.quantecon.org/zreferences.html#id5)].\n",
682 |     "\n",
683 |     "<p><a id=fncoex href=#fncoex-link><strong>[3]</strong></a> Convergence of the sum in [(6.3)](#equation-opexpo) follows from boundedness of $ A $ and the fact that $ \\linop $ is a Banach space.\n",
684 |     "\n",
685 |     "<p><a id=ucnote href=#ucnote-link><strong>[4]</strong></a> Be careful: the definition of a UC semigroup requires that\n",
686 |     "$ t \\mapsto U_t $ is continuous as a map into $ \\linop $, rather than uniformly\n",
687 |     "continuous.  The UC terminology comes about because, for a UC semigroup, we\n",
688 |     "have, by definition of the operator norm,\n",
689 |     "$ \\sup_{\\| g \\| \\leq 1} \\| U_s g - U_t g \\| \\to 0 $ when $ s \\to t $.\n",
690 |     "\n",
691 |     "<p><a id=fnhy href=#fnhy-link><strong>[5]</strong></a> An excellent treatment of the general theory of $ C_0 $ semigroups, can be found in [[Bobrowski, 2005](https://continuous-time-mcs.quantecon.org/zreferences.html#id7)]."
692 |    ]
693 |   }
694 |  ],
695 |  "metadata": {
696 |   "date": 1763463649.7420359,
697 |   "filename": "generators.md",
698 |   "kernelspec": {
699 |    "display_name": "Python",
700 |    "language": "python3",
701 |    "name": "python3"
702 |   },
703 |   "title": "Semigroups and Generators"
704 |  },
705 |  "nbformat": 4,
706 |  "nbformat_minor": 5
707 | }


--------------------------------------------------------------------------------
/kolmogorov_fwd.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "id": "2525714d",
  6 |    "metadata": {},
  7 |    "source": [
  8 |     "$$\n",
  9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
 10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
 11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
 12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
 13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
 14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
 15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
 16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
 17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
 18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
 19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
 20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
 21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
 22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
 23 |     "$$"
 24 |    ]
 25 |   },
 26 |   {
 27 |    "cell_type": "markdown",
 28 |    "id": "26f9287e",
 29 |    "metadata": {},
 30 |    "source": [
 31 |     "# The Kolmogorov Forward Equation\n",
 32 |     "\n",
 33 |     "In addition to what’s in Anaconda, this lecture will need the following libraries:"
 34 |    ]
 35 |   },
 36 |   {
 37 |    "cell_type": "code",
 38 |    "execution_count": null,
 39 |    "id": "338cbb3f",
 40 |    "metadata": {
 41 |     "hide-output": false
 42 |    },
 43 |    "outputs": [],
 44 |    "source": [
 45 |     "!pip install quantecon"
 46 |    ]
 47 |   },
 48 |   {
 49 |    "cell_type": "markdown",
 50 |    "id": "d00499f1",
 51 |    "metadata": {},
 52 |    "source": [
 53 |     "## Overview\n",
 54 |     "\n",
 55 |     "In this lecture we approach continuous time Markov chains from a more\n",
 56 |     "analytical perspective.\n",
 57 |     "\n",
 58 |     "The emphasis will be on describing distribution flows through vector-valued\n",
 59 |     "differential equations and their solutions.\n",
 60 |     "\n",
 61 |     "These distribution flows show how the time $ t $ distribution associated with a\n",
 62 |     "given Markov chain $ (X_t) $ changes over time.\n",
 63 |     "\n",
 64 |     "Distribution flows will be identified with initial value problems generated by autonomous linear ordinary differential equations (ODEs) in vector space.\n",
 65 |     "\n",
 66 |     "We will see that the solutions of these flows are described by Markov semigroups.\n",
 67 |     "\n",
 68 |     "This leads us back to the theory we have already constructed – some care will\n",
 69 |     "be taken to clarify all the connections.\n",
 70 |     "\n",
 71 |     "In order to avoid being distracted by technicalities, we continue to defer our\n",
 72 |     "treatment of infinite state spaces, assuming throughout this lecture that $ |S|\n",
 73 |     "= n $.\n",
 74 |     "\n",
 75 |     "As before, $ \\dD $ is the set of all distributions on $ S $.\n",
 76 |     "\n",
 77 |     "We will use the following imports"
 78 |    ]
 79 |   },
 80 |   {
 81 |    "cell_type": "code",
 82 |    "execution_count": null,
 83 |    "id": "55173da0",
 84 |    "metadata": {
 85 |     "hide-output": false
 86 |    },
 87 |    "outputs": [],
 88 |    "source": [
 89 |     "import numpy as np\n",
 90 |     "import scipy as sp\n",
 91 |     "import matplotlib.pyplot as plt\n",
 92 |     "import quantecon as qe\n",
 93 |     "from numba import njit\n",
 94 |     "from scipy.linalg import expm\n",
 95 |     "\n",
 96 |     "from matplotlib import cm\n",
 97 |     "from mpl_toolkits.mplot3d import Axes3D\n",
 98 |     "from mpl_toolkits.mplot3d.art3d import Poly3DCollection"
 99 |    ]
100 |   },
101 |   {
102 |    "cell_type": "markdown",
103 |    "id": "7bbbad6e",
104 |    "metadata": {},
105 |    "source": [
106 |     "## From Difference Equations to ODEs\n",
107 |     "\n",
108 |     "[Previously](https://continuous-time-mcs.quantecon.org/markov_prop.html#invdistflows) we generated this figure, which shows how distributions evolve over time for the inventory model under a certain parameterization:\n",
109 |     "\n",
110 |     "![https://continuous-time-mcs.quantecon.org/_static/lecture_specific/markov_prop/flow_fig.png](https://continuous-time-mcs.quantecon.org/_static/lecture_specific/markov_prop/flow_fig.png)\n",
111 |     "\n",
112 |     "Probability flows for the inventory model.  \n",
113 |     "(Hot colors indicate early dates and cool colors denote later dates.)\n",
114 |     "\n",
115 |     "We also learned how this flow is related to\n",
116 |     "the Kolmogorov backward equation, which is an ODE.\n",
117 |     "\n",
118 |     "In this section we examine distribution flows and their connection to\n",
119 |     "ODEs and continuous time Markov chains more systematically."
120 |    ]
121 |   },
122 |   {
123 |    "cell_type": "markdown",
124 |    "id": "8a3b175d",
125 |    "metadata": {},
126 |    "source": [
127 |     "### Review of the Discrete Time Case\n",
128 |     "\n",
129 |     "Let $ (X_t) $ be a discrete time Markov chain with Markov matrix $ P $.\n",
130 |     "\n",
131 |     "[Recall that](https://continuous-time-mcs.quantecon.org/markov_prop.html#finstatediscretemc), in the discrete time case, the distribution $ \\psi_t $ of $ X_t $ updates according to\n",
132 |     "\n",
133 |     "$$\n",
134 |     "\\psi_{t+1} = \\psi_t P, \n",
135 |     "    \\qquad \\psi_0 \\text{ a given element of } \\dD,\n",
136 |     "$$\n",
137 |     "\n",
138 |     "where distributions are understood as row vectors.\n",
139 |     "\n",
140 |     "Here’s a visualization for the case $ S = \\{0, 1, 2\\} $, so that $ \\dD $ is the [standard\n",
141 |     "simplex](https://en.wikipedia.org/wiki/Simplex) in $ \\RR^3 $.\n",
142 |     "\n",
143 |     "The initial condition is ` (0, 0, 1)` and the Markov matrix is"
144 |    ]
145 |   },
146 |   {
147 |    "cell_type": "code",
148 |    "execution_count": null,
149 |    "id": "503e86aa",
150 |    "metadata": {
151 |     "hide-output": false
152 |    },
153 |    "outputs": [],
154 |    "source": [
155 |     "P = ((0.9, 0.1, 0.0),\n",
156 |     "     (0.4, 0.4, 0.2),\n",
157 |     "     (0.1, 0.1, 0.8))"
158 |    ]
159 |   },
160 |   {
161 |    "cell_type": "code",
162 |    "execution_count": null,
163 |    "id": "7cf5ed90",
164 |    "metadata": {
165 |     "hide-output": false
166 |    },
167 |    "outputs": [],
168 |    "source": [
169 |     "def unit_simplex(angle):\n",
170 |     "    \n",
171 |     "    fig = plt.figure(figsize=(8, 6))\n",
172 |     "    ax = fig.add_subplot(111, projection='3d')\n",
173 |     "\n",
174 |     "    vtx = [[0, 0, 1],\n",
175 |     "           [0, 1, 0], \n",
176 |     "           [1, 0, 0]]\n",
177 |     "    \n",
178 |     "    tri = Poly3DCollection([vtx], color='darkblue', alpha=0.3)\n",
179 |     "    tri.set_facecolor([0.5, 0.5, 1])\n",
180 |     "    ax.add_collection3d(tri)\n",
181 |     "\n",
182 |     "    ax.set(xlim=(0, 1), ylim=(0, 1), zlim=(0, 1), \n",
183 |     "           xticks=(1,), yticks=(1,), zticks=(1,))\n",
184 |     "\n",
185 |     "    ax.set_xticklabels(['$(1, 0, 0)$'], fontsize=12)\n",
186 |     "    ax.set_yticklabels(['$(0, 1, 0)$'], fontsize=12)\n",
187 |     "    ax.set_zticklabels(['$(0, 0, 1)$'], fontsize=12)\n",
188 |     "\n",
189 |     "    ax.xaxis.majorTicks[0].set_pad(15)\n",
190 |     "    ax.yaxis.majorTicks[0].set_pad(15)\n",
191 |     "    ax.zaxis.majorTicks[0].set_pad(35)\n",
192 |     "\n",
193 |     "    ax.view_init(30, angle)\n",
194 |     "\n",
195 |     "    # Move axis to origin\n",
196 |     "    ax.xaxis._axinfo['juggled'] = (0, 0, 0)\n",
197 |     "    ax.yaxis._axinfo['juggled'] = (1, 1, 1)\n",
198 |     "    ax.zaxis._axinfo['juggled'] = (2, 2, 0)\n",
199 |     "    \n",
200 |     "    ax.grid(False)\n",
201 |     "    \n",
202 |     "    return ax\n",
203 |     "\n",
204 |     "\n",
205 |     "def convergence_plot(ψ, n=14, angle=50):\n",
206 |     "\n",
207 |     "    ax = unit_simplex(angle)\n",
208 |     "\n",
209 |     "    P = ((0.9, 0.1, 0.0),\n",
210 |     "         (0.4, 0.4, 0.2),\n",
211 |     "         (0.1, 0.1, 0.8))\n",
212 |     "    \n",
213 |     "    P = np.array(P)\n",
214 |     "    colors = cm.jet_r(np.linspace(0.0, 1, n))\n",
215 |     "\n",
216 |     "    x_vals, y_vals, z_vals = [], [], []\n",
217 |     "    for t in range(n):\n",
218 |     "        x_vals.append(ψ[0])\n",
219 |     "        y_vals.append(ψ[1])\n",
220 |     "        z_vals.append(ψ[2])\n",
221 |     "        ψ = ψ @ P\n",
222 |     "\n",
223 |     "    ax.scatter(x_vals, y_vals, z_vals, c=colors, s=50, alpha=0.7, depthshade=False)\n",
224 |     "\n",
225 |     "    return ψ\n",
226 |     "\n",
227 |     "ψ = convergence_plot((0, 0, 1))\n",
228 |     "\n",
229 |     "plt.show()"
230 |    ]
231 |   },
232 |   {
233 |    "cell_type": "markdown",
234 |    "id": "8aa54409",
235 |    "metadata": {},
236 |    "source": [
237 |     "There’s a sense in which a discrete time Markov chain “is” a homogeneous\n",
238 |     "linear difference equation in distribution space.\n",
239 |     "\n",
240 |     "To clarify this, suppose we\n",
241 |     "take $ G $ to be a linear map from $ \\dD $ to itself and\n",
242 |     "write down the difference equation\n",
243 |     "\n",
244 |     "\n",
245 |     "<a id='equation-gdiff2'></a>\n",
246 |     "$$\n",
247 |     "\\psi_{t+1} = G(\\psi_t)\n",
248 |     "    \\quad \\text{with } \\psi_0 \\in \\dD \\text{ given}. \\tag{5.1}\n",
249 |     "$$\n",
250 |     "\n",
251 |     "Because $ G $ is a linear map from a finite dimensional space to itself, it can\n",
252 |     "be represented by a matrix.\n",
253 |     "\n",
254 |     "Moreover, a matrix $ P $ is a Markov matrix if and only if $ \\psi \\mapsto\n",
255 |     "\\psi P $ sends $ \\dD $ into itself (check it if you haven’t already).\n",
256 |     "\n",
257 |     "So, under the stated conditions, our difference equation [(5.1)](#equation-gdiff2) uniquely\n",
258 |     "identifies a Markov matrix, along with an initial condition $ \\psi_0 $.\n",
259 |     "\n",
260 |     "Together, these objects identify the joint distribution of a discrete time Markov chain, as [previously described](https://continuous-time-mcs.quantecon.org/markov_prop.html#jdfin)."
261 |    ]
262 |   },
263 |   {
264 |    "cell_type": "markdown",
265 |    "id": "1a7d9df2",
266 |    "metadata": {},
267 |    "source": [
268 |     "### Shifting to Continuous Time\n",
269 |     "\n",
270 |     "We have just argued that a discrete time Markov chain can be identified with a\n",
271 |     "linear difference equation evolving in $ \\dD $.\n",
272 |     "\n",
273 |     "This strongly suggests that a continuous time Markov chain can be identified\n",
274 |     "with a linear ODE evolving in $ \\dD $.\n",
275 |     "\n",
276 |     "This intuition is correct and important.\n",
277 |     "\n",
278 |     "The rest of the lecture maps out the main ideas."
279 |    ]
280 |   },
281 |   {
282 |    "cell_type": "markdown",
283 |    "id": "28684c05",
284 |    "metadata": {},
285 |    "source": [
286 |     "## ODEs in Distribution Space\n",
287 |     "\n",
288 |     "Consider linear differential equation given by\n",
289 |     "\n",
290 |     "\n",
291 |     "<a id='equation-ode-mc'></a>\n",
292 |     "$$\n",
293 |     "\\psi_t' = \\psi_t Q, \n",
294 |     "    \\qquad \\psi_0 \\text{ a given element of } \\dD, \\tag{5.2}\n",
295 |     "$$\n",
296 |     "\n",
297 |     "where\n",
298 |     "\n",
299 |     "- $ Q $ is an $ n \\times n $ matrix,  \n",
300 |     "- distributions are again understood as row vectors, and  \n",
301 |     "- derivatives are taken element by element, so that  \n",
302 |     "\n",
303 |     "\n",
304 |     "$$\n",
305 |     "\\psi_t' =\n",
306 |     "    \\begin{pmatrix}\n",
307 |     "        \\frac{d}{dt} \\psi_t(x_1) &\n",
308 |     "        \\cdots &\n",
309 |     "        \\frac{d}{dt} \\psi_t(x_n)\n",
310 |     "    \\end{pmatrix}\n",
311 |     "$$\n",
312 |     "\n",
313 |     "\n",
314 |     "<a id='solvode'></a>"
315 |    ]
316 |   },
317 |   {
318 |    "cell_type": "markdown",
319 |    "id": "e7be044e",
320 |    "metadata": {},
321 |    "source": [
322 |     "### Solutions to Linear Vector ODEs\n",
323 |     "\n",
324 |     "Using the matrix exponential, the unique solution to the initial value problem\n",
325 |     "[(5.2)](#equation-ode-mc) is\n",
326 |     "\n",
327 |     "\n",
328 |     "<a id='equation-cmc-sol'></a>\n",
329 |     "$$\n",
330 |     "\\psi_t = \\psi_0 P_t \n",
331 |     "    \\quad \\text{where } P_t := e^{tQ} \\tag{5.3}\n",
332 |     "$$\n",
333 |     "\n",
334 |     "To check that [(5.3)](#equation-cmc-sol) is a solution, we use [(4.7)](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#equation-expoderiv) again to get\n",
335 |     "\n",
336 |     "$$\n",
337 |     "\\frac{d}{d t} P_t =  Q e^{tQ} = e^{tQ} Q\n",
338 |     "$$\n",
339 |     "\n",
340 |     "The first equality can be written as $ P_t' =  Q P_t $ and this is just\n",
341 |     "the [Kolmogorov backward equation](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html).\n",
342 |     "\n",
343 |     "The second equality can be written as\n",
344 |     "\n",
345 |     "$$\n",
346 |     "P_t' = P_t Q\n",
347 |     "$$\n",
348 |     "\n",
349 |     "and is called the **Kolmogorov forward equation**.\n",
350 |     "\n",
351 |     "Applying the Kolmogorov forward equation, we obtain\n",
352 |     "\n",
353 |     "$$\n",
354 |     "\\frac{d}{d t} \\psi_t \n",
355 |     "    = \\frac{d}{d t} \\psi_0 P_t \n",
356 |     "    = \\psi_0 \\frac{d}{d t} P_t \n",
357 |     "    = \\psi_0 P_t Q\n",
358 |     "    = \\psi_t Q\n",
359 |     "$$\n",
360 |     "\n",
361 |     "This confirms that [(5.3)](#equation-cmc-sol) solves [(5.2)](#equation-ode-mc).\n",
362 |     "\n",
363 |     "Here’s an example of three distribution flows with dynamics generated  by [(5.2)](#equation-ode-mc),  one starting from each vertex.\n",
364 |     "\n",
365 |     "The code uses [(5.3)](#equation-cmc-sol) with matrix $ Q $ given by"
366 |    ]
367 |   },
368 |   {
369 |    "cell_type": "code",
370 |    "execution_count": null,
371 |    "id": "c8917856",
372 |    "metadata": {
373 |     "hide-output": false
374 |    },
375 |    "outputs": [],
376 |    "source": [
377 |     "Q = ((-3, 2, 1),\n",
378 |     "     (3, -5, 2),\n",
379 |     "     (4, 6, -10))"
380 |    ]
381 |   },
382 |   {
383 |    "cell_type": "code",
384 |    "execution_count": null,
385 |    "id": "04440d83",
386 |    "metadata": {
387 |     "hide-output": false
388 |    },
389 |    "outputs": [],
390 |    "source": [
391 |     "Q = np.array(Q)\n",
392 |     "ψ_00 = np.array((0.01, 0.01, 0.99))\n",
393 |     "ψ_01 = np.array((0.01, 0.99, 0.01))\n",
394 |     "ψ_02 = np.array((0.99, 0.01, 0.01))\n",
395 |     "\n",
396 |     "ax = unit_simplex(angle=50)    \n",
397 |     "\n",
398 |     "def flow_plot(ψ, h=0.001, n=400, angle=50):\n",
399 |     "    colors = cm.jet_r(np.linspace(0.0, 1, n))\n",
400 |     "\n",
401 |     "    x_vals, y_vals, z_vals = [], [], []\n",
402 |     "    for t in range(n):\n",
403 |     "        x_vals.append(ψ[0])\n",
404 |     "        y_vals.append(ψ[1])\n",
405 |     "        z_vals.append(ψ[2])\n",
406 |     "        ψ = ψ @ expm(h * Q)\n",
407 |     "\n",
408 |     "    ax.scatter(x_vals, y_vals, z_vals, c=colors, s=20, alpha=0.2, depthshade=False)\n",
409 |     "\n",
410 |     "flow_plot(ψ_00)\n",
411 |     "flow_plot(ψ_01)\n",
412 |     "flow_plot(ψ_02)\n",
413 |     "\n",
414 |     "plt.show()"
415 |    ]
416 |   },
417 |   {
418 |    "cell_type": "markdown",
419 |    "id": "fc8e8083",
420 |    "metadata": {},
421 |    "source": [
422 |     "(Distributions cool over time, so initial conditions are hot colors.)"
423 |    ]
424 |   },
425 |   {
426 |    "cell_type": "markdown",
427 |    "id": "1100ffe2",
428 |    "metadata": {},
429 |    "source": [
430 |     "### Forwards vs Backwards Equations\n",
431 |     "\n",
432 |     "As the above discussion shows, we can take the Kolmogorov forward equation\n",
433 |     "$ P_t' = P_t Q $ and premultiply by any distribution $ \\psi_0 $ to get the\n",
434 |     "distribution ODE $ \\psi'_t = \\psi_t Q $.\n",
435 |     "\n",
436 |     "In this sense, we can understand the Kolmogorov forward equation as pushing\n",
437 |     "distributions forward in time.\n",
438 |     "\n",
439 |     "Analogously, we can take the Kolmogorov backward equation\n",
440 |     "$ P_t' = Q P_t $ and postmultiply by any vector $ h $ to get\n",
441 |     "\n",
442 |     "$$\n",
443 |     "(P_t h)' = Q P_t h\n",
444 |     "$$\n",
445 |     "\n",
446 |     "Recalling that $ (P_t h)(x) = \\EE [ h(X_t) \\,|\\, X_0 = x] $, this vector\n",
447 |     "ODE tells us how expectations evolve, conditioning backward to time zero.\n",
448 |     "\n",
449 |     "Both the forward and the backward equations uniquely pin down the same solution $ P_t = e^{tQ} $ when combined with the initial condition $ P_0 = I $."
450 |    ]
451 |   },
452 |   {
453 |    "cell_type": "markdown",
454 |    "id": "ee40ca06",
455 |    "metadata": {},
456 |    "source": [
457 |     "### Matrix- vs Vector-Valued ODEs\n",
458 |     "\n",
459 |     "The ODE $ \\psi'_t = \\psi_t Q $ is sometimes called the\n",
460 |     "**Fokker–Planck equation** (although this terminology is most commonly used\n",
461 |     "in the context of diffusions).\n",
462 |     "\n",
463 |     "It is a vector-valued ODE that describes the evolution of a particular\n",
464 |     "distribution path.\n",
465 |     "\n",
466 |     "By comparison, the Kolmogorov forward equation is (like the backward equation)\n",
467 |     "a differential equation in matrices.\n",
468 |     "\n",
469 |     "(And matrices are really maps, which send vectors into vectors.)\n",
470 |     "\n",
471 |     "Operating at this level is less intuitive and more abstract than working with the\n",
472 |     "Fokker–Planck equation.\n",
473 |     "\n",
474 |     "But, in the end, the object that we want to describe is a Markov\n",
475 |     "semigroup.\n",
476 |     "\n",
477 |     "The Kolmogorov forward and backward equations are the ODEs that define\n",
478 |     "this fundamental object."
479 |    ]
480 |   },
481 |   {
482 |    "cell_type": "markdown",
483 |    "id": "0fc5d538",
484 |    "metadata": {},
485 |    "source": [
486 |     "### Preserving Distributions\n",
487 |     "\n",
488 |     "In the simulation above, $ Q $ was chosen with some care, so that the flow\n",
489 |     "remains in $ \\dD $.\n",
490 |     "\n",
491 |     "What are the exact properties we require on $ Q $ such that $ \\psi_t $ is always\n",
492 |     "in $ \\dD $?\n",
493 |     "\n",
494 |     "This is an important question, because we are setting up an exact\n",
495 |     "correspondence between linear ODEs that evolve in $ \\dD $ and continuous\n",
496 |     "time Markov chains.\n",
497 |     "\n",
498 |     "Recall that the linear update rule $ \\psi \\mapsto \\psi P $ is invariant on\n",
499 |     "$ \\dD $ if and only if $ P $ is a Markov matrix.\n",
500 |     "\n",
501 |     "So now we can rephrase our key question regarding invariance on $ \\dD $:\n",
502 |     "\n",
503 |     "What properties do we need to impose on $ Q $ so that $ P_t = e^{tQ} $ is a Markov matrix\n",
504 |     "for all $ t $?\n",
505 |     "\n",
506 |     "A square matrix $ Q $ is called an **intensity matrix** if $ Q $ has zero row\n",
507 |     "sums and $ Q(x, y) \\geq 0 $ whenever $ x \\not= y $."
508 |    ]
509 |   },
510 |   {
511 |    "cell_type": "markdown",
512 |    "id": "22457c27",
513 |    "metadata": {},
514 |    "source": [
515 |     "**Theorem 5.1**\n",
516 |     "\n",
517 |     "If $ Q $ is a matrix on $ S $ and $ P_t := e^{tQ} $, then the following statements\n",
518 |     "are equivalent:\n",
519 |     "\n",
520 |     "1. $ P_t $ is a Markov matrix for all $ t $.  \n",
521 |     "1. $ Q $ is an intensity matrix.  "
522 |    ]
523 |   },
524 |   {
525 |    "cell_type": "markdown",
526 |    "id": "226303ab",
527 |    "metadata": {},
528 |    "source": [
529 |     "The proof is related to that of [Lemma 4.2](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#jctosg) and is found as\n",
530 |     "a solved exercise below."
531 |    ]
532 |   },
533 |   {
534 |    "cell_type": "markdown",
535 |    "id": "4f637064",
536 |    "metadata": {},
537 |    "source": [
538 |     "**Corollary 5.1**\n",
539 |     "\n",
540 |     "If $ Q $ is an intensity matrix on finite $ S $ and $ P_t = e^{tQ} $ for all $ t \\geq 0 $,\n",
541 |     "then $ (P_t) $ is a Markov semigroup."
542 |    ]
543 |   },
544 |   {
545 |    "cell_type": "markdown",
546 |    "id": "9ed5f0c4",
547 |    "metadata": {},
548 |    "source": [
549 |     "We call $ (P_t) $ the Markov semigroup **generated** by $ Q $.\n",
550 |     "\n",
551 |     "Later we will see that this result extends to the case $ |S| = \\infty $ under\n",
552 |     "some mild restrictions on $ Q $."
553 |    ]
554 |   },
555 |   {
556 |    "cell_type": "markdown",
557 |    "id": "98c16dc7",
558 |    "metadata": {},
559 |    "source": [
560 |     "## Jump Chains\n",
561 |     "\n",
562 |     "Let’s return to the chain $ (X_t) $ created from jump chain pair $ (\\lambda, K) $ in\n",
563 |     "[Algorithm 4.1](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#ejc_algo).\n",
564 |     "\n",
565 |     "We found that the semigroup is given by\n",
566 |     "\n",
567 |     "$$\n",
568 |     "P_t = e^{tQ}\n",
569 |     "    \\quad \\text{where} \\quad\n",
570 |     "    Q(x, y) := \\lambda(x) (K(x, y) - I(x, y))\n",
571 |     "$$\n",
572 |     "\n",
573 |     "Using the fact that $ K $ is a Markov matrix and the jump rate function\n",
574 |     "$ \\lambda $ is nonnegative, you can easily check that $ Q $ satisfies the\n",
575 |     "definition of an intensity matrix.\n",
576 |     "\n",
577 |     "Hence $ (P_t) $, the Markov semigroup for the jump chain $ (X_t) $, is the\n",
578 |     "semigroup generated by the intensity matrix $ Q(x, y) = \\lambda(x) (K(x, y) - I(x, y)) $.\n",
579 |     "\n",
580 |     "We can differentiate $ P_t = e^{tQ} $ to obtain the Kolmogorov forward equation\n",
581 |     "$ P_t' = P_t Q $.\n",
582 |     "\n",
583 |     "We can then premultiply by $ \\psi_0 \\in \\dD $ to get $ \\psi_t' = \\psi_t\n",
584 |     "Q $, which is the Fokker–Planck equation.\n",
585 |     "\n",
586 |     "More explicitly, for given $ y \\in S $,\n",
587 |     "\n",
588 |     "$$\n",
589 |     "\\psi_t'(y)\n",
590 |     "    = \\sum_{x \\not= y} \\psi_t(x) \\lambda(x) K(x, y) - \\psi_t(y) \\lambda(y)\n",
591 |     "$$\n",
592 |     "\n",
593 |     "The rate of probability flow into $ y $ is equal to the inflow from other states\n",
594 |     "minus the outflow."
595 |    ]
596 |   },
597 |   {
598 |    "cell_type": "markdown",
599 |    "id": "4f431a49",
600 |    "metadata": {},
601 |    "source": [
602 |     "## Summary\n",
603 |     "\n",
604 |     "We have seen that any intensity matrix $ Q $ on $ S $ defines a Markov semigroup via $ P_t = e^{tQ} $.\n",
605 |     "\n",
606 |     "Henceforth, we will say that $ (X_t) $ is **a Markov chain with intensity matrix** $ Q $ if\n",
607 |     "$ (X_t) $ is a Markov chain with Markov semigroup $ (e^{tQ}) $.\n",
608 |     "\n",
609 |     "While our discussion has been in the context of a finite state space, later we\n",
610 |     "will see that these ideas carry over to an infinite state setting under mild\n",
611 |     "restrictions.\n",
612 |     "\n",
613 |     "We have also hinted at the fact that *every* continuous time Markov chain\n",
614 |     "is a Markov chain with intensity matrix $ Q $ for some suitably chosen $ Q $.\n",
615 |     "\n",
616 |     "Later we will prove this to be universally true when $ S $ is finite and true\n",
617 |     "under mild conditions when $ S $ is countably infinite.\n",
618 |     "\n",
619 |     "Intensity matrices are important because\n",
620 |     "\n",
621 |     "1. they are the natural infinitesimal descriptions of Markov semigroups,  \n",
622 |     "1. they are often easy to write down in applications and  \n",
623 |     "1. they provide an intuitive description of dynamics.  \n",
624 |     "\n",
625 |     "\n",
626 |     "Later, we will see that, for a given intensity matrix $ Q $, the elements are\n",
627 |     "understood as follows:\n",
628 |     "\n",
629 |     "- when $ x \\not= y $, the value $ Q(x, y) $ is the “rate of leaving $ x $ for $ y $” and  \n",
630 |     "- $ -Q(x, x) \\geq 0 $ is the “rate of leaving $ x $” .  "
631 |    ]
632 |   },
633 |   {
634 |    "cell_type": "markdown",
635 |    "id": "4c605399",
636 |    "metadata": {},
637 |    "source": [
638 |     "## Exercises"
639 |    ]
640 |   },
641 |   {
642 |    "cell_type": "markdown",
643 |    "id": "eac307ce",
644 |    "metadata": {},
645 |    "source": [
646 |     "## Exercise 5.1\n",
647 |     "\n",
648 |     "Let $ (P_t) $ be a Markov semigroup such that $ t \\mapsto P_t(x, y) $ is\n",
649 |     "differentiable at all $ t \\geq 0 $ and $ (x, y) \\in S \\times S $.\n",
650 |     "\n",
651 |     "(The derivative at $ t=0 $ is the usual right derivative.)\n",
652 |     "\n",
653 |     "Define (pointwise, at each $ (x,y) $),\n",
654 |     "\n",
655 |     "\n",
656 |     "<a id='equation-genfl'></a>\n",
657 |     "$$\n",
658 |     "Q := P'_0 = \\lim_{h \\downarrow 0} \\frac{P_h - I}{h} \\tag{5.4}\n",
659 |     "$$\n",
660 |     "\n",
661 |     "Assuming that this limit exists, and hence $ Q $ is well-defined, show that\n",
662 |     "\n",
663 |     "$$\n",
664 |     "P'_t = P_t Q\n",
665 |     "\\quad \\text{and} \\quad\n",
666 |     "P'_t = Q P_t\n",
667 |     "$$\n",
668 |     "\n",
669 |     "both hold. (These are the Kolmogorov forward and backward equations.)"
670 |    ]
671 |   },
672 |   {
673 |    "cell_type": "markdown",
674 |    "id": "ebc9dd1e",
675 |    "metadata": {},
676 |    "source": [
677 |     "## Solution to[ Exercise 5.1](https://continuous-time-mcs.quantecon.org/#kolmogorov-fwd-1)\n",
678 |     "\n",
679 |     "Let $ (P_t) $ be a Markov semigroup and let $ Q $ be as defined in the statement\n",
680 |     "of the exercise.\n",
681 |     "\n",
682 |     "Fix $ t \\geq 0 $ and $ h > 0 $.\n",
683 |     "\n",
684 |     "Combining the semigroup property and linearity with the restriction $ P_0 = I $, we get\n",
685 |     "\n",
686 |     "$$\n",
687 |     "\\frac{P_{t+h} - P_t}{h}\n",
688 |     "= \\frac{P_t P_h - P_t}{h}\n",
689 |     "= \\frac{P_t (P_h - I)}{h}\n",
690 |     "$$\n",
691 |     "\n",
692 |     "Taking $ h \\downarrow 0 $ and using the definition of $ Q $ give $ P_t' = P_t Q $,\n",
693 |     "which is the Kolmogorov forward equation.\n",
694 |     "\n",
695 |     "For the backward equation we observe that\n",
696 |     "\n",
697 |     "$$\n",
698 |     "\\frac{P_{t+h} - P_t}{h}\n",
699 |     "= \\frac{P_h P_t - P_t}{h}\n",
700 |     "= \\frac{(P_h - I) P_t}{h}\n",
701 |     "$$\n",
702 |     "\n",
703 |     "also holds.  Taking $ h \\downarrow 0 $ gives the Kolmogorov backward equation."
704 |    ]
705 |   },
706 |   {
707 |    "cell_type": "markdown",
708 |    "id": "371d696c",
709 |    "metadata": {},
710 |    "source": [
711 |     "## Exercise 5.2\n",
712 |     "\n",
713 |     "Recall [our model](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#sdji) of jump chains with state-dependent jump\n",
714 |     "intensities given by rate function $ x \\mapsto \\lambda(x) $.\n",
715 |     "\n",
716 |     "After a wait time with exponential rate $ \\lambda(x) \\in (0, \\infty) $, the\n",
717 |     "state transitions from $ x $ to $ y $ with probability $ K(x,y) $.\n",
718 |     "\n",
719 |     "We found that the associated semigroup $ (P_t) $ satisfies the Kolmogorov\n",
720 |     "backward equation $ P'_t = Q P_t $ with\n",
721 |     "\n",
722 |     "\n",
723 |     "<a id='equation-qeqagain'></a>\n",
724 |     "$$\n",
725 |     "Q(x, y) := \\lambda(x) (K(x, y) - I(x, y)) \\tag{5.5}\n",
726 |     "$$\n",
727 |     "\n",
728 |     "Show that $ Q $ is an intensity matrix and that [(5.4)](#equation-genfl) holds."
729 |    ]
730 |   },
731 |   {
732 |    "cell_type": "markdown",
733 |    "id": "9cda6dc9",
734 |    "metadata": {},
735 |    "source": [
736 |     "## Solution to[ Exercise 5.2](https://continuous-time-mcs.quantecon.org/#kolmogorov-fwd-2)\n",
737 |     "\n",
738 |     "Let $ Q $ be as defined in [(5.5)](#equation-qeqagain).\n",
739 |     "\n",
740 |     "We need to show that $ Q $ is nonnegative off the diagonal and has zero row\n",
741 |     "sums.\n",
742 |     "\n",
743 |     "The first assertion is immediate from nonnegativity of $ K $ and $ \\lambda $.\n",
744 |     "\n",
745 |     "For the second, we use the fact that $ K $ is a Markov matrix, so that, with $ 1 $\n",
746 |     "as a column vector of ones,\n",
747 |     "\n",
748 |     "$$\n",
749 |     "Q 1\n",
750 |     "= \\lambda (K 1 - 1)\n",
751 |     "= \\lambda (1 - 1)\n",
752 |     "= 0\n",
753 |     "$$"
754 |    ]
755 |   },
756 |   {
757 |    "cell_type": "markdown",
758 |    "id": "21a94fb5",
759 |    "metadata": {},
760 |    "source": [
761 |     "## Exercise 5.3\n",
762 |     "\n",
763 |     "Prove [Theorem 5.1](#intvsmk) by adapting the arguments in [Lemma 4.2](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#jctosg).\n",
764 |     "(This is nontrivial but worth at least trying.)\n",
765 |     "\n",
766 |     "Hint: The constant $ m $ in the proof can be set to $ \\max_x |Q(x, x)| $."
767 |    ]
768 |   },
769 |   {
770 |    "cell_type": "markdown",
771 |    "id": "fe6ba2f8",
772 |    "metadata": {},
773 |    "source": [
774 |     "## Solution to[ Exercise 5.3](https://continuous-time-mcs.quantecon.org/#kolmogorov-fwd-3)\n",
775 |     "\n",
776 |     "Suppose that $ Q $ is an intensity matrix, fix $ t \\geq 0 $ and set $ P_t = e^{tQ} $.\n",
777 |     "\n",
778 |     "The proof from [Lemma 4.2](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#jctosg) that $ P_t $ has unit row sums applies\n",
779 |     "directly to the current case.\n",
780 |     "\n",
781 |     "The proof of nonnegativity of $ P_t $ can be applied after some\n",
782 |     "modifications.\n",
783 |     "\n",
784 |     "To this end, set $ m := \\max_x |Q(x,x)| $ and $ \\hat P := I + Q / m $.\n",
785 |     "\n",
786 |     "You can check that $ \\hat P $ is a Markov matrix and that $ Q = m( \\hat P - I) $.\n",
787 |     "\n",
788 |     "The rest of the proof of nonnegativity of $ P_t $ is unchanged and we will not repeat it.\n",
789 |     "\n",
790 |     "We conclude that $ P_t $ is a Markov matrix.\n",
791 |     "\n",
792 |     "Regarding the converse implication, suppose that $ P_t = e^{tQ} $ is a Markov\n",
793 |     "matrix for all $ t $ and let $ 1 $ be a column vector of ones.\n",
794 |     "\n",
795 |     "Because $ P_t $ has unit row sums and differentiation is linear,\n",
796 |     "we can employ the Kolmogorov backward equation to obtain\n",
797 |     "\n",
798 |     "$$\n",
799 |     "Q 1\n",
800 |     "    = Q P_t 1\n",
801 |     "    = \\left( \\frac{d}{d t} P_t \\right) 1\n",
802 |     "    = \\frac{d}{d t} (P_t 1)\n",
803 |     "    = \\frac{d}{d t} 1\n",
804 |     "    = 0\n",
805 |     "$$\n",
806 |     "\n",
807 |     "Hence $ Q $ has zero row sums.\n",
808 |     "\n",
809 |     "We can use the definition of the matrix exponential to obtain,\n",
810 |     "for any $ x, y $ and $ t \\geq 0 $,\n",
811 |     "\n",
812 |     "\n",
813 |     "<a id='equation-otp'></a>\n",
814 |     "$$\n",
815 |     "P_t(x, y) = \\mathbb 1\\{x = y\\} + t Q(x, y) + o(t) \\tag{5.6}\n",
816 |     "$$\n",
817 |     "\n",
818 |     "From this equality and the assumption that $ P_t $ is a Markov matrix for all\n",
819 |     "$ t $, we see that the off diagonal elements of $ Q $ must be\n",
820 |     "nonnegative.\n",
821 |     "\n",
822 |     "Hence $ Q $ is an intensity matrix."
823 |    ]
824 |   }
825 |  ],
826 |  "metadata": {
827 |   "date": 1763463649.8178875,
828 |   "filename": "kolmogorov_fwd.md",
829 |   "kernelspec": {
830 |    "display_name": "Python",
831 |    "language": "python3",
832 |    "name": "python3"
833 |   },
834 |   "title": "The Kolmogorov Forward Equation"
835 |  },
836 |  "nbformat": 4,
837 |  "nbformat_minor": 5
838 | }


--------------------------------------------------------------------------------
/kolmogorov_bwd.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "id": "a3435ea8",
  6 |    "metadata": {},
  7 |    "source": [
  8 |     "$$\n",
  9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
 10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
 11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
 12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
 13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
 14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
 15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
 16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
 17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
 18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
 19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
 20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
 21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
 22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
 23 |     "$$"
 24 |    ]
 25 |   },
 26 |   {
 27 |    "cell_type": "markdown",
 28 |    "id": "787db2c4",
 29 |    "metadata": {},
 30 |    "source": [
 31 |     "# The Kolmogorov Backward Equation\n",
 32 |     "\n",
 33 |     "In addition to what’s in Anaconda, this lecture will need the following libraries:"
 34 |    ]
 35 |   },
 36 |   {
 37 |    "cell_type": "code",
 38 |    "execution_count": null,
 39 |    "id": "8e9abfa5",
 40 |    "metadata": {
 41 |     "hide-output": false
 42 |    },
 43 |    "outputs": [],
 44 |    "source": [
 45 |     "!pip install quantecon"
 46 |    ]
 47 |   },
 48 |   {
 49 |    "cell_type": "markdown",
 50 |    "id": "e0170ba7",
 51 |    "metadata": {},
 52 |    "source": [
 53 |     "## Overview\n",
 54 |     "\n",
 55 |     "As models become more complex, deriving analytical representations of the\n",
 56 |     "Markov semigroup $ (P_t) $ becomes harder.\n",
 57 |     "\n",
 58 |     "This is analogous to the idea that solutions to continuous time models often\n",
 59 |     "lack analytical solutions.\n",
 60 |     "\n",
 61 |     "For example, when studying deterministic paths in continuous time,\n",
 62 |     "infinitesimal descriptions ([ODEs](https://en.wikipedia.org/wiki/Ordinary_differential_equation) and [PDEs](https://en.wikipedia.org/wiki/Partial_differential_equation)) are often more intuitive and easier\n",
 63 |     "to write down than the associated solutions.\n",
 64 |     "\n",
 65 |     "(This is one of the shining insights of mathematics, beginning with the work\n",
 66 |     "of great scientists such as Isaac Newton.)\n",
 67 |     "\n",
 68 |     "We will see in this lecture that the same is true for continuous time Markov\n",
 69 |     "chains.\n",
 70 |     "\n",
 71 |     "To help us focus on intuition in this lecture, rather than technicalities, the state space is assumed to be finite, with $ |S|=n $.\n",
 72 |     "\n",
 73 |     "Later we will investigate the case where $ |S| = \\infty $.\n",
 74 |     "\n",
 75 |     "We will use the following imports"
 76 |    ]
 77 |   },
 78 |   {
 79 |    "cell_type": "code",
 80 |    "execution_count": null,
 81 |    "id": "2abf68a1",
 82 |    "metadata": {
 83 |     "hide-output": false
 84 |    },
 85 |    "outputs": [],
 86 |    "source": [
 87 |     "import numpy as np\n",
 88 |     "import scipy as sp\n",
 89 |     "import matplotlib.pyplot as plt\n",
 90 |     "import quantecon as qe\n",
 91 |     "from numba import njit\n",
 92 |     "\n",
 93 |     "from scipy.linalg import expm\n",
 94 |     "from scipy.stats import binom"
 95 |    ]
 96 |   },
 97 |   {
 98 |    "cell_type": "markdown",
 99 |    "id": "0d13c371",
100 |    "metadata": {},
101 |    "source": [
102 |     "\n",
103 |     "<a id='sdji'></a>"
104 |    ]
105 |   },
106 |   {
107 |    "cell_type": "markdown",
108 |    "id": "14ed9612",
109 |    "metadata": {},
110 |    "source": [
111 |     "## State Dependent Jump Intensities\n",
112 |     "\n",
113 |     "As we have seen, continuous time Markov chains jump between states, and hence can\n",
114 |     "have the form\n",
115 |     "\n",
116 |     "$$\n",
117 |     "X_t = \\sum_{k \\geq 0} Y_k \\mathbb 1\\{J_k \\leq t < J_{k+1}\\}\n",
118 |     "    \\qquad (t \\geq 0)\n",
119 |     "$$\n",
120 |     "\n",
121 |     "where $ (J_k) $ are jump times and $ (Y_k) $ are the states at each jump.\n",
122 |     "\n",
123 |     "(We are assuming that $ J_k \\to \\infty $ with probability one, so that $ X_t $ is well\n",
124 |     "defined for all $ t \\geq 0 $, but this is always true for when holding times are exponential and the state space is finite.)\n",
125 |     "\n",
126 |     "In the [previous lecture](https://continuous-time-mcs.quantecon.org/markov_prop.html),\n",
127 |     "\n",
128 |     "- the sequence $ (Y_k) $ was drawn from a Markov matrix $ K $ and called the embedded jump chain, while  \n",
129 |     "- the holding times $ W_k := J_k - J_{k-1} $ were IID and Exp$ (\\lambda) $ for some\n",
130 |     "  constant jump intensity $ \\lambda $.  \n",
131 |     "\n",
132 |     "\n",
133 |     "In this lecture, we will generalize by allowing the jump intensity to vary\n",
134 |     "with the state.\n",
135 |     "\n",
136 |     "This difference sounds minor but in fact it will allow us to reach full generality\n",
137 |     "in our description of continuous time Markov chains, as\n",
138 |     "clarified below."
139 |    ]
140 |   },
141 |   {
142 |    "cell_type": "markdown",
143 |    "id": "302a36d5",
144 |    "metadata": {},
145 |    "source": [
146 |     "### Motivation\n",
147 |     "\n",
148 |     "As a motivating example, recall [the inventory model](https://continuous-time-mcs.quantecon.org/markov_prop.html#inventory-dynam),\n",
149 |     "where we assumed that the wait time for the next customer was equal\n",
150 |     "to the wait time for new inventory.\n",
151 |     "\n",
152 |     "This assumption was made purely for convenience and seems unlikely to hold true.\n",
153 |     "\n",
154 |     "When we relax it, the jump intensities depend on the state.\n",
155 |     "\n",
156 |     "\n",
157 |     "<a id='jumpchainalgo'></a>"
158 |    ]
159 |   },
160 |   {
161 |    "cell_type": "markdown",
162 |    "id": "eded02ec",
163 |    "metadata": {},
164 |    "source": [
165 |     "### Jump Chain Algorithm\n",
166 |     "\n",
167 |     "We start with three primitives\n",
168 |     "\n",
169 |     "1. An initial condition $ \\psi $,  \n",
170 |     "1. a Markov matrix $ K $ on $ S $ satisfying $ K(x, x) = 0 $ for all $ x \\in S $\n",
171 |     "  and  \n",
172 |     "1. a function $ \\lambda $ mapping $ S $ to $ (0, \\infty) $.  \n",
173 |     "\n",
174 |     "\n",
175 |     "The process $ (X_t) $\n",
176 |     "\n",
177 |     "- starts at state $ x $, draw from $ \\psi $,  \n",
178 |     "- waits there for an exponential time $ W $ with rate $ \\lambda(x) $ and then  \n",
179 |     "- updates to a new state $ y $ drawn from $ K(x, \\cdot) $.  \n",
180 |     "\n",
181 |     "\n",
182 |     "Now we take $ y $ as the new state for the process and repeat.\n",
183 |     "\n",
184 |     "Here is the same algorithm written more explicitly:"
185 |    ]
186 |   },
187 |   {
188 |    "cell_type": "markdown",
189 |    "id": "838eeff6",
190 |    "metadata": {},
191 |    "source": [
192 |     "**Algorithm 4.1** (Jump Chain Algorithm)\n",
193 |     "\n",
194 |     "**Inputs** $ \\psi \\in \\dD $, rate function $ \\lambda $, Markov matrix $ K $\n",
195 |     "\n",
196 |     "**Outputs** Markov chain $ (X_t) $\n",
197 |     "\n",
198 |     "1. Draw $ Y_0 $ from $ \\psi $, set $ J_0 = 0 $ and $ k=1 $.  \n",
199 |     "1. Draw $ W_k $ independently from Exp$ (\\lambda(Y_{k-1})) $.  \n",
200 |     "1. Set $ J_k = J_{k-1} + W_k $.  \n",
201 |     "1. Set $ X_t = Y_{k-1} $ for $ t $ in $ [J_{k-1}, J_k) $.  \n",
202 |     "1. Draw $ Y_k $ from $ K(Y_{k-1}, \\cdot) $.  \n",
203 |     "1. Set $ k = k+1 $ and go to step 2.  "
204 |    ]
205 |   },
206 |   {
207 |    "cell_type": "markdown",
208 |    "id": "e960a9d9",
209 |    "metadata": {},
210 |    "source": [
211 |     "The sequence $ (W_k) $ is drawn as an IID sequence and $ (W_k) $ and $ (Y_k) $ are\n",
212 |     "drawn independently.\n",
213 |     "\n",
214 |     "The restriction $ K(x,x) = 0 $ for all $ x $ implies that $ (X_t) $ actually jumps at each jump time."
215 |    ]
216 |   },
217 |   {
218 |    "cell_type": "markdown",
219 |    "id": "7a6286e6",
220 |    "metadata": {},
221 |    "source": [
222 |     "## Computing the Semigroup\n",
223 |     "\n",
224 |     "For the jump process $ (X_t) $ with time varying intensities described in the\n",
225 |     "jump chain algorithm, calculating the Markov semigroup is not a trivial exercise.\n",
226 |     "\n",
227 |     "The approach we adopt is\n",
228 |     "\n",
229 |     "1. Use probabilistic reasoning to obtain an integral equation that the\n",
230 |     "  semigroup must satisfy.  \n",
231 |     "1. Convert the integral equation into a differential equation that is easier\n",
232 |     "  to work with.  \n",
233 |     "1. Solve this differential equation to obtain the Markov semigroup $ (P_t) $.  \n",
234 |     "\n",
235 |     "\n",
236 |     "The differential equation in question has a special name:  the Kolmogorov backward equation."
237 |    ]
238 |   },
239 |   {
240 |    "cell_type": "markdown",
241 |    "id": "4122be41",
242 |    "metadata": {},
243 |    "source": [
244 |     "### An Integral Equation\n",
245 |     "\n",
246 |     "Here is the first step in the sequence listed above."
247 |    ]
248 |   },
249 |   {
250 |    "cell_type": "markdown",
251 |    "id": "ea53c03c",
252 |    "metadata": {},
253 |    "source": [
254 |     "**Lemma 4.1** (An Integral Equation)\n",
255 |     "\n",
256 |     "The semigroup $ (P_t) $ of the jump chain with rate function $ \\lambda $ and Markov matrix $ K $ obeys the integral equation\n",
257 |     "\n",
258 |     "\n",
259 |     "<a id='equation-kbinteg'></a>\n",
260 |     "$$\n",
261 |     "P_t(x, y) = e^{-t \\lambda(x)} I(x, y)\n",
262 |     "    + \\lambda(x) \n",
263 |     "      \\int_0^t (K P_{t-\\tau})(x, y) e^{- \\tau \\lambda(x)} d \\tau \\tag{4.1}\n",
264 |     "$$\n",
265 |     "\n",
266 |     "for all $ t \\geq 0 $ and $ x, y $ in $ S $."
267 |    ]
268 |   },
269 |   {
270 |    "cell_type": "markdown",
271 |    "id": "98390204",
272 |    "metadata": {},
273 |    "source": [
274 |     "Here $ (P_t) $ is the Markov semigroup of $ (X_t) $, the process constructed via\n",
275 |     "[Algorithm 4.1](#ejc_algo), while $ K P_{t-\\tau} $ is the matrix product of $ K $ and\n",
276 |     "$ P_{t-\\tau} $."
277 |    ]
278 |   },
279 |   {
280 |    "cell_type": "markdown",
281 |    "id": "a2207c78",
282 |    "metadata": {},
283 |    "source": [
284 |     "**Proof.**\n",
285 |     "\n",
286 |     "Proof. Conditioning implicitly on $ X_0 = x $, the semigroup $ (P_t) $ must satisfy\n",
287 |     "\n",
288 |     "\n",
289 |     "<a id='equation-pt-split'></a>\n",
290 |     "$$\n",
291 |     "P_t(x, y) \n",
292 |     "    = \\PP\\{X_t = y\\}\n",
293 |     "    = \\PP\\{X_t = y, \\; J_1 > t \\}\n",
294 |     "        + \\PP\\{X_t = y, \\; J_1 \\leq t \\} \\tag{4.2}\n",
295 |     "$$\n",
296 |     "\n",
297 |     "Regarding the first term on the right hand side of [(4.2)](#equation-pt-split), we have\n",
298 |     "\n",
299 |     "\n",
300 |     "<a id='equation-pt-first'></a>\n",
301 |     "$$\n",
302 |     "\\PP\\{X_t = y, \\; J_1 > t \\}\n",
303 |     "        = I(x, y) P\\{J_1 > t \\}\n",
304 |     "        = I(x, y) e^{- t \\lambda(x)} \\tag{4.3}\n",
305 |     "$$\n",
306 |     "\n",
307 |     "where $ I(x, y) = \\mathbb 1\\{x = y\\} $.\n",
308 |     "\n",
309 |     "For the second term on the right hand side of [(4.2)](#equation-pt-split), we have\n",
310 |     "\n",
311 |     "$$\n",
312 |     "\\PP\\{X_t = y, \\; J_1 \\leq t \\}\n",
313 |     "    = \\EE \n",
314 |     "        \\left[\n",
315 |     "            \\mathbb 1\\{J_1 \\leq t\\} \\PP\\{X_t = y \\,|\\, W_1, Y_1\\}\n",
316 |     "        \\right]\n",
317 |     "    = \\EE \n",
318 |     "        \\left[\n",
319 |     "            \\mathbb 1\\{J_1 \\leq t\\} P_{t - J_1} (Y_1, y) \n",
320 |     "        \\right]\n",
321 |     "$$\n",
322 |     "\n",
323 |     "Evaluating the expectation and using the independence of $ J_1 $ and $ Y_1 $, this becomes\n",
324 |     "\n",
325 |     "$$\n",
326 |     "\\begin{aligned}\n",
327 |     "    \\PP\\{X_t = y, \\; J_1 \\leq t \\}\n",
328 |     "    & = \\int_0^\\infty\n",
329 |     "            \\mathbb 1\\{\\tau \\leq t\\}\n",
330 |     "            \\sum_z K(x, z) P_{t - \\tau} (z, y)  \\lambda(x) e^{-\\tau \\lambda(x)} \n",
331 |     "            d \\tau\n",
332 |     "        \\\\\n",
333 |     "    & = \\lambda(x)\n",
334 |     "            \\int_0^t\n",
335 |     "            \\sum_z K(x, z) P_{t - \\tau} (z, y)  e^{-\\tau \\lambda(x)} \n",
336 |     "            d \\tau\n",
337 |     "\\end{aligned}\n",
338 |     "$$\n",
339 |     "\n",
340 |     "Combining this result with [(4.2)](#equation-pt-split) and [(4.3)](#equation-pt-first) gives\n",
341 |     "[(4.1)](#equation-kbinteg)."
342 |    ]
343 |   },
344 |   {
345 |    "cell_type": "markdown",
346 |    "id": "e2b9a705",
347 |    "metadata": {},
348 |    "source": [
349 |     "### Kolmogorov’s Differential Equation\n",
350 |     "\n",
351 |     "We have now confirmed that the semigroup $ (P_t) $ associated with the jump\n",
352 |     "chain process $ (X_t) $ satisfies [(4.1)](#equation-kbinteg).\n",
353 |     "\n",
354 |     "Equation [(4.1)](#equation-kbinteg) is important but we can simplify it further without\n",
355 |     "losing information by taking the time derivative.\n",
356 |     "\n",
357 |     "This leads to our main result for the lecture"
358 |    ]
359 |   },
360 |   {
361 |    "cell_type": "markdown",
362 |    "id": "0efe2e20",
363 |    "metadata": {},
364 |    "source": [
365 |     "**Theorem 4.1** (Kolmogorov Backward Equation)\n",
366 |     "\n",
367 |     "The semigroup $ (P_t) $ of the jump chain with rate function $ \\lambda $ and Markov matrix $ K $ satisfies the **Kolmogorov backward equation**\n",
368 |     "\n",
369 |     "\n",
370 |     "<a id='equation-kolbackeq'></a>\n",
371 |     "$$\n",
372 |     "P'_t = Q P_t \n",
373 |     "    \\quad \\text{where } \\;\n",
374 |     "    Q(x, y) := \\lambda(x) (K(x, y) - I(x, y)) \\tag{4.4}\n",
375 |     "$$"
376 |    ]
377 |   },
378 |   {
379 |    "cell_type": "markdown",
380 |    "id": "152e57b9",
381 |    "metadata": {},
382 |    "source": [
383 |     "The derivative on the left hand side of [(4.4)](#equation-kolbackeq) is taken element by\n",
384 |     "element, with respect to $ t $, so that\n",
385 |     "\n",
386 |     "$$\n",
387 |     "P'_t(x, y) = \\left( \\frac{d}{dt} P_t(x, y) \\right)\n",
388 |     "    \\qquad ((x, y) \\in S \\times S)\n",
389 |     "$$\n",
390 |     "\n",
391 |     "The proof that differentiating [(4.1)](#equation-kbinteg) yields [(4.4)](#equation-kolbackeq) is an\n",
392 |     "important exercise (see below)."
393 |    ]
394 |   },
395 |   {
396 |    "cell_type": "markdown",
397 |    "id": "a5410cc7",
398 |    "metadata": {},
399 |    "source": [
400 |     "### Exponential Solution\n",
401 |     "\n",
402 |     "The Kolmogorov backward equation is a matrix-valued differential equation.\n",
403 |     "\n",
404 |     "Recall that, for a scalar differential equation $ y'_t = a y_t $ with constant\n",
405 |     "$ a $ and initial condition $ y_0 $, the solution is $ y_t = e^{ta} y_0 $.\n",
406 |     "\n",
407 |     "This, along with $ P_0 = I $, encourages us to guess that the solution to\n",
408 |     "Kolmogorov’s backward equation [(4.4)](#equation-kolbackeq) is\n",
409 |     "\n",
410 |     "\n",
411 |     "<a id='equation-expsol'></a>\n",
412 |     "$$\n",
413 |     "P_t = e^{t Q} \\tag{4.5}\n",
414 |     "$$\n",
415 |     "\n",
416 |     "where the right hand side is the [matrix exponential](https://en.wikipedia.org/wiki/Matrix_exponential), with definition\n",
417 |     "\n",
418 |     "\n",
419 |     "<a id='equation-expofun'></a>\n",
420 |     "$$\n",
421 |     "e^{tQ} \n",
422 |     "    = \\sum_{k \\geq 0} \\frac{1}{k!} (tQ)^k\n",
423 |     "    = I + tQ + \\frac{t^2}{2!} Q^2 + \\cdots \\tag{4.6}\n",
424 |     "$$\n",
425 |     "\n",
426 |     "Working element by element, it is not difficult to confirm that\n",
427 |     "the derivative of the exponential function $ t \\mapsto e^{tQ} $ is\n",
428 |     "\n",
429 |     "\n",
430 |     "<a id='equation-expoderiv'></a>\n",
431 |     "$$\n",
432 |     "\\frac{d}{dt} e^{t Q} = Q e^{t Q} = e^{t Q} Q \\tag{4.7}\n",
433 |     "$$\n",
434 |     "\n",
435 |     "Hence, differentiating [(4.5)](#equation-expsol) gives $ P'_t = Q e^{t Q} = Q P_t $, which\n",
436 |     "convinces us that the exponential solution satisfies [(4.4)](#equation-kolbackeq).\n",
437 |     "\n",
438 |     "Notice that our solution\n",
439 |     "\n",
440 |     "\n",
441 |     "<a id='equation-psolq'></a>\n",
442 |     "$$\n",
443 |     "P_t = e^{t Q} \n",
444 |     "    \\quad \\text{where } \\;\n",
445 |     "    Q(x, y) := \\lambda(x) (K(x, y) - I(x, y)) \\tag{4.8}\n",
446 |     "$$\n",
447 |     "\n",
448 |     "for the semigroup of the jump process $ (X_t) $ associated with the jump matrix\n",
449 |     "$ K $ and the jump intensity function $ \\lambda \\colon S \\to (0, \\infty) $ is\n",
450 |     "consistent with our earlier result.\n",
451 |     "\n",
452 |     "In particular, we [showed](https://continuous-time-mcs.quantecon.org/markov_prop.html#consjumptransemi) that, for the model with\n",
453 |     "constant jump intensity $ \\lambda $, we have $ P_t = e^{t \\lambda (K - I)} $.\n",
454 |     "\n",
455 |     "This is obviously a special case of [(4.8)](#equation-psolq)."
456 |    ]
457 |   },
458 |   {
459 |    "cell_type": "markdown",
460 |    "id": "08997782",
461 |    "metadata": {},
462 |    "source": [
463 |     "## Properties of the Solution\n",
464 |     "\n",
465 |     "Let’s investigate further the properties of the exponential solution."
466 |    ]
467 |   },
468 |   {
469 |    "cell_type": "markdown",
470 |    "id": "253a9a7e",
471 |    "metadata": {},
472 |    "source": [
473 |     "### Checking the Transition Semigroup Properties\n",
474 |     "\n",
475 |     "While we have confirmed that $ P_t = e^{t Q} $ solves the Kolmogorov backward\n",
476 |     "equation, we still need to check that this solution is a Markov semigroup."
477 |    ]
478 |   },
479 |   {
480 |    "cell_type": "markdown",
481 |    "id": "5390ac5a",
482 |    "metadata": {},
483 |    "source": [
484 |     "**Lemma 4.2** (From Jump Chain to Semigroup)\n",
485 |     "\n",
486 |     "Let $ \\lambda $ map $ S $ to $ \\RR_+ $ and let $ K $ be a Markov matrix on $ S $.\n",
487 |     "If $ P_t = e^{t Q} $ for all $ t \\geq 0 $, where\n",
488 |     "$ Q(x, y) = \\lambda(x) (K(x, y) - I(x, y)) $, then $ (P_t) $ is a Markov\n",
489 |     "semigroup on $ S $."
490 |    ]
491 |   },
492 |   {
493 |    "cell_type": "markdown",
494 |    "id": "755465cf",
495 |    "metadata": {},
496 |    "source": [
497 |     "**Proof.**\n",
498 |     "\n",
499 |     "Proof. Observe first that $ Q $ has zero row sums, since\n",
500 |     "\n",
501 |     "$$\n",
502 |     "\\sum_y Q(x, y) \n",
503 |     "    = \\lambda(x) \\sum_y (K(x, y) - I(x, y))\n",
504 |     "    = 0\n",
505 |     "$$\n",
506 |     "\n",
507 |     "As a small exercise, you can check that, with $ 1 $\n",
508 |     "representing a column vector of ones, the following is true\n",
509 |     "\n",
510 |     "\n",
511 |     "<a id='equation-zrsnec'></a>\n",
512 |     "$$\n",
513 |     "Q \\text{ has zero row sums }\n",
514 |     "    \\iff\n",
515 |     "    Q^k 1 = 0 \\text{ for all } k \\geq 1 \\tag{4.9}\n",
516 |     "$$\n",
517 |     "\n",
518 |     "This implies that $ Q^k 1 = 0 $ for all $ k $ and, as a result, for any $ t \\geq\n",
519 |     "0 $,\n",
520 |     "\n",
521 |     "$$\n",
522 |     "P_t 1\n",
523 |     "    = e^{tQ} 1\n",
524 |     "    = I1 + tQ1 + \\frac{t^2}{2!} Q^2 1 + \\cdots\n",
525 |     "    = I1 = 1\n",
526 |     "$$\n",
527 |     "\n",
528 |     "In other words, each $ P_t $ has unit row sums.\n",
529 |     "\n",
530 |     "Next we check nonnegativity of all elements of $ P_t $ (which can easily fail for\n",
531 |     "matrix exponentials).\n",
532 |     "\n",
533 |     "To this end, adopting an argument from [[Stroock, 2013](https://continuous-time-mcs.quantecon.org/zreferences.html#id14)], we\n",
534 |     "set $ m := \\max_x \\lambda(x) $ and $ \\hat P := I + Q / m $.\n",
535 |     "\n",
536 |     "It is not difficult to check that $ \\hat P $ is a Markov matrix and $ Q = m( \\hat\n",
537 |     "P - I) $.\n",
538 |     "\n",
539 |     "Recalling that, for matrix exponentials, $ e^{A+B} = e^A e^B $ whenever $ AB =\n",
540 |     "BA $, we have\n",
541 |     "\n",
542 |     "$$\n",
543 |     "e^{tQ} \n",
544 |     "    = e^{tm (\\hat P - I)} \n",
545 |     "    = e^{-tm I} e^{tm \\hat P}\n",
546 |     "    = e^{-tm} \n",
547 |     "        \\left( \n",
548 |     "            I + tm \\hat P + \\frac{(tm)^2}{2!} \\hat P^2 + \\cdots\n",
549 |     "        \\right)\n",
550 |     "$$\n",
551 |     "\n",
552 |     "It is clear from this representation that all entries of $ e^{tQ} $ are\n",
553 |     "nonnegative.\n",
554 |     "\n",
555 |     "Finally, we need to check the continuity condition $ P_t(x, y) \\to I(x,y) $ as\n",
556 |     "$ t \\to 0 $, which is also part of the definition of a Markov semigroup.\n",
557 |     "This is immediate, in the present case, because the exponential function is\n",
558 |     "continuous, and hence $ P_t = e^{tQ} \\to e^0 = I $."
559 |    ]
560 |   },
561 |   {
562 |    "cell_type": "markdown",
563 |    "id": "8301ff1e",
564 |    "metadata": {},
565 |    "source": [
566 |     "We can now be reassured that our solution to the Kolmogorov backward equation\n",
567 |     "is indeed a Markov semigroup."
568 |    ]
569 |   },
570 |   {
571 |    "cell_type": "markdown",
572 |    "id": "7539220c",
573 |    "metadata": {},
574 |    "source": [
575 |     "### Uniqueness\n",
576 |     "\n",
577 |     "Might there be another, entirely different Markov semigroup that also\n",
578 |     "satisfies the Kolmogorov backward equation?\n",
579 |     "\n",
580 |     "The answer is no:  linear ODEs in finite dimensional vector space with\n",
581 |     "constant coefficients and fixed initial conditions (in this case $ P_0 = I $)\n",
582 |     "have unique solutions.\n",
583 |     "\n",
584 |     "In fact it’s not hard to supply a proof — see the exercises."
585 |    ]
586 |   },
587 |   {
588 |    "cell_type": "markdown",
589 |    "id": "0c6352f1",
590 |    "metadata": {},
591 |    "source": [
592 |     "## Application: The Inventory Model\n",
593 |     "\n",
594 |     "Let us look at a modified version of the inventory model where jump\n",
595 |     "intensities depend on the state.\n",
596 |     "\n",
597 |     "In particular, the wait time for new inventory will now be exponential at rate\n",
598 |     "$ \\gamma $.\n",
599 |     "\n",
600 |     "The arrival rate for customers will still be denoted by $ \\lambda $ and allowed\n",
601 |     "to differ from $ \\gamma $.\n",
602 |     "\n",
603 |     "For parameters we take"
604 |    ]
605 |   },
606 |   {
607 |    "cell_type": "code",
608 |    "execution_count": null,
609 |    "id": "23c58667",
610 |    "metadata": {
611 |     "hide-output": false
612 |    },
613 |    "outputs": [],
614 |    "source": [
615 |     "α = 0.6\n",
616 |     "λ = 0.5\n",
617 |     "γ = 0.1\n",
618 |     "b = 10"
619 |    ]
620 |   },
621 |   {
622 |    "cell_type": "markdown",
623 |    "id": "31c0ab0e",
624 |    "metadata": {},
625 |    "source": [
626 |     "Our plan is to investigate the distribution $ \\psi_T $ of $ X_T $ at $ T=30 $.\n",
627 |     "\n",
628 |     "We will do this by simulating many independent draws of $ X_T $ and\n",
629 |     "histogramming them.\n",
630 |     "\n",
631 |     "(In the exercises you are asked to calculate $ \\psi_T $ a different way, via\n",
632 |     "[(4.8)](#equation-psolq).)"
633 |    ]
634 |   },
635 |   {
636 |    "cell_type": "code",
637 |    "execution_count": null,
638 |    "id": "c11eb51d",
639 |    "metadata": {
640 |     "hide-output": false
641 |    },
642 |    "outputs": [],
643 |    "source": [
644 |     "@njit\n",
645 |     "def draw_X(T, X_0, max_iter=5000):\n",
646 |     "    \"\"\"\n",
647 |     "    Generate one draw of X_T given X_0.\n",
648 |     "    \"\"\"\n",
649 |     "\n",
650 |     "    J, Y = 0, X_0\n",
651 |     "    m = 0\n",
652 |     "\n",
653 |     "    while m < max_iter:\n",
654 |     "        s = 1/γ if Y == 0 else 1/λ\n",
655 |     "        W = np.random.exponential(scale=s)  # W ~ E(λ)\n",
656 |     "        J += W\n",
657 |     "        if J >= T:\n",
658 |     "            return Y\n",
659 |     "        # Otherwise update Y\n",
660 |     "        if Y == 0:\n",
661 |     "            Y = b\n",
662 |     "        else:\n",
663 |     "            U = np.random.geometric(α)\n",
664 |     "            Y = Y - min(Y, U)\n",
665 |     "        m += 1\n",
666 |     "\n",
667 |     "\n",
668 |     "@njit\n",
669 |     "def independent_draws(T=10, num_draws=100):\n",
670 |     "    \"Generate a vector of independent draws of X_T.\"\n",
671 |     "\n",
672 |     "    draws = np.empty(num_draws, dtype=np.int64)\n",
673 |     "\n",
674 |     "    for i in range(num_draws):\n",
675 |     "        X_0 = np.random.binomial(b+1, 0.25)\n",
676 |     "        draws[i] = draw_X(T, X_0)\n",
677 |     "\n",
678 |     "    return draws"
679 |    ]
680 |   },
681 |   {
682 |    "cell_type": "code",
683 |    "execution_count": null,
684 |    "id": "c42d7aa5",
685 |    "metadata": {
686 |     "hide-output": false
687 |    },
688 |    "outputs": [],
689 |    "source": [
690 |     "T = 30\n",
691 |     "n = b + 1\n",
692 |     "draws = independent_draws(T, num_draws=100_000)\n",
693 |     "fig, ax = plt.subplots()\n",
694 |     "\n",
695 |     "ax.bar(range(n), [np.mean(draws == i) for i in range(n)], width=0.8, alpha=0.6)\n",
696 |     "ax.set_xlabel(\"inventory\", fontsize=14)\n",
697 |     "\n",
698 |     "plt.show()"
699 |    ]
700 |   },
701 |   {
702 |    "cell_type": "markdown",
703 |    "id": "ec45e8d6",
704 |    "metadata": {},
705 |    "source": [
706 |     "If you experiment with the code above, you will see that the large amount of\n",
707 |     "mass on zero is due to the low arrival rate $ \\gamma $ for inventory."
708 |    ]
709 |   },
710 |   {
711 |    "cell_type": "markdown",
712 |    "id": "7f64003e",
713 |    "metadata": {},
714 |    "source": [
715 |     "## Exercises"
716 |    ]
717 |   },
718 |   {
719 |    "cell_type": "markdown",
720 |    "id": "01fb9c9f",
721 |    "metadata": {},
722 |    "source": [
723 |     "## Exercise 4.1\n",
724 |     "\n",
725 |     "In the discussion above, we generated an approximation of $ \\psi_T $ when\n",
726 |     "$ T=30 $, the initial condition is Binomial$ (n, 0.25) $ and parameters\n",
727 |     "are set to"
728 |    ]
729 |   },
730 |   {
731 |    "cell_type": "code",
732 |    "execution_count": null,
733 |    "id": "2e941573",
734 |    "metadata": {
735 |     "hide-output": false
736 |    },
737 |    "outputs": [],
738 |    "source": [
739 |     "α = 0.6\n",
740 |     "λ = 0.5\n",
741 |     "γ = 0.1\n",
742 |     "b = 10"
743 |    ]
744 |   },
745 |   {
746 |    "cell_type": "markdown",
747 |    "id": "4c10866b",
748 |    "metadata": {},
749 |    "source": [
750 |     "The calculation was done by simulating independent draws and histogramming.\n",
751 |     "\n",
752 |     "Try to generate the same figure using [(4.8)](#equation-psolq) instead, modifying code from\n",
753 |     "[our lecture](https://continuous-time-mcs.quantecon.org/markov_prop.html) on the Markov property."
754 |    ]
755 |   },
756 |   {
757 |    "cell_type": "markdown",
758 |    "id": "58820260",
759 |    "metadata": {},
760 |    "source": [
761 |     "## Solution to[ Exercise 4.1](https://continuous-time-mcs.quantecon.org/#kolmogorov-bwd-1)\n",
762 |     "\n",
763 |     "Here is one solution:"
764 |    ]
765 |   },
766 |   {
767 |    "cell_type": "code",
768 |    "execution_count": null,
769 |    "id": "56d133ad",
770 |    "metadata": {
771 |     "hide-output": false
772 |    },
773 |    "outputs": [],
774 |    "source": [
775 |     "α = 0.6\n",
776 |     "λ = 0.5\n",
777 |     "γ = 0.1\n",
778 |     "b = 10\n",
779 |     "\n",
780 |     "states = np.arange(n)\n",
781 |     "I = np.identity(n)\n",
782 |     "\n",
783 |     "# Embedded jump chain matrix\n",
784 |     "K = np.zeros((n, n))\n",
785 |     "K[0, -1] = 1\n",
786 |     "for i in range(1, n):\n",
787 |     "    for j in range(0, i):\n",
788 |     "        if j == 0:\n",
789 |     "            K[i, j] = (1 - α)**(i-1)\n",
790 |     "        else:\n",
791 |     "            K[i, j] = α * (1 - α)**(i-j-1)\n",
792 |     "\n",
793 |     "# Jump intensities as a function of the state\n",
794 |     "r = np.ones(n) * λ\n",
795 |     "r[0] = γ\n",
796 |     "\n",
797 |     "# Q matrix\n",
798 |     "Q = np.empty_like(K)\n",
799 |     "for i in range(n):\n",
800 |     "    for j in range(n):\n",
801 |     "        Q[i, j] = r[i] * (K[i, j] - I[i, j])\n",
802 |     "\n",
803 |     "def P_t(ψ, t):\n",
804 |     "    return ψ @ expm(t * Q)\n",
805 |     "\n",
806 |     "ψ_0 = binom.pmf(states, n, 0.25)\n",
807 |     "ψ_T = P_t(ψ_0, T)\n",
808 |     "\n",
809 |     "fig, ax = plt.subplots()\n",
810 |     "\n",
811 |     "ax.bar(range(n), ψ_T, width=0.8, alpha=0.6)\n",
812 |     "ax.set_xlabel(\"inventory\", fontsize=14)\n",
813 |     "\n",
814 |     "plt.show()"
815 |    ]
816 |   },
817 |   {
818 |    "cell_type": "markdown",
819 |    "id": "097adf33",
820 |    "metadata": {},
821 |    "source": [
822 |     "## Exercise 4.2\n",
823 |     "\n",
824 |     "Prove that differentiating [(4.1)](#equation-kbinteg) at each $ (x, y) $ yields [(4.4)](#equation-kolbackeq)."
825 |    ]
826 |   },
827 |   {
828 |    "cell_type": "markdown",
829 |    "id": "f2e976e6",
830 |    "metadata": {},
831 |    "source": [
832 |     "## Solution to[ Exercise 4.2](https://continuous-time-mcs.quantecon.org/#kolmogorov-bwd-2)\n",
833 |     "\n",
834 |     "One can easily verify that, when $ f $ is a differentiable function and $ \\alpha >\n",
835 |     "0 $, we have\n",
836 |     "\n",
837 |     "\n",
838 |     "<a id='equation-gdiff'></a>\n",
839 |     "$$\n",
840 |     "g(t) = e^{- t \\alpha} f(t)\n",
841 |     "\\quad \\implies \\quad\n",
842 |     "g'(t) = e^{- t \\alpha} f'(t) - \\alpha g(t) \\tag{4.10}\n",
843 |     "$$\n",
844 |     "\n",
845 |     "Note also that, with the change of variable $ s = t - \\tau $, we can rewrite\n",
846 |     "[(4.1)](#equation-kbinteg) as\n",
847 |     "\n",
848 |     "\n",
849 |     "<a id='equation-kbinteg2'></a>\n",
850 |     "$$\n",
851 |     "P_t(x, y) =\n",
852 |     "e^{-t \\lambda(x)}\n",
853 |     "\\left\\{\n",
854 |     "    I(x, y)\n",
855 |     "    + \\lambda(x)\n",
856 |     "    \\int_0^t (K P_s)(x, y) e^{s \\lambda(x)} d s\n",
857 |     "\\right\\} \\tag{4.11}\n",
858 |     "$$\n",
859 |     "\n",
860 |     "Applying [(4.10)](#equation-gdiff) yields\n",
861 |     "\n",
862 |     "$$\n",
863 |     "P'_t(x, y)\n",
864 |     "= e^{-t \\lambda(x)}\n",
865 |     "    \\left\\{ \n",
866 |     "            \\lambda(x)\n",
867 |     "            (K P_t)(x, y) e^{t \\lambda(x)}\n",
868 |     "    \\right\\}\n",
869 |     "    - \\lambda(x) P_t(x, y)\n",
870 |     "$$\n",
871 |     "\n",
872 |     "After minor rearrangements this becomes\n",
873 |     "\n",
874 |     "$$\n",
875 |     "P'_t(x, y)\n",
876 |     "= \\lambda(x) [ (K - I)  P_t](x, y)\n",
877 |     "$$\n",
878 |     "\n",
879 |     "which is identical to [(4.4)](#equation-kolbackeq)."
880 |    ]
881 |   },
882 |   {
883 |    "cell_type": "markdown",
884 |    "id": "bde64911",
885 |    "metadata": {},
886 |    "source": [
887 |     "## Exercise 4.3\n",
888 |     "\n",
889 |     "We claimed above that the solution $ P_t = e^{t Q} $ is the unique\n",
890 |     "Markov semigroup satisfying the backward equation $ P'_t = Q P_t $.\n",
891 |     "\n",
892 |     "Try to supply a proof.\n",
893 |     "\n",
894 |     "(This is not an easy exercise but worth thinking about in any case.)"
895 |    ]
896 |   },
897 |   {
898 |    "cell_type": "markdown",
899 |    "id": "8ceb2c9f",
900 |    "metadata": {},
901 |    "source": [
902 |     "## Solution to[ Exercise 4.3](https://continuous-time-mcs.quantecon.org/#kolmogorov-bwd-3)\n",
903 |     "\n",
904 |     "Here is one proof of uniqueness.\n",
905 |     "\n",
906 |     "Suppose that $ (\\hat P_t) $ is another Markov semigroup satisfying\n",
907 |     "$ P'_t = Q P_t $.\n",
908 |     "\n",
909 |     "Fix $ t > 0 $ and let $ V_s $ be defined by $ V_s = P_s \\hat P_{t-s} $ for all $ s\n",
910 |     "\\geq 0 $.\n",
911 |     "\n",
912 |     "Note that $ V_0 = \\hat P_t $ and $ V_t = P_t $.\n",
913 |     "\n",
914 |     "Note also that $ s \\mapsto V_s $ is differentiable, with derivative\n",
915 |     "\n",
916 |     "$$\n",
917 |     "V'_s \n",
918 |     "= P'_s \\hat P_{t-s} - P_s \\hat P'_{t-s}\n",
919 |     "= P_s Q \\hat P_{t-s} - P_s Q \\hat P_{t-s}\n",
920 |     "= 0\n",
921 |     "$$\n",
922 |     "\n",
923 |     "where, in the second last equality, we used [(4.7)](#equation-expoderiv).\n",
924 |     "\n",
925 |     "Hence $ V_s $ is constant, so our previous observations $ V_0 = \\hat P_t $ and $ V_t = P_t $\n",
926 |     "now yield $ \\hat P_t = P_t $.\n",
927 |     "\n",
928 |     "Since $ t $ was arbitrary, the proof is now done."
929 |    ]
930 |   }
931 |  ],
932 |  "metadata": {
933 |   "date": 1763463649.7927132,
934 |   "filename": "kolmogorov_bwd.md",
935 |   "kernelspec": {
936 |    "display_name": "Python",
937 |    "language": "python3",
938 |    "name": "python3"
939 |   },
940 |   "title": "The Kolmogorov Backward Equation"
941 |  },
942 |  "nbformat": 4,
943 |  "nbformat_minor": 5
944 | }


--------------------------------------------------------------------------------
/uc_mc_semigroups.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "id": "014a76b5",
  6 |    "metadata": {},
  7 |    "source": [
  8 |     "$$\n",
  9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
 10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
 11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
 12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
 13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
 14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
 15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
 16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
 17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
 18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
 19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
 20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
 21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
 22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
 23 |     "$$"
 24 |    ]
 25 |   },
 26 |   {
 27 |    "cell_type": "markdown",
 28 |    "id": "9d00e584",
 29 |    "metadata": {},
 30 |    "source": [
 31 |     "# UC Markov Semigroups"
 32 |    ]
 33 |   },
 34 |   {
 35 |    "cell_type": "markdown",
 36 |    "id": "993e290a",
 37 |    "metadata": {},
 38 |    "source": [
 39 |     "## Overview\n",
 40 |     "\n",
 41 |     "In our previous lecture we covered some of the general theory of operator\n",
 42 |     "semigroups.\n",
 43 |     "\n",
 44 |     "Next we translate these results into the setting of Markov semigroups.\n",
 45 |     "\n",
 46 |     "The Markov semigroups are defined on a countable set $ S $.\n",
 47 |     "\n",
 48 |     "The main aim is to give an exact one-to-one correspondence between\n",
 49 |     "\n",
 50 |     "- UC Markov semigroups  \n",
 51 |     "- “conservative” intensity matrices and  \n",
 52 |     "- jump chains with state dependent jump intensities  \n",
 53 |     "\n",
 54 |     "\n",
 55 |     "Conservativeness is defined below and relates to “nonexplosiveness” of the\n",
 56 |     "associated Markov chain.\n",
 57 |     "\n",
 58 |     "We will also give a brief discussion of intensity matrices that do not have\n",
 59 |     "this property, along with the processes they generate."
 60 |    ]
 61 |   },
 62 |   {
 63 |    "cell_type": "markdown",
 64 |    "id": "ae1d8146",
 65 |    "metadata": {},
 66 |    "source": [
 67 |     "## Notation and Terminology\n",
 68 |     "\n",
 69 |     "Let $ S $ be an arbitrary countable set.\n",
 70 |     "\n",
 71 |     "Let $ \\ell_1 $ be the Banach space of **summable functions** on $ S $; that is, all $ g \\, \\colon S \\to \\RR $ with\n",
 72 |     "\n",
 73 |     "$$\n",
 74 |     "\\| g \\| := \\sum_x |g(x)| < \\infty\n",
 75 |     "$$\n",
 76 |     "\n",
 77 |     "Note that $ \\dD $, the set of all distributions on $ S $, is contained in $ \\ell_1 $.\n",
 78 |     "\n",
 79 |     "Each Markov matrix $ P $ on $ S $ can and will be identifed with a linear operator\n",
 80 |     "$ f \\mapsto fP $ on $ \\ell_1 $ via\n",
 81 |     "\n",
 82 |     "\n",
 83 |     "<a id='equation-mmismo'></a>\n",
 84 |     "$$\n",
 85 |     "(fP)(y) = \\sum_x f(x) P(x, y)\n",
 86 |     "    \\qquad (f \\in \\ell_1, \\; y \\in S) \\tag{7.1}\n",
 87 |     "$$\n",
 88 |     "\n",
 89 |     "To be consistent with earlier notation, we are writing the argument of $ P $ to\n",
 90 |     "the left and applying $ P $ to it as if premultiplying $ P $ by a row vector.\n",
 91 |     "\n",
 92 |     "In the exercises you are asked to verify that [(7.1)](#equation-mmismo) defines\n",
 93 |     "a bounded linear operator on $ \\ell_1 $ such that\n",
 94 |     "\n",
 95 |     "\n",
 96 |     "<a id='equation-propp'></a>\n",
 97 |     "$$\n",
 98 |     "\\|P\\| = 1 \\text{ and } \\phi P \\in \\dD \\text{ whenever } \\phi \\in \\dD \\tag{7.2}\n",
 99 |     "$$\n",
100 |     "\n",
101 |     "Note that composition of $ P $ with itself is equivalent to powers of the matrix\n",
102 |     "under matrix multiplication.\n",
103 |     "\n",
104 |     "For an intensity matrix $ Q $ on $ S $ we can try to introduce the associated\n",
105 |     "operator analogously, via\n",
106 |     "\n",
107 |     "\n",
108 |     "<a id='equation-imislo'></a>\n",
109 |     "$$\n",
110 |     "(fQ)(y) = \\sum_x f(x) Q(x, y)\n",
111 |     "    \\qquad (f \\in \\ell_1, \\; y \\in S) \\tag{7.3}\n",
112 |     "$$\n",
113 |     "\n",
114 |     "However, the sum in [(7.3)](#equation-imislo) is not always well defined.<sup><a href=#fnim id=fnim-link>[1]</a></sup>\n",
115 |     "\n",
116 |     "We say that an intensity matrix $ Q $ is **conservative** if the sum in\n",
117 |     "[(7.3)](#equation-imislo) is well defined at all $ y $ and, in addition, the mapping $ f\n",
118 |     "\\mapsto fQ $ in [(7.3)](#equation-imislo) is a bounded linear operator on $ \\ell_1 $.\n",
119 |     "\n",
120 |     "Below we show how this property can be checked in applications."
121 |    ]
122 |   },
123 |   {
124 |    "cell_type": "markdown",
125 |    "id": "d9c34631",
126 |    "metadata": {},
127 |    "source": [
128 |     "## UC Markov Semigroups and their Generators\n",
129 |     "\n",
130 |     "Let $ Q $ be a conservative intensity matrix on $ S $.\n",
131 |     "\n",
132 |     "Since $ Q $ is in $ \\linopell $, the operator exponential $ e^{tQ} $ is well defined\n",
133 |     "as an element of $ \\linopell $ for all $ t \\geq 0 $.\n",
134 |     "\n",
135 |     "Moreover, by [Example 6.3](https://continuous-time-mcs.quantecon.org/generators.html#ecuc), the family $ (P_t) $ in $ \\lL(\\ell_1) $ defined by\n",
136 |     "$ P_t = e^{tQ} $ defines a UC Markov semigroup on $ \\ell_1 $.\n",
137 |     "\n",
138 |     "(Here, a Markov semigroup $ (P_t) $ is both a collection of Markov matrices and a\n",
139 |     "collection of operators, as in [(7.1)](#equation-mmismo).)\n",
140 |     "\n",
141 |     "The next theorem says that this is the only way UC Markov semigroups can arise."
142 |    ]
143 |   },
144 |   {
145 |    "cell_type": "markdown",
146 |    "id": "eca81345",
147 |    "metadata": {},
148 |    "source": [
149 |     "**Theorem 7.1**\n",
150 |     "\n",
151 |     "If $ (P_t) $ is a UC Markov semigroup on $ \\ell_1 $, then there\n",
152 |     "exists a conservative intensity matrix $ Q $ such that $ P_t = e^{tQ} $ for all $ t \\geq 0 $."
153 |    ]
154 |   },
155 |   {
156 |    "cell_type": "markdown",
157 |    "id": "9d80dd1d",
158 |    "metadata": {},
159 |    "source": [
160 |     "**Proof.**\n",
161 |     "\n",
162 |     "Proof. Let $ (P_t) $ be a UC Markov semigroup on $ \\ell_1 $.\n",
163 |     "\n",
164 |     "Since $ (P_t) $ is a UC semigroup on $ \\ell_1 $, it follows from\n",
165 |     "[Theorem 6.1](https://continuous-time-mcs.quantecon.org/generators.html#ucsgec) that there exists a $ Q \\in \\lL(\\ell_1) $ such that\n",
166 |     "$ P_t = e^{tQ} $ for all $ t \\geq 0 $.\n",
167 |     "\n",
168 |     "We only need to show that $ Q $ is a conservative intensity matrix.\n",
169 |     "\n",
170 |     "Because $ (P_t) $ is a Markov semigroup, $ P_t $ is a Markov matrix for all $ t $,\n",
171 |     "and, since $ P_t = e^{tQ} $ for all $ t $, it follows that $ Q $ is an intensity\n",
172 |     "matrix.\n",
173 |     "\n",
174 |     "We proved this for the case $ |S| < \\infty $ in [Theorem 5.1](https://continuous-time-mcs.quantecon.org/kolmogorov_fwd.html#intvsmk) and\n",
175 |     "one can verify that the same arguments go through when $ |S| = \\infty $.\n",
176 |     "\n",
177 |     "As $ Q \\in \\lL(\\ell_1) $, we know that $ Q $ is a bounded operator, so $ Q $ is a\n",
178 |     "conservative intensity matrix."
179 |    ]
180 |   },
181 |   {
182 |    "cell_type": "markdown",
183 |    "id": "3e2215a7",
184 |    "metadata": {},
185 |    "source": [
186 |     "From [Theorem 7.1](#usmg) we can easily deduce that\n",
187 |     "\n",
188 |     "- $ P_t $ is differentiable at every $ t \\geq 0 $,  \n",
189 |     "- $ Q $ is the generator of $ (P_t) $ and  \n",
190 |     "- $ P_t' = Q P_t = P_t Q $ for all $ t \\geq 0 $.  \n",
191 |     "- $ P_0' = Q $  \n",
192 |     "\n",
193 |     "\n",
194 |     "In fact these results are just a special case of the claims in [Theorem 6.1](https://continuous-time-mcs.quantecon.org/generators.html#ucsgec).\n",
195 |     "\n",
196 |     "The second last of these results is the Kolmogorov forward and backward equations.\n",
197 |     "\n",
198 |     "The last of these results shows that we can obtain the intensity matrix $ Q $ by differentiating $ P_t $ at $ t=0 $."
199 |    ]
200 |   },
201 |   {
202 |    "cell_type": "markdown",
203 |    "id": "a1b56651",
204 |    "metadata": {},
205 |    "source": [
206 |     "**Example 7.1**\n",
207 |     "\n",
208 |     "Let us consider again the Poisson process $ (N_t) $ with rate $ \\lambda > 0 $\n",
209 |     "in light of the discussion above.\n",
210 |     "\n",
211 |     "The corresponding semigroup $ (P_t) $ is UC and hence there exists a\n",
212 |     "conservative intensity matrix $ Q $ with $ P_t = e^{tQ} $ for all $ t \\geq 0 $.\n",
213 |     "\n",
214 |     "This fact can be established by proving UC property and then appealing to\n",
215 |     "[Theorem 7.1](#usmg).\n",
216 |     "\n",
217 |     "Another alternative, easier in this case, is to supply the intensity matrix\n",
218 |     "$ Q $ directly and then verify that $ P_t = e^{tQ} $ holds.\n",
219 |     "\n",
220 |     "The semigroup for a Poisson process with rate $ \\lambda $ was given in\n",
221 |     "[(3.9)](https://continuous-time-mcs.quantecon.org/markov_prop.html#equation-poissemi) and is repeated here:\n",
222 |     "\n",
223 |     "\n",
224 |     "<a id='equation-poissemi2'></a>\n",
225 |     "$$\n",
226 |     "P_t(j, k) \n",
227 |     "    = \n",
228 |     "    \\begin{cases}\n",
229 |     "    e^{-\\lambda t} \\frac{ (\\lambda t)^{k-j} }{(k-j)!}  \n",
230 |     "        & \\text{ if } j \\leq k\n",
231 |     "        \\\\\n",
232 |     "    0 & \\text{ otherwise}\n",
233 |     "    \\end{cases} \\tag{7.4}\n",
234 |     "$$\n",
235 |     "\n",
236 |     "For the intensity matrix we take\n",
237 |     "\n",
238 |     "\n",
239 |     "<a id='equation-poissonq'></a>\n",
240 |     "$$\n",
241 |     "Q := \n",
242 |     "    \\begin{pmatrix}\n",
243 |     "    -\\lambda & \\lambda & 0 & 0 & 0 & \\cdots\n",
244 |     "    \\\\\n",
245 |     "    0 & -\\lambda & \\lambda & 0 & 0 & \\cdots\n",
246 |     "    \\\\\n",
247 |     "    0 & 0 & -\\lambda & \\lambda & 0 & \\cdots\n",
248 |     "    \\\\\n",
249 |     "    0 & 0 & 0 & -\\lambda & \\lambda & \\cdots\n",
250 |     "    \\\\\n",
251 |     "    \\vdots & \\vdots  & \\vdots  & \\vdots  & \\vdots \n",
252 |     "    \\end{pmatrix} \\tag{7.5}\n",
253 |     "$$\n",
254 |     "\n",
255 |     "The form of $ Q $ is intuitive: probability flows out of state $ i $ and into\n",
256 |     "state $ i+1 $ at the rate $ \\lambda $.\n",
257 |     "\n",
258 |     "It is immediate that $ Q $ is an intensity matrix, as claimed.\n",
259 |     "\n",
260 |     "The exercises ask you to confirm that $ Q $ is in $ \\lL(\\ell_1) $.\n",
261 |     "\n",
262 |     "To prove that $ P_t = e^{tQ} $ for any $ t \\geq 0 $, we first decompose $ Q $ as $ Q\n",
263 |     "= \\lambda (K - I) $, where $ K $ is defined by\n",
264 |     "\n",
265 |     "$$\n",
266 |     "K(i, j) = \\mathbb 1\\{j = i + 1\\}\n",
267 |     "$$\n",
268 |     "\n",
269 |     "For given $ t \\geq 0 $, we then have\n",
270 |     "\n",
271 |     "$$\n",
272 |     "e^{tQ}\n",
273 |     "    = e^{\\lambda t (K-I)}\n",
274 |     "    = e^{-\\lambda t} e^{\\lambda t K}\n",
275 |     "    = e^{-\\lambda t} \n",
276 |     "    \\sum_{m \\geq 0} \\frac{(\\lambda t K)^m}{m!}\n",
277 |     "$$\n",
278 |     "\n",
279 |     "The exercises ask you to verify that, for the powers of $ K $, we have $ K^m(i,\n",
280 |     "j) = \\mathbb 1\\{j = i + m\\} $.\n",
281 |     "\n",
282 |     "Inserting this expression for $ K^m $ leads to\n",
283 |     "\n",
284 |     "$$\n",
285 |     "e^{tQ}(i, j)\n",
286 |     "    = e^{-\\lambda t} \n",
287 |     "    \\sum_{m \\geq 0} \\frac{(\\lambda t )^m}{m!} \\mathbb 1\\{j = i + m\\}\n",
288 |     "    = e^{-\\lambda t} \n",
289 |     "    \\sum_{m \\geq 0} \\frac{(\\lambda t )^m}{m!} \\mathbb 1\\{m = j-i\\}\n",
290 |     "$$\n",
291 |     "\n",
292 |     "This is identical to [(7.4)](#equation-poissemi2).\n",
293 |     "\n",
294 |     "It now follows that $ t \\mapsto P_t \\in \\lL(\\ell_1) $ is differentiable at every\n",
295 |     "$ t \\geq 0 $ and $ Q $ is the generator of $ (P_t) $, with $ P_0' = Q $."
296 |    ]
297 |   },
298 |   {
299 |    "cell_type": "markdown",
300 |    "id": "5002893a",
301 |    "metadata": {},
302 |    "source": [
303 |     "### A Necessary and Sufficient Condition\n",
304 |     "\n",
305 |     "Our definition of a conservative intensity matrix works for the theory above\n",
306 |     "but can be hard to check in appliations and lacks probabilistic intuition.\n",
307 |     "\n",
308 |     "Fortunately, we have the following simple characterization."
309 |    ]
310 |   },
311 |   {
312 |    "cell_type": "markdown",
313 |    "id": "6a624d6e",
314 |    "metadata": {},
315 |    "source": [
316 |     "**Lemma 7.1**\n",
317 |     "\n",
318 |     "An intensity matrix $ Q $ on $ S $ is conservative if and only if $ \\sup_x |Q(x,\n",
319 |     "x)| $ is finite."
320 |    ]
321 |   },
322 |   {
323 |    "cell_type": "markdown",
324 |    "id": "79a2e5a8",
325 |    "metadata": {},
326 |    "source": [
327 |     "The proof is a solved exercise."
328 |    ]
329 |   },
330 |   {
331 |    "cell_type": "markdown",
332 |    "id": "92d8817a",
333 |    "metadata": {},
334 |    "source": [
335 |     "**Example 7.2**\n",
336 |     "\n",
337 |     "Recall the jump chain setting where, repeating [(4.4)](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#equation-kolbackeq), we defined $ Q $\n",
338 |     "via\n",
339 |     "\n",
340 |     "\n",
341 |     "<a id='equation-kolbackeq-inf'></a>\n",
342 |     "$$\n",
343 |     "Q(x, y) := \\lambda(x) (K(x, y) - I(x, y)) \\tag{7.6}\n",
344 |     "$$\n",
345 |     "\n",
346 |     "The function $ \\lambda \\colon S \\to \\RR_+ $ gives the jump rate at each state,\n",
347 |     "while $ K $ is the Markov matrix for the embedded discrete time jump chain.\n",
348 |     "\n",
349 |     "Previously we discussed this in the case where $ S $ is finite but there is no\n",
350 |     "need to restrict attention to that case.\n",
351 |     "\n",
352 |     "For general countable $ S $, the matrix $ Q $ defined in [(7.6)](#equation-kolbackeq-inf) is\n",
353 |     "still an intensity matrix.\n",
354 |     "\n",
355 |     "If we continue to assume that $ K(x,x) = 0 $ for all $ x $, then $ Q(x,x) = -\n",
356 |     "\\lambda(x) $.\n",
357 |     "\n",
358 |     "Hence, $ Q $ is conservative if and only if $ \\sup_x \\lambda(x) $ is finite.\n",
359 |     "\n",
360 |     "In other words, $ Q $ is conservative if the set of jump rates is bounded."
361 |    ]
362 |   },
363 |   {
364 |    "cell_type": "markdown",
365 |    "id": "357ee926",
366 |    "metadata": {},
367 |    "source": [
368 |     "This example shows that requiring $ Q $ to be conservative is a relatively mild\n",
369 |     "restriction."
370 |    ]
371 |   },
372 |   {
373 |    "cell_type": "markdown",
374 |    "id": "790c3095",
375 |    "metadata": {},
376 |    "source": [
377 |     "### The Finite State Case\n",
378 |     "\n",
379 |     "It is immediate from [Lemma 7.1](#scintcon) that every intensity matrix is\n",
380 |     "conservative when the state space $ S $ is finite.\n",
381 |     "\n",
382 |     "Hence, in this setting, every intensity matrix $ Q $ on $ S $ defines a UC Markov\n",
383 |     "semigroup $ (P_t) $ via $ P_t = e^{tQ} $.\n",
384 |     "\n",
385 |     "Conversely, if $ S $ is finite, then any Markov semigroup $ (P_t) $ is a UC Markov\n",
386 |     "semigroup.\n",
387 |     "\n",
388 |     "To see this, recall that, as a Markov semigroup, $ (P_t) $ satisfies\n",
389 |     "$ \\lim_{t \\to 0} P_t(x, y) = I(x,y) $ for all $ x,y $ in $ S $.\n",
390 |     "\n",
391 |     "In any finite dimensional space, pointwise convergence implies norm\n",
392 |     "convergence, so $ P_t \\to I $ in operator norm as $ t \\to 0 $ from above.\n",
393 |     "\n",
394 |     "As we saw previously, this is enough to ensure that $ t \\mapsto P_t $ is norm\n",
395 |     "continuous everywhere on $ \\RR_+ $.\n",
396 |     "\n",
397 |     "Hence $ (P_t) $ is a UC Markov semigroup, as claimed.\n",
398 |     "\n",
399 |     "Combining these results with [Theorem 7.1](#usmg), we conclude that, when $ S $ is\n",
400 |     "finite, there is a one-to-one correspondence between Markov semigroups and\n",
401 |     "intensity matrices."
402 |    ]
403 |   },
404 |   {
405 |    "cell_type": "markdown",
406 |    "id": "8204add9",
407 |    "metadata": {},
408 |    "source": [
409 |     "## From Intensity Matrix to Jump Chain\n",
410 |     "\n",
411 |     "We now understand that there is a one-to-one pairing between conservative\n",
412 |     "intensity matrices and UC Markov semigroups.\n",
413 |     "\n",
414 |     "These ideas are important from an analytical perspective.\n",
415 |     "\n",
416 |     "Now we provide another point of view, more connected to probability.\n",
417 |     "\n",
418 |     "This point of view is important for both theory and computation."
419 |    ]
420 |   },
421 |   {
422 |    "cell_type": "markdown",
423 |    "id": "effd8f84",
424 |    "metadata": {},
425 |    "source": [
426 |     "### Jump Chain Pairs\n",
427 |     "\n",
428 |     "Let us agree to call $ (\\lambda, K) $ a **jump chain pair** if $ \\lambda $ is a\n",
429 |     "map from $ S $ to $ \\RR_+ $ and $ K $ is a Markov matrix on $ S $.\n",
430 |     "\n",
431 |     "It is easy to verify that the matrix $ Q $ on $ S $ defined by\n",
432 |     "\n",
433 |     "\n",
434 |     "<a id='equation-jcinmat'></a>\n",
435 |     "$$\n",
436 |     "Q(x, y) := \\lambda(x) (K(x, y) - I(x, y)) \\tag{7.7}\n",
437 |     "$$\n",
438 |     "\n",
439 |     "is an intensity matrix.\n",
440 |     "\n",
441 |     "(We saw in [an earlier lecture](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html) that $ Q $ is the intensity\n",
442 |     "matrix for the jump chain $ (X_t) $ built via [Algorithm 4.1](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#ejc_algo) from jump\n",
443 |     "chain pair $ (\\lambda, K) $.)\n",
444 |     "\n",
445 |     "As we now show, every intensity matrix admits the decomposition in\n",
446 |     "[(7.7)](#equation-jcinmat) for some jump chain pair."
447 |    ]
448 |   },
449 |   {
450 |    "cell_type": "markdown",
451 |    "id": "ed569fa7",
452 |    "metadata": {},
453 |    "source": [
454 |     "### Jump Chain Decomposition\n",
455 |     "\n",
456 |     "Given an intensity matrix $ Q $, set\n",
457 |     "\n",
458 |     "\n",
459 |     "<a id='equation-lambdafromq'></a>\n",
460 |     "$$\n",
461 |     "\\lambda(x) := -Q(x, x)\n",
462 |     "    \\qquad (x \\in S) \\tag{7.8}\n",
463 |     "$$\n",
464 |     "\n",
465 |     "Next we build $ K $, first along the principle diagonal via\n",
466 |     "\n",
467 |     "\n",
468 |     "<a id='equation-kfromqxx'></a>\n",
469 |     "$$\n",
470 |     "K(x,x) = \n",
471 |     "    \\begin{cases}\n",
472 |     "        0 & \\text{ if } \\lambda(x) > 0\n",
473 |     "        \\\\\n",
474 |     "        1 & \\text{ otherwise}\n",
475 |     "    \\end{cases} \\tag{7.9}\n",
476 |     "$$\n",
477 |     "\n",
478 |     "Thus, if the rate of leaving $ x $ is positive, we set $ K(x,x) = 0 $, so that the\n",
479 |     "embedded jump chain moves away from $ x $ with probability one when the next\n",
480 |     "jump occurs.\n",
481 |     "\n",
482 |     "Otherwise, when $ Q(x,x) = 0 $, we stay at $ x $ forever, so $ x $ is an **absorbing\n",
483 |     "state**.\n",
484 |     "\n",
485 |     "Off the principle diagonal, where $ x \\not= y $, we set\n",
486 |     "\n",
487 |     "\n",
488 |     "<a id='equation-kfromqxy'></a>\n",
489 |     "$$\n",
490 |     "K(x,y) = \n",
491 |     "    \\begin{cases}\n",
492 |     "        \\frac{Q(x,y)}{\\lambda(x)} & \\text{ if } \\lambda(x) > 0\n",
493 |     "        \\\\\n",
494 |     "        0 & \\text{ otherwise }\n",
495 |     "    \\end{cases} \\tag{7.10}\n",
496 |     "$$\n",
497 |     "\n",
498 |     "The exercises below ask you to confirm that, for $ \\lambda $ and $ K $ just defined,\n",
499 |     "\n",
500 |     "1. $ (\\lambda, K) $ is a jump chain pair and  \n",
501 |     "1. the intensity matrix $ Q $ satisfies [(7.7)](#equation-jcinmat).  \n",
502 |     "\n",
503 |     "\n",
504 |     "We call $ (\\lambda, K) $ the **jump chain decomposition** of $ Q $.\n",
505 |     "\n",
506 |     "We summarize in a lemma."
507 |    ]
508 |   },
509 |   {
510 |    "cell_type": "markdown",
511 |    "id": "437fc031",
512 |    "metadata": {},
513 |    "source": [
514 |     "**Lemma 7.2**\n",
515 |     "\n",
516 |     "A matrix $ Q $ on $ S $ is an intensity matrix if and only if there exists a jump\n",
517 |     "chain pair $ (\\lambda, K) $ such that [(7.7)](#equation-jcinmat) holds."
518 |    ]
519 |   },
520 |   {
521 |    "cell_type": "markdown",
522 |    "id": "2e85a539",
523 |    "metadata": {},
524 |    "source": [
525 |     "### The Conservative Case\n",
526 |     "\n",
527 |     "We know from [Example 7.2](#jccs) that an intensity matrix $ Q $ is conservative\n",
528 |     "if and only if $ \\lambda $ is bounded.\n",
529 |     "\n",
530 |     "Moreover, we saw in [Theorem 7.1](#usmg) that the pairing between conservative\n",
531 |     "intensity matrices and UC Markov semigroups is one-to-one.\n",
532 |     "\n",
533 |     "This leads to the following result."
534 |    ]
535 |   },
536 |   {
537 |    "cell_type": "markdown",
538 |    "id": "17d81319",
539 |    "metadata": {},
540 |    "source": [
541 |     "**Theorem 7.2**\n",
542 |     "\n",
543 |     "On $ S $, there exists a one-to-one correspondence between the following sets of\n",
544 |     "objects:\n",
545 |     "\n",
546 |     "1. The set of all jump chain pairs $ (\\lambda, K) $ such that $ \\lambda $ is bounded.  \n",
547 |     "1. The set of all conservative intensity matrices.  \n",
548 |     "1. The set of all UC Markov semigroups.  "
549 |    ]
550 |   },
551 |   {
552 |    "cell_type": "markdown",
553 |    "id": "7c3ba842",
554 |    "metadata": {},
555 |    "source": [
556 |     "### Simulation\n",
557 |     "\n",
558 |     "In view of the preceding discussion, we have a simple way to simulate a Markov\n",
559 |     "chain given any conservative intensity matrix $ Q $.\n",
560 |     "\n",
561 |     "The steps are\n",
562 |     "\n",
563 |     "1. Decompose $ Q $ into a jump chain pair $ (\\lambda, K) $.  \n",
564 |     "1. Simulate via [Algorithm 4.1](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#ejc_algo).  \n",
565 |     "\n",
566 |     "\n",
567 |     "Recalling our discussion of the Kolmogorov backward equation, we know that\n",
568 |     "this produces a Markov chain with Markov semigroup\n",
569 |     "$ (P_t) $ where $ P_t = e^{tQ} $ for $ Q $ satisfying [(7.7)](#equation-jcinmat).\n",
570 |     "\n",
571 |     "(Although our argument assumed finite $ S $, the proof goes through when\n",
572 |     "$ S $ is countably infinite and $ Q $ is conservative with very minor changes.)\n",
573 |     "\n",
574 |     "In particular, $ (X_t) $ is a continuous time Markov chain with intensity matrix\n",
575 |     "$ Q $."
576 |    ]
577 |   },
578 |   {
579 |    "cell_type": "markdown",
580 |    "id": "719282b4",
581 |    "metadata": {},
582 |    "source": [
583 |     "## Beyond Bounded Intensity Matrices\n",
584 |     "\n",
585 |     "If we do run into an application where an intensity matrix $ Q $ is not\n",
586 |     "conservative, what might we expect?\n",
587 |     "\n",
588 |     "In this scenario, we can at least hope that $ Q $ is the generator of a $ C_0 $\n",
589 |     "semigroup.\n",
590 |     "\n",
591 |     "Since $ Q $ is an intensity matrix, we can be sure that this semigroup will be a\n",
592 |     "Markov semigroup.\n",
593 |     "\n",
594 |     "To know when $ Q $ will be the generator of a $ C_0 $\n",
595 |     "semigroup, we need to look to the [Hille-Yoshida\n",
596 |     "Theorem](https://en.wikipedia.org/wiki/Hille%E2%80%93Yosida_theorem) and\n",
597 |     "sufficient conditions derived from it.\n",
598 |     "\n",
599 |     "While we omit a detailed treatment, it is worth noting that the issue is\n",
600 |     "linked to explosions.\n",
601 |     "\n",
602 |     "To see the connection, recall that some initial value problems do not lead to a\n",
603 |     "valid solution defined for all $ t \\in \\RR_+ $.\n",
604 |     "\n",
605 |     "An example is the scalar problem $ x'_t = 1 + x_t^2 $, which has solution $ x_t =\n",
606 |     "\\tan (t - c) $ for some constant $ c $.\n",
607 |     "\n",
608 |     "But this solution equals $ +\\infty $ for $ t \\geq c + \\pi/2 $.\n",
609 |     "\n",
610 |     "The problem is that the time path explodes to infinity in finite time.\n",
611 |     "\n",
612 |     "The same issue can occur for Markov processes, if jump rates grow sufficiently\n",
613 |     "quickly.\n",
614 |     "\n",
615 |     "For more discussion, see, for example, Section 2.7 of [[Norris, 1998](https://continuous-time-mcs.quantecon.org/zreferences.html#id12)]."
616 |    ]
617 |   },
618 |   {
619 |    "cell_type": "markdown",
620 |    "id": "3b27dcb0",
621 |    "metadata": {},
622 |    "source": [
623 |     "## Exercises"
624 |    ]
625 |   },
626 |   {
627 |    "cell_type": "markdown",
628 |    "id": "91fabcd9",
629 |    "metadata": {},
630 |    "source": [
631 |     "## Exercise 7.1\n",
632 |     "\n",
633 |     "Let $ P $ be a Markov matrix on $ S $ and identify it with the linear operator in\n",
634 |     "[(7.1)](#equation-mmismo).  Verify the claims in [(7.2)](#equation-propp)."
635 |    ]
636 |   },
637 |   {
638 |    "cell_type": "markdown",
639 |    "id": "f40eeff4",
640 |    "metadata": {},
641 |    "source": [
642 |     "## Solution to[ Exercise 7.1](https://continuous-time-mcs.quantecon.org/#uc-mc-semigroups-ex-1)\n",
643 |     "\n",
644 |     "To determine the norm of $ P $, we use the definition in [(6.2)](https://continuous-time-mcs.quantecon.org/generators.html#equation-norml).\n",
645 |     "\n",
646 |     "If $ f \\in \\ell_1 $ and $ \\| f \\| \\leq 1 $, then\n",
647 |     "\n",
648 |     "$$\n",
649 |     "\\| f P \\| \n",
650 |     "\\leq \\sum_y \\sum_x |f(x)| P(x, y)\n",
651 |     "= \\sum_x |f(x)| \\sum_y P(x, y)\n",
652 |     "= \\sum_x |f(x)| \n",
653 |     "= \\| f \\|\n",
654 |     "$$\n",
655 |     "\n",
656 |     "Hence $ \\| P \\| \\leq 1 $.\n",
657 |     "\n",
658 |     "To see that equality holds we can repeat this argument with $ f \\geq 0 $,\n",
659 |     "obtaining $ \\| fP \\| = \\|f\\| $.\n",
660 |     "\n",
661 |     "Now pick any $ \\phi \\in \\dD $.\n",
662 |     "\n",
663 |     "Clearly $ \\phi P \\geq 0 $, and\n",
664 |     "\n",
665 |     "$$\n",
666 |     "\\sum_y (\\phi P)(y)\n",
667 |     "=\\sum_y \\sum_x \\phi (x) P(x, y)\n",
668 |     "=\\sum_x \\phi (x) \\sum_y P(x, y)\n",
669 |     "= 1\n",
670 |     "$$\n",
671 |     "\n",
672 |     "Hence $ \\phi P \\in \\dD $ as claimed."
673 |    ]
674 |   },
675 |   {
676 |    "cell_type": "markdown",
677 |    "id": "5e575c06",
678 |    "metadata": {},
679 |    "source": [
680 |     "## Exercise 7.2\n",
681 |     "\n",
682 |     "Prove the claim in [Lemma 7.1](#scintcon)."
683 |    ]
684 |   },
685 |   {
686 |    "cell_type": "markdown",
687 |    "id": "4e80ee3a",
688 |    "metadata": {},
689 |    "source": [
690 |     "## Solution to[ Exercise 7.2](https://continuous-time-mcs.quantecon.org/#uc-mc-semigroups-ex-2)\n",
691 |     "\n",
692 |     "Here is one solution.\n",
693 |     "\n",
694 |     "Let $ Q $ be an intensity matrix on $ S $.\n",
695 |     "\n",
696 |     "Suppose first that $ m := \\sup_x |Q(x,x)| $ is finite.\n",
697 |     "\n",
698 |     "Set $ \\hat P := I + Q / m $.\n",
699 |     "\n",
700 |     "It is not hard to check that $ \\hat P $ is a Markov matrix and that $ Q = m( \\hat\n",
701 |     "P - I) $.\n",
702 |     "\n",
703 |     "Since $ \\hat P $ is a Markov matrix, it induces a bounded linear operator on\n",
704 |     "$ \\ell_1 $ via [(7.1)](#equation-mmismo).\n",
705 |     "\n",
706 |     "As $ \\lL(\\ell_1) $ is a linear space, we see that $ Q $ is likewise in\n",
707 |     "$ \\lL(\\ell_1) $.\n",
708 |     "\n",
709 |     "In particular, $ Q $ is a bounded operator, and hence conservative.\n",
710 |     "\n",
711 |     "Next, suppose that $ Q $ is conservative and yet $ \\sup_x |Q(x,x)| $ is infinite.\n",
712 |     "\n",
713 |     "Choose $ x \\in S $ such that $ |Q(x, x)| > \\| Q\\| $\n",
714 |     "\n",
715 |     "Let $ f \\in \\ell_1 $ be defined by $ f(z) = \\mathbb 1\\{z = x\\} $.\n",
716 |     "\n",
717 |     "Since $ \\|f\\| = 1 $, we have\n",
718 |     "\n",
719 |     "$$\n",
720 |     "\\| Q \\| \n",
721 |     "\\geq \\| f Q \\|\n",
722 |     "= \\sum_y \\left| \\sum_z f(z) Q(z, y) \\right|\n",
723 |     "= \\sum_y | Q(x, y) |\n",
724 |     "\\geq | Q(x, x) |\n",
725 |     "$$\n",
726 |     "\n",
727 |     "Contradiction."
728 |    ]
729 |   },
730 |   {
731 |    "cell_type": "markdown",
732 |    "id": "88dfa2a6",
733 |    "metadata": {},
734 |    "source": [
735 |     "## Exercise 7.3\n",
736 |     "\n",
737 |     "Confirm that $ Q $ defined in [(7.5)](#equation-poissonq) induces a bounded linear operator on\n",
738 |     "$ \\ell_1 $ via [(7.3)](#equation-imislo)."
739 |    ]
740 |   },
741 |   {
742 |    "cell_type": "markdown",
743 |    "id": "332c0eb3",
744 |    "metadata": {},
745 |    "source": [
746 |     "## Solution to[ Exercise 7.3](https://continuous-time-mcs.quantecon.org/#uc-mc-semigroups-ex-3)\n",
747 |     "\n",
748 |     "Linearity is obvious so we focus on boundedness.\n",
749 |     "\n",
750 |     "For any $ f \\in \\ell_1 $ and this choice of $ Q $, we have\n",
751 |     "\n",
752 |     "$$\n",
753 |     "\\sum_y |(fQ)(y)|\n",
754 |     "\\leq \\sum_y \\sum_x |f(x) Q(x, y)|\n",
755 |     "\\leq \\lambda \\sum_y \\sum_x |f(y) - f(y+1)|\n",
756 |     "$$\n",
757 |     "\n",
758 |     "Applying the triangle inequality, we see that the right hand side is dominated\n",
759 |     "by $ 2 \\lambda \\| f\\| $.\n",
760 |     "\n",
761 |     "Hence $ \\| fQ \\| \\leq 2 \\lambda \\|f\\| $, which implies that $ Q \\in \\lL(\\ell_1) $\n",
762 |     "as required."
763 |    ]
764 |   },
765 |   {
766 |    "cell_type": "markdown",
767 |    "id": "683dd7b4",
768 |    "metadata": {},
769 |    "source": [
770 |     "## Exercise 7.4\n",
771 |     "\n",
772 |     "Let $ K $ be defined on $ \\ZZ_+ \\times \\ZZ_+ $ by $ K(i, j) = \\mathbb 1\\{j = i + 1\\} $.\n",
773 |     "\n",
774 |     "Show that, with $ K^m $ representing the $ m $-th matrix product of $ K $ with itself,\n",
775 |     "we have $ K^m(i, j) = \\mathbb 1\\{j = i + m\\} $ for any $ i, j \\in \\ZZ_+ $."
776 |    ]
777 |   },
778 |   {
779 |    "cell_type": "markdown",
780 |    "id": "f4f1ea9a",
781 |    "metadata": {},
782 |    "source": [
783 |     "## Solution to[ Exercise 7.4](https://continuous-time-mcs.quantecon.org/#uc-mc-semigroups-ex-4)\n",
784 |     "\n",
785 |     "The statement $ K^m(i, j) = \\mathbb 1\\{j = i + m\\} $ holds by definition when\n",
786 |     "$ m=1 $.\n",
787 |     "\n",
788 |     "Now suppose it holds at arbitrary $ m $.\n",
789 |     "\n",
790 |     "We then have, by definition of composition (matrix multiplication),\n",
791 |     "\n",
792 |     "$$\n",
793 |     "K^{m+1}(i, j)\n",
794 |     "    = \\sum_n K(i, n) K^m (n, j)\n",
795 |     "    = \\sum_n K(i, n) \\mathbb 1\\{j = n + m\\}\n",
796 |     "    = K(i, j-m)\n",
797 |     "$$\n",
798 |     "\n",
799 |     "Applying the definition $ K(i, j) = \\mathbb 1\\{j = i + 1\\} $ completes verification of the claim."
800 |    ]
801 |   },
802 |   {
803 |    "cell_type": "markdown",
804 |    "id": "b09cb1dc",
805 |    "metadata": {},
806 |    "source": [
807 |     "## Exercise 7.5\n",
808 |     "\n",
809 |     "Let $ Q $ be any intensity matrix on $ S $.\n",
810 |     "\n",
811 |     "Prove that the jump chain decomposition of $ Q $ is in fact a jump chain pair.\n",
812 |     "\n",
813 |     "Prove that, in addition, this decomposition $ (\\lambda, K) $ satisfies [(7.7)](#equation-jcinmat)."
814 |    ]
815 |   },
816 |   {
817 |    "cell_type": "markdown",
818 |    "id": "3d3ec6fe",
819 |    "metadata": {},
820 |    "source": [
821 |     "## Solution to[ Exercise 7.5](https://continuous-time-mcs.quantecon.org/#uc-mc-semigroups-ex-5)\n",
822 |     "\n",
823 |     "Let $ Q $ be an intensity matrix and let $ (\\lambda, K) $ be the jump chain\n",
824 |     "decomposition of $ Q $.\n",
825 |     "\n",
826 |     "Nonnegativity of $ \\lambda $ is immediate from the definition of an intensity\n",
827 |     "matrix.\n",
828 |     "\n",
829 |     "To see that $ K $ is a Markov matrix we fix $ x \\in S $ and suppose first that\n",
830 |     "$ \\lambda(x) > 0 $.\n",
831 |     "\n",
832 |     "Then\n",
833 |     "\n",
834 |     "$$\n",
835 |     "\\sum_y K(x, y) \n",
836 |     "= \\sum_{y \\not= x} K(x,y) \n",
837 |     "= \\sum_{y \\not= x} \\frac{Q(x,y)}{\\lambda(x)}\n",
838 |     "= 1\n",
839 |     "$$\n",
840 |     "\n",
841 |     "If, on the other hand, $ \\lambda(x) = 0 $, then\n",
842 |     "$ \\sum_y K(x, y) = 1 $, is immediate from the definition.\n",
843 |     "\n",
844 |     "As $ K $ is nonnegative, we see that $ K $ is a Markov matrix.\n",
845 |     "\n",
846 |     "Thus, $ (\\lambda, K) $ is a valid jump chain pair.\n",
847 |     "\n",
848 |     "The proof that $ Q $ and $ (\\lambda, K) $ satisfy [(7.7)](#equation-jcinmat) is mechanical and\n",
849 |     "the details are omitted.\n",
850 |     "\n",
851 |     "(Try working case-by-case, with $ \\lambda(x) = 0, x=y $, $ \\lambda(x) > 0, x=y $, etc.)\n",
852 |     "\n",
853 |     "<p><a id=fnim href=#fnim-link><strong>[1]</strong></a> Previously, we introduced the notion of an intensity matrix when $ S $\n",
854 |     "is finite and the definition is essentially unchanged in the current\n",
855 |     "setting.  In particular, $ Q \\colon S \\times S \\to \\RR $ is called an\n",
856 |     "**intensity matrix** if $ Q $ has zero row sums and $ Q(x, y) \\geq 0 $ whenever\n",
857 |     "$ x \\not= y $."
858 |    ]
859 |   }
860 |  ],
861 |  "metadata": {
862 |   "date": 1763463650.0713534,
863 |   "filename": "uc_mc_semigroups.md",
864 |   "kernelspec": {
865 |    "display_name": "Python",
866 |    "language": "python3",
867 |    "name": "python3"
868 |   },
869 |   "title": "UC Markov Semigroups"
870 |  },
871 |  "nbformat": 4,
872 |  "nbformat_minor": 5
873 | }


--------------------------------------------------------------------------------
/ergodicity.ipynb:
--------------------------------------------------------------------------------
  1 | {
  2 |  "cells": [
  3 |   {
  4 |    "cell_type": "markdown",
  5 |    "id": "b7cd6954",
  6 |    "metadata": {},
  7 |    "source": [
  8 |     "$$\n",
  9 |     "\\newcommand{\\Exp}{\\operatorname{Exp}}\n",
 10 |     "\\newcommand{\\Binomial}{\\operatorname{Binomial}}\n",
 11 |     "\\newcommand{\\Poisson}{\\operatorname{Poisson}}\n",
 12 |     "\\newcommand{\\BB}{\\mathbb{B}}\n",
 13 |     "\\newcommand{\\EE}{\\mathbb{E}}\n",
 14 |     "\\newcommand{\\PP}{\\mathbb{P}}\n",
 15 |     "\\newcommand{\\RR}{\\mathbb{R}}\n",
 16 |     "\\newcommand{\\NN}{\\mathbb{N}}\n",
 17 |     "\\newcommand{\\ZZ}{\\mathbb{Z}}\n",
 18 |     "\\newcommand{\\dD}{\\mathcal{D}}\n",
 19 |     "\\newcommand{\\fF}{\\mathcal{F}}\n",
 20 |     "\\newcommand{\\lL}{\\mathcal{L}}\n",
 21 |     "\\newcommand{\\linop}{\\mathcal{L}(\\mathbb{B})}\n",
 22 |     "\\newcommand{\\linopell}{\\mathcal{L}(\\ell_1)}\n",
 23 |     "$$"
 24 |    ]
 25 |   },
 26 |   {
 27 |    "cell_type": "markdown",
 28 |    "id": "31de71ab",
 29 |    "metadata": {},
 30 |    "source": [
 31 |     "# Stationarity and Ergodicity\n",
 32 |     "\n",
 33 |     "In addition to what’s in Anaconda, this lecture will need the following libraries:"
 34 |    ]
 35 |   },
 36 |   {
 37 |    "cell_type": "code",
 38 |    "execution_count": null,
 39 |    "id": "79373485",
 40 |    "metadata": {
 41 |     "hide-output": false
 42 |    },
 43 |    "outputs": [],
 44 |    "source": [
 45 |     "!pip install quantecon"
 46 |    ]
 47 |   },
 48 |   {
 49 |    "cell_type": "markdown",
 50 |    "id": "1b29116c",
 51 |    "metadata": {},
 52 |    "source": [
 53 |     "## Overview\n",
 54 |     "\n",
 55 |     "In this lecture we discuss stability and equilibrium behavior for continuous\n",
 56 |     "time Markov chains.\n",
 57 |     "\n",
 58 |     "To give one example of why this theory matters, consider queues, which are\n",
 59 |     "often modeled as continuous time Markov chains.\n",
 60 |     "\n",
 61 |     "Queueing theory is used in applications such as\n",
 62 |     "\n",
 63 |     "- treatment of patients arriving at a hospital  \n",
 64 |     "- optimal design of manufacturing processes  \n",
 65 |     "- requests to a file server  \n",
 66 |     "- air traffic  \n",
 67 |     "- customers waiting on a helpline  \n",
 68 |     "\n",
 69 |     "\n",
 70 |     "A key topic in queueing theory is average behavior over the long run.\n",
 71 |     "\n",
 72 |     "- Will the length of the queue grow without bounds?  \n",
 73 |     "- If not, is there some kind of long run equilibrium?  \n",
 74 |     "- If so, what is the average waiting time in this equilibrium?  \n",
 75 |     "- What is the average length of the queue over a week, or a month?  \n",
 76 |     "\n",
 77 |     "\n",
 78 |     "We will use the following imports"
 79 |    ]
 80 |   },
 81 |   {
 82 |    "cell_type": "code",
 83 |    "execution_count": null,
 84 |    "id": "7de6aa0d",
 85 |    "metadata": {
 86 |     "hide-output": false
 87 |    },
 88 |    "outputs": [],
 89 |    "source": [
 90 |     "import numpy as np\n",
 91 |     "import scipy as sp\n",
 92 |     "import matplotlib.pyplot as plt\n",
 93 |     "import quantecon as qe\n",
 94 |     "from numba import njit\n",
 95 |     "from scipy.linalg import expm\n",
 96 |     "\n",
 97 |     "from matplotlib import cm\n",
 98 |     "from mpl_toolkits.mplot3d import Axes3D\n",
 99 |     "from mpl_toolkits.mplot3d.art3d import Poly3DCollection"
100 |    ]
101 |   },
102 |   {
103 |    "cell_type": "markdown",
104 |    "id": "27b93ca9",
105 |    "metadata": {},
106 |    "source": [
107 |     "## Stationary Distributions"
108 |    ]
109 |   },
110 |   {
111 |    "cell_type": "markdown",
112 |    "id": "96a7c81b",
113 |    "metadata": {},
114 |    "source": [
115 |     "### Definition\n",
116 |     "\n",
117 |     "Let $ S $ be countable.\n",
118 |     "\n",
119 |     "Recall that, for a discrete time Markov chain with Markov matrix $ P $ on $ S $, a\n",
120 |     "distribution $ \\psi $ is called stationary for $ P $ if $ \\psi P = \\psi $.\n",
121 |     "\n",
122 |     "This means that if $ X_t $ has distribution $ \\psi $, then so does $ X_{t+1} $.\n",
123 |     "\n",
124 |     "For continuous time Markov chains, the definition is analogous.\n",
125 |     "\n",
126 |     "Given a Markov semigroup $ (P_t) $ on $ S $, a distribution $ \\psi^* \\in \\dD $ is called\n",
127 |     "**stationary** for $ (P_t) $ if\n",
128 |     "\n",
129 |     "$$\n",
130 |     "\\psi^* P_t = \\psi^* \n",
131 |     "    \\text{ for all } t \\geq 0\n",
132 |     "$$\n",
133 |     "\n",
134 |     "As one example, we look [again](https://continuous-time-mcs.quantecon.org/kolmogorov_fwd.html#solvode) at the chain on $ S = \\{0, 1,\n",
135 |     "2\\} $ with intensity matrix"
136 |    ]
137 |   },
138 |   {
139 |    "cell_type": "code",
140 |    "execution_count": null,
141 |    "id": "f30bf8b0",
142 |    "metadata": {
143 |     "hide-output": false
144 |    },
145 |    "outputs": [],
146 |    "source": [
147 |     "Q = ((-3, 2, 1),\n",
148 |     "     (3, -5, 2),\n",
149 |     "     (4, 6, -10))"
150 |    ]
151 |   },
152 |   {
153 |    "cell_type": "markdown",
154 |    "id": "1fdc89e2",
155 |    "metadata": {},
156 |    "source": [
157 |     "The following figure was shown before, except that now there is a black dot\n",
158 |     "that the three trajectories seem to be converging to.\n",
159 |     "\n",
160 |     "(Recall that, in the color scheme, trajectories cool as time evolves.)"
161 |    ]
162 |   },
163 |   {
164 |    "cell_type": "code",
165 |    "execution_count": null,
166 |    "id": "193085a1",
167 |    "metadata": {
168 |     "hide-output": false
169 |    },
170 |    "outputs": [],
171 |    "source": [
172 |     "def unit_simplex(angle):\n",
173 |     "    \n",
174 |     "    fig = plt.figure(figsize=(8, 6))\n",
175 |     "    ax = fig.add_subplot(111, projection='3d')\n",
176 |     "\n",
177 |     "    vtx = [[0, 0, 1],\n",
178 |     "           [0, 1, 0], \n",
179 |     "           [1, 0, 0]]\n",
180 |     "    \n",
181 |     "    tri = Poly3DCollection([vtx], color='darkblue', alpha=0.3)\n",
182 |     "    tri.set_facecolor([0.5, 0.5, 1])\n",
183 |     "    ax.add_collection3d(tri)\n",
184 |     "\n",
185 |     "    ax.set(xlim=(0, 1), ylim=(0, 1), zlim=(0, 1), \n",
186 |     "           xticks=(1,), yticks=(1,), zticks=(1,))\n",
187 |     "\n",
188 |     "    ax.set_xticklabels(['$(1, 0, 0)$'], fontsize=12)\n",
189 |     "    ax.set_yticklabels(['$(0, 1, 0)$'], fontsize=12)\n",
190 |     "    ax.set_zticklabels(['$(0, 0, 1)$'], fontsize=12)\n",
191 |     "\n",
192 |     "    ax.xaxis.majorTicks[0].set_pad(15)\n",
193 |     "    ax.yaxis.majorTicks[0].set_pad(15)\n",
194 |     "    ax.zaxis.majorTicks[0].set_pad(35)\n",
195 |     "\n",
196 |     "    ax.view_init(30, angle)\n",
197 |     "\n",
198 |     "    # Move axis to origin\n",
199 |     "    ax.xaxis._axinfo['juggled'] = (0, 0, 0)\n",
200 |     "    ax.yaxis._axinfo['juggled'] = (1, 1, 1)\n",
201 |     "    ax.zaxis._axinfo['juggled'] = (2, 2, 0)\n",
202 |     "    \n",
203 |     "    ax.grid(False)\n",
204 |     "    \n",
205 |     "    return ax\n",
206 |     "\n",
207 |     "Q = np.array(Q)\n",
208 |     "ψ_00 = np.array((0.01, 0.01, 0.99))\n",
209 |     "ψ_01 = np.array((0.01, 0.99, 0.01))\n",
210 |     "ψ_02 = np.array((0.99, 0.01, 0.01))\n",
211 |     "\n",
212 |     "ax = unit_simplex(angle=50)    \n",
213 |     "\n",
214 |     "def flow_plot(ψ, h=0.001, n=300, angle=50):\n",
215 |     "    colors = cm.jet_r(np.linspace(0.0, 1, n))\n",
216 |     "\n",
217 |     "    x_vals, y_vals, z_vals = [], [], []\n",
218 |     "    for t in range(n):\n",
219 |     "        x_vals.append(ψ[0])\n",
220 |     "        y_vals.append(ψ[1])\n",
221 |     "        z_vals.append(ψ[2])\n",
222 |     "        ψ = ψ @ expm(h * Q)\n",
223 |     "\n",
224 |     "    ax.scatter(x_vals, y_vals, z_vals, c=colors, s=20, alpha=0.2, depthshade=False)\n",
225 |     "\n",
226 |     "flow_plot(ψ_00)\n",
227 |     "flow_plot(ψ_01)\n",
228 |     "flow_plot(ψ_02)\n",
229 |     "\n",
230 |     "# Add stationary distribution\n",
231 |     "P_1 = expm(Q)\n",
232 |     "mc = qe.MarkovChain(P_1)\n",
233 |     "ψ = mc.stationary_distributions[0]\n",
234 |     "ax.scatter(ψ[0], ψ[1], ψ[2], c='k', s=30, depthshade=False)\n",
235 |     "\n",
236 |     "plt.show()"
237 |    ]
238 |   },
239 |   {
240 |    "cell_type": "markdown",
241 |    "id": "0df40267",
242 |    "metadata": {},
243 |    "source": [
244 |     "This black dot is the stationary distribution $ \\psi^* $ of the Markov semigroup $ (P_t) $ generated by $ Q $.\n",
245 |     "\n",
246 |     "It was calculated using the `stationary_distributions` attribute of QuantEcon’s\n",
247 |     "[`MarkovChain` class](https://quanteconpy.readthedocs.io/en/latest/markov.html), by arbitrarily setting $ t=1 $ and solving $ \\psi P_1 = \\psi $.\n",
248 |     "\n",
249 |     "Below we show that, for this choice of $ Q $, the stationary distribution\n",
250 |     "$ \\psi^* $ is unique in $ \\dD $, due to irreducibility.\n",
251 |     "\n",
252 |     "Moreover, $ \\psi P_t \\to \\psi^* $ as $ t \\to \\infty $ for any $ \\psi \\in \\dD $, as\n",
253 |     "suggested by the figure."
254 |    ]
255 |   },
256 |   {
257 |    "cell_type": "markdown",
258 |    "id": "c31cb4cc",
259 |    "metadata": {},
260 |    "source": [
261 |     "### Stationarity via the Generator\n",
262 |     "\n",
263 |     "In many cases, it is easier to use the generator of the semigroup to identify\n",
264 |     "stationary distributions rather than the semigroup itself.\n",
265 |     "\n",
266 |     "This is analogous to the idea that a point $ \\bar x $ in $ \\RR^d $ is stationary for\n",
267 |     "a vector ODE $ x'_t = F(x_t) $ when $ F(\\bar x) = 0 $.\n",
268 |     "\n",
269 |     "(Here $ F $ is the infinitesimal description, and hence analogous to the generator.)\n",
270 |     "\n",
271 |     "The next result holds true under weaker conditions but the version stated here\n",
272 |     "is easy to prove and sufficient for applications we consider."
273 |    ]
274 |   },
275 |   {
276 |    "cell_type": "markdown",
277 |    "id": "e72d2c9a",
278 |    "metadata": {},
279 |    "source": [
280 |     "**Theorem 8.1**\n",
281 |     "\n",
282 |     "Let $ (P_t) $ be a UC Markov semigroup with generator $ Q $.  A distribution\n",
283 |     "$ \\psi $ on $ S $ is stationary for $ (P_t) $ if and only if $ \\psi Q = 0 $."
284 |    ]
285 |   },
286 |   {
287 |    "cell_type": "markdown",
288 |    "id": "a581d899",
289 |    "metadata": {},
290 |    "source": [
291 |     "**Proof.**\n",
292 |     "\n",
293 |     "Proof. Fix $ \\psi \\in \\dD $ and suppose first that $ \\psi Q = 0 $.\n",
294 |     "\n",
295 |     "Since $ (P_t) $ is a UC Markov semigroup, we have $ P_t = e^{tQ} $ for all $ t $,\n",
296 |     "and hence, for any $ t \\geq 0 $,\n",
297 |     "\n",
298 |     "$$\n",
299 |     "\\psi e^{tQ} = \\psi + t \\psi Q + t^2 \\frac{\\psi Q^2}{2!} \n",
300 |     "    + \\cdots\n",
301 |     "$$\n",
302 |     "\n",
303 |     "From $ \\psi Q = 0 $ we get $ \\psi Q^k = 0 $ for all $ k \\in \\NN $, so the last\n",
304 |     "display yields $ \\psi P_t = \\psi $.\n",
305 |     "\n",
306 |     "Hence $ \\psi $ is stationary for $ (P_t) $.\n",
307 |     "\n",
308 |     "Now suppose that $ \\psi $ is stationary for $ (P_t) $ and set $ D_t := (1/t) (P_t -\n",
309 |     "I) $.\n",
310 |     "\n",
311 |     "From the triangle inequality and the definition of the operator norm, for any given $ t $,\n",
312 |     "\n",
313 |     "$$\n",
314 |     "\\| \\psi Q \\| \n",
315 |     "    \\leq \\| \\psi (Q - D_t) \\| + \\| \\psi D_t \\|\n",
316 |     "    \\leq \\| Q - D_t \\| + \\| \\psi D_t \\|\n",
317 |     "$$\n",
318 |     "\n",
319 |     "Since $ (P_t) $ is UC and $ Q $ is its generator, we have $ \\| D_t - Q \\| \\to 0 $ in\n",
320 |     "$ \\lL(\\ell_1) $ as $ t \\to 0^+ $.\n",
321 |     "\n",
322 |     "Hence $ \\| \\psi Q \\| \\leq \\liminf_{t \\downarrow 0} \\| \\psi D_t \\| $.\n",
323 |     "\n",
324 |     "As $ \\psi $ is stationary for $ (P_t) $, we have $ \\psi D_t = 0 $ for all $ t $.\n",
325 |     "\n",
326 |     "Hence $ \\psi Q = 0 $, as was to be shown."
327 |    ]
328 |   },
329 |   {
330 |    "cell_type": "markdown",
331 |    "id": "f7f4bbb7",
332 |    "metadata": {},
333 |    "source": [
334 |     "## Irreducibility and Uniqueness\n",
335 |     "\n",
336 |     "Let $ (P_t) $ be a Markov semigroup on $ S $ and consider arbitrary states $ x, y \\in S $.\n",
337 |     "\n",
338 |     "We say that state $ y $ is **accessible** from state $ x $ if\n",
339 |     "there exists a $ t \\geq 0 $ such that $ P_t(x, y) > 0 $.\n",
340 |     "\n",
341 |     "We say that $ x $ and $ y $ **communicate** if $ x $ is accessible from $ y $ and $ y $\n",
342 |     "is accessible from $ x $.\n",
343 |     "\n",
344 |     "A Markov semigroup $ (P_t) $ on $ S $ is called **irreducible** if\n",
345 |     "every pair $ x,y $ in $ S $ communicates.\n",
346 |     "\n",
347 |     "We seek a characterization of irreducibility of $ (P_t) $ in terms of its\n",
348 |     "generator.\n",
349 |     "\n",
350 |     "As a first step, we will say there is a **$ Q $-positive probability flow** from $ x $\n",
351 |     "to $ y $ if there exists a finite sequence $ (z_i)_{i=0}^m $ in $ S $ starting at\n",
352 |     "$ x=z_0 $ and ending at $ y=z_m $ such that $ Q(z_i, z_{i+1}) > 0 $ for all $ i $."
353 |    ]
354 |   },
355 |   {
356 |    "cell_type": "markdown",
357 |    "id": "4c015f31",
358 |    "metadata": {},
359 |    "source": [
360 |     "**Theorem 8.2**\n",
361 |     "\n",
362 |     "Let $ (P_t) $ be a UC Markov semigroup with generator $ Q $.\n",
363 |     "For distinct states $ x $ and $ y $, the following statements are equivalent:\n",
364 |     "\n",
365 |     "1. The state $ y $ is accessible from $ x $ under $ (P_t) $.  \n",
366 |     "1. There is a $ Q $-positive probability flow from $ x $ to $ y $.  \n",
367 |     "1. $ P_t(x, y) > 0 $ for all $ t > 0 $.  "
368 |    ]
369 |   },
370 |   {
371 |    "cell_type": "markdown",
372 |    "id": "5afbfd5a",
373 |    "metadata": {},
374 |    "source": [
375 |     "**Proof.**\n",
376 |     "\n",
377 |     "Proof. Pick any two distinct states $ x $ and $ y $.\n",
378 |     "\n",
379 |     "It is obvious that statement 3 implies statement 1, so we need only prove\n",
380 |     "(1 $ \\implies $ 2) and (2 $ \\implies $ 3).\n",
381 |     "\n",
382 |     "Starting with (1 $ \\implies $ 2), recall that\n",
383 |     "\n",
384 |     "\n",
385 |     "<a id='equation-ptexpan'></a>\n",
386 |     "$$\n",
387 |     "P_t(x, y) = t Q(x,y) + \\frac{t^2}{2!} Q^2(x, y) + \\cdots \\tag{8.1}\n",
388 |     "$$\n",
389 |     "\n",
390 |     "If $ x $ is accessible from $ y $, then $ P_t(x, y) > 0 $ for some $ t > 0 $, so\n",
391 |     "$ Q^k(x,y) > 0 $ for at least one $ k \\in \\NN $.\n",
392 |     "\n",
393 |     "Writing out the matrix product as a sum, we now have\n",
394 |     "\n",
395 |     "\n",
396 |     "<a id='equation-qkassum'></a>\n",
397 |     "$$\n",
398 |     "Q^k(x,y) =\n",
399 |     "    \\sum_{z_1}\n",
400 |     "    \\sum_{z_2}\n",
401 |     "    \\cdots\n",
402 |     "    \\sum_{z_{k-1}}\n",
403 |     "        Q(x, z_1) Q(z_1, z_2) \\cdots Q(z_{k-1}, y) \n",
404 |     "        > 0 \\tag{8.2}\n",
405 |     "$$\n",
406 |     "\n",
407 |     "It follows that at least one element of the sum must be strictly positive.\n",
408 |     "\n",
409 |     "Therefore, a $ Q $-positive probability flow from $ x $ to $ y $ exists.\n",
410 |     "\n",
411 |     "Turning to (2 $ \\implies $ 3), first note that, for arbitrary states $ u $ and $ v $, if $ Q(u, v) > 0 $ then $ P_t(u, v) > 0 $ for all $ t > 0 $.\n",
412 |     "\n",
413 |     "To see this, let $ (\\lambda, K) $ be the jump chain pair constructed from $ Q $ via\n",
414 |     "[(7.8)](https://continuous-time-mcs.quantecon.org/uc_mc_semigroups.html#equation-lambdafromq), [(7.9)](https://continuous-time-mcs.quantecon.org/uc_mc_semigroups.html#equation-kfromqxx) and [(7.10)](https://continuous-time-mcs.quantecon.org/uc_mc_semigroups.html#equation-kfromqxy).\n",
415 |     "\n",
416 |     "Observe that, since $ Q(u, v) > 0 $, we must have $ \\lambda(u) > 0 $.\n",
417 |     "\n",
418 |     "As a consequence, applying [(7.10)](https://continuous-time-mcs.quantecon.org/uc_mc_semigroups.html#equation-kfromqxy), we have\n",
419 |     "\n",
420 |     "$$\n",
421 |     "K(u, v) = \\frac{Q(u, v)}{\\lambda(u)} > 0\n",
422 |     "$$\n",
423 |     "\n",
424 |     "Let $ (Y_k) $ and $ (J_k) $ be the embedded jump chain and jump sequence generated\n",
425 |     "by [Algorithm 4.1](https://continuous-time-mcs.quantecon.org/kolmogorov_bwd.html#ejc_algo), with $ Y_0 = u $.\n",
426 |     "\n",
427 |     "With $ E_1 \\sim \\Exp(1) $ and $ E_2 \\sim \\Exp(1) $, we have, for any $ t > 0 $,\n",
428 |     "\n",
429 |     "$$\n",
430 |     "\\begin{aligned}\n",
431 |     "    P_t(u, v) \n",
432 |     "    & \\geq \\PP \\{J_1 \\leq t, \\, Y_1 = v, \\, J_2 > t \\}\n",
433 |     "    \\\\\n",
434 |     "    & \\geq \\PP \\{E_1 \\leq t\\lambda(u), \\, E_2 > t\\lambda(v) \\} \\PP\\{ Y_1 = v  \\}\n",
435 |     "    \\\\\n",
436 |     "    & = \\PP \\{E_1 \\leq t\\lambda(u)\\} \n",
437 |     "        \\PP \\{ E_2 > t\\lambda(v) \\} K(u, v) \n",
438 |     "    \\\\\n",
439 |     "    & > 0\n",
440 |     "\\end{aligned}\n",
441 |     "$$\n",
442 |     "\n",
443 |     "Now suppose there is a $ Q $-positive probability flow $ (z_i)_{i=0}^m $ from $ x $\n",
444 |     "to $ y $.\n",
445 |     "\n",
446 |     "If we fix $ t > 0 $ and repeatedly apply [(3.7)](https://continuous-time-mcs.quantecon.org/markov_prop.html#equation-chapkol-ct2) along with the last\n",
447 |     "result, we obtain\n",
448 |     "\n",
449 |     "$$\n",
450 |     "P_t(x, y)\n",
451 |     "    \\geq\n",
452 |     "    \\prod_{i=0}^{m-1} P_{t/m} (z_i, z_{i+1}) > 0\n",
453 |     "$$"
454 |    ]
455 |   },
456 |   {
457 |    "cell_type": "markdown",
458 |    "id": "432b4a05",
459 |    "metadata": {},
460 |    "source": [
461 |     "[Theorem 8.2](#equivirr) leads directly to the following strong result."
462 |    ]
463 |   },
464 |   {
465 |    "cell_type": "markdown",
466 |    "id": "ead5f1c7",
467 |    "metadata": {},
468 |    "source": [
469 |     "**Corollary 8.1**\n",
470 |     "\n",
471 |     "For a UC Markov semigroup $ (P_t) $, the following statements are\n",
472 |     "equivalent:\n",
473 |     "\n",
474 |     "1. $ (P_t) $ is irreducible.  \n",
475 |     "1. $ P_t(x,y) > 0 $ for all $ t > 0 $ and all $ (x,y) \\in S \\times S $.  "
476 |    ]
477 |   },
478 |   {
479 |    "cell_type": "markdown",
480 |    "id": "560a76a5",
481 |    "metadata": {},
482 |    "source": [
483 |     ">**Note**\n",
484 |     ">\n",
485 |     ">To obtain stable long run behavior in discrete time Markov chains, it is\n",
486 |     "common to assume that the chain is aperiodic.\n",
487 |     "\n",
488 |     "This needs to be assumed on top of irreducibility if one wishes to rule out\n",
489 |     "all dependence on initial conditions.\n",
490 |     "\n",
491 |     "[Corollary 8.1](#perimposs) shows that periodicity is not a concern for irreducible\n",
492 |     "continuous time Markov chains.\n",
493 |     "\n",
494 |     "Positive probability flow from $ x $ to $ y $ at some $ t > 0 $ immediately implies\n",
495 |     "positive flow for all $ t > 0 $."
496 |    ]
497 |   },
498 |   {
499 |    "cell_type": "markdown",
500 |    "id": "cb0bad76",
501 |    "metadata": {},
502 |    "source": [
503 |     "## Asymptotic Stabilitiy\n",
504 |     "\n",
505 |     "We call Markov semigroup $ (P_t) $ **asymptotically stable** if $ (P_t) $ has a\n",
506 |     "unique stationary distribution $ \\psi^* $ in $ \\dD $ and\n",
507 |     "\n",
508 |     "\n",
509 |     "<a id='equation-asyms'></a>\n",
510 |     "$$\n",
511 |     "\\| \\psi P_t - \\psi^* \\| \\to 0 \\text{ as } t \\to \\infty\n",
512 |     "    \\text{ for all } \\psi \\in \\dD \\tag{8.3}\n",
513 |     "$$\n",
514 |     "\n",
515 |     "Our aim is to establish conditions for asymptotic stability of Markov\n",
516 |     "semigroups."
517 |    ]
518 |   },
519 |   {
520 |    "cell_type": "markdown",
521 |    "id": "8d4e4c0e",
522 |    "metadata": {},
523 |    "source": [
524 |     "### Contractivity\n",
525 |     "\n",
526 |     "Let’s recall some useful facts about the discrete time case.\n",
527 |     "\n",
528 |     "First, if $ P $ is any Markov matrix, we have, in the $ \\ell_1 $ norm,\n",
529 |     "\n",
530 |     "\n",
531 |     "<a id='equation-allmocontract'></a>\n",
532 |     "$$\n",
533 |     "\\| \\psi P \\| \\leq \\| \\psi \\|\n",
534 |     "    \\text{ for all } \\psi \\in \\dD \\tag{8.4}\n",
535 |     "$$\n",
536 |     "\n",
537 |     "This is because, given $ \\psi \\in \\dD $,\n",
538 |     "\n",
539 |     "$$\n",
540 |     "\\| \\psi P \\|\n",
541 |     "    = \\sum_y \\left| \\sum_x \\psi(x) P(x, y) \\right|\n",
542 |     "    \\leq \\sum_y \\sum_x | \\psi(x) | P(x, y)\n",
543 |     "    = \\| \\psi \\|\n",
544 |     "$$\n",
545 |     "\n",
546 |     "By linearity, for $ \\psi, \\phi \\in \\dD $, we then have\n",
547 |     "\n",
548 |     "$$\n",
549 |     "\\| \\psi P - \\phi P \\| \\leq \\| \\psi - \\phi \\|\n",
550 |     "$$\n",
551 |     "\n",
552 |     "Hence every Markov operator is contracting on $ \\dD $.\n",
553 |     "\n",
554 |     "Moreover, if $ P $ is everywhere positive, then this inequality is strict:"
555 |    ]
556 |   },
557 |   {
558 |    "cell_type": "markdown",
559 |    "id": "9a72ccf4",
560 |    "metadata": {},
561 |    "source": [
562 |     "**Lemma 8.1** (Strict Contractivity)\n",
563 |     "\n",
564 |     "If $ P $ is a Markov matrix and $ P(x, y) > 0 $ for all $ x, y $, then\n",
565 |     "\n",
566 |     "$$\n",
567 |     "\\| \\psi P - \\phi P \\| < \\| \\psi - \\phi \\|\n",
568 |     "    \\text{ for all } \\psi, \\phi \\in \\dD\n",
569 |     "    \\text{ with } \\psi \\not= \\phi\n",
570 |     "$$"
571 |    ]
572 |   },
573 |   {
574 |    "cell_type": "markdown",
575 |    "id": "3c34bd3e",
576 |    "metadata": {},
577 |    "source": [
578 |     "The proof follows from the strict triangle inequality, as opposed to the weak\n",
579 |     "triangle inequality we used to obtain [(8.4)](#equation-allmocontract).\n",
580 |     "\n",
581 |     "See, for example, Proposition 3.1.2 of [[Lasota and Mackey, 1994](https://continuous-time-mcs.quantecon.org/zreferences.html#id3)] or Lemma 8.2.3 of [[Stachurski, 2009](https://continuous-time-mcs.quantecon.org/zreferences.html#id4)]."
582 |    ]
583 |   },
584 |   {
585 |    "cell_type": "markdown",
586 |    "id": "7654fafc",
587 |    "metadata": {},
588 |    "source": [
589 |     "### Uniqueness\n",
590 |     "\n",
591 |     "Irreducibility of a given Markov chain implies that there are no disjoint\n",
592 |     "[absorbing sets](https://en.wikipedia.org/wiki/Absorbing_set).\n",
593 |     "\n",
594 |     "This in turn leads to uniqueness of stationary distributions:"
595 |    ]
596 |   },
597 |   {
598 |    "cell_type": "markdown",
599 |    "id": "8b00db01",
600 |    "metadata": {},
601 |    "source": [
602 |     "**Theorem 8.3**\n",
603 |     "\n",
604 |     "Let $ (P_t) $ be a UC Markov semigroup on $ S $.  If $ (P_t) $ is irreducible, then\n",
605 |     "$ (P_t) $ has at most one stationary distribution."
606 |    ]
607 |   },
608 |   {
609 |    "cell_type": "markdown",
610 |    "id": "b8c15640",
611 |    "metadata": {},
612 |    "source": [
613 |     "**Proof.**\n",
614 |     "\n",
615 |     "Proof. Suppose to the contrary that $ \\psi $ and $ \\phi $ are both stationary for\n",
616 |     "$ (P_t) $.\n",
617 |     "\n",
618 |     "Since $ (P_t) $ is irreducible, we know that $ P_1(x,y) > 0 $ for all $ x,y \\in S $.\n",
619 |     "\n",
620 |     "If $ \\psi \\not= \\phi $, then, due to positivity of $ P_1 $, the strict inequality\n",
621 |     "in [Lemma 8.1](#strictcontract) holds.\n",
622 |     "\n",
623 |     "At the same time, by stationarity, $ \\| \\psi P - \\phi P \\| = \\| \\psi - \\phi\n",
624 |     "\\| $.  Contradiction."
625 |    ]
626 |   },
627 |   {
628 |    "cell_type": "markdown",
629 |    "id": "d2093570",
630 |    "metadata": {},
631 |    "source": [
632 |     "**Example 8.1**\n",
633 |     "\n",
634 |     "An M/M/1 queue with parameters $ \\mu, \\lambda $ is a continuous time Markov chain $ (X_t) $ on $ S = \\ZZ_+ $ with with intensity matrix\n",
635 |     "\n",
636 |     "\n",
637 |     "<a id='equation-mm1q'></a>\n",
638 |     "$$\n",
639 |     "Q=\\begin{pmatrix}\n",
640 |     "    -\\lambda & \\lambda & 0 & 0 & \\cdots \\\\\n",
641 |     "    \\mu & -(\\mu + \\lambda) & \\lambda & 0 & \\cdots \\\\\n",
642 |     "    0 & \\mu & -(\\mu + \\lambda) & \\lambda & \\cdots\\\\\n",
643 |     "    \\vdots & \\vdots & \\vdots & \\vdots & \\ddots\n",
644 |     "\\end{pmatrix} \\tag{8.5}\n",
645 |     "$$\n",
646 |     "\n",
647 |     "The chain $ (X_t) $ records the length of the queue at each moment in time.\n",
648 |     "\n",
649 |     "The intensity matrix captures the idea that customers flow into the queue at\n",
650 |     "rate $ \\lambda $ and are served (and hence leave the queue) at rate $ \\mu $.\n",
651 |     "\n",
652 |     "If $ \\lambda $ and $ \\mu $ are both positive, then there is a $ Q $-positive\n",
653 |     "probability flow between any two states, in both directions, so the\n",
654 |     "corresponding semigroup $ (P_t) $ is irreducible.\n",
655 |     "\n",
656 |     "[Theorem 8.3](#uniirr) now tells us that $ (P_t) $ has at most one stationary\n",
657 |     "distribution."
658 |    ]
659 |   },
660 |   {
661 |    "cell_type": "markdown",
662 |    "id": "0e0e7351",
663 |    "metadata": {},
664 |    "source": [
665 |     "### Stability from the Skeleton\n",
666 |     "\n",
667 |     "Recall the definition of asymptotic stability given in [(8.3)](#equation-asyms).\n",
668 |     "\n",
669 |     "Analogously, we call an individual Markov operator $ P $ asymptotically stable if\n",
670 |     "$ P $ has a unique stationary distribution $ \\psi^* $ in $ \\dD $ and\n",
671 |     "$ \\psi P^n \\to \\psi^* $ as $ n \\to \\infty $ for all $ \\psi \\in \\dD $.\n",
672 |     "\n",
673 |     "The next result gives a connection between discrete and continuous stability.\n",
674 |     "\n",
675 |     "The critical ingredient linking these two concepts is the contractivity in\n",
676 |     "[(8.4)](#equation-allmocontract)."
677 |    ]
678 |   },
679 |   {
680 |    "cell_type": "markdown",
681 |    "id": "26700070",
682 |    "metadata": {},
683 |    "source": [
684 |     "**Lemma 8.2**\n",
685 |     "\n",
686 |     "Let $ (P_t) $ be a Markov semigroup. If there exists an $ s > 0 $ such that the\n",
687 |     "Markov matrix $ P_s $ is asymptotically stable, then $ (P_t) $ is asymptotically\n",
688 |     "stable with the same stationary distribution."
689 |    ]
690 |   },
691 |   {
692 |    "cell_type": "markdown",
693 |    "id": "4866a88c",
694 |    "metadata": {},
695 |    "source": [
696 |     "**Proof.**\n",
697 |     "\n",
698 |     "Proof. Let $ (P_t) $ and $ s $ be as in the statement of [Lemma 8.2](#stabskel).\n",
699 |     "\n",
700 |     "Let $ \\psi^* $ be the stationary distribution of $ P_s $.  Fix $ \\psi \\in \\dD $ and\n",
701 |     "$ \\epsilon > 0 $.\n",
702 |     "\n",
703 |     "By stability of $ P_s $, we can take an $ n \\in \\NN $ such that\n",
704 |     "$ \\| \\psi P_s^n - \\psi^* \\| < \\epsilon $.\n",
705 |     "\n",
706 |     "Pick any $ t > sn $.  Set $ h := t - sn $.\n",
707 |     "\n",
708 |     "By the contractivity in [(8.4)](#equation-allmocontract) and $ P_{sn} = P_s^n $, we have\n",
709 |     "\n",
710 |     "$$\n",
711 |     "\\| \\psi P_t - \\psi^* \\|\n",
712 |     "    = \\| \\psi P_{sn} P_h - \\psi^* P_h \\|\n",
713 |     "    \\leq \\| \\psi P_{sn} - \\psi^* \\|\n",
714 |     "    < \\epsilon\n",
715 |     "$$\n",
716 |     "\n",
717 |     "Hence asymptotic stability holds for $ (P_t) $."
718 |    ]
719 |   },
720 |   {
721 |    "cell_type": "markdown",
722 |    "id": "9f3b98ea",
723 |    "metadata": {},
724 |    "source": [
725 |     "### Stability via Drift\n",
726 |     "\n",
727 |     "In this section we address drift conditions, which are a powerful method for\n",
728 |     "obtaining asymptotic stability when the state space can be infinite.\n",
729 |     "\n",
730 |     "The idea is to show that the state tends to drift back to a finite set over\n",
731 |     "time.\n",
732 |     "\n",
733 |     "Such drift, when combined with the contractivity in\n",
734 |     "[Lemma 8.1](#strictcontract), is enough to give global stability.\n",
735 |     "\n",
736 |     "The next theorem gives a useful version of this class of results."
737 |    ]
738 |   },
739 |   {
740 |    "cell_type": "markdown",
741 |    "id": "0e7db861",
742 |    "metadata": {},
743 |    "source": [
744 |     "**Theorem 8.4**\n",
745 |     "\n",
746 |     "Let $ (P_t) $ be a UC Markov semigroup with intensity matrix $ Q $.  If $ (P_t) $ is\n",
747 |     "irreducible and there exists a function $ v \\colon S \\to \\RR_+ $, a finite set\n",
748 |     "$ F \\subset S $ and positive constants $ \\epsilon $ and $ M $ such that\n",
749 |     "\n",
750 |     "$$\n",
751 |     "\\sum_y Q(x, y) v(y) \n",
752 |     "    \\leq \n",
753 |     "    \\begin{cases}\n",
754 |     "    M & \\text{ if } x \\in F \\text{ and }\n",
755 |     "    \\\\\n",
756 |     "    - \\epsilon & \\text{ otherwise } \n",
757 |     "    \\end{cases}\n",
758 |     "$$\n",
759 |     "\n",
760 |     "then $ (P_t) $ is asymptotically stable."
761 |    ]
762 |   },
763 |   {
764 |    "cell_type": "markdown",
765 |    "id": "2d1a205d",
766 |    "metadata": {},
767 |    "source": [
768 |     "The proof of [Theorem 8.4](#sdrift) can be found in [[Pichór *et al.*, 2012](https://continuous-time-mcs.quantecon.org/zreferences.html#id2)]."
769 |    ]
770 |   },
771 |   {
772 |    "cell_type": "markdown",
773 |    "id": "b29a27ce",
774 |    "metadata": {},
775 |    "source": [
776 |     "**Example 8.2**\n",
777 |     "\n",
778 |     "Consider again the M/M/1 queue on $ \\ZZ_+ $ with intensity matrix [(8.5)](#equation-mm1q).\n",
779 |     "\n",
780 |     "Suppose that $ \\lambda < \\mu $.\n",
781 |     "\n",
782 |     "It is intuitive that, in this case, the queue length will not\n",
783 |     "tend to infinity (since the service rate is higher than the arrival rate).\n",
784 |     "\n",
785 |     "This intuition can be confirmed via [Theorem 8.4](#sdrift), after setting $ v(j) =\n",
786 |     "j $.\n",
787 |     "\n",
788 |     "Indeed, we have, for any $ i \\geq 1 $,\n",
789 |     "\n",
790 |     "$$\n",
791 |     "\\sum_{j \\geq 0} Q(i, j) v(j) \n",
792 |     "    = (i-1) \\mu - i (\\mu + \\lambda) + (i+1) \\lambda\n",
793 |     "    = \\lambda - \\mu\n",
794 |     "$$\n",
795 |     "\n",
796 |     "Setting $ F=\\{0\\} $ and $ M = \\lambda - \\mu = - \\epsilon $, we see that the conditions\n",
797 |     "of [Theorem 8.4](#sdrift) hold.\n",
798 |     "\n",
799 |     "Hence the associated semigroup $ (P_t) $ is asymptotically stable."
800 |    ]
801 |   },
802 |   {
803 |    "cell_type": "markdown",
804 |    "id": "ef678429",
805 |    "metadata": {},
806 |    "source": [
807 |     "**Corollary 8.2**\n",
808 |     "\n",
809 |     "If $ (P_t) $ is an irreducible UC Markov semigroup and $ S $ is finite, then\n",
810 |     "$ (P_t) $ is asymptotically stable."
811 |    ]
812 |   },
813 |   {
814 |    "cell_type": "markdown",
815 |    "id": "36a9234d",
816 |    "metadata": {},
817 |    "source": [
818 |     "A solved exercise below asks you to confirm this."
819 |    ]
820 |   },
821 |   {
822 |    "cell_type": "markdown",
823 |    "id": "14c750a6",
824 |    "metadata": {},
825 |    "source": [
826 |     "## Exercises"
827 |    ]
828 |   },
829 |   {
830 |    "cell_type": "markdown",
831 |    "id": "18dbcf9b",
832 |    "metadata": {},
833 |    "source": [
834 |     "## Exercise 8.1\n",
835 |     "\n",
836 |     "Let $ (P_t) $ be a Markov semigroup.  True or false:\n",
837 |     "for this semigroup, every state $ x $ is accessible from itself."
838 |    ]
839 |   },
840 |   {
841 |    "cell_type": "markdown",
842 |    "id": "db9d344b",
843 |    "metadata": {},
844 |    "source": [
845 |     "## Solution to[ Exercise 8.1](https://continuous-time-mcs.quantecon.org/#ergodicity-ex-1)\n",
846 |     "\n",
847 |     "The statement is true.  With $ t=0 $ we have $ P_t(x,x) = I(x,x) = 1 > 0 $."
848 |    ]
849 |   },
850 |   {
851 |    "cell_type": "markdown",
852 |    "id": "fb01a811",
853 |    "metadata": {},
854 |    "source": [
855 |     "## Exercise 8.2\n",
856 |     "\n",
857 |     "Let $ (\\lambda_k) $ be a bounded non-increasing sequence in $ (0, \\infty) $.\n",
858 |     "\n",
859 |     "A **pure birth process** starting at zero is a continuous time Markov process\n",
860 |     "$ (X_t) $ on state space $ \\ZZ_+ $ with intensity matrix\n",
861 |     "\n",
862 |     "$$\n",
863 |     "Q=\\begin{pmatrix}\n",
864 |     "    -\\lambda_0 & \\lambda_0 & 0 & 0 & \\cdots \\\\\n",
865 |     "    0 & -\\lambda_1 & \\lambda_1 & 0 & \\cdots \\\\\n",
866 |     "    0 & 0 & -\\lambda_2 & \\lambda_2 & \\cdots\\\\\n",
867 |     "    \\vdots & \\vdots & \\vdots & \\vdots & \\ddots\n",
868 |     "\\end{pmatrix}\n",
869 |     "$$\n",
870 |     "\n",
871 |     "Show that $ (P_t) $, the corresponding Markov semigroup, has no stationary\n",
872 |     "distribution."
873 |    ]
874 |   },
875 |   {
876 |    "cell_type": "markdown",
877 |    "id": "8510c7e3",
878 |    "metadata": {},
879 |    "source": [
880 |     "## Solution to[ Exercise 8.2](https://continuous-time-mcs.quantecon.org/#ergodicity-ex-2)\n",
881 |     "\n",
882 |     "Suppose to the contrary that $ \\phi \\in \\dD $ and $ \\phi Q = 0 $.\n",
883 |     "\n",
884 |     "Then, for any $ j \\geq 1 $,\n",
885 |     "\n",
886 |     "$$\n",
887 |     "(\\phi Q)(j) \n",
888 |     "= \\sum_{i \\geq 0} \\phi(i) Q(i, j)\n",
889 |     "= - \\lambda_j \\phi(j) + \\lambda_{j-1} \\phi(j-1) \n",
890 |     "= 0\n",
891 |     "$$\n",
892 |     "\n",
893 |     "Since $ (\\lambda_k) $ is non-increasing, it follows that\n",
894 |     "\n",
895 |     "$$\n",
896 |     "\\frac{\\phi(j)}{\\phi(j-1)} = \\frac{\\lambda_{j-1}}{\\lambda_j} \\geq 1\n",
897 |     "$$\n",
898 |     "\n",
899 |     "Therefore, for any $ j\\geq 1 $, it must be:\n",
900 |     "\n",
901 |     "$$\n",
902 |     "\\phi(j) \\geq \\phi(j-1)\n",
903 |     "$$\n",
904 |     "\n",
905 |     "It follows that $ \\phi $ is non-decreasing on $ \\ZZ_+ $.\n",
906 |     "\n",
907 |     "But $ \\dD $ contains no non-decreasing functions when the state space is infinite.\n",
908 |     "(Why?)\n",
909 |     "\n",
910 |     "Contradiction."
911 |    ]
912 |   },
913 |   {
914 |    "cell_type": "markdown",
915 |    "id": "debc222a",
916 |    "metadata": {},
917 |    "source": [
918 |     "## Exercise 8.3\n",
919 |     "\n",
920 |     "Confirm that [Theorem 8.4](#sdrift) implies [Corollary 8.2](#sfinite)."
921 |    ]
922 |   },
923 |   {
924 |    "cell_type": "markdown",
925 |    "id": "ec5f0e2d",
926 |    "metadata": {},
927 |    "source": [
928 |     "## Solution to[ Exercise 8.3](https://continuous-time-mcs.quantecon.org/#ergodicity-ex-3)\n",
929 |     "\n",
930 |     "Let $ (P_t) $ be an irreducible UC Markov semigroup and let $ S $ be finite.\n",
931 |     "\n",
932 |     "Pick any positive constants $ M, \\epsilon $ and set $ v = M $ and $ F = S $.\n",
933 |     "\n",
934 |     "We then have\n",
935 |     "\n",
936 |     "$$\n",
937 |     "\\sum_y Q(x, y) v(y)\n",
938 |     "= M \\sum_y Q(x, y)\n",
939 |     "= 0\n",
940 |     "$$\n",
941 |     "\n",
942 |     "Hence the drift condition in [Theorem 8.4](#sdrift) holds and $ (P_t) $ is\n",
943 |     "asymptotically stable."
944 |    ]
945 |   }
946 |  ],
947 |  "metadata": {
948 |   "date": 1763463649.6541378,
949 |   "filename": "ergodicity.md",
950 |   "kernelspec": {
951 |    "display_name": "Python",
952 |    "language": "python3",
953 |    "name": "python3"
954 |   },
955 |   "title": "Stationarity and Ergodicity"
956 |  },
957 |  "nbformat": 4,
958 |  "nbformat_minor": 5
959 | }


--------------------------------------------------------------------------------