Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:11,100 --> 00:00:16,680
So in this lecture, we are going to discuss a very important topic when it comes to Financial Times
2
00:00:16,680 --> 00:00:23,190
series, this is the random walk in the corresponding random walk hypothesis to give you a very brief
3
00:00:23,190 --> 00:00:27,660
summary, a random walk is what we implemented when we did price simulations.
4
00:00:28,140 --> 00:00:33,180
This lecture will expand on what we did by taking a more theoretical look at what we've already done
5
00:00:33,180 --> 00:00:33,990
in practice.
6
00:00:34,740 --> 00:00:39,990
The practical part is useful, but the theoretical part is critical for providing you with necessary
7
00:00:39,990 --> 00:00:40,660
insights.
8
00:00:41,190 --> 00:00:46,620
In fact, we'll learn later in this course that the random walk is a special case of a Arima, a very
9
00:00:46,620 --> 00:00:48,150
important time series model.
10
00:00:49,640 --> 00:00:51,980
So what is the random hypothesis?
11
00:00:52,910 --> 00:00:56,890
Well, put simply, it says that stock prices follow a random walk.
12
00:00:57,380 --> 00:01:01,850
Now, of course, you may not know exactly what a random walk is yet, but that's what this lecture
13
00:01:01,850 --> 00:01:05,260
is about now because of the nature of random walks.
14
00:01:05,270 --> 00:01:09,890
If stock prices do, in fact, follow a random walk, then they are unpredictable.
15
00:01:10,310 --> 00:01:12,380
The rest of this lecture will show you how.
16
00:01:17,080 --> 00:01:21,390
But first, let's discuss some of the history behind the random walk hypothesis.
17
00:01:22,120 --> 00:01:27,100
Firstly, the mathematical concept of random walks has existed for a long time.
18
00:01:27,610 --> 00:01:29,770
As you'll see, it's just a mark of process.
19
00:01:29,770 --> 00:01:32,740
And so it's something you would normally learn in probability class.
20
00:01:33,220 --> 00:01:38,410
The random walk hypothesis is specific to finance and stock prices in particular.
21
00:01:39,070 --> 00:01:44,460
It was popularized in the 70s when a book called A Random Walk Down Wall Street was released.
22
00:01:45,040 --> 00:01:49,400
In fact, this was the book that also popularized the efficient market hypothesis.
23
00:01:50,230 --> 00:01:55,870
Note that both the random walk hypothesis and the efficient market hypothesis lead to the same conclusion,
24
00:01:56,110 --> 00:01:58,070
which is that you can't beat the market.
25
00:01:58,720 --> 00:02:02,200
Now, of course, there are people who don't believe in the random hypothesis.
26
00:02:02,350 --> 00:02:06,760
And so another book has come out called A Non Random Walk down Wall Street.
27
00:02:07,540 --> 00:02:12,320
Interestingly, this book came out almost 30 years later after a random walk down Wall Street.
28
00:02:12,610 --> 00:02:18,490
So it's not as if the random hypothesis and the efficient market hypothesis are ideas which are easily
29
00:02:18,490 --> 00:02:19,090
debunked.
30
00:02:19,630 --> 00:02:25,180
In this course, we're actually going to fit models to stock prices and we'll find that sometimes the
31
00:02:25,180 --> 00:02:27,940
best fitting model is, in fact, a random walk.
32
00:02:32,740 --> 00:02:34,300
So what is a random walk?
33
00:02:34,900 --> 00:02:39,550
Well, probably the simplest random walk works like this start at any price.
34
00:02:40,070 --> 00:02:46,120
Then in order to generate the next price, simply pick either plus one or minus one with equal probability.
35
00:02:46,720 --> 00:02:54,290
So P one is equal to P0 plus E one where E one, it can be either minus one or plus one, then generate
36
00:02:54,290 --> 00:02:59,920
P two from P one in the same way by picking either plus one or minus one with equal probability and
37
00:02:59,920 --> 00:03:01,150
then adding it to P1.
38
00:03:01,840 --> 00:03:05,110
Then we find P three and then we find P four and so on.
39
00:03:05,590 --> 00:03:07,150
So this is a random walk.
40
00:03:07,870 --> 00:03:14,200
Basically you can imagine yourself walking on the sidewalk in one dimension at every step you either
41
00:03:14,200 --> 00:03:19,480
decide to take one step to the left or one step to the right based on the result of a coin flip.
42
00:03:19,990 --> 00:03:22,030
Your walk is then a random walk.
43
00:03:22,780 --> 00:03:25,300
Notice one important property of the random walk.
44
00:03:25,660 --> 00:03:27,960
It's impossible to predict the next value.
45
00:03:28,360 --> 00:03:30,820
You only have a 50 percent chance of getting it right.
46
00:03:31,630 --> 00:03:36,940
In other words, your ability to predict the result of your walk is the same as your ability to predict
47
00:03:36,940 --> 00:03:38,920
the result of a series of coin flips.
48
00:03:43,870 --> 00:03:48,570
Now, we know that changes in stock price aren't just minus one and plus one, but can be real valued.
49
00:03:48,970 --> 00:03:54,250
In fact, we spent a lot of time in the previous section of this course trying to figure out what is
50
00:03:54,250 --> 00:03:56,330
the distribution that stock returns follow.
51
00:03:57,040 --> 00:04:00,260
Let's assume for now that the noise term is Gaussian.
52
00:04:01,030 --> 00:04:03,820
What would our algorithm be for generating stock prices?
53
00:04:04,360 --> 00:04:07,750
Again, we start at p0 equal to some arbitrary value.
54
00:04:08,260 --> 00:04:15,040
To find the next price, we first sample E from our Gaussian, then we add a P zero plus one to find
55
00:04:15,040 --> 00:04:16,500
P one the next price.
56
00:04:17,020 --> 00:04:20,590
We do the same thing to generate P2 and P3 and so forth.
57
00:04:21,130 --> 00:04:26,740
This should sound familiar because it's exactly what we did in our price simulation exercise from the
58
00:04:26,740 --> 00:04:27,780
previous lecture.
59
00:04:28,480 --> 00:04:31,090
In fact that was exactly a random walk.
60
00:04:31,780 --> 00:04:38,740
Notice again how we can't predict P one from P zero or equivalently we can't predict P one minus P zero,
61
00:04:38,770 --> 00:04:41,490
which is just E one, which is Gaussian noise.
62
00:04:41,980 --> 00:04:45,550
We can only predict one insofar as we know its expected value.
63
00:04:50,520 --> 00:04:55,760
Here's something interesting we can do that helps us understand why working with log prices is valuable,
64
00:04:56,460 --> 00:05:00,690
the general formula for a random walk with a drift is as follows.
65
00:05:01,680 --> 00:05:04,660
Muse called the drifter, and it's considered to be constant.
66
00:05:05,040 --> 00:05:08,880
If you're thinking of a time series, this would control the trend of the Time series.
67
00:05:09,690 --> 00:05:13,710
E of T is a Gaussian with mean zero and some variance sigma squared.
68
00:05:14,640 --> 00:05:22,170
In this case, time T and part time T minus one are the log prices at time T and time T minus one respectively.
69
00:05:23,100 --> 00:05:30,960
Note that if I take time T minus one to the left hand side, I get a time T minus time T minus one,
70
00:05:31,260 --> 00:05:32,610
which is the log return.
71
00:05:33,890 --> 00:05:38,330
If we were working with nonlawyers returns, this wouldn't be as convenient, since we would need a
72
00:05:38,330 --> 00:05:41,910
P of T minus one in the denominator to represent the return.
73
00:05:42,800 --> 00:05:48,200
What this says is that the log return is just the thing on the right hand side, which is just the Gaussian
74
00:05:48,200 --> 00:05:50,250
with Meenu and Variance Sigma squared.
75
00:05:50,840 --> 00:05:55,320
So the random walk model goes hand in hand with log prices and log returns.
76
00:05:55,840 --> 00:06:01,610
In fact, this model is the basis for the Black-Scholes formula which earned the Nobel Prize in economics.
77
00:06:06,420 --> 00:06:10,320
Now, the big question is, of course, is the random walk hypothesis correct?
78
00:06:10,890 --> 00:06:14,800
Well, let's recognize that there are some hidden assumptions in the random walk model.
79
00:06:15,570 --> 00:06:20,460
First is that the log returns are ID independent and identically distributed.
80
00:06:21,000 --> 00:06:25,340
We have seen that this may not be true because we have observed volatility clustering.
81
00:06:26,040 --> 00:06:31,620
If the volatility changes over time, then by definition it's not identically distributed.
82
00:06:32,250 --> 00:06:38,790
Furthermore, if the volatility in one period has some relationship to nearby periods, that is high.
83
00:06:38,790 --> 00:06:43,620
Volatility is clustered with other high volatility, then it's also not independent.
84
00:06:48,480 --> 00:06:52,690
At the same time, the random walk model is convenient and easy to work with.
85
00:06:53,220 --> 00:06:59,580
We will find that when we fit Arima models to stock prices, sometimes the best fitting model will be
86
00:06:59,580 --> 00:07:00,420
a random walk.
87
00:07:00,840 --> 00:07:07,230
So it wouldn't be wrong to say that sometimes for certain periods of time, stock prices do look like
88
00:07:07,230 --> 00:07:08,580
they follow a random walk.
89
00:07:09,210 --> 00:07:15,030
As with the efficient market hypothesis, it's possible to use statistical tests to determine whether
90
00:07:15,030 --> 00:07:17,340
or not stock prices follow a random walk.
91
00:07:22,030 --> 00:07:27,610
Now, since this is, of course, on Time series, we're going to do some time series analysis on random
92
00:07:27,610 --> 00:07:28,250
walks.
93
00:07:29,200 --> 00:07:33,750
Let's recognize that a random walk is just a specific instance of a Markov chain.
94
00:07:34,300 --> 00:07:39,400
If you've ever taken any of my courses on NLP or reinforcement learning, you should be familiar with
95
00:07:39,400 --> 00:07:40,260
this concept.
96
00:07:40,870 --> 00:07:42,310
The basic idea is this.
97
00:07:42,910 --> 00:07:44,110
Consider the sentence.
98
00:07:44,110 --> 00:07:46,930
The quick brown fox jumps over the lazy dog.
99
00:07:47,350 --> 00:07:52,960
If I gave you the sequence, the quick brown fox jumps over the lazy, how can you predict the next
100
00:07:52,960 --> 00:07:54,520
word of this sentence?
101
00:07:55,240 --> 00:08:00,610
Well, one solution is to build a probability distribution so you have the probability of the word a
102
00:08:00,610 --> 00:08:05,830
time t given the word a time, T minus one, given the word of times you minus two and so on.
103
00:08:06,370 --> 00:08:08,410
We call such a model a language model.
104
00:08:13,370 --> 00:08:18,800
Well, to get to the point, the mark of assumption says this, it says that instead of the word a time
105
00:08:18,800 --> 00:08:24,950
t, depending on all previous words, it only depends on the most immediate preceding word.
106
00:08:25,580 --> 00:08:31,580
That is P of word a time T given word, a time T minus one word at time, T minus two and so on is equal
107
00:08:31,580 --> 00:08:35,690
to P of word a time T given word of time, T minus one.
108
00:08:36,500 --> 00:08:40,910
Now you might think, OK, that's fine, but let's make this a little less abstract.
109
00:08:45,720 --> 00:08:50,350
Suppose I give you the word lazy and I ask you to predict the next word in my sentence.
110
00:08:50,760 --> 00:08:52,750
Of course, there are many possibilities.
111
00:08:53,190 --> 00:08:58,520
It could be lazy dog, but you'd probably be cheating because that's the sentence I gave you earlier.
112
00:08:59,160 --> 00:09:04,920
It might be lazy programmer, who is the author of this course, but again, you're going to use exogenous
113
00:09:04,920 --> 00:09:06,250
data to make your prediction.
114
00:09:07,020 --> 00:09:08,290
How about lazy student?
115
00:09:09,150 --> 00:09:14,190
In fact, it's quite difficult to know with any certainty exactly what the next word will be, given
116
00:09:14,190 --> 00:09:15,230
only a single word.
117
00:09:16,050 --> 00:09:16,830
Consider the word.
118
00:09:16,830 --> 00:09:19,590
The the next word could be practically anything.
119
00:09:20,190 --> 00:09:25,080
So the lesson here is that the mark of assumption is an extremely strong modeling assumption.
120
00:09:25,470 --> 00:09:27,180
At the same time, it's quite useful.
121
00:09:32,170 --> 00:09:37,690
So let's assume we have a Gaussian random, OK, this is excessive T equals to X of T minus one plus
122
00:09:37,690 --> 00:09:42,290
F.T. Where it is Gaussian distributed with mean zero and variance sigma squared.
123
00:09:42,910 --> 00:09:49,810
In this case, we can see that X of T is completely determined by a Gaussian distribution center, that
124
00:09:49,810 --> 00:09:52,600
X of T minus one with a variance sigma squared.
125
00:09:53,200 --> 00:09:59,320
That is, it does not depend on any previous values in the series, not X, a T minus two, not actually
126
00:09:59,350 --> 00:10:00,720
T minus three and so on.
127
00:10:01,390 --> 00:10:04,840
Therefore the Gaussian random walk forms a Markov chain.
128
00:10:09,830 --> 00:10:15,970
So if stock prices follow a Gaussian random walk, then the next obvious question is how do we forecast
129
00:10:16,610 --> 00:10:19,440
remember that because the next step is essentially random.
130
00:10:19,580 --> 00:10:22,130
The best we can do is find the expected value.
131
00:10:22,820 --> 00:10:28,850
Well, the expected value of a Gaussian with mean X of T minus one is just the mean X of T minus one.
132
00:10:29,510 --> 00:10:30,560
So what does this say?
133
00:10:31,220 --> 00:10:36,740
It's saying that if your stock price follows a random walk, then your best guess for the next stock
134
00:10:36,740 --> 00:10:39,540
price in the series is just the previous value.
135
00:10:39,860 --> 00:10:41,970
We cannot do any better than this.
136
00:10:42,680 --> 00:10:47,690
Notice that this justifies our method of filling in missing data, which is to copy the previous stock
137
00:10:47,690 --> 00:10:49,040
price forward in time.
138
00:10:53,980 --> 00:11:00,130
Now, as you know, often when we make estimates and statistics, we also want to quantify how confident
139
00:11:00,130 --> 00:11:01,620
we are in those estimates.
140
00:11:02,380 --> 00:11:08,380
Let's suppose we start at X of T and we want to forecast tall steps into the future to find X of T,
141
00:11:08,380 --> 00:11:15,370
plus how we already know the expected value of X 50 plus how it's just X of T, the same value we started
142
00:11:15,370 --> 00:11:15,740
with.
143
00:11:16,360 --> 00:11:17,680
But what does this variance?
144
00:11:18,340 --> 00:11:20,820
Well, we can use our price simulation formula.
145
00:11:21,430 --> 00:11:25,930
We know that X 50 plus one is equal to acts of T plus T plus one.
146
00:11:26,560 --> 00:11:33,040
Based on that, we also know that X 50 plus to zero to x 50 plus one plus the two plus two, which is
147
00:11:33,040 --> 00:11:34,860
added one to all the time indices.
148
00:11:35,650 --> 00:11:42,610
However, we can substitute X 50 plus one and then we would get X of T plus E plus one plus eight plus
149
00:11:42,610 --> 00:11:43,030
two.
150
00:11:43,780 --> 00:11:47,980
And then we keep following this pattern until we get to 50 plus tau.
151
00:11:48,430 --> 00:11:55,240
So X of T plus tau Ziko to X of T plus F.T. plus one policy of T plus two all the way up to 80 plus
152
00:11:55,240 --> 00:11:55,780
tão.
153
00:11:56,560 --> 00:12:00,400
Now luckily we did something exactly like this in the previous section.
154
00:12:01,150 --> 00:12:07,330
If all the E's are Gaussian with mean zero and variance sigma squared, then there's some as mean zero
155
00:12:07,330 --> 00:12:09,790
and variance tau time sigma squared.
156
00:12:10,420 --> 00:12:15,340
Therefore we can say that the variance in our estimate increases linearly with tau.
157
00:12:16,030 --> 00:12:21,580
More commonly we work with the standard deviation so we can see that the standard deviation of our forecast
158
00:12:21,790 --> 00:12:25,810
increases with the square root of the number of forecasting steps.
159
00:12:30,470 --> 00:12:34,560
Let's consider a well-known theorem from statistics, the central limit theorem.
160
00:12:35,390 --> 00:12:41,540
We know that our forecast, the Time T plus tau is the last known price of T plus the sum of a bunch
161
00:12:41,540 --> 00:12:42,640
of noise terms.
162
00:12:43,280 --> 00:12:48,030
Recall that the central limit theorem says that this sum tends to a Gaussian distribution.
163
00:12:48,740 --> 00:12:54,740
And so even if your returns do not necessarily follow a Gaussian distribution in the short term, what
164
00:12:54,740 --> 00:12:55,880
happens in the long term?
165
00:12:56,480 --> 00:12:59,720
Well, in the long term, you're just adding up a bunch of random variables.
166
00:12:59,900 --> 00:13:03,830
And due to the central limit theorem, their distribution approaches a Gaussian.
167
00:13:08,730 --> 00:13:13,920
I want to end this lecture with a tale about a famous experiment run by The Wall Street Journal in nineteen
168
00:13:13,920 --> 00:13:14,470
eighty eight.
169
00:13:15,360 --> 00:13:20,640
And this experiment called the Dart Throwing Investment Contest, professional stock traders from the
170
00:13:20,640 --> 00:13:26,550
New York Stock Exchange competed against dummy investors who simply threw darts on a board to choose
171
00:13:26,550 --> 00:13:27,390
stocks randomly.
172
00:13:28,110 --> 00:13:32,850
Now, granted, one might argue that throwing darts is not actually random and there may have been better
173
00:13:32,850 --> 00:13:34,400
ways to make random choices.
174
00:13:35,010 --> 00:13:41,190
In any case, they found that professional investors beat the dummy investors sixty one out of one hundred
175
00:13:41,190 --> 00:13:45,660
times and the dummy investors won only 39 out of 100 times.
176
00:13:46,290 --> 00:13:51,780
So you might think it's better to go with a professional investor rather than just picking stocks randomly.
177
00:13:52,500 --> 00:13:58,470
However, the professional investors only beat the market 51 out of 100 times.
178
00:13:59,100 --> 00:14:04,610
This is why it's often advised not to use active investing, although your bank will tell you otherwise.
179
00:14:04,980 --> 00:14:10,440
Just don't forget your bank is there to sell you things, not to give you good advice if you buy into
180
00:14:10,440 --> 00:14:11,910
an actively managed fund.
181
00:14:11,940 --> 00:14:17,520
First of all, you may only have a 50 percent chance of beating the market and on average you will match
182
00:14:17,520 --> 00:14:18,030
the market.
183
00:14:18,570 --> 00:14:23,860
However, the fees for actively managed funds are much higher than passively managed funds.
184
00:14:24,300 --> 00:14:29,100
Therefore, if you invest in the market itself, your fees will be much lower and you will have the
185
00:14:29,100 --> 00:14:30,780
same expected return anyway.
19836
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.