Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:03,060 --> 00:00:04,620
Instructor: Welcome back.
2
00:00:04,620 --> 00:00:06,420
In this lecture, we are going to introduce
3
00:00:06,420 --> 00:00:09,930
one of the most commonly found continuous distributions,
4
00:00:09,930 --> 00:00:11,373
the normal distribution.
5
00:00:12,360 --> 00:00:13,260
For starters,
6
00:00:13,260 --> 00:00:17,310
we define a normal distribution using a capital letter N
7
00:00:17,310 --> 00:00:20,373
followed by the mean and variance of the distribution.
8
00:00:21,240 --> 00:00:22,650
We read the following notation
9
00:00:22,650 --> 00:00:26,910
as variable X follows a normal distribution
10
00:00:26,910 --> 00:00:31,083
with mean, mu, and variance, sigma squared.
11
00:00:32,130 --> 00:00:33,900
When dealing with actual data,
12
00:00:33,900 --> 00:00:36,299
we would usually know the numerical values of mu
13
00:00:36,299 --> 00:00:37,893
and sigma squared.
14
00:00:39,780 --> 00:00:42,330
The normal distribution frequently appears in nature,
15
00:00:42,330 --> 00:00:45,513
as well as in life, in various shapes and forms.
16
00:00:46,710 --> 00:00:50,010
For example, the size of a fully grown male lion
17
00:00:50,010 --> 00:00:51,783
follows a normal distribution.
18
00:00:52,860 --> 00:00:55,470
Many records suggest that the average lion weighs
19
00:00:55,470 --> 00:01:00,470
between 150 and 250 kilograms, or 330 to 550 pounds.
20
00:01:04,050 --> 00:01:07,413
Of course, specimens exist which fall outside of this range.
21
00:01:08,280 --> 00:01:11,220
However, lions weighing less than 150
22
00:01:11,220 --> 00:01:14,698
or more than 250 kilograms tend to be the exception
23
00:01:14,698 --> 00:01:16,920
rather than the rule.
24
00:01:16,920 --> 00:01:20,520
Such individuals serve as outliers in our set,
25
00:01:20,520 --> 00:01:21,990
and the more data we gather,
26
00:01:21,990 --> 00:01:24,090
the lower part of the data they represent.
27
00:01:25,380 --> 00:01:26,520
Now that you know what types
28
00:01:26,520 --> 00:01:29,070
of events follow a normal distribution,
29
00:01:29,070 --> 00:01:32,013
let us examine some of its distinct characteristics.
30
00:01:33,240 --> 00:01:34,770
For starters, the graph
31
00:01:34,770 --> 00:01:37,770
of a normal distribution is bell shaped.
32
00:01:37,770 --> 00:01:39,480
Therefore, the majority of the data
33
00:01:39,480 --> 00:01:42,300
is centered around the mean.
34
00:01:42,300 --> 00:01:44,010
Thus, values further away
35
00:01:44,010 --> 00:01:46,740
from the mean are less likely to occur.
36
00:01:46,740 --> 00:01:47,940
Furthermore, we can see
37
00:01:47,940 --> 00:01:50,883
that the graph is symmetric with regards to the mean.
38
00:01:51,870 --> 00:01:54,189
That suggests values equally far away
39
00:01:54,189 --> 00:01:57,783
in opposing directions would still be equally likely.
40
00:01:59,460 --> 00:02:01,810
Let's go back to the lion example from earlier.
41
00:02:02,940 --> 00:02:04,980
If the mean is 400,
42
00:02:04,980 --> 00:02:07,410
symmetry suggests a lion is equally likely
43
00:02:07,410 --> 00:02:11,430
to weigh 350 pounds and 450 pounds,
44
00:02:11,430 --> 00:02:13,953
since both are 50 pounds away from the mean.
45
00:02:16,770 --> 00:02:18,930
All right, for anybody interested,
46
00:02:18,930 --> 00:02:22,784
you can find the CDF and the PDF of the normal distribution
47
00:02:22,784 --> 00:02:25,263
in the additional materials for this lecture.
48
00:02:26,520 --> 00:02:27,353
Instead of going
49
00:02:27,353 --> 00:02:29,400
through the complex algebraic simplifications
50
00:02:29,400 --> 00:02:31,590
in this lecture, we are simply going to talk
51
00:02:31,590 --> 00:02:34,023
about the expected value and the variance.
52
00:02:35,790 --> 00:02:38,100
The expected value for a normal distribution
53
00:02:38,100 --> 00:02:43,100
equals its mean, mu, whereas its variance, sigma squared,
54
00:02:43,560 --> 00:02:46,383
is usually given when we define the distribution.
55
00:02:47,280 --> 00:02:48,990
However, if it isn't,
56
00:02:48,990 --> 00:02:51,663
we can deduce it from the expected value.
57
00:02:52,860 --> 00:02:56,460
To do so, we must apply the formula we showed earlier.
58
00:02:56,460 --> 00:02:58,860
The variance of a variable is equal
59
00:02:58,860 --> 00:03:01,890
to the expected value of the squared variable
60
00:03:01,890 --> 00:03:05,133
minus the squared expected value of the variable.
61
00:03:06,750 --> 00:03:07,623
Good job.
62
00:03:08,490 --> 00:03:10,763
Another peculiarity of the normal distribution
63
00:03:10,763 --> 00:03:15,393
is the 68, 95, 99.7 law.
64
00:03:16,290 --> 00:03:19,740
This law suggests that for any normally distributed event,
65
00:03:19,740 --> 00:03:23,946
68% of all outcomes fall within one standard deviation
66
00:03:23,946 --> 00:03:25,353
away from the mean,
67
00:03:26,490 --> 00:03:30,570
95% fall within two standard deviations,
68
00:03:30,570 --> 00:03:33,453
and 99.7 within three.
69
00:03:35,070 --> 00:03:37,170
The last part really emphasizes the fact
70
00:03:37,170 --> 00:03:41,190
that outliers are extremely rare in normal distributions.
71
00:03:41,190 --> 00:03:43,980
It also suggests how much we know about a data set
72
00:03:43,980 --> 00:03:45,365
if we only have the information
73
00:03:45,365 --> 00:03:47,523
that it is normally distributed.
74
00:03:49,980 --> 00:03:51,483
Fantastic work, everyone.
75
00:03:52,530 --> 00:03:55,200
Before we move on to other types of distributions,
76
00:03:55,200 --> 00:03:56,033
you need to know
77
00:03:56,033 --> 00:03:58,173
that we can analyze any normal distribution.
78
00:03:59,340 --> 00:04:02,430
To do this, we need to standardize the distribution,
79
00:04:02,430 --> 00:04:05,103
which we will explain in detail in the next video.
80
00:04:06,060 --> 00:04:07,083
Thanks for watching.
6239
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.