Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:03,030 --> 00:00:04,230
-: Hello again.
2
00:00:04,230 --> 00:00:05,063
In this lecture,
3
00:00:05,063 --> 00:00:07,470
we are going to discuss the exponential distribution
4
00:00:07,470 --> 00:00:09,393
and its main characteristics.
5
00:00:10,500 --> 00:00:13,800
For starters, we define the exponential distribution
6
00:00:13,800 --> 00:00:15,993
with the abbreviation Exp,
7
00:00:16,920 --> 00:00:19,383
followed by a scale parameter, lambda.
8
00:00:20,400 --> 00:00:22,410
We read the following statement as
9
00:00:22,410 --> 00:00:25,860
Variable X follows an exponential distribution
10
00:00:25,860 --> 00:00:27,993
with a scale of a half.
11
00:00:30,150 --> 00:00:31,200
All right.
12
00:00:31,200 --> 00:00:33,090
Variables which most closely follow an
13
00:00:33,090 --> 00:00:35,100
exponential distribution are ones
14
00:00:35,100 --> 00:00:37,740
with a probability that initially decreases
15
00:00:37,740 --> 00:00:39,453
before eventually plateauing.
16
00:00:40,830 --> 00:00:44,583
One such example is the number of views for a YouTube vlog.
17
00:00:45,540 --> 00:00:47,940
There is a great interest upon release, so it starts
18
00:00:47,940 --> 00:00:50,140
off with many views in the first day or two.
19
00:00:51,180 --> 00:00:54,180
After most subscribers have had the chance to see the video,
20
00:00:54,180 --> 00:00:55,653
the view counter slows down.
21
00:00:56,610 --> 00:00:59,430
Even though the aggregate amount of views keeps increasing,
22
00:00:59,430 --> 00:01:02,520
the number of new ones diminishes daily.
23
00:01:02,520 --> 00:01:05,310
As time goes on, the video either becomes outdated
24
00:01:05,310 --> 00:01:07,380
or the author produces new content,
25
00:01:07,380 --> 00:01:09,483
so viewership focus shifts away.
26
00:01:10,590 --> 00:01:12,300
Therefore, it is more likely
27
00:01:12,300 --> 00:01:14,160
for a random viewing to have occurred
28
00:01:14,160 --> 00:01:16,260
close to the video's initial release
29
00:01:16,260 --> 00:01:18,153
than in any of the following periods.
30
00:01:19,380 --> 00:01:20,820
Graphically, the PDF
31
00:01:20,820 --> 00:01:23,370
of such a function would start off very high
32
00:01:23,370 --> 00:01:27,090
and sharply decrease within the first few timeframes.
33
00:01:27,090 --> 00:01:29,190
The curve somewhat resembles a boomerang
34
00:01:29,190 --> 00:01:32,553
with each handle lining up with the X and Y axis.
35
00:01:33,840 --> 00:01:34,740
All right.
36
00:01:34,740 --> 00:01:36,900
We know what the PDF would look like
37
00:01:36,900 --> 00:01:38,313
but what about the CDF?
38
00:01:39,180 --> 00:01:42,990
In a weird way, the CDF would also resemble a boomerang.
39
00:01:42,990 --> 00:01:46,233
However, this one has shifted 90 degrees to the right.
40
00:01:47,190 --> 00:01:48,023
As you know,
41
00:01:48,023 --> 00:01:50,760
the cumulative distribution eventually approaches one,
42
00:01:50,760 --> 00:01:53,163
so that would be the value where it plateaus.
43
00:01:54,810 --> 00:01:56,820
To define an exponential distribution,
44
00:01:56,820 --> 00:01:59,100
we require a rate parameter,
45
00:01:59,100 --> 00:02:01,203
denoted by the Greek letter, lambda.
46
00:02:02,070 --> 00:02:05,310
This parameter determines how fast the CDF curve
47
00:02:05,310 --> 00:02:06,987
reaches the point of plateauing,
48
00:02:06,987 --> 00:02:08,793
and how spread out the graph is.
49
00:02:10,650 --> 00:02:11,910
All right, let's talk
50
00:02:11,910 --> 00:02:14,313
about the expected value and the variance.
51
00:02:15,180 --> 00:02:17,910
The expected value for an exponential distribution
52
00:02:17,910 --> 00:02:21,483
is equal to one over the rate parameter lambda.
53
00:02:22,620 --> 00:02:25,503
Whilst the variance is one over lambda squared.
54
00:02:26,880 --> 00:02:28,110
In data analysis,
55
00:02:28,110 --> 00:02:31,023
we end up using exponential distributions quite often.
56
00:02:31,980 --> 00:02:35,820
However, unlike the normal or chi squared distributions,
57
00:02:35,820 --> 00:02:39,000
we do not have a table of known variables for it.
58
00:02:39,000 --> 00:02:41,763
That is why sometimes we prefer to transform it.
59
00:02:42,780 --> 00:02:45,000
Generally, we can take the natural logarithm
60
00:02:45,000 --> 00:02:47,760
of every set of an exponential distribution
61
00:02:47,760 --> 00:02:49,950
and get a normal distribution.
62
00:02:49,950 --> 00:02:53,070
In statistics, we can use this new transform data
63
00:02:53,070 --> 00:02:55,290
to run linear regressions.
64
00:02:55,290 --> 00:02:57,270
This is one of the most common transformations
65
00:02:57,270 --> 00:02:58,270
I've had to perform.
66
00:03:00,300 --> 00:03:01,470
Before we move on,
67
00:03:01,470 --> 00:03:03,600
we need to introduce an extremely important type
68
00:03:03,600 --> 00:03:07,410
of distribution that is often used in mathematical modeling.
69
00:03:07,410 --> 00:03:09,930
We're going to focus on the logistic distribution
70
00:03:09,930 --> 00:03:12,423
and its main characteristics in the next video.
71
00:03:13,650 --> 00:03:14,733
Thanks for watching.
5482
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.