Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,360 --> 00:00:02,910
-: After exploring the measures of central tendency,
2
00:00:02,910 --> 00:00:05,790
let's move on to the measures of asymmetry.
3
00:00:05,790 --> 00:00:08,220
The most commonly used tool to measure asymmetry
4
00:00:08,220 --> 00:00:09,488
is skewness.
5
00:00:09,488 --> 00:00:12,003
This is the formula to calculate it.
6
00:00:13,830 --> 00:00:16,320
Almost always you will use software that performs
7
00:00:16,320 --> 00:00:17,580
a calculation for you.
8
00:00:17,580 --> 00:00:20,970
So in this lesson, we will not get into the computation
9
00:00:20,970 --> 00:00:23,472
but rather the meaning of skewness.
10
00:00:23,472 --> 00:00:26,700
So skewness indicates whether the observations
11
00:00:26,700 --> 00:00:30,300
in a data set are concentrated on one side.
12
00:00:30,300 --> 00:00:32,490
Skewness can be confusing at the beginning.
13
00:00:32,490 --> 00:00:34,623
So an example is in place.
14
00:00:35,580 --> 00:00:37,530
Remember frequency distribution tables
15
00:00:37,530 --> 00:00:39,300
from previous lectures?
16
00:00:39,300 --> 00:00:41,340
Here we have three data sets,
17
00:00:41,340 --> 00:00:43,890
in the respective frequency distributions.
18
00:00:43,890 --> 00:00:48,087
We have also calculated the means, medians, and modes.
19
00:00:48,087 --> 00:00:52,764
The first data set has a mean of 2.79 and a median of two.
20
00:00:52,764 --> 00:00:56,160
Hence, the mean is bigger than the median.
21
00:00:56,160 --> 00:00:58,923
We say that this is a positive or right skew.
22
00:00:59,850 --> 00:01:01,560
From the graph, you can clearly see
23
00:01:01,560 --> 00:01:04,620
that the data points are concentrated on the left side.
24
00:01:04,620 --> 00:01:07,320
Note that the direction of the skew is counterintuitive.
25
00:01:07,320 --> 00:01:09,960
It does not depend on which side the line is leaning to
26
00:01:09,960 --> 00:01:13,110
but rather to which side it's tail is leaning to.
27
00:01:13,110 --> 00:01:17,643
So right skewness means that the outliers are to the right.
28
00:01:18,780 --> 00:01:20,370
It is interesting to see the measures
29
00:01:20,370 --> 00:01:23,460
of central tendency incorporated in the graph.
30
00:01:23,460 --> 00:01:25,440
When we have right skewness, the mean is bigger
31
00:01:25,440 --> 00:01:27,450
than the median, and the mode is the value
32
00:01:27,450 --> 00:01:29,493
with the highest visual representation.
33
00:01:31,477 --> 00:01:33,960
In the second graph, we have plotted a data set
34
00:01:33,960 --> 00:01:36,660
that has an equal mean, median and mode.
35
00:01:36,660 --> 00:01:39,360
The frequency of occurrence is completely symmetrical
36
00:01:39,360 --> 00:01:42,668
and we call this a zero or a no skew.
37
00:01:42,668 --> 00:01:44,910
Most often you'll hear people say
38
00:01:44,910 --> 00:01:46,773
that the distribution is symmetrical.
39
00:01:48,270 --> 00:01:49,380
For the third data set,
40
00:01:49,380 --> 00:01:54,123
we have a mean of 4.9, a median of five, and a mode of six.
41
00:01:54,123 --> 00:01:56,670
As the mean is lower than the median,
42
00:01:56,670 --> 00:01:59,930
we say that there is a negative or a left skew.
43
00:01:59,930 --> 00:02:03,750
Once again, the highest point is define by the mode.
44
00:02:03,750 --> 00:02:06,240
Why is it called a left skew again?
45
00:02:06,240 --> 00:02:09,062
That's right, because the outliers are to the left.
46
00:02:10,127 --> 00:02:14,043
All right, so why is skewness important?
47
00:02:14,910 --> 00:02:18,330
Skewness tells us a lot about where the data is situated.
48
00:02:18,330 --> 00:02:20,370
As we mentioned in our previous lesson,
49
00:02:20,370 --> 00:02:21,960
the mean, median and mode
50
00:02:21,960 --> 00:02:24,060
should be used together to get a good understanding
51
00:02:24,060 --> 00:02:25,410
of the data set.
52
00:02:25,410 --> 00:02:27,780
Measures of asymmetry like skewness are the link
53
00:02:27,780 --> 00:02:31,050
between central tendency measures and probability theory
54
00:02:31,050 --> 00:02:33,360
which ultimately allows us to get a more complete
55
00:02:33,360 --> 00:02:36,420
understanding of the data we are working with.
56
00:02:36,420 --> 00:02:37,420
Thanks for watching.
4534
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.