Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,390 --> 00:00:04,380
Now the next window function we are going to discuss as entail.
2
00:00:05,290 --> 00:00:10,960
And title divides the rows within the partition and two and groups.
3
00:00:11,800 --> 00:00:19,120
It is called end title because and it stands for number and tile is number of tiles.
4
00:00:20,910 --> 00:00:32,370
So suppose if I want to divide the data within the stores into two titles or two groups, I can use
5
00:00:32,370 --> 00:00:33,150
Intel.
6
00:00:34,590 --> 00:00:36,030
So, for example.
7
00:00:36,860 --> 00:00:42,140
There are three customers in a story and four customers in store.
8
00:00:42,170 --> 00:00:47,120
B if I want to divide these three customers into two groups.
9
00:00:47,950 --> 00:00:54,910
The first two customers will get group value of one and the next customer will get group value of two.
10
00:00:55,930 --> 00:00:59,410
If we have four customers as a store b.
11
00:01:00,950 --> 00:01:09,140
The first two customers will get group value of one, and the next two customers will get a group value
12
00:01:09,140 --> 00:01:10,280
of two.
13
00:01:12,210 --> 00:01:14,670
The syntax is also very similar.
14
00:01:14,700 --> 00:01:21,870
Instead of rank or row number, you have to write entail and within the bracket you have to write the
15
00:01:21,870 --> 00:01:26,340
number of tiles you want in your partitions.
16
00:01:29,140 --> 00:01:31,870
So if I wanted three tiles.
17
00:01:32,700 --> 00:01:36,330
My result will be one, two, three.
18
00:01:36,330 --> 00:01:42,120
So three customers will get equally divided into three titles or three groups.
19
00:01:43,120 --> 00:01:53,230
In case of store be the first two customer will get a value of one and third, and for customer will
20
00:01:53,230 --> 00:01:55,270
be getting a value of two and three.
21
00:01:55,510 --> 00:02:04,180
So this function will try to divide the number of rows in each partition into the number of tiles you
22
00:02:04,180 --> 00:02:06,280
have mentioned in the formula.
23
00:02:07,550 --> 00:02:09,920
So what is the use case of this?
24
00:02:10,160 --> 00:02:19,490
So suppose in our rank and roll number video, you have seen that the number of customers from California
25
00:02:19,610 --> 00:02:23,990
are much larger than the number of customers from Alabama.
26
00:02:24,620 --> 00:02:32,270
So suppose if I wanted to take top 20% of customers from each state, how can I do that?
27
00:02:33,280 --> 00:02:35,700
To select 20%.
28
00:02:35,710 --> 00:02:41,380
You can divide the number of customers in each estate into five parts.
29
00:02:41,740 --> 00:02:49,210
So if you divide 100 by five, so each part will contain a 20% of population.
30
00:02:50,610 --> 00:03:00,240
So to select 20%, I can use until five and then select only the tile value one customers.
31
00:03:01,590 --> 00:03:06,510
And similar to rule number and rank functions here also you can use order by.
32
00:03:07,800 --> 00:03:14,100
So let's select top 20% of customers from each state.
33
00:03:14,370 --> 00:03:16,950
So let's go back to our PG admin.
34
00:03:19,440 --> 00:03:21,120
So you can see that.
35
00:03:22,450 --> 00:03:26,680
There are around nine customers in Alabama.
36
00:03:28,050 --> 00:03:33,060
And there are much more customers in California.
37
00:03:34,130 --> 00:03:38,720
So we want 20% of customer from each estate.
38
00:03:38,730 --> 00:03:47,570
So we should get these two values from Alabama and a lot more people from California because there are
39
00:03:47,570 --> 00:03:49,490
much more people in California.
40
00:03:50,650 --> 00:03:53,140
So I will use the.
41
00:03:53,990 --> 00:03:55,040
Same table.
42
00:04:00,460 --> 00:04:02,170
Let's provide intel.
43
00:04:02,200 --> 00:04:08,020
Since we want 20% of customer, we will be dividing our data into five groups.
44
00:04:11,570 --> 00:04:16,580
If you want 10%, then you have to divide your data into ten groups.
45
00:04:17,270 --> 00:04:26,810
If you want just 5%, you have to divide your data into 20 groups so 100 divided by the percentage you
46
00:04:26,810 --> 00:04:27,470
want.
47
00:04:28,010 --> 00:04:29,930
So I want 20%.
48
00:04:29,930 --> 00:04:33,080
So 100 divided by 20, which is five.
49
00:04:33,530 --> 00:04:35,810
That's why I have to write and tell.
50
00:04:36,660 --> 00:04:37,380
Five.
51
00:04:38,020 --> 00:04:40,240
Divide my data into five groups.
52
00:04:43,170 --> 00:04:46,890
Again, partitioned by state and order by order.
53
00:04:47,820 --> 00:04:50,640
And I will write it.
54
00:04:52,040 --> 00:04:53,000
Biden number.
55
00:05:00,760 --> 00:05:08,290
If I run this, you can see that the first two customers are getting tail number as one.
56
00:05:08,830 --> 00:05:16,420
The next two getting title number is two, then next to three, then next to four, and then a single
57
00:05:16,420 --> 00:05:18,430
customer with group number five.
58
00:05:18,790 --> 00:05:25,360
If you can go to California here, you will find many more customers within each group.
59
00:05:25,720 --> 00:05:31,990
So you can see there are a lot more people in group one, Group three, group four and grow five for
60
00:05:31,990 --> 00:05:33,130
a small estate.
61
00:05:33,130 --> 00:05:43,180
For example, in Kentucky, we are getting just four customer, so roughly around 20% of customer from
62
00:05:43,180 --> 00:05:47,690
each state we are getting and how to filter out.
63
00:05:47,710 --> 00:05:57,280
Since our task is to get only 20% of customers from each state on the basis of their number of orders,
64
00:05:57,370 --> 00:06:01,300
we can just select very close.
65
00:06:02,300 --> 00:06:06,800
Select a start from this.
66
00:06:14,760 --> 00:06:15,390
Where.
67
00:06:16,790 --> 00:06:17,390
Dale.
68
00:06:18,460 --> 00:06:19,180
And.
69
00:06:20,250 --> 00:06:22,770
Is equal to one.
70
00:06:28,390 --> 00:06:28,710
Hmm.
71
00:06:28,930 --> 00:06:31,540
There is something wrong with our Twitter.
72
00:06:32,950 --> 00:06:33,640
Okay.
73
00:06:34,170 --> 00:06:35,080
I see.
74
00:06:35,800 --> 00:06:37,600
And we'll write a lot.
75
00:06:42,900 --> 00:06:48,380
You can see that now we are getting top 20% of customer from each set.
76
00:06:51,840 --> 00:06:54,270
Now for bottom 20%.
77
00:06:54,270 --> 00:06:58,050
You can just select the last title for data.
78
00:06:58,470 --> 00:07:02,850
So these are the people with least number of orders in each state.
79
00:07:04,030 --> 00:07:12,340
So you can easily use an intel function when you want to segregate your data on the basis of percentages
80
00:07:12,670 --> 00:07:19,540
for ranks, you can use any of the three rank function For percentages, you have to use Intel function.
7117
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.