Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,000 --> 00:00:02,000
Instructor: Next up, we've got tech specific tools.
2
00:00:02,000 --> 00:00:05,000
And looking at the Transform tab and the query editor,
3
00:00:05,000 --> 00:00:08,000
you'll notice that Power BI groups different sets
4
00:00:08,000 --> 00:00:11,000
of tools together based on their purpose or function.
5
00:00:11,000 --> 00:00:13,000
So in this case, we'll find all
6
00:00:13,000 --> 00:00:16,000
of our tech specific tools grouped together
7
00:00:16,000 --> 00:00:17,000
at the end of the ribbon.
8
00:00:17,000 --> 00:00:19,000
And within this group
9
00:00:19,000 --> 00:00:23,000
we've got some really interesting and powerful options.
10
00:00:23,000 --> 00:00:24,000
So for one, we can split
11
00:00:24,000 --> 00:00:27,000
up a column based on a specific character
12
00:00:27,000 --> 00:00:30,000
or delimiter or based on a number of characters.
13
00:00:31,000 --> 00:00:33,000
We can format any of our text columns
14
00:00:33,000 --> 00:00:35,000
using basic formatting options,
15
00:00:35,000 --> 00:00:38,000
like lowercase, uppercase, proper case,
16
00:00:38,000 --> 00:00:41,000
which is capitalizing the first letter of each word.
17
00:00:41,000 --> 00:00:44,000
Or we can use tools like Trim, which eliminate leading
18
00:00:44,000 --> 00:00:47,000
and trailing spaces, or clean, which does the same thing
19
00:00:47,000 --> 00:00:51,000
and also eliminates non-principal characters.
20
00:00:51,000 --> 00:00:53,000
Now you might be thinking that those trim
21
00:00:53,000 --> 00:00:55,000
and clean options really aren't that helpful,
22
00:00:55,000 --> 00:00:57,000
but once you run into a case where you
23
00:00:57,000 --> 00:01:01,000
have one trailing space in your data set, and trust me
24
00:01:01,000 --> 00:01:03,000
this will drive you crazy the first time you experience it.
25
00:01:03,000 --> 00:01:05,000
The problem is, as human beings
26
00:01:05,000 --> 00:01:08,000
we're incapable of seeing a trailing space.
27
00:01:08,000 --> 00:01:09,000
It's completely invisible to us
28
00:01:09,000 --> 00:01:11,000
and it looks exactly the same
29
00:01:11,000 --> 00:01:14,000
as any other data point without the trailing space.
30
00:01:14,000 --> 00:01:16,000
But to Power BI, it looks
31
00:01:16,000 --> 00:01:18,000
like a completely different and unique value.
32
00:01:18,000 --> 00:01:20,000
So in cases like that,
33
00:01:20,000 --> 00:01:23,000
trim or clean can be great tools just to standardize
34
00:01:23,000 --> 00:01:25,000
and help avoid issues like that,
35
00:01:25,000 --> 00:01:26,000
especially if you're working
36
00:01:26,000 --> 00:01:29,000
with really messy or unstructured text data.
37
00:01:30,000 --> 00:01:33,000
We also have some great extract tools.
38
00:01:33,000 --> 00:01:35,000
We can extract a certain subset of characters
39
00:01:35,000 --> 00:01:37,000
from a string based on a specific length.
40
00:01:37,000 --> 00:01:38,000
We can extract the first
41
00:01:38,000 --> 00:01:41,000
or last number of characters or a range.
42
00:01:41,000 --> 00:01:44,000
But where it gets really interesting is using delimiters,
43
00:01:44,000 --> 00:01:48,000
which we've already used as part of our first assignment.
44
00:01:48,000 --> 00:01:51,000
So what we can do here is we can actually tell Power BI
45
00:01:51,000 --> 00:01:53,000
that we want to return all of the characters
46
00:01:53,000 --> 00:01:57,000
before a specific delimiter or symbol or character
47
00:01:57,000 --> 00:02:00,000
or after it, or between two distinct delimiters.
48
00:02:00,000 --> 00:02:03,000
And there are also some advanced options as well
49
00:02:03,000 --> 00:02:05,000
that allow you to specify whether you search
50
00:02:05,000 --> 00:02:07,000
from the left side of the string or the right
51
00:02:07,000 --> 00:02:09,000
and if you want to skip a certain number
52
00:02:09,000 --> 00:02:13,000
of instances of each delimiter before returning the text.
53
00:02:13,000 --> 00:02:15,000
So just some really great flexibility there
54
00:02:15,000 --> 00:02:17,000
with those extract tools.
55
00:02:17,000 --> 00:02:18,000
So you may have noticed that some
56
00:02:18,000 --> 00:02:21,000
of the tools here are grayed out or inactive,
57
00:02:21,000 --> 00:02:24,000
like the merge columns or parse.
58
00:02:24,000 --> 00:02:26,000
And that brings up a really important point,
59
00:02:26,000 --> 00:02:29,000
which is that this toolbar is completely dynamic based
60
00:02:29,000 --> 00:02:31,000
on what you've selected.
61
00:02:31,000 --> 00:02:34,000
So if you've only selected a single column,
62
00:02:34,000 --> 00:02:36,000
the merge column options are irrelevant,
63
00:02:36,000 --> 00:02:38,000
so you can't even click them.
64
00:02:38,000 --> 00:02:39,000
You'd have to select multiple columns
65
00:02:39,000 --> 00:02:42,000
in order to activate that option.
66
00:02:42,000 --> 00:02:44,000
And then taking that even further
67
00:02:44,000 --> 00:02:46,000
if you've selected a column that's numerical instead
68
00:02:46,000 --> 00:02:50,000
of text, this entire group of tools might be replaced
69
00:02:50,000 --> 00:02:54,000
by numeric based or number specific tools instead.
70
00:02:54,000 --> 00:02:58,000
So just remember that the entire ribbon, the entire toolbar
71
00:02:58,000 --> 00:03:00,000
that you're seeing here will dynamically change based
72
00:03:00,000 --> 00:03:02,000
on your selections.
73
00:03:02,000 --> 00:03:05,000
Now, one more very important point to highlight
74
00:03:05,000 --> 00:03:07,000
before we shift gears into Power BI,
75
00:03:07,000 --> 00:03:09,000
anytime you see this yellow box that says,
76
00:03:09,000 --> 00:03:11,000
hey, this is important,
77
00:03:11,000 --> 00:03:14,000
I'm gonna talk about something really important.
78
00:03:14,000 --> 00:03:16,000
So that means focus in and pay attention
79
00:03:16,000 --> 00:03:18,000
because this is gonna be something that'll come
80
00:03:18,000 --> 00:03:20,000
up time and time again throughout the course.
81
00:03:20,000 --> 00:03:22,000
And it's really, really important
82
00:03:22,000 --> 00:03:25,000
for you to fully grasp and understand.
83
00:03:25,000 --> 00:03:27,000
So what I want to talk about now is the difference
84
00:03:27,000 --> 00:03:30,000
between transform and add columns
85
00:03:30,000 --> 00:03:32,000
because this is something that confuse me for a while
86
00:03:32,000 --> 00:03:35,000
and it's really important to understand the difference
87
00:03:35,000 --> 00:03:36,000
between the two.
88
00:03:36,000 --> 00:03:40,000
And what I was noticing was that the same tools, in fact
89
00:03:40,000 --> 00:03:42,000
the same identical sets
90
00:03:42,000 --> 00:03:44,000
of tools kept popping up in different places.
91
00:03:44,000 --> 00:03:47,000
And for the longest time, I just thought
92
00:03:47,000 --> 00:03:49,000
that this was really confusing and redundant,
93
00:03:49,000 --> 00:03:52,000
until I realized that the outcome is completely
94
00:03:52,000 --> 00:03:55,000
different depending on where you select the tool.
95
00:03:55,000 --> 00:03:58,000
So when you select a tool from within the Transform tab,
96
00:03:58,000 --> 00:04:00,000
you're essentially modifying
97
00:04:00,000 --> 00:04:03,000
or overriding the column that you've selected.
98
00:04:03,000 --> 00:04:06,000
But when you choose a tool from the Add Column tab
99
00:04:06,000 --> 00:04:09,000
you're creating a brand new column within your table.
100
00:04:09,000 --> 00:04:12,000
So that may sound really obvious when I say it now,
101
00:04:12,000 --> 00:04:15,000
but I guarantee as you're learning this tool
102
00:04:15,000 --> 00:04:18,000
and as you're playing around with the query editor, you will
103
00:04:18,000 --> 00:04:22,000
at some point in time select the tool from the wrong tab.
104
00:04:22,000 --> 00:04:23,000
And you know what?
105
00:04:23,000 --> 00:04:25,000
That's okay because obviously nothing is set in stone.
106
00:04:25,000 --> 00:04:28,000
It's as simple as just deleting the last applied step
107
00:04:28,000 --> 00:04:30,000
and you'll be back where you started.
108
00:04:30,000 --> 00:04:32,000
But it is something to keep in mind
109
00:04:32,000 --> 00:04:35,000
and hopefully it will help you at least understand
110
00:04:35,000 --> 00:04:37,000
where you've gone wrong
111
00:04:37,000 --> 00:04:40,000
and why these tools appear in multiple places.
112
00:04:40,000 --> 00:04:42,000
So with that, let's head back to Power BI
113
00:04:42,000 --> 00:04:44,000
and we're gonna get our hands dirty with some
114
00:04:44,000 --> 00:04:46,000
of these text tools.
115
00:04:46,000 --> 00:04:49,000
So continuing with the customer lookup table,
116
00:04:49,000 --> 00:04:51,000
one of the things that's been bothering me
117
00:04:51,000 --> 00:04:56,000
about this table is the prefix and the customer name
118
00:04:56,000 --> 00:04:57,000
and first name columns.
119
00:04:57,000 --> 00:05:00,000
So if I scroll back over here, right, everything's
120
00:05:00,000 --> 00:05:04,000
capitalized and it just it looks pretty jarring
121
00:05:04,000 --> 00:05:04,000
to me when I look at this.
122
00:05:04,000 --> 00:05:08,000
I would like to see this in a proper case.
123
00:05:08,000 --> 00:05:11,000
Let's first update our prefixed column, right?
124
00:05:11,000 --> 00:05:13,000
So we'll come up here to add column,
125
00:05:13,000 --> 00:05:15,000
and we're gonna come down to format.
126
00:05:15,000 --> 00:05:18,000
And within the format menu option we have see our lowercase,
127
00:05:18,000 --> 00:05:21,000
uppercase, and then capitalize each word.
128
00:05:21,000 --> 00:05:23,000
This is the proper text option, right?
129
00:05:23,000 --> 00:05:28,000
So we can click here and oh, so all right, so we've actually
130
00:05:28,000 --> 00:05:32,000
added a new column to the end of our data set here
131
00:05:32,000 --> 00:05:34,000
and it's called Capitalize Each Word,
132
00:05:34,000 --> 00:05:36,000
which is a a pretty bad title there.
133
00:05:36,000 --> 00:05:40,000
But at any rate, what we've done here is instead
134
00:05:40,000 --> 00:05:43,000
of selecting this from the Transform tab, we've actually
135
00:05:43,000 --> 00:05:47,000
selected the format option from the Add Column tab.
136
00:05:47,000 --> 00:05:49,000
So again, we're adding a brand new column to the end
137
00:05:49,000 --> 00:05:50,000
of the data set.
138
00:05:50,000 --> 00:05:53,000
And again, just to highlight like this is what I'm talking
139
00:05:53,000 --> 00:05:56,000
about, it's really easy to just select the wrong menu item
140
00:05:56,000 --> 00:05:58,000
and create a brand new column instead
141
00:05:58,000 --> 00:06:01,000
of transforming a column or vice versa.
142
00:06:01,000 --> 00:06:03,000
You could do it the other way around.
143
00:06:03,000 --> 00:06:06,000
Again, the great news here is that it's easily fixed, right?
144
00:06:06,000 --> 00:06:09,000
We can just come in here, click X to delete that,
145
00:06:09,000 --> 00:06:14,000
apply to step, we'll come back over to our prefix column
146
00:06:15,000 --> 00:06:18,000
and now we can head up to our transform menu.
147
00:06:18,000 --> 00:06:20,000
And then from here, head over to our text column tools,
148
00:06:20,000 --> 00:06:24,000
click format, capitalize each word,
149
00:06:24,000 --> 00:06:25,000
and now we're in the right spot.
150
00:06:25,000 --> 00:06:28,000
We've transformed the values in this column
151
00:06:28,000 --> 00:06:32,000
to a proper case, or actually only capitalizing each word.
152
00:06:32,000 --> 00:06:33,000
All right, so the next thing that I'd like
153
00:06:33,000 --> 00:06:37,000
to do is also apply that same transformation step
154
00:06:37,000 --> 00:06:40,000
to the first name and last name columns as well.
155
00:06:40,000 --> 00:06:42,000
And so what I could do is I could select each one
156
00:06:42,000 --> 00:06:46,000
of these one at a time, come up to format
157
00:06:46,000 --> 00:06:50,000
or I can multi-select by holding shift and click,
158
00:06:50,000 --> 00:06:54,000
come here to format, capitalize each word.
159
00:06:54,000 --> 00:06:57,000
And then the Query Editor creates those applied steps
160
00:06:57,000 --> 00:07:00,000
to change first name and last name both to proper.
161
00:07:00,000 --> 00:07:02,000
Awesome, so we're in good shape there.
162
00:07:02,000 --> 00:07:04,000
The other interesting piece here to call out is
163
00:07:04,000 --> 00:07:06,000
we didn't add a new applied step here
164
00:07:06,000 --> 00:07:08,000
for capitalize each word.
165
00:07:08,000 --> 00:07:11,000
If we actually look at the M code here, we can see
166
00:07:11,000 --> 00:07:16,000
that we're transforming the columns to proper text, right?
167
00:07:16,000 --> 00:07:17,000
Text dot proper.
168
00:07:17,000 --> 00:07:20,000
And what Power Query has done is it added two
169
00:07:20,000 --> 00:07:23,000
more conditions here for first name
170
00:07:23,000 --> 00:07:26,000
and last name after prefix.
171
00:07:26,000 --> 00:07:27,000
Right, so we're just kind of adding
172
00:07:27,000 --> 00:07:31,000
or mashing that code together into one applied step
173
00:07:31,000 --> 00:07:33,000
because the transformation is all the same there.
174
00:07:33,000 --> 00:07:35,000
All right, so now that all
175
00:07:35,000 --> 00:07:37,000
of these customer text attributes are updated
176
00:07:37,000 --> 00:07:39,000
to proper case., what I would love to do
177
00:07:39,000 --> 00:07:44,000
is create one brand new column for customer full name.
178
00:07:44,000 --> 00:07:45,000
All right, and you can see here again,
179
00:07:45,000 --> 00:07:48,000
we're in our transform tools.
180
00:07:48,000 --> 00:07:50,000
So we want to head over to add column.
181
00:07:50,000 --> 00:07:54,000
And what I want to do is I wanna select prefix, first name,
182
00:07:54,000 --> 00:07:55,000
and last name.
183
00:07:55,000 --> 00:07:58,000
So again, click on prefix hold shift
184
00:07:58,000 --> 00:08:00,000
and select last name to select all three.
185
00:08:00,000 --> 00:08:01,000
One thing to call out here is
186
00:08:01,000 --> 00:08:04,000
that selection order matters when you're kind of merging
187
00:08:04,000 --> 00:08:07,000
or combining things within Power Query, right?
188
00:08:07,000 --> 00:08:10,000
So if you had selected last name first
189
00:08:10,000 --> 00:08:13,000
and then prefix Power Query, interprets that
190
00:08:13,000 --> 00:08:15,000
as you want the last name column first
191
00:08:15,000 --> 00:08:18,000
and then first name in the middle, and then prefix last.
192
00:08:18,000 --> 00:08:21,000
So just make sure you're selecting the columns
193
00:08:21,000 --> 00:08:24,000
in the order that you want them combined.
194
00:08:24,000 --> 00:08:27,000
So from here, and we're gonna come up to merge columns.
195
00:08:27,000 --> 00:08:29,000
And again, here we're just setting our separator.
196
00:08:29,000 --> 00:08:32,000
We're gonna add a space in between each one of these
197
00:08:32,000 --> 00:08:34,000
we can create a new column name.
198
00:08:34,000 --> 00:08:35,000
This is an optional step,
199
00:08:35,000 --> 00:08:38,000
but again merged here is a pretty bad name.
200
00:08:38,000 --> 00:08:42,000
So we'll call this full name, click okay and awesome.
201
00:08:42,000 --> 00:08:44,000
You can see that we have our new insert merged column
202
00:08:44,000 --> 00:08:46,000
applied step here.
203
00:08:46,000 --> 00:08:48,000
Scroll to the end of the data set.
204
00:08:48,000 --> 00:08:52,000
We've got our full name column where we have our prefix,
205
00:08:52,000 --> 00:08:54,000
our first and last name.
206
00:08:54,000 --> 00:08:56,000
And then the last thing that I want to do here
207
00:08:56,000 --> 00:08:58,000
is just rename this applied step
208
00:08:58,000 --> 00:08:59,000
to something a little bit more readable
209
00:08:59,000 --> 00:09:00,000
so we can remember it.
210
00:09:00,000 --> 00:09:03,000
Again, we don't have a lot going on right now
211
00:09:03,000 --> 00:09:05,000
but this may change over time.
212
00:09:05,000 --> 00:09:07,000
All right, so I'm gonna update merged
213
00:09:08,000 --> 00:09:11,000
to full name and we'll apply that change.
214
00:09:11,000 --> 00:09:15,000
All right, so with those updates, I think we're pretty good
215
00:09:15,000 --> 00:09:18,000
with our customer lookup table here.
216
00:09:18,000 --> 00:09:20,000
Last thing is let's head back to the Home tab
217
00:09:20,000 --> 00:09:24,000
and I wanna click close and apply to apply these changes
218
00:09:24,000 --> 00:09:25,000
and load them into our data model.
219
00:09:27,000 --> 00:09:30,000
All right, so now that that's finished loading,
220
00:09:30,000 --> 00:09:32,000
can scroll over and you can see
221
00:09:32,000 --> 00:09:36,000
that we have our new table here for customer lookup.
222
00:09:36,000 --> 00:09:38,000
The other thing that's interesting and to take note
223
00:09:38,000 --> 00:09:40,000
of is that the tables that we have marked
224
00:09:40,000 --> 00:09:43,000
as disabled load are that we're not loading
225
00:09:43,000 --> 00:09:48,000
into Power BI data model, like our SQL Connections
226
00:09:48,000 --> 00:09:50,000
they're not showing up here, and that's expected.
227
00:09:50,000 --> 00:09:52,000
So it's just great to confirm that
228
00:09:52,000 --> 00:09:55,000
and understand exactly what that functionality does.
229
00:09:55,000 --> 00:09:57,000
All right, so I'm gonna save this
230
00:09:57,000 --> 00:09:59,000
and I will see you in the next lecture.
18822
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.