Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,000 --> 00:00:01,000
Presenter: All right, so to kick off
2
00:00:01,000 --> 00:00:04,000
the data modeling 101 part of the section,
3
00:00:04,000 --> 00:00:06,000
you may very well be asking yourself,
4
00:00:06,000 --> 00:00:09,000
well, what the heck is a data model anyway?
5
00:00:09,000 --> 00:00:12,000
So I want you to picture something like this.
6
00:00:12,000 --> 00:00:14,000
We've got three different tables shown here.
7
00:00:14,000 --> 00:00:17,000
Product lookup, sales data, and returns data.
8
00:00:17,000 --> 00:00:20,000
This is not a data model.
9
00:00:20,000 --> 00:00:22,000
What we're looking at here is a collection
10
00:00:22,000 --> 00:00:24,000
of three independent tables
11
00:00:24,000 --> 00:00:27,000
which share no connections or relationships.
12
00:00:27,000 --> 00:00:29,000
In other words, the sales data table
13
00:00:29,000 --> 00:00:31,000
has no idea that the product lookup
14
00:00:31,000 --> 00:00:34,000
or returns data tables even exist.
15
00:00:34,000 --> 00:00:35,000
And what that means is that
16
00:00:35,000 --> 00:00:38,000
if you tried to pull in or analyze data
17
00:00:38,000 --> 00:00:40,000
from those sales or returns tables,
18
00:00:40,000 --> 00:00:43,000
like the order quantity and return quantity for instance,
19
00:00:43,000 --> 00:00:46,000
and filter those values based on some field
20
00:00:46,000 --> 00:00:48,000
in the product lookup table,
21
00:00:48,000 --> 00:00:50,000
like the product name,
22
00:00:50,000 --> 00:00:51,000
this is what you'd see.
23
00:00:51,000 --> 00:00:54,000
You'd get that same repeated grand total
24
00:00:54,000 --> 00:00:57,000
in every single row of the table.
25
00:00:57,000 --> 00:00:58,000
And again, that's because we have
26
00:00:58,000 --> 00:01:00,000
no relationship, no connection
27
00:01:00,000 --> 00:01:02,000
that's allowing us to filter
28
00:01:02,000 --> 00:01:05,000
or segment those sales or returns values
29
00:01:05,000 --> 00:01:08,000
based on fields outside of those tables.
30
00:01:08,000 --> 00:01:10,000
Now, let's picture something like this.
31
00:01:10,000 --> 00:01:12,000
Looks very similar, right?
32
00:01:12,000 --> 00:01:14,000
We've got the same three tables here
33
00:01:14,000 --> 00:01:16,000
but this is a data model.
34
00:01:16,000 --> 00:01:18,000
And why is it a data model?
35
00:01:18,000 --> 00:01:21,000
Well, because now we have relationships.
36
00:01:21,000 --> 00:01:23,000
These three tables are connected
37
00:01:23,000 --> 00:01:27,000
via relationships based on common fields or keys.
38
00:01:27,000 --> 00:01:29,000
And we'll talk more about different types of keys
39
00:01:29,000 --> 00:01:31,000
and what they mean in just a minute.
40
00:01:31,000 --> 00:01:33,000
But for now, what this means
41
00:01:33,000 --> 00:01:34,000
is that we can take fields
42
00:01:34,000 --> 00:01:36,000
from our sales and returns data.
43
00:01:36,000 --> 00:01:39,000
And because those relationships exist,
44
00:01:39,000 --> 00:01:40,000
it means that we can aggregate
45
00:01:40,000 --> 00:01:42,000
or filter those values
46
00:01:42,000 --> 00:01:44,000
based on fields from related tables.
47
00:01:44,000 --> 00:01:48,000
So now, instead of seeing that same repeated grand total,
48
00:01:48,000 --> 00:01:49,000
we see accurate values
49
00:01:49,000 --> 00:01:51,000
for these order quantity
50
00:01:51,000 --> 00:01:53,000
and return quantity fields.
51
00:01:53,000 --> 00:01:56,000
This is why data modeling is so so powerful
52
00:01:56,000 --> 00:01:58,000
because it allows you to pull data
53
00:01:58,000 --> 00:02:01,000
from all different places and sources.
54
00:02:01,000 --> 00:02:03,000
And as long as you have valid keys
55
00:02:03,000 --> 00:02:06,000
or connections to form relationships between them,
56
00:02:06,000 --> 00:02:08,000
it means you can analyze that data
57
00:02:08,000 --> 00:02:10,000
from all of the tables
58
00:02:10,000 --> 00:02:12,000
in one centralized place
59
00:02:12,000 --> 00:02:14,000
as if you're merging it all together
60
00:02:14,000 --> 00:02:17,000
without having to physically join these tables
61
00:02:17,000 --> 00:02:20,000
or write a bunch of complex formulas and functions.
62
00:02:20,000 --> 00:02:21,000
So with that, let's go ahead
63
00:02:21,000 --> 00:02:24,000
and open up our AdventureWorks report file in Power BI
64
00:02:24,000 --> 00:02:26,000
and I'll show you exactly what this looks
65
00:02:26,000 --> 00:02:28,000
like inside of the Power BI environment.
66
00:02:29,000 --> 00:02:32,000
All right, so once you have Power BI pulled up,
67
00:02:32,000 --> 00:02:34,000
go ahead and jump into the model view here.
68
00:02:34,000 --> 00:02:36,000
This is where you should see all
69
00:02:36,000 --> 00:02:37,000
of those tables that you loaded up
70
00:02:37,000 --> 00:02:40,000
in Power Query in the last section of the course, right?
71
00:02:40,000 --> 00:02:43,000
We've got our territories, our products, categories,
72
00:02:43,000 --> 00:02:47,000
subcategories, customer lookup table, calendar table,
73
00:02:47,000 --> 00:02:49,000
and our sales data table.
74
00:02:49,000 --> 00:02:51,000
And remember, we haven't done anything
75
00:02:51,000 --> 00:02:54,000
to create relationships between these tables
76
00:02:54,000 --> 00:02:57,000
so we don't have a data model quite yet.
77
00:02:57,000 --> 00:02:58,000
And to really hammer that point home
78
00:02:58,000 --> 00:03:00,000
just like we did in the slides,
79
00:03:00,000 --> 00:03:03,000
let's jump into the report view here.
80
00:03:03,000 --> 00:03:04,000
We've got our blank canvas.
81
00:03:04,000 --> 00:03:05,000
I'm gonna talk about
82
00:03:05,000 --> 00:03:07,000
all these different visual types
83
00:03:07,000 --> 00:03:08,000
later in the course, but for now,
84
00:03:08,000 --> 00:03:09,000
we're just gonna use one type
85
00:03:09,000 --> 00:03:12,000
of visual called the matrix visual.
86
00:03:12,000 --> 00:03:14,000
You'll find it right here in the insert menu.
87
00:03:14,000 --> 00:03:16,000
If you're using an older version of Power BI,
88
00:03:16,000 --> 00:03:18,000
you may see it in a visualization pane
89
00:03:18,000 --> 00:03:20,000
here on the right side of the canvas,
90
00:03:20,000 --> 00:03:21,000
but it kind of looks like
91
00:03:21,000 --> 00:03:23,000
a table with some blue shading on it.
92
00:03:23,000 --> 00:03:24,000
So give it a click.
93
00:03:24,000 --> 00:03:27,000
This will insert the matrix visual here.
94
00:03:27,000 --> 00:03:30,000
And this works just like a pivot table in Excel.
95
00:03:30,000 --> 00:03:32,000
We can pull in fields into rows,
96
00:03:32,000 --> 00:03:34,000
into columns, and into values
97
00:03:34,000 --> 00:03:37,000
to give us a nice little flexible table visual
98
00:03:37,000 --> 00:03:40,000
that we can use to explore these numbers.
99
00:03:40,000 --> 00:03:42,000
So let's go ahead and add data to our values.
100
00:03:42,000 --> 00:03:43,000
We can either pull in
101
00:03:43,000 --> 00:03:46,000
from the rightmost pane called the data pane
102
00:03:46,000 --> 00:03:48,000
or we can do it right here in line.
103
00:03:48,000 --> 00:03:52,000
Let's grab order quantity from our sales data table.
104
00:03:52,000 --> 00:03:55,000
That's gonna automatically aggregate to a sum
105
00:03:55,000 --> 00:03:59,000
and we see that same familiar total 84,174.
106
00:03:59,000 --> 00:04:02,000
And remember, because we have no relationships yet,
107
00:04:02,000 --> 00:04:05,000
that means that I can break down that total
108
00:04:05,000 --> 00:04:07,000
using other fields inside
109
00:04:07,000 --> 00:04:09,000
of the sales data table, right?
110
00:04:09,000 --> 00:04:12,000
Like the product key for instance, that looks fine,
111
00:04:12,000 --> 00:04:14,000
but what I can't do is break those values
112
00:04:14,000 --> 00:04:17,000
down by any other table.
113
00:04:17,000 --> 00:04:18,000
So to use the same example,
114
00:04:18,000 --> 00:04:21,000
let's drill into our product lookup table,
115
00:04:21,000 --> 00:04:22,000
bring in product name,
116
00:04:22,000 --> 00:04:24,000
and this is where we see
117
00:04:24,000 --> 00:04:26,000
those same repeating totals
118
00:04:26,000 --> 00:04:28,000
row after row, after row.
119
00:04:28,000 --> 00:04:29,000
So let's pause there.
120
00:04:29,000 --> 00:04:31,000
That's kind of our 10,000 foot view
121
00:04:31,000 --> 00:04:34,000
of what a data model actually is.
122
00:04:34,000 --> 00:04:36,000
Next up, we're gonna talk about different types
123
00:04:36,000 --> 00:04:38,000
of tables within a model
124
00:04:38,000 --> 00:04:40,000
and how to create relationships
125
00:04:40,000 --> 00:04:43,000
between them using primary and foreign keys.
9400
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.