Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,000 --> 00:00:01,000
Instructor: Let's take a few minutes
2
00:00:01,000 --> 00:00:03,000
and talk about an extremely important topic,
3
00:00:03,000 --> 00:00:06,000
data visualization best practices.
4
00:00:06,000 --> 00:00:09,000
Now, data viz is equal parts art and science.
5
00:00:09,000 --> 00:00:12,000
And before you start just dumping data
6
00:00:12,000 --> 00:00:15,000
onto a canvas or choosing whatever chart looks pretty
7
00:00:15,000 --> 00:00:17,000
or happens to fill the page,
8
00:00:17,000 --> 00:00:19,000
you need to step back and ask yourself
9
00:00:19,000 --> 00:00:21,000
these three key questions.
10
00:00:21,000 --> 00:00:24,000
Number one, what type of data am I working with?
11
00:00:24,000 --> 00:00:25,000
Is it geospatial?
12
00:00:25,000 --> 00:00:27,000
Is it time series?
13
00:00:27,000 --> 00:00:28,000
Are there hierarchies?
14
00:00:28,000 --> 00:00:29,000
Is it financial?
15
00:00:29,000 --> 00:00:30,000
And so on.
16
00:00:30,000 --> 00:00:34,000
Number two, what exactly am I trying to communicate?
17
00:00:34,000 --> 00:00:35,000
Right?
18
00:00:35,000 --> 00:00:37,000
Is it a comparison across categories?
19
00:00:37,000 --> 00:00:38,000
Am I trying to show a composition,
20
00:00:38,000 --> 00:00:41,000
a relationship, a distribution?
21
00:00:41,000 --> 00:00:44,000
And third, who is my audience?
22
00:00:44,000 --> 00:00:44,000
Right?
23
00:00:44,000 --> 00:00:47,000
Who's the end user and what exactly do they need?
24
00:00:47,000 --> 00:00:51,000
Am I presenting to or designing for a fellow analyst,
25
00:00:51,000 --> 00:00:53,000
for a manager, for an executive,
26
00:00:53,000 --> 00:00:55,000
or even for the general public?
27
00:00:55,000 --> 00:00:58,000
That end user will really dictate how my visuals
28
00:00:58,000 --> 00:01:00,000
are designed and developed.
29
00:01:00,000 --> 00:01:02,000
So let's go ahead and unpack each
30
00:01:02,000 --> 00:01:04,000
of these questions in a bit more depth
31
00:01:04,000 --> 00:01:06,000
starting with question one,
32
00:01:06,000 --> 00:01:08,000
what type of data are you working with?
33
00:01:08,000 --> 00:01:11,000
Now, data comes in all shapes and sizes.
34
00:01:11,000 --> 00:01:13,000
It can fall in all sorts of categories.
35
00:01:13,000 --> 00:01:15,000
And some of the things you're looking for
36
00:01:15,000 --> 00:01:18,000
are things like, do I have time series data?
37
00:01:18,000 --> 00:01:19,000
Right?
38
00:01:19,000 --> 00:01:21,000
Is there a date field that lets me show trends
39
00:01:21,000 --> 00:01:23,000
or patterns over time?
40
00:01:23,000 --> 00:01:26,000
Do I have geospatial fields that let me draw comparisons
41
00:01:26,000 --> 00:01:28,000
between geographic regions
42
00:01:28,000 --> 00:01:30,000
or locations using things like maps?
43
00:01:30,000 --> 00:01:33,000
Are there interesting categorical fields
44
00:01:33,000 --> 00:01:34,000
that I can use for filtering
45
00:01:34,000 --> 00:01:36,000
or segmenting the data in my reports?
46
00:01:36,000 --> 00:01:39,000
Again, do I have hierarchies that I can drill up
47
00:01:39,000 --> 00:01:41,000
or down into as part of my analysis?
48
00:01:41,000 --> 00:01:44,000
And then there's some less common categories as well.
49
00:01:44,000 --> 00:01:45,000
You might have financial specific data.
50
00:01:45,000 --> 00:01:47,000
You might have text data
51
00:01:47,000 --> 00:01:49,000
that's a little bit less visual or numeric.
52
00:01:49,000 --> 00:01:52,000
You may have funnel stages represented in your data set
53
00:01:52,000 --> 00:01:54,000
or even things like survey responses.
54
00:01:54,000 --> 00:01:57,000
And again, there are many, many more examples
55
00:01:57,000 --> 00:01:59,000
of the types of data that you might encounter.
56
00:01:59,000 --> 00:02:01,000
But the bottom line here is that the type
57
00:02:01,000 --> 00:02:04,000
of data that you're working with will often determine
58
00:02:04,000 --> 00:02:07,000
which type of visual will best represent it.
59
00:02:07,000 --> 00:02:10,000
For example, using maps to represent geospatial data,
60
00:02:10,000 --> 00:02:13,000
using line charts for time series data,
61
00:02:13,000 --> 00:02:16,000
bar or column charts for categorical comparisons,
62
00:02:16,000 --> 00:02:19,000
tree maps for hierarchical data, and so on and so forth.
63
00:02:20,000 --> 00:02:23,000
Question two is all about what you're trying to communicate.
64
00:02:23,000 --> 00:02:24,000
So let's break this down
65
00:02:24,000 --> 00:02:27,000
into four different categories here.
66
00:02:27,000 --> 00:02:29,000
Could be a comparison, a composition,
67
00:02:29,000 --> 00:02:32,000
a distribution, or a relationship.
68
00:02:32,000 --> 00:02:35,000
Now a comparison is when you're trying to compare values
69
00:02:35,000 --> 00:02:39,000
either over time or across different categories.
70
00:02:39,000 --> 00:02:40,000
And the common visuals that you'll use
71
00:02:40,000 --> 00:02:43,000
to communicate comparisons are things
72
00:02:43,000 --> 00:02:47,000
like basic column and bar charts, clustered columns,
73
00:02:47,000 --> 00:02:50,000
data tables or heat maps, if you're using time series data,
74
00:02:50,000 --> 00:02:52,000
line charts or area charts.
75
00:02:52,000 --> 00:02:55,000
And then sometimes more specialized visuals
76
00:02:55,000 --> 00:02:58,000
like radar charts can be helpful here as well.
77
00:02:58,000 --> 00:03:00,000
Composition is all about breaking down
78
00:03:00,000 --> 00:03:03,000
the component parts of a whole.
79
00:03:03,000 --> 00:03:04,000
This is where you'll typically use visuals
80
00:03:04,000 --> 00:03:09,000
like stacked bar or column charts, pies or donut charts,
81
00:03:09,000 --> 00:03:11,000
stacked areas to show both composition
82
00:03:11,000 --> 00:03:13,000
and trending over time,
83
00:03:13,000 --> 00:03:16,000
or possibly some more specialized visuals like waterfalls,
84
00:03:16,000 --> 00:03:20,000
funnels, tree maps, or sunbursts.
85
00:03:20,000 --> 00:03:22,000
Distribution is about showing the frequency
86
00:03:22,000 --> 00:03:24,000
of values within a series.
87
00:03:24,000 --> 00:03:27,000
And histograms are really far and away the most common
88
00:03:27,000 --> 00:03:30,000
and popular type of visual to show distributions.
89
00:03:30,000 --> 00:03:33,000
If you've ever seen a bell curve or normal distribution,
90
00:03:33,000 --> 00:03:35,000
that's a histogram at work.
91
00:03:35,000 --> 00:03:37,000
You might use things like density plots
92
00:03:37,000 --> 00:03:40,000
or box and whisker charts here as well.
93
00:03:40,000 --> 00:03:42,000
And last but not least, we have relationships
94
00:03:42,000 --> 00:03:43,000
which are about showing correlation
95
00:03:43,000 --> 00:03:45,000
between multiple variables.
96
00:03:45,000 --> 00:03:47,000
Scatter plots and bubble charts
97
00:03:47,000 --> 00:03:51,000
are far and away the most common visuals in this category,
98
00:03:51,000 --> 00:03:53,000
could also potentially use data tables,
99
00:03:53,000 --> 00:03:57,000
heat maps, or a correlation matrix as well.
100
00:03:57,000 --> 00:04:00,000
So this can be a handy guide to help point you
101
00:04:00,000 --> 00:04:02,000
towards the right visuals to choose
102
00:04:02,000 --> 00:04:04,000
based on what you're trying to communicate.
103
00:04:04,000 --> 00:04:06,000
And I know this is a quick review.
104
00:04:06,000 --> 00:04:08,000
I know this can feel a little bit overwhelming
105
00:04:08,000 --> 00:04:11,000
talking about all of these different chart types.
106
00:04:11,000 --> 00:04:13,000
How could you possibly know which one
107
00:04:13,000 --> 00:04:15,000
to choose for any given situation?
108
00:04:15,000 --> 00:04:17,000
So the big takeaway here,
109
00:04:17,000 --> 00:04:20,000
the bottom line is to keep it simple.
110
00:04:20,000 --> 00:04:21,000
There are hundreds of charts,
111
00:04:21,000 --> 00:04:24,000
if not thousands of charts to choose from.
112
00:04:24,000 --> 00:04:25,000
But at the end of the day,
113
00:04:25,000 --> 00:04:28,000
your basic tried and true options like bar charts,
114
00:04:28,000 --> 00:04:32,000
column charts, line charts, histograms, scatter plots,
115
00:04:32,000 --> 00:04:36,000
those are gonna be a great fit in 90% of use cases.
116
00:04:36,000 --> 00:04:38,000
And the reason they're tried and true
117
00:04:38,000 --> 00:04:40,000
is because they often do the best job
118
00:04:40,000 --> 00:04:43,000
telling the simplest and clearest story
119
00:04:43,000 --> 00:04:46,000
which ultimately is the goal of data visualization.
120
00:04:46,000 --> 00:04:48,000
And that brings us to question three.
121
00:04:48,000 --> 00:04:51,000
Who is the end user and what do they need?
122
00:04:51,000 --> 00:04:53,000
So let's simplify things a bit and imagine
123
00:04:53,000 --> 00:04:56,000
that there are three potential audiences.
124
00:04:56,000 --> 00:04:59,000
We've got the analyst, the manager, and the executive.
125
00:04:59,000 --> 00:05:01,000
Obviously, there are different variations
126
00:05:01,000 --> 00:05:05,000
of end users and audiences that exist out there
127
00:05:05,000 --> 00:05:06,000
but this will help us start to understand
128
00:05:06,000 --> 00:05:09,000
how to tailor an analysis or visualization
129
00:05:09,000 --> 00:05:12,000
based on who's consuming it.
130
00:05:12,000 --> 00:05:15,000
So when you're designing for someone at the analyst level,
131
00:05:15,000 --> 00:05:18,000
typically these are people who like to see details,
132
00:05:18,000 --> 00:05:20,000
they want to understand what's happening
133
00:05:20,000 --> 00:05:23,000
at a granular level, they're analytically minded,
134
00:05:23,000 --> 00:05:25,000
so they might want to see things
135
00:05:25,000 --> 00:05:28,000
like tables or combo charts, a bit more complex,
136
00:05:28,000 --> 00:05:29,000
maybe a little bit more data heavy.
137
00:05:29,000 --> 00:05:31,000
And again, they'll want access
138
00:05:31,000 --> 00:05:35,000
to some granular detail to support root cause analysis.
139
00:05:35,000 --> 00:05:37,000
On the other hand, a manager level audience
140
00:05:37,000 --> 00:05:41,000
might want more summarized data with a focus on clear
141
00:05:41,000 --> 00:05:45,000
and actionable insights to help them operate the business.
142
00:05:45,000 --> 00:05:47,000
So in that case, it typically makes sense
143
00:05:47,000 --> 00:05:49,000
to skew towards more common
144
00:05:49,000 --> 00:05:52,000
or basic charts and graphs, some detail,
145
00:05:52,000 --> 00:05:53,000
but really only when it supports
146
00:05:53,000 --> 00:05:56,000
a specific insight or recommendation.
147
00:05:56,000 --> 00:05:58,000
And then finally, at the top of the food chain,
148
00:05:58,000 --> 00:06:00,000
you've got the executive audience.
149
00:06:00,000 --> 00:06:02,000
These are people who are super busy,
150
00:06:02,000 --> 00:06:04,000
typically they just want high-level,
151
00:06:04,000 --> 00:06:07,000
crystal clear KPIs that they can use
152
00:06:07,000 --> 00:06:09,000
to track business health
153
00:06:09,000 --> 00:06:12,000
and top-line performance at a glance.
154
00:06:12,000 --> 00:06:14,000
So this is where visuals like KPI cards
155
00:06:14,000 --> 00:06:17,000
or very simple charts often make the most sense,
156
00:06:17,000 --> 00:06:18,000
and you wanna keep the detail
157
00:06:18,000 --> 00:06:22,000
to a minimum unless it adds critical context to those KPIs.
158
00:06:22,000 --> 00:06:25,000
So again, takeaway here is that how you visualize
159
00:06:25,000 --> 00:06:28,000
and present your data is largely a function
160
00:06:28,000 --> 00:06:31,000
of who will be consuming it, right?
161
00:06:31,000 --> 00:06:34,000
So that fellow analyst might want the granular details,
162
00:06:34,000 --> 00:06:37,000
managers and executives might prefer top-line KPIs.
163
00:06:37,000 --> 00:06:41,000
And again, that focus on clear data-driven insights.
164
00:06:41,000 --> 00:06:42,000
So there you have it.
165
00:06:42,000 --> 00:06:43,000
That's our crash course
166
00:06:43,000 --> 00:06:46,000
in the three key questions for data visualization.
167
00:06:46,000 --> 00:06:47,000
Next up, we're gonna talk
168
00:06:47,000 --> 00:06:50,000
about our step-by-step dashboard design framework
169
00:06:50,000 --> 00:06:52,000
and then we're actually gonna put pencil to paper
170
00:06:52,000 --> 00:06:54,000
and start sketching out the layout
171
00:06:54,000 --> 00:06:57,000
for our Adventure Works report.
172
00:06:57,000 --> 00:06:58,000
Stay tuned.
13516
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.