Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,790 --> 00:00:08,230
‫So what is a database so far we've talked a lot about ask you all, we talked about queries, we talked
2
00:00:08,230 --> 00:00:14,770
‫about why we use Google, we even talked about why it's called a declarative language, but we didn't
3
00:00:14,770 --> 00:00:19,050
‫really talk much about what a database actually is.
4
00:00:19,090 --> 00:00:26,050
‫We gave it a short definition, which was it's a structured set of data that is structured in a certain
5
00:00:26,050 --> 00:00:32,600
‫way to scale with organizations that have massive quantities of data.
6
00:00:33,460 --> 00:00:40,020
‫Now, databases are our modern day interpretation of how to organize data.
7
00:00:40,570 --> 00:00:47,790
‫But before databases were around, we had something called file processing systems.
8
00:00:48,190 --> 00:00:50,940
‫This is the precursor to the modern database.
9
00:00:51,160 --> 00:00:55,960
‫And the reason I'm zooming in on this is because to know where we are, I find it important to know
10
00:00:55,960 --> 00:00:56,800
‫where we came from.
11
00:00:57,320 --> 00:01:04,420
‫So file processing systems were a massive leap ahead, especially in terms of, you know, we used to
12
00:01:04,420 --> 00:01:07,540
‫have cabinet systems and filing systems on paper.
13
00:01:07,930 --> 00:01:13,570
‫These file processing systems were software based and they were a huge leap ahead of whatever we had
14
00:01:13,570 --> 00:01:14,380
‫back in the day.
15
00:01:15,250 --> 00:01:17,230
‫But they had their flaws.
16
00:01:17,680 --> 00:01:23,050
‫First of all, you were saving data on individual files, right?
17
00:01:24,000 --> 00:01:26,650
‫And there was no correlation between the files.
18
00:01:26,700 --> 00:01:27,610
‫What do I mean by that?
19
00:01:27,870 --> 00:01:30,020
‫There were no relationships between the data.
20
00:01:30,300 --> 00:01:34,360
‫You basically were creating systems to save particular types of data.
21
00:01:34,770 --> 00:01:38,610
‫Let's say you had a real estate firm in one part of the system.
22
00:01:38,610 --> 00:01:41,580
‫You would keep all of the listings in the second part of the system.
23
00:01:41,580 --> 00:01:43,110
‫You would keep all of the customers.
24
00:01:43,350 --> 00:01:46,260
‫And in the third part of the system, you would keep all of the sales.
25
00:01:46,800 --> 00:01:50,160
‫But there wasn't really any relationship between the data.
26
00:01:50,460 --> 00:01:51,450
‫Now, what do I mean by that?
27
00:01:51,450 --> 00:01:52,680
‫Of course, what purchases?
28
00:01:52,680 --> 00:01:58,950
‫You would know which customers did the purchase, but the limitation lied in the fact that.
29
00:01:59,910 --> 00:02:06,450
‫If a customer changed over here, the change wouldn't go all the way through to the sale, you basically
30
00:02:06,450 --> 00:02:09,450
‫were copying customer information into the sale.
31
00:02:10,320 --> 00:02:16,050
‫So whenever the customer, let's say, changed their employment or changed their address or changed
32
00:02:16,050 --> 00:02:18,170
‫your phone number, the update would happen here.
33
00:02:18,630 --> 00:02:22,820
‫But the previous sale would not get the reference to the new information.
34
00:02:23,430 --> 00:02:30,360
‫So file processing systems had a lot of benefits, but a lot of negatives because you were building
35
00:02:30,360 --> 00:02:36,710
‫custom software in specific coding languages like Java that would manage your data.
36
00:02:36,720 --> 00:02:41,220
‫You were doing it all, all the programmers, the coders, the developers, whatever you want to call
37
00:02:41,220 --> 00:02:45,960
‫them, they were building their own systems and that's what we call file processing systems.
38
00:02:48,880 --> 00:02:56,350
‫In terms of file processing systems for a user, you would be talking to the computer and some specific
39
00:02:56,350 --> 00:03:00,700
‫coding language, let's say Python or Java would be talking directly to the machinery.
40
00:03:01,270 --> 00:03:07,600
‫And this, like I said, was a large disadvantage because you had all of this management to do in Python
41
00:03:07,600 --> 00:03:10,360
‫or Java or whatever other language you had.
42
00:03:10,660 --> 00:03:17,140
‫The software was so customized to the system, you even needed to know how your data was being stored,
43
00:03:17,260 --> 00:03:22,570
‫what hard drives you were using, what software was running on the server, whether it be Mac, Windows,
44
00:03:22,570 --> 00:03:26,040
‫Linux, you needed to know all of that in order to write the software.
45
00:03:26,440 --> 00:03:28,870
‫So there was a lot of overhead here.
46
00:03:29,260 --> 00:03:34,170
‫On top of that, the user didn't really care about what system was managing the data.
47
00:03:34,570 --> 00:03:42,070
‫Yet every system was specific to what type of data you could build a customer entry software and then
48
00:03:42,070 --> 00:03:43,900
‫you could build a sales software.
49
00:03:44,200 --> 00:03:48,230
‫And all of these systems did not have common ground.
50
00:03:48,310 --> 00:03:49,390
‫What do I mean by that?
51
00:03:49,690 --> 00:03:55,870
‫They were all built custom, so the user needed to know each and every particular system in order to
52
00:03:55,870 --> 00:03:59,320
‫work efficiently, not to mention all of the flaws.
53
00:04:00,010 --> 00:04:04,270
‫So back into the limitations, let's say you had a school system.
54
00:04:04,660 --> 00:04:11,560
‫What you can see here is clearly that the student system, the exam system and the library system were
55
00:04:11,560 --> 00:04:13,780
‫completely independent from each other.
56
00:04:14,410 --> 00:04:16,900
‫So let's say an exam recommended certain books.
57
00:04:17,410 --> 00:04:24,160
‫The exams database didn't necessarily know how the library was storing their data and the library data
58
00:04:24,190 --> 00:04:26,980
‫didn't necessarily know how students were storing their data.
59
00:04:26,980 --> 00:04:28,440
‫All of these systems were independent.
60
00:04:28,440 --> 00:04:31,900
‫That means all of the way that the files were stored were different.
61
00:04:32,140 --> 00:04:34,930
‫There was no common model.
62
00:04:35,560 --> 00:04:37,230
‫Now, what is a model?
63
00:04:37,660 --> 00:04:43,090
‫Well, a model is just a way of saying, hey, my data is going to follow this structure.
64
00:04:43,510 --> 00:04:45,930
‫We're going to have columns and rows and columns.
65
00:04:45,940 --> 00:04:47,080
‫I'm going to put this in Columbia.
66
00:04:47,080 --> 00:04:48,040
‫I'm going to put this.
67
00:04:48,310 --> 00:04:51,700
‫You would put together a system of how your data would be structured.
68
00:04:52,180 --> 00:04:56,710
‫And if all of the data followed a similar model, they could talk to each other.
69
00:04:56,950 --> 00:04:58,330
‫But back in the day they didn't.
70
00:04:58,630 --> 00:05:04,960
‫That means if a student were to loan a book to study for a particular exam, the library would copy
71
00:05:04,960 --> 00:05:11,140
‫the student data and then store it with the rental and the exam would copy the student data when the
72
00:05:11,140 --> 00:05:12,250
‫student took the exam.
73
00:05:12,370 --> 00:05:17,590
‫So you would have a bunch of redundant data, not to mention that all of the data was isolated and none
74
00:05:17,590 --> 00:05:19,220
‫of the systems could talk to each other.
75
00:05:19,510 --> 00:05:21,600
‫These are huge limitations.
76
00:05:22,420 --> 00:05:31,530
‫Now, in comes a data base oriented approach where we migrated to a singular system.
77
00:05:32,590 --> 00:05:33,850
‫This is fascinating.
78
00:05:34,180 --> 00:05:35,980
‫Let's dive a bit deeper into this.
8260
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.