All language subtitles for The.Secret.Rules.Of.Modern.Living.Algorithms

af Afrikaans
ak Akan
sq Albanian
am Amharic
ar Arabic
hy Armenian
az Azerbaijani
eu Basque
be Belarusian
bem Bemba
bn Bengali
bh Bihari
bs Bosnian
br Breton
bg Bulgarian
km Cambodian
ca Catalan
ceb Cebuano
chr Cherokee
ny Chichewa
zh-CN Chinese (Simplified)
zh-TW Chinese (Traditional)
co Corsican
hr Croatian
cs Czech
da Danish
en English Download
eo Esperanto
et Estonian
ee Ewe
fo Faroese
tl Filipino
fi Finnish
fr French Download
fy Frisian
gaa Ga
gl Galician
ka Georgian
de German
el Greek
gn Guarani
gu Gujarati
ht Haitian Creole
ha Hausa
haw Hawaiian
iw Hebrew
hi Hindi
hmn Hmong
hu Hungarian
is Icelandic
ig Igbo
id Indonesian
ia Interlingua
ga Irish
it Italian
ja Japanese
jw Javanese
kn Kannada
kk Kazakh
rw Kinyarwanda
rn Kirundi
kg Kongo
ko Korean
kri Krio (Sierra Leone)
ku Kurdish
ckb Kurdish (Soranî)
ky Kyrgyz
lo Laothian
la Latin
lv Latvian
ln Lingala
lt Lithuanian
loz Lozi
lg Luganda
ach Luo
lb Luxembourgish
mk Macedonian
mg Malagasy
ms Malay
ml Malayalam
mt Maltese
mi Maori
mr Marathi
mfe Mauritian Creole
mo Moldavian
mn Mongolian
my Myanmar (Burmese)
sr-ME Montenegrin
ne Nepali
pcm Nigerian Pidgin
nso Northern Sotho
no Norwegian
nn Norwegian (Nynorsk)
oc Occitan
or Oriya
om Oromo
ps Pashto
fa Persian
pl Polish
pt-BR Portuguese (Brazil)
pt Portuguese (Portugal)
pa Punjabi
qu Quechua
ro Romanian Download
rm Romansh
nyn Runyakitara
ru Russian
sm Samoan
gd Scots Gaelic
sr Serbian
sh Serbo-Croatian
st Sesotho
tn Setswana
crs Seychellois Creole
sn Shona
sd Sindhi
si Sinhalese
sk Slovak
sl Slovenian
so Somali
es Spanish Download
es-419 Spanish (Latin American)
su Sundanese
sw Swahili
sv Swedish
tg Tajik
ta Tamil
tt Tatar
te Telugu
th Thai
ti Tigrinya
to Tonga
lua Tshiluba
tum Tumbuka
tr Turkish
tk Turkmen
tw Twi
ug Uighur
uk Ukrainian
ur Urdu
uz Uzbek
vi Vietnamese
cy Welsh
wo Wolof
xh Xhosa
yi Yiddish
yo Yoruba
zu Zulu
Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated: 1 00:00:02,840 --> 00:00:03,840 Encoded by 2 00:00:04,040 --> 00:00:07,840 Without us noticing, modern life has been taken over. 3 00:00:10,080 --> 00:00:14,480 As we search for love, shop online, 4 00:00:14,480 --> 00:00:18,000 travel the world, 5 00:00:18,000 --> 00:00:20,320 even as we save lives, 6 00:00:20,320 --> 00:00:25,160 there are step-by-step instructions working quietly behind the scenes. 7 00:00:27,080 --> 00:00:30,440 More and more, they are ruling our lives. 8 00:00:30,440 --> 00:00:33,040 They're called algorithms. 9 00:00:34,400 --> 00:00:36,640 Algorithms are everywhere. 10 00:00:36,640 --> 00:00:39,680 These bite-sized chunks of maths have become central 11 00:00:39,680 --> 00:00:41,520 to our daily lives. 12 00:00:41,520 --> 00:00:44,800 But because they are invisible, we tend to take them for granted, 13 00:00:44,800 --> 00:00:46,320 even misunderstand them. 14 00:00:50,960 --> 00:00:52,080 LAUGHTER 15 00:00:53,200 --> 00:00:57,040 They are the secret to our digital world, and so much more. 16 00:00:58,720 --> 00:01:02,120 'In this programme, I'm going to show you some of my favourite 17 00:01:02,120 --> 00:01:05,880 'algorithms to reveal where they came from...' 18 00:01:05,880 --> 00:01:08,480 Algorithms are ancient. 19 00:01:08,480 --> 00:01:09,800 '..how they work...' 20 00:01:09,800 --> 00:01:11,920 The challenge is to find the shortest route... 21 00:01:11,920 --> 00:01:14,640 These are the rough instructions that you would use. 22 00:01:14,640 --> 00:01:17,800 ..for returning to your starting point. 23 00:01:17,800 --> 00:01:20,240 '..what they might be able to do in the future.' 24 00:01:20,240 --> 00:01:23,440 - The algorithm's kind of writing itself? Or...? - Absolutely. 25 00:01:23,440 --> 00:01:26,080 '..and how we can't live without them.' 26 00:01:26,080 --> 00:01:29,600 Even when we're baking a cake, we're following an algorithm. 27 00:01:29,600 --> 00:01:32,520 As a mathematician, I love algorithms. 28 00:01:32,520 --> 00:01:35,120 Not only are they impressive problem solvers, 29 00:01:35,120 --> 00:01:38,800 but also strangely beautiful, tapping into the mathematical 30 00:01:38,800 --> 00:01:42,200 order that underpins how the universe works. 31 00:01:42,200 --> 00:01:45,760 Welcome to the weird and wonderful world of algorithms. 32 00:01:54,600 --> 00:01:57,680 Most of us carry one of these around. 33 00:01:57,680 --> 00:02:00,160 Now, you might have noticed that when you take a photo 34 00:02:00,160 --> 00:02:05,800 with your phone, then it draws a box around any face, like this. 35 00:02:05,800 --> 00:02:09,480 This is the result of a special face-detection algorithm 36 00:02:09,480 --> 00:02:13,200 and it helps to keep the face in the photo in focus. 37 00:02:14,480 --> 00:02:18,280 'Like all algorithms, this one solves a problem. 38 00:02:18,280 --> 00:02:21,520 'In this case, finding a human face. 39 00:02:21,520 --> 00:02:24,760 'While it's not fooled by a face made of fruit, 40 00:02:24,760 --> 00:02:28,400 'it does detect a human face in a photo. 41 00:02:28,400 --> 00:02:31,200 'So, how does it do it? 42 00:02:31,200 --> 00:02:34,120 'At their root, algorithms are little more than 43 00:02:34,120 --> 00:02:37,280 'a series of step-by-step instructions. 44 00:02:37,280 --> 00:02:40,520 'This one works by methodically scanning the image 45 00:02:40,520 --> 00:02:43,520 'looking for four particular abstract patterns 46 00:02:43,520 --> 00:02:45,960 'associated with a face. 47 00:02:45,960 --> 00:02:48,280 'When these are detected one after another, 48 00:02:48,280 --> 00:02:52,520 'then the algorithm indicates it's found a human face.' 49 00:02:52,520 --> 00:02:56,880 The process taps into the underlying pattern behind all faces, 50 00:02:56,880 --> 00:02:59,520 no matter what shape or size. 51 00:02:59,520 --> 00:03:02,840 The end result is just one example of how algorithms have 52 00:03:02,840 --> 00:03:05,560 made our lives easier. 53 00:03:05,560 --> 00:03:09,000 - I'll do it! - I'll do it! - I was here first! - OK. 54 00:03:09,000 --> 00:03:10,600 So, off you go. 55 00:03:10,600 --> 00:03:14,240 'We tend to associate algorithms with computers, smartphones 56 00:03:14,240 --> 00:03:15,480 'and the internet. 57 00:03:15,480 --> 00:03:19,320 'But they are not exclusive to the world of technology. 58 00:03:19,320 --> 00:03:24,160 'My day job is Professor of Mathematics at Oxford University. 59 00:03:24,160 --> 00:03:26,920 'And one of the things I enjoy most is keeping 60 00:03:26,920 --> 00:03:29,120 'the students on their toes.' 61 00:03:29,120 --> 00:03:31,040 OK, I'll take one. 62 00:03:31,040 --> 00:03:33,920 Here, we're playing a mathematical game with a jar 63 00:03:33,920 --> 00:03:37,120 full of chocolates and one red hot chilli. 64 00:03:38,480 --> 00:03:42,720 'The aim is not to be left with the chilli at the end. 65 00:03:42,720 --> 00:03:44,360 'But what these students don't know, 66 00:03:44,360 --> 00:03:49,520 'is that I'm playing it with the help of an algorithm.' 67 00:03:49,520 --> 00:03:51,800 - OK. Ready? BOTH: - Yeah. 68 00:03:51,800 --> 00:03:54,560 Right, I'm going to go first, so remember, you can take one, 69 00:03:54,560 --> 00:03:57,200 two or three chocolates at a time. 70 00:03:57,200 --> 00:04:00,880 I'm not a greedy guy, so I'll just take one. Now, your turn. 71 00:04:00,880 --> 00:04:05,440 'Each player takes on their turn, between one and three chocolates.' 72 00:04:05,440 --> 00:04:09,120 You've taken two, OK. So, I'm going to take... I'll take two. 73 00:04:09,120 --> 00:04:12,280 'Whatever my opponent does, my algorithm that tells me 74 00:04:12,280 --> 00:04:14,200 'how to respond.' 75 00:04:14,200 --> 00:04:16,520 OK, I'll take two. 76 00:04:16,520 --> 00:04:19,080 And your turn again. SHE LAUGHS 77 00:04:19,080 --> 00:04:20,680 Oh, yeah. 78 00:04:20,680 --> 00:04:24,040 - So I'll take...three. - Three. And I'll take one. 79 00:04:24,040 --> 00:04:27,160 - And just a chilli left... - So, wait. Is that me? - Yeah, so you have 80 00:04:27,160 --> 00:04:29,640 - to eat the chilli. - Oh, no! - So, there you go. 81 00:04:29,640 --> 00:04:32,960 'Let me reveal how the algorithm I was using helped me win.' 82 00:04:32,960 --> 00:04:34,520 It's the only way to learn. 83 00:04:35,680 --> 00:04:40,680 So, the key is to think about grouping things in fours. 84 00:04:41,960 --> 00:04:46,720 13 chocolates divides into three groups of four, with one left over. 85 00:04:46,720 --> 00:04:50,160 So, by taking one chocolate in the first round and then four 86 00:04:50,160 --> 00:04:54,160 minus whatever the other player takes in the subsequent rounds, 87 00:04:54,160 --> 00:04:57,040 this algorithm ensures that the other player 88 00:04:57,040 --> 00:04:58,960 is always left with the chilli. 89 00:04:58,960 --> 00:05:01,080 The essence of a really good 90 00:05:01,080 --> 00:05:04,240 algorithm, its magic, if you like, is mathematics. 91 00:05:04,240 --> 00:05:07,440 The best algorithms are those that tap into the underlying 92 00:05:07,440 --> 00:05:10,400 mathematical structure hiding beneath a problem. 93 00:05:11,600 --> 00:05:13,080 OK, pop the chilli back. 94 00:05:14,480 --> 00:05:17,760 I'll be introducing you to some of the algorithms that have 95 00:05:17,760 --> 00:05:20,320 become the beating heart of modern life. 96 00:05:21,880 --> 00:05:24,920 But first, I want to show you that, for all their modern 97 00:05:24,920 --> 00:05:28,360 applications, algorithms are extremely old. 98 00:05:29,680 --> 00:05:33,600 In fact, they predate computers by thousands of years. 99 00:05:35,480 --> 00:05:38,280 The oldest algorithm we know of was devised 100 00:05:38,280 --> 00:05:40,480 to solve a mathematical problem. 101 00:05:40,480 --> 00:05:44,800 It was first written down by the Ancient Greek mathematician Euclid. 102 00:05:44,800 --> 00:05:47,520 Euclid's Algorithm, as it's known, 103 00:05:47,520 --> 00:05:50,680 is a method for finding the greatest common devisor. 104 00:05:52,360 --> 00:05:55,680 The greatest common devisor is the largest number that will 105 00:05:55,680 --> 00:06:00,280 divide into a pair of other numbers without leaving a remainder. 106 00:06:00,280 --> 00:06:03,000 So, in this case, four divides into both eight 107 00:06:03,000 --> 00:06:06,080 and 12 without a remainder. 108 00:06:06,080 --> 00:06:08,520 It's simple to find for small numbers, 109 00:06:08,520 --> 00:06:10,600 but much more tricky for large ones. 110 00:06:12,120 --> 00:06:15,480 While Euclid was the greatest mathematician of his day, 111 00:06:15,480 --> 00:06:18,640 his algorithm could have made him a fortune as a tiler. 112 00:06:19,840 --> 00:06:22,280 Let me show you why. 113 00:06:22,280 --> 00:06:25,440 Imagine you've got a rectangular-shaped floor 114 00:06:25,440 --> 00:06:26,760 and you want to find 115 00:06:26,760 --> 00:06:30,360 the most efficient way to tile it with square tiles. 116 00:06:30,360 --> 00:06:34,080 In other words, what's the largest square tile that will exactly 117 00:06:34,080 --> 00:06:38,040 divide the dimensions of the floor with nothing left over? 118 00:06:38,040 --> 00:06:40,440 This is, in fact, a geometric version 119 00:06:40,440 --> 00:06:43,080 of the greatest common devisor problem. 120 00:06:43,080 --> 00:06:46,280 The dimensions of the floor are the two numbers 121 00:06:46,280 --> 00:06:48,640 and the size of the tiles, which we're going to try 122 00:06:48,640 --> 00:06:51,960 and work out, is their greatest common devisor. 123 00:06:54,040 --> 00:06:57,480 We're going to follow Euclid's Algorithm step by step to show 124 00:06:57,480 --> 00:07:01,480 how it is able to find the perfect sized square tile for this floor. 125 00:07:02,920 --> 00:07:06,800 According to Euclid's Algorithm, we need to start filling the rectangle 126 00:07:06,800 --> 00:07:10,960 with square tiles corresponding to the smallest of the two dimensions. 127 00:07:13,760 --> 00:07:15,920 This is the first stage of the job. 128 00:07:17,160 --> 00:07:19,040 Euclid's Algorithm then tells us 129 00:07:19,040 --> 00:07:22,400 to do exactly the same thing again with this rectangle. 130 00:07:24,080 --> 00:07:28,520 At each stage, the algorithm tells us to select square tiles 131 00:07:28,520 --> 00:07:31,600 corresponding to the shortest side of the rectangle. 132 00:07:33,600 --> 00:07:38,880 So this time, our square tiles perfectly fill the leftover space. 133 00:07:38,880 --> 00:07:42,960 Now, my square tile has dimensions 15x15. 134 00:07:42,960 --> 00:07:45,720 So Euclid's Algorithm tells us 135 00:07:45,720 --> 00:07:50,440 that the greatest common devisor of 150 and 345 is 15. 136 00:07:53,240 --> 00:07:56,160 I'm not suggesting you use Euclid's Algorithm every time 137 00:07:56,160 --> 00:07:58,360 you need to order some tiles, 138 00:07:58,360 --> 00:08:02,480 but the amazing thing is that this simple step-by-step method 139 00:08:02,480 --> 00:08:06,440 finds the perfect square tile whatever the dimensions of the floor. 140 00:08:07,800 --> 00:08:11,640 Euclid's Algorithm may appear to be just a mathematical technique, 141 00:08:11,640 --> 00:08:16,400 but it very elegantly fulfils all the criteria for an algorithm. 142 00:08:16,400 --> 00:08:20,120 It's a precisely stated set of instructions, the procedure 143 00:08:20,120 --> 00:08:24,800 always finishes, and it can be proven that it works in all cases. 144 00:08:28,320 --> 00:08:31,800 The power of algorithms is that you don't have to reinvent 145 00:08:31,800 --> 00:08:36,480 the wheel each time. They're general solutions to problems. 146 00:08:36,480 --> 00:08:40,000 This holds as true for ancient algorithms as for modern ones. 147 00:08:45,320 --> 00:08:49,480 In 1998, in this garage in Menlo Park in California, 148 00:08:49,480 --> 00:08:52,440 an important piece of algorithmic history was made. 149 00:08:54,520 --> 00:08:58,560 Inside were two PHD students from Stamford University. 150 00:08:58,560 --> 00:09:00,760 Larry Page and Sergey Brin. 151 00:09:02,320 --> 00:09:05,640 Their aim was to come up with a search engine that could find 152 00:09:05,640 --> 00:09:08,480 things efficiently on the World Wide Web. 153 00:09:11,000 --> 00:09:13,880 Out of these humble beginnings, Google was born. 154 00:09:15,400 --> 00:09:18,760 But Google wouldn't be Google if it wasn't for the algorithm that 155 00:09:18,760 --> 00:09:21,600 Larry and Sergey created, called PageRank. 156 00:09:30,880 --> 00:09:34,040 PageRank was the algorithm at the heart of the first 157 00:09:34,040 --> 00:09:37,200 incarnation of the Google search engine. 158 00:09:37,200 --> 00:09:42,120 Now, technically, it's not a search algorithm, but a ranking algorithm. 159 00:09:42,120 --> 00:09:45,600 So when you type a query into a search engine, 160 00:09:45,600 --> 00:09:49,880 then there are literally millions of pages which will match that query. 161 00:09:49,880 --> 00:09:53,600 What PageRank does is to rank all of those pages so the one 162 00:09:53,600 --> 00:09:56,960 at the top is the one you're more likely to be interested in. 163 00:09:58,520 --> 00:10:01,440 Larry and Sergey came up with the idea to do PageRank 164 00:10:01,440 --> 00:10:05,880 and to use it as a ranking system to improve the quality of web search. 165 00:10:05,880 --> 00:10:07,800 I remember myself at the time, 166 00:10:07,800 --> 00:10:10,720 you used a web search engine like AltaVista. 167 00:10:10,720 --> 00:10:12,760 You would have to click the Next Page link 168 00:10:12,760 --> 00:10:14,880 a lot of times to find what you were looking for. 169 00:10:14,880 --> 00:10:17,280 PageRank was one of the reasons why Google was 170 00:10:17,280 --> 00:10:20,760 so much better than the existing search engines at the time. 171 00:10:21,800 --> 00:10:24,840 The inner workings of PageRank are hidden from view 172 00:10:24,840 --> 00:10:26,680 on the World Wide Web. 173 00:10:26,680 --> 00:10:30,360 So to reveal how it does its job, we're going to use the PageRank 174 00:10:30,360 --> 00:10:33,440 algorithm to rank the players of a football team. 175 00:10:34,600 --> 00:10:36,960 PageRank looks at two things. 176 00:10:36,960 --> 00:10:42,040 It looks at the incoming links to a web page, that is the other pages 177 00:10:42,040 --> 00:10:46,360 that link to the page, and it looks at how important those pages are. 178 00:10:51,960 --> 00:10:54,840 In our demonstration to show the cleverness of the PageRank 179 00:10:54,840 --> 00:10:59,280 algorithm, the players in the football team are the web pages 180 00:10:59,280 --> 00:11:02,880 and the passes between them are the web links. 181 00:11:02,880 --> 00:11:05,680 The input for the algorithm. 182 00:11:05,680 --> 00:11:09,240 Generally speaking, the PageRank algorithm will give a higher 183 00:11:09,240 --> 00:11:13,240 rank to a website if it's got a lot of links coming from other websites. 184 00:11:13,240 --> 00:11:16,000 So in the case of football, if a player gets more 185 00:11:16,000 --> 00:11:20,080 passes from the rest of the team, then they'll be ranked higher. 186 00:11:20,080 --> 00:11:21,680 It's not quite that simple. 187 00:11:21,680 --> 00:11:24,960 Because the PageRank algorithm actually gives more weight to 188 00:11:24,960 --> 00:11:28,880 a link from a website that itself has a high page rank. 189 00:11:28,880 --> 00:11:32,520 So actually, a pass from a popular player is worth more than 190 00:11:32,520 --> 00:11:35,960 a pass from a player who's hardly involved in the game at all. 191 00:11:37,120 --> 00:11:40,920 This is a visualisation of the algorithm at work. 192 00:11:40,920 --> 00:11:45,880 The stats are the players' current ranking. The output of the algorithm. 193 00:11:45,880 --> 00:11:50,280 And every time there's a pass, these rankings are updated. 194 00:11:50,280 --> 00:11:56,360 When Google uses this algorithm, it only changes once thing - the input. 195 00:11:56,360 --> 00:11:59,280 In place of passes, it uses web links. 196 00:12:01,280 --> 00:12:04,320 Note that the importance of a page depends on the importance 197 00:12:04,320 --> 00:12:06,480 of the pages that link to it. 198 00:12:06,480 --> 00:12:09,160 This means that you have to compute page rank for all 199 00:12:09,160 --> 00:12:11,240 the pages at the same time. 200 00:12:11,240 --> 00:12:14,200 And you actually have to repeat the computation because, each time, 201 00:12:14,200 --> 00:12:16,600 you'll update the importance of all the pages. 202 00:12:16,600 --> 00:12:19,040 And that in turn will influence 203 00:12:19,040 --> 00:12:22,120 the importance of the pages that those pages link to. 204 00:12:30,680 --> 00:12:33,840 At the end of the match, the job of the algorithm is done. 205 00:12:36,720 --> 00:12:39,880 If we wanted to search for the key player in the team, 206 00:12:39,880 --> 00:12:41,840 this is PageRank's answer. 207 00:12:43,800 --> 00:12:46,400 Player 11 has the highest PageRank score. 208 00:12:48,320 --> 00:12:50,640 I think the PageRank algorithm is probably 209 00:12:50,640 --> 00:12:52,560 my favourite algorithm of all time. 210 00:12:52,560 --> 00:12:54,960 And it's amazing that it can be applied not just to 211 00:12:54,960 --> 00:12:58,520 the World Wide Web, but analysing a football match, as well. 212 00:12:58,520 --> 00:13:01,320 But for me, it's the fact that there's a beautiful bit of 213 00:13:01,320 --> 00:13:03,880 mathematics at its heart that always seems to find 214 00:13:03,880 --> 00:13:05,960 the website I'm looking for. 215 00:13:08,120 --> 00:13:09,320 Within Google, I think 216 00:13:09,320 --> 00:13:14,320 PageRank is seen as a very important part of Google's early development. 217 00:13:15,520 --> 00:13:18,600 PageRank was the secret to why the search engine that Larry 218 00:13:18,600 --> 00:13:22,200 and Sergey built in the 1990s was so successful. 219 00:13:23,920 --> 00:13:28,640 Now, Google handles over 3.5 billion searches every day. 220 00:13:28,640 --> 00:13:31,960 It's the world's most poplar search engine. 221 00:13:31,960 --> 00:13:36,480 And the company is worth more than 450 billion. 222 00:13:37,560 --> 00:13:40,760 Not bad for two PhD students working out of a garage. 223 00:13:49,000 --> 00:13:52,600 Algorithms are simple step-by-step recipes. 224 00:13:52,600 --> 00:13:56,800 Inventing them requires incredible creativity and genius. 225 00:13:56,800 --> 00:14:01,000 But using them is just a matter of following instructions. 226 00:14:01,000 --> 00:14:04,600 And this is why algorithms are perfect for computers. 227 00:14:08,240 --> 00:14:10,200 Computers are just machines. 228 00:14:10,200 --> 00:14:14,000 They just do repetitive tasks at phenomenal speeds. 229 00:14:14,000 --> 00:14:15,560 Unbelievable speeds. 230 00:14:15,560 --> 00:14:20,080 So they're absolutely perfect for performing these repetitive tasks 231 00:14:20,080 --> 00:14:23,120 that are unambiguously defined 232 00:14:23,120 --> 00:14:27,320 and can be done in a finite amount of time. 233 00:14:29,040 --> 00:14:32,040 Computer code is basically making an algorithm specific. 234 00:14:32,040 --> 00:14:33,840 So the algorithm is the kind of idea. 235 00:14:33,840 --> 00:14:35,280 How would you solve the problem? 236 00:14:35,280 --> 00:14:37,680 These are the rough instructions that you would use. 237 00:14:37,680 --> 00:14:40,760 And then that can be translated into particular code. 238 00:14:43,920 --> 00:14:47,880 Lots of types of algorithms have been created with a computer in mind. 239 00:14:49,800 --> 00:14:53,360 And some of the most important are sorting algorithms. 240 00:14:54,880 --> 00:14:58,880 Now, the job of a sorting algorithm is to put things in order. 241 00:14:58,880 --> 00:15:00,560 And they have lots of uses. 242 00:15:00,560 --> 00:15:03,720 For example, on the internet, information gets 243 00:15:03,720 --> 00:15:08,720 broken down into packets of data which then get sent across the web. 244 00:15:08,720 --> 00:15:11,000 Now, to reassemble that data, 245 00:15:11,000 --> 00:15:15,120 sorting algorithms are absolutely crucial to putting this data 246 00:15:15,120 --> 00:15:18,720 back in the correct order so that we can view the picture, 247 00:15:18,720 --> 00:15:21,560 or read the email we've just been sent. 248 00:15:26,120 --> 00:15:30,000 This is the System Development Corporation in California. 249 00:15:30,000 --> 00:15:35,560 It's considered to be the world's first computer software company. 250 00:15:35,560 --> 00:15:40,680 And it was here in 1963 that two computer scientists first formally 251 00:15:40,680 --> 00:15:44,360 wrote down one of the most iconic sorting algorithms of all-time. 252 00:15:48,240 --> 00:15:50,280 It's called bubble sort. 253 00:15:50,280 --> 00:15:53,520 And here's an example of bubble sort in action, 254 00:15:53,520 --> 00:15:55,920 sorting blocks instead of numbers. 255 00:15:57,720 --> 00:16:01,200 It gets its name because with each round of the algorithm, 256 00:16:01,200 --> 00:16:05,240 the largest unsorted object bubbles to the top. 257 00:16:05,240 --> 00:16:09,000 Like all our algorithms so far, there's method in the madness. 258 00:16:14,760 --> 00:16:16,640 To see how this algorithm works, 259 00:16:16,640 --> 00:16:19,120 we're going to use it to sort eight objects. 260 00:16:20,760 --> 00:16:24,720 Now, the bubble sort algorithm says to consider the objects in pairs 261 00:16:24,720 --> 00:16:27,480 and swap them over if they're in the wrong order. 262 00:16:27,480 --> 00:16:31,840 So we're going to start at this end here and work our way to the top. 263 00:16:31,840 --> 00:16:35,880 So I consider these two, they're in the wrong order, so I swap them over. 264 00:16:37,560 --> 00:16:40,000 Consider the next pair, they're in the right order, 265 00:16:40,000 --> 00:16:42,280 so I leave them as they are. 266 00:16:42,280 --> 00:16:45,960 Consider this pair, they're in the wrong order, so I swap them. 267 00:16:48,920 --> 00:16:51,080 And we just continue doing this. 268 00:16:58,160 --> 00:17:01,600 Now the bubble sort algorithm says to go back to the beginning 269 00:17:01,600 --> 00:17:05,760 and repeat the process over and over again until the objects are in order. 270 00:17:19,800 --> 00:17:24,120 The algorithm stops when there are no pairs to swap round. 271 00:17:24,120 --> 00:17:27,880 So the bubble sort algorithm has successfully done its job. 272 00:17:27,880 --> 00:17:30,760 I've now got the objects perfectly ordered, 273 00:17:30,760 --> 00:17:32,640 according to ascending height. 274 00:17:34,160 --> 00:17:37,640 Bubble sort is elegantly simple and straightforward. 275 00:17:37,640 --> 00:17:41,880 But if the scale of the sorting task is huge, say, organising vast swathes 276 00:17:41,880 --> 00:17:45,720 of data, then there might be better sorting algorithms for the job. 277 00:17:50,800 --> 00:17:52,680 This is John von Neumann, 278 00:17:52,680 --> 00:17:56,560 the scientific genius who helped pioneer the modern computer, 279 00:17:56,560 --> 00:17:58,760 game theory, the atomic bomb 280 00:17:58,760 --> 00:18:02,200 and, as it turns out, invented a sorting algorithm. 281 00:18:04,760 --> 00:18:08,080 He devised it to work on this, one of the world's earliest 282 00:18:08,080 --> 00:18:11,880 electronic computers, which he'd helped design. 283 00:18:11,880 --> 00:18:14,800 The algorithm is called merge sort. 284 00:18:16,800 --> 00:18:21,200 The merge sort algorithm works on a principle of divide and conquer. 285 00:18:21,200 --> 00:18:26,280 And it consists of two parts. The first bit is the dividing part. 286 00:18:28,560 --> 00:18:31,920 This involves splitting everything into smaller groups. 287 00:18:35,240 --> 00:18:38,160 And now comes the conquering bit. 288 00:18:40,720 --> 00:18:43,640 The groups are now merged back together. 289 00:18:43,640 --> 00:18:47,480 But as I merge the two groups, I compare the sizes of the objects 290 00:18:47,480 --> 00:18:51,400 one pair at a time so that the merged group becomes sorted. 291 00:19:00,480 --> 00:19:03,240 Now, the merge sort algorithm might look rather similar to the 292 00:19:03,240 --> 00:19:07,240 bubble sort, but where it comes into its own is that with a larger 293 00:19:07,240 --> 00:19:10,280 number of objects, it's much, much faster. 294 00:19:10,280 --> 00:19:15,520 So let's see how merge sort compares in speed to bubble sort. 295 00:19:15,520 --> 00:19:18,040 It's time for a battle of the algorithms! 296 00:19:21,880 --> 00:19:26,000 Here we've got bubble sort on the bottom and merge sort on the top. 297 00:19:26,000 --> 00:19:28,760 And we've got them sorting 1,000 objects. 298 00:19:28,760 --> 00:19:31,840 Now, although they'll both produce the same end result, 299 00:19:31,840 --> 00:19:35,280 you can already see merge sort is getting there much faster. 300 00:19:35,280 --> 00:19:38,760 And this difference in performance gets more pronounced 301 00:19:38,760 --> 00:19:41,120 the more objects they're asked to sort. 302 00:19:53,040 --> 00:19:55,200 LAUGHTER 303 00:19:57,600 --> 00:19:59,560 Well, er... 304 00:19:59,560 --> 00:20:02,920 - I'm sorry, maybe... - No, no, no, no, no. 305 00:20:02,920 --> 00:20:05,000 I-I think...I think, er... 306 00:20:05,000 --> 00:20:08,400 I think the bubble sort would be the wrong way to go. 307 00:20:08,400 --> 00:20:10,160 LAUGHTER 308 00:20:10,160 --> 00:20:11,680 APPLAUSE 309 00:20:12,720 --> 00:20:15,360 Come on. Who told him this? 310 00:20:22,480 --> 00:20:24,760 Merge sort beats bubble sort hands down 311 00:20:24,760 --> 00:20:26,800 for sorting large amounts of data. 312 00:20:28,560 --> 00:20:31,200 But in the crazy world of algorithms, there are many, 313 00:20:31,200 --> 00:20:33,520 many different ways to sort. 314 00:20:36,000 --> 00:20:37,680 At the last count, 315 00:20:37,680 --> 00:20:41,160 there were over 20 different types of sorting algorithms. 316 00:20:42,920 --> 00:20:46,800 All weirdly achieving the same result, but by different means. 317 00:20:58,240 --> 00:21:02,680 - So there's bubble sort, there's merge sort. - Insertion sort. 318 00:21:02,680 --> 00:21:06,480 - There's heap sort, there's quick sort. - Timsort. 319 00:21:06,480 --> 00:21:07,840 You've got gnome sort. 320 00:21:07,840 --> 00:21:10,840 There's pigeonhole sort, which is also called radix sort. 321 00:21:10,840 --> 00:21:13,440 There's bogosort, which might never finish. 322 00:21:19,400 --> 00:21:23,320 There's no such thing as the best sorting algorithm. 323 00:21:23,320 --> 00:21:25,440 Each has its own pros and cons. 324 00:21:26,640 --> 00:21:28,080 And which one gets used 325 00:21:28,080 --> 00:21:31,080 often depends on the specifics of the problem. 326 00:21:32,760 --> 00:21:36,640 I think the beauty of studying algorithms is to try to aspire 327 00:21:36,640 --> 00:21:40,400 for solutions that are as elegant and efficient as possible. 328 00:21:40,400 --> 00:21:44,640 I actually think bubble sort's very pretty. I like it. 329 00:21:44,640 --> 00:21:46,320 Merge sort's beautiful. 330 00:21:49,520 --> 00:21:51,840 We really couldn't live without them. 331 00:21:51,840 --> 00:21:54,840 Sorting algorithms bring order to the world. 332 00:22:05,240 --> 00:22:07,920 So far, we've seen algorithms tackle the tiny 333 00:22:07,920 --> 00:22:11,280 problems of sizing our bathroom tiles and sorting our data. 334 00:22:12,920 --> 00:22:16,040 But how well do they cope with the messy world of love? 335 00:22:18,080 --> 00:22:20,880 Online dating is really popular these days. 336 00:22:20,880 --> 00:22:23,640 In fact, one survey suggests that over a third 337 00:22:23,640 --> 00:22:26,400 of recent marriages started online. 338 00:22:27,400 --> 00:22:30,800 How these dating websites work is that they use something called 339 00:22:30,800 --> 00:22:33,000 a matching algorithm. 340 00:22:33,000 --> 00:22:36,200 They search through the profiles, try to match people up according 341 00:22:36,200 --> 00:22:40,320 to their likes and dislikes, personality traits and so on. 342 00:22:40,320 --> 00:22:43,200 In fact, the algorithms seem to be better than humans. 343 00:22:43,200 --> 00:22:46,480 Because recent research has shown those who meet online 344 00:22:46,480 --> 00:22:49,160 tend to be happier and have longer marriages. 345 00:22:52,360 --> 00:22:56,640 I'll ask you to receive your prizes from His Majesty the King. 346 00:22:56,640 --> 00:23:01,080 In fact, matching algorithms have quite a lot to brag about. 347 00:23:01,080 --> 00:23:05,800 Because in 2012, for the first time, a Nobel Prize was awarded 348 00:23:05,800 --> 00:23:07,840 because of an algorithm. 349 00:23:07,840 --> 00:23:11,280 A matching algorithm created by the late David Gale 350 00:23:11,280 --> 00:23:13,480 and mathematician Lloyd Shapley, 351 00:23:13,480 --> 00:23:16,240 seen here receiving his share of the prize. 352 00:23:20,040 --> 00:23:23,720 The story begins in the 1960s when Gale and Shapley wanted to 353 00:23:23,720 --> 00:23:27,840 solve a problem concerned with college admissions. 354 00:23:27,840 --> 00:23:31,880 How to match up students to colleges so that everyone got a place. 355 00:23:32,880 --> 00:23:35,400 But, more importantly, was happy, even if 356 00:23:35,400 --> 00:23:37,480 they didn't get their first choice. 357 00:23:40,480 --> 00:23:44,160 They called it the stable marriage problem. 358 00:23:44,160 --> 00:23:46,680 The stable marriage problem goes like this. 359 00:23:46,680 --> 00:23:49,120 Suppose you've got four women and four men 360 00:23:49,120 --> 00:23:51,000 and they want to get married. 361 00:23:51,000 --> 00:23:54,000 Now, they've ranked each other according to their preferences. 362 00:23:54,000 --> 00:23:55,880 So, for example, the Queen of Hearts here, 363 00:23:55,880 --> 00:23:57,960 first choice is the King of Clubs. 364 00:23:57,960 --> 00:24:00,040 Second choice, King of Diamonds, 365 00:24:00,040 --> 00:24:02,840 and her last choice is the King of Hearts. 366 00:24:02,840 --> 00:24:06,080 So the challenge here is to play Cupid and pair up the kings 367 00:24:06,080 --> 00:24:09,920 and queens so that each one gets a partner, but, more importantly, 368 00:24:09,920 --> 00:24:12,520 so that the marriages are stable. 369 00:24:12,520 --> 00:24:15,640 A stable marriage means that the kings and queens don't 370 00:24:15,640 --> 00:24:20,640 necessarily get their first choice, but they get the best on offer. 371 00:24:20,640 --> 00:24:25,240 For example, if I paired the King of Hearts and the Queen of Hearts 372 00:24:25,240 --> 00:24:28,240 and the King of Spades and the Queen of Spades, 373 00:24:28,240 --> 00:24:31,040 this would be an unstable marriage. 374 00:24:31,040 --> 00:24:34,480 Because the King of Spades doesn't really like the Queen of Spades. 375 00:24:34,480 --> 00:24:36,640 He'd prefer the Queen of Hearts. 376 00:24:38,120 --> 00:24:40,040 The Queen of Hearts, in her turn, 377 00:24:40,040 --> 00:24:41,960 doesn't really like the King of Hearts. 378 00:24:41,960 --> 00:24:44,840 She'd prefer the King of Spades. 379 00:24:44,840 --> 00:24:48,120 So these two are going to run off together in this pairing. 380 00:24:51,960 --> 00:24:56,480 Where there's a problem, there's an algorithm not far behind. 381 00:24:56,480 --> 00:24:59,160 In 1962, Gale and Shapley came up with 382 00:24:59,160 --> 00:25:02,760 their Nobel-Prize-winning algorithm. 383 00:25:02,760 --> 00:25:09,560 A step-by-step recipe which always finds perfectly-stable marriages. 384 00:25:09,560 --> 00:25:11,240 So in the first round of the algorithm, 385 00:25:11,240 --> 00:25:14,440 the queens all proposed to their first-choice kings. 386 00:25:14,440 --> 00:25:18,720 So the Queen of Spades' first choice is the King of Spades. 387 00:25:18,720 --> 00:25:21,200 She proposes to the King of Spades. 388 00:25:21,200 --> 00:25:24,360 The Queen of Hearts' first choice is the King of Clubs, 389 00:25:24,360 --> 00:25:26,800 so she proposes to the King of Clubs. 390 00:25:26,800 --> 00:25:30,360 The Queen of Diamonds' first choice is the King of Spades. 391 00:25:30,360 --> 00:25:33,320 And the Queen of Clubs' first choice is also the King of Spades. 392 00:25:33,320 --> 00:25:36,600 So King of Spades seems to be the Darcy of this royal court. 393 00:25:37,800 --> 00:25:40,560 Now, the King of Spades has got three proposals. 394 00:25:41,720 --> 00:25:44,840 So he chooses his most popular queen, 395 00:25:44,840 --> 00:25:48,640 who is actually the Queen of Diamonds, and rejects the other two. 396 00:25:51,440 --> 00:25:55,600 So we have two provisional engagements, two rejections. 397 00:25:55,600 --> 00:25:59,280 We now remove the rejected queen's first choices. 398 00:25:59,280 --> 00:26:01,040 And it's time for round two. 399 00:26:02,480 --> 00:26:06,960 So the Queen of Spades is going to propose to the King of Diamonds. 400 00:26:06,960 --> 00:26:10,160 And the Queen of Clubs proposes to the King of Clubs. 401 00:26:11,560 --> 00:26:14,240 But now the King of Clubs has got two proposals 402 00:26:14,240 --> 00:26:17,440 and actually prefers the Queen of Clubs. 403 00:26:17,440 --> 00:26:20,280 So he rejects the Queen of Hearts, his provisional 404 00:26:20,280 --> 00:26:22,920 engagement on the first round of the algorithm, 405 00:26:22,920 --> 00:26:24,440 and we have to start again. 406 00:26:26,000 --> 00:26:28,080 In each round, the rejected queens 407 00:26:28,080 --> 00:26:31,360 propose to the next king on their list. 408 00:26:31,360 --> 00:26:34,480 And the kings always go for the best offer they get. 409 00:26:35,680 --> 00:26:40,000 In this round of the algorithm, she proposes to the King of Hearts 410 00:26:40,000 --> 00:26:44,040 and finally, everyone's paired up with a single queen and king 411 00:26:44,040 --> 00:26:45,960 and all the marriages are stable. 412 00:26:49,120 --> 00:26:53,440 The Gale-Shapley algorithm is now used all over the world. 413 00:26:53,440 --> 00:26:56,840 In Denmark, to match children to day-care places. 414 00:26:56,840 --> 00:27:00,040 In Hungary, to match students to schools. 415 00:27:00,040 --> 00:27:03,440 In New York, to allocate rabbis to synagogues. 416 00:27:03,440 --> 00:27:07,360 And in China, Germany and Spain, to match students to universities. 417 00:27:10,480 --> 00:27:13,560 Whilst in the UK, it's led to the development 418 00:27:13,560 --> 00:27:18,440 of a matching algorithm that, for some people, has saved their lives. 419 00:27:23,040 --> 00:27:26,800 At the age of 20, Seraya in south London was diagnosed 420 00:27:26,800 --> 00:27:31,120 with a chronic kidney disease and told she needed a transplant. 421 00:27:32,880 --> 00:27:37,000 I was on dialysis for 18 months and very unwell. 422 00:27:37,000 --> 00:27:40,240 I couldn't go to work. I had no social life. 423 00:27:40,240 --> 00:27:44,200 It was literally hospital three times a week for treatment and home. 424 00:27:45,440 --> 00:27:47,880 A close friend was willing to donate, 425 00:27:47,880 --> 00:27:50,880 but their tissue types were not compatible. 426 00:27:53,480 --> 00:27:55,840 In St Albans, Tamir was seriously ill 427 00:27:55,840 --> 00:27:58,840 and his wife, Lyndsey, wanted to donate. 428 00:27:58,840 --> 00:28:00,560 But they had the same problem. 429 00:28:02,000 --> 00:28:04,760 We went through all the blood tests and all the workup 430 00:28:04,760 --> 00:28:08,040 and it turned out that we were incompatible blood groups. 431 00:28:10,320 --> 00:28:13,080 Often, kidney patients who are fortunate enough 432 00:28:13,080 --> 00:28:16,080 to have a would-be donor find there's a mismatch 433 00:28:16,080 --> 00:28:18,920 between their donor's blood group or tissue type. 434 00:28:20,720 --> 00:28:26,280 But since 2007, the NHS has been using a special matching algorithm 435 00:28:26,280 --> 00:28:29,160 to find potential matches for willing donors 436 00:28:29,160 --> 00:28:31,480 to kidney patients all over the UK. 437 00:28:35,360 --> 00:28:37,640 When we first looked at this problem, 438 00:28:37,640 --> 00:28:41,320 we really underestimated the complexity. 439 00:28:41,320 --> 00:28:46,360 And originally, we just started with swaps between two pairs. 440 00:28:46,360 --> 00:28:48,120 So it was very simple, 441 00:28:48,120 --> 00:28:53,040 but it soon became obvious that we needed something much more complex. 442 00:28:56,920 --> 00:29:00,000 I became in touch with Rachel Johnson at the NHS 443 00:29:00,000 --> 00:29:02,720 and we then got involved at that stage in being able to design 444 00:29:02,720 --> 00:29:05,560 algorithms which would allow not just pair-wise exchanges, 445 00:29:05,560 --> 00:29:08,120 but also exchanges among three couples, as well. 446 00:29:10,080 --> 00:29:13,080 The algorithm considers several scenarios. 447 00:29:13,080 --> 00:29:15,400 The simplest is a two-way swap 448 00:29:15,400 --> 00:29:18,360 with two couples exchanging kidneys. 449 00:29:21,560 --> 00:29:23,840 More complicated is a three-way swap, 450 00:29:23,840 --> 00:29:26,720 where the kidneys get passed around in a cycle. 451 00:29:29,960 --> 00:29:34,960 There are 200 patients in each of our matching runs. 452 00:29:34,960 --> 00:29:38,960 We need to look for all the possible transplants. 453 00:29:40,200 --> 00:29:42,440 And it's surprising how many there are. 454 00:29:42,440 --> 00:29:44,440 There are literally, you know, hundreds, 455 00:29:44,440 --> 00:29:47,040 sometimes thousands of possibilities. 456 00:29:47,040 --> 00:29:51,400 It's something that just could not be achieved without the algorithm. 457 00:29:53,120 --> 00:29:57,120 One day, Seraya received the call that a match had been found 458 00:29:57,120 --> 00:30:02,200 400 miles away with Linda, a donor living in Bowness near Edinburgh. 459 00:30:03,720 --> 00:30:06,760 My husband's dad needed a new kidney. 460 00:30:06,760 --> 00:30:11,200 He'd been ill for a bit of time. And I wasn't a perfect match. 461 00:30:11,200 --> 00:30:17,000 And I then got a phone call and it was all go from there. 462 00:30:19,120 --> 00:30:20,920 We got the initial phone call saying 463 00:30:20,920 --> 00:30:23,520 we'd been matched up in the three-way pool. 464 00:30:23,520 --> 00:30:26,560 You're just nervous that it's not going to go ahead 465 00:30:26,560 --> 00:30:28,240 because your life depends on it. 466 00:30:29,960 --> 00:30:31,640 For the matching couples, 467 00:30:31,640 --> 00:30:35,080 all the operations had to happen simultaneously. 468 00:30:35,080 --> 00:30:38,280 It was a major logistical challenge. 469 00:30:38,280 --> 00:30:41,360 When my donor went to theatre, they called over to check 470 00:30:41,360 --> 00:30:44,600 that my donor was also in Newcastle going to theatre. 471 00:30:44,600 --> 00:30:46,960 And they both got it at the exact same time. 472 00:30:46,960 --> 00:30:49,400 And they make the call and the kidneys come out. 473 00:30:49,400 --> 00:30:51,160 I think they went by motorbike. 474 00:30:51,160 --> 00:30:53,120 We were told they might go by helicopter, 475 00:30:53,120 --> 00:30:56,680 so I thought at least one bit of me might have been in a helicopter, 476 00:30:56,680 --> 00:30:58,960 but, no, it went by motorbike. 477 00:31:02,880 --> 00:31:06,200 And it eventually went ahead, thankfully, in December. 478 00:31:06,200 --> 00:31:09,160 - The best Christmas present. - Hm! 479 00:31:09,160 --> 00:31:12,440 Personally, I just imagined it was doctors behind there 480 00:31:12,440 --> 00:31:14,880 matching people up off this list. 481 00:31:14,880 --> 00:31:17,640 So, yeah, it's a bit strange 482 00:31:17,640 --> 00:31:20,240 that it comes down to maths at the end of the day. 483 00:31:20,240 --> 00:31:23,720 It's a great scheme and it's still fairly recent. 484 00:31:23,720 --> 00:31:27,120 And many years ago, I wouldn't have had this chance. 485 00:31:27,120 --> 00:31:31,480 I feel a lot of gratitude to Linda and also to the algorithm. 486 00:31:31,480 --> 00:31:33,400 So, yeah, I'm very grateful. 487 00:31:34,680 --> 00:31:39,760 So far, more than 400 patients have benefited from the NHS scheme 488 00:31:39,760 --> 00:31:42,520 and its special matching algorithm. 489 00:31:42,520 --> 00:31:44,840 It was only when we actually seen media articles 490 00:31:44,840 --> 00:31:47,160 and we actually started to think, "Oh, hold on, 491 00:31:47,160 --> 00:31:49,480 "that person might have actually had that match 492 00:31:49,480 --> 00:31:53,080 "through the October matching run's pair-wise exchange," and so on, 493 00:31:53,080 --> 00:31:55,320 that you actually start to see the stories 494 00:31:55,320 --> 00:31:57,200 that are behind the anonymous data. 495 00:31:57,200 --> 00:32:00,560 It's quite funny because David's always really concerned 496 00:32:00,560 --> 00:32:03,400 that the algorithm will take a long time to run. 497 00:32:03,400 --> 00:32:07,280 And, you know, it's been up to 30 minutes and he gets concerned. 498 00:32:07,280 --> 00:32:10,440 But actually, 30 minutes, you know, to us, 499 00:32:10,440 --> 00:32:14,080 it's incredible that it can do all of that in 30 minutes. 500 00:32:25,000 --> 00:32:29,360 So far, we have seen how algorithms are capable of amazing feats. 501 00:32:30,440 --> 00:32:33,520 From solving abstract mathematical problems 502 00:32:33,520 --> 00:32:37,320 to helping us find stuff on the World Wide Web. 503 00:32:37,320 --> 00:32:41,240 And they key thing for all of these algorithms is their speed. 504 00:32:41,240 --> 00:32:44,480 So the important feature of a good algorithm is first 505 00:32:44,480 --> 00:32:47,440 that it'd better be correct, but once you know it's correct, 506 00:32:47,440 --> 00:32:49,400 it's also important that it runs quickly. 507 00:32:49,400 --> 00:32:52,600 There's no good having an algorithm that takes longer 508 00:32:52,600 --> 00:32:57,000 than your lifetime to run if you're wanting the result tomorrow. 509 00:32:58,320 --> 00:33:02,680 This face-detection algorithm is an example of an efficient algorithm. 510 00:33:02,680 --> 00:33:05,840 Because it's efficient, it's able to run in real time. 511 00:33:05,840 --> 00:33:07,720 And that's what makes it useful. 512 00:33:09,640 --> 00:33:14,160 But just as in real life, some problems are harder than others. 513 00:33:14,160 --> 00:33:17,480 Every now and then, algorithms meet their match. 514 00:33:19,200 --> 00:33:21,960 I think the most common misconception about algorithms 515 00:33:21,960 --> 00:33:24,280 is just that algorithms can do anything. 516 00:33:24,280 --> 00:33:27,240 I think people don't really know about the limits. 517 00:33:27,240 --> 00:33:30,760 Some problems simply cannot be solved by efficient algorithms. 518 00:33:32,640 --> 00:33:36,800 There are some places where efficient algorithms cannot go. 519 00:33:36,800 --> 00:33:40,000 Lines in the sand that can't be crossed. 520 00:33:40,000 --> 00:33:43,240 The trouble is knowing which problems they can solve 521 00:33:43,240 --> 00:33:44,680 and which they can't. 522 00:33:48,040 --> 00:33:51,320 Take this Rubik's Cube and imagine the more general challenge 523 00:33:51,320 --> 00:33:54,000 of trying to solve a cube of arbitrary dimensions. 524 00:33:54,000 --> 00:33:57,040 So, for example, with 50 squares down each side. 525 00:33:57,040 --> 00:33:58,520 Now, you might expect this 526 00:33:58,520 --> 00:34:01,600 to be one of the really fiendishly difficult problems, 527 00:34:01,600 --> 00:34:03,960 but actually, it belongs in the easy camp. 528 00:34:03,960 --> 00:34:08,000 We know an algorithm that can solve the general Rubik's Cube 529 00:34:08,000 --> 00:34:09,800 in a reasonable amount of time. 530 00:34:13,320 --> 00:34:14,680 Although it looks hard, 531 00:34:14,680 --> 00:34:17,920 this problem can be cracked by efficient algorithms. 532 00:34:22,800 --> 00:34:25,280 However, here's one that definitely can't. 533 00:34:27,400 --> 00:34:30,320 Imagine you've got a draughts board of arbitrary size 534 00:34:30,320 --> 00:34:32,800 and an arrangement of pieces on the board. 535 00:34:32,800 --> 00:34:34,360 The challenge is to work out 536 00:34:34,360 --> 00:34:38,240 whether white can force a win from this position. 537 00:34:38,240 --> 00:34:40,120 Now, draughts is a pretty easy game, 538 00:34:40,120 --> 00:34:42,400 but it's been mathematically proven 539 00:34:42,400 --> 00:34:46,640 that there's no algorithm that can solve this problem efficiently. 540 00:34:46,640 --> 00:34:49,040 It's an inherently difficult problem. 541 00:34:51,160 --> 00:34:55,600 The only way to solve this puzzle is through sheer hard slog - 542 00:34:55,600 --> 00:34:58,320 working out all the millions of possibilities. 543 00:35:00,080 --> 00:35:04,840 So this problem lies firmly beyond the reach of efficient algorithms. 544 00:35:04,840 --> 00:35:06,520 It can't be solved quickly. 545 00:35:10,240 --> 00:35:14,600 But for some problems, how hard they are is not clear cut. 546 00:35:14,600 --> 00:35:19,080 This is a large sudoku. It's got 625 squares. 547 00:35:20,320 --> 00:35:24,400 One of the nice things about sudoku is that once you've found a solution, 548 00:35:24,400 --> 00:35:28,040 it's relatively straightforward to check whether or not it's right. 549 00:35:28,040 --> 00:35:30,360 And this is true however large the puzzle. 550 00:35:32,360 --> 00:35:34,800 In this case, I've just got to check each row, 551 00:35:34,800 --> 00:35:38,280 column and block doesn't feature a number twice. 552 00:35:38,280 --> 00:35:42,240 Sudoku belongs to a very special category of problems 553 00:35:42,240 --> 00:35:44,840 that all share this characteristic. 554 00:35:44,840 --> 00:35:48,840 Once you've come up with a solution, it's always easy to check it. 555 00:35:49,880 --> 00:35:53,160 The mystery is whether there's an efficient algorithm 556 00:35:53,160 --> 00:35:55,520 to find the solution in the first place. 557 00:35:58,360 --> 00:36:02,520 And sudoku is not alone. There are lots of problems like this. 558 00:36:02,520 --> 00:36:05,040 The most intensely studied of them all 559 00:36:05,040 --> 00:36:08,480 is known as the travelling salesman problem. 560 00:36:13,360 --> 00:36:16,920 A travelling salesman travels door to door, city to city, 561 00:36:16,920 --> 00:36:20,480 selling anything from brushes and Hoovers to double-glazing. 562 00:36:22,520 --> 00:36:25,000 It sounds like a straightforward job. 563 00:36:25,000 --> 00:36:28,880 But all travelling salesmen face the same question. 564 00:36:28,880 --> 00:36:31,560 What's the shortest route to take? 565 00:36:33,520 --> 00:36:37,400 So important is this problem that the Clay Mathematics Institute 566 00:36:37,400 --> 00:36:42,120 has offered 1 million for whoever can find an efficient algorithm, 567 00:36:42,120 --> 00:36:44,520 or prove that none exists. 568 00:36:46,400 --> 00:36:49,000 The travelling salesman problem goes like this. 569 00:36:49,000 --> 00:36:50,520 Imagine you're a salesman 570 00:36:50,520 --> 00:36:55,120 and you've got to visit a list of cities represented by the red dots. 571 00:36:55,120 --> 00:36:57,640 The challenge is to find the shortest route 572 00:36:57,640 --> 00:37:02,040 so you visit each city once before returning to your starting point. 573 00:37:02,040 --> 00:37:04,520 Now, you might imagine the best thing is 574 00:37:04,520 --> 00:37:07,520 to just consider all the routes, like this. 575 00:37:13,960 --> 00:37:18,560 The method of checking all possibilities is a type of algorithm. 576 00:37:18,560 --> 00:37:20,440 And for three cities, it works fine 577 00:37:20,440 --> 00:37:23,640 because there are only three possible routes to check. 578 00:37:27,080 --> 00:37:30,200 But what if we add two more cities to the list? 579 00:37:32,920 --> 00:37:36,360 With five cities, there are 60 different possible routes. 580 00:37:39,160 --> 00:37:44,040 And if we add another city, then there are 360 possible routes. 581 00:37:44,040 --> 00:37:49,320 And for ten cities, there are over 1.8 million possible routes. 582 00:37:49,320 --> 00:37:51,600 If our algorithm chugged through them, 583 00:37:51,600 --> 00:37:54,720 checking all of these at a rate of ten per second, 584 00:37:54,720 --> 00:37:58,320 it would take two days before it found the shortest. 585 00:37:58,320 --> 00:38:01,720 So you can see a method of trying all the different possibilities, 586 00:38:01,720 --> 00:38:06,440 a kind of brute-force algorithm, if you like, is just simply impractical. 587 00:38:07,720 --> 00:38:10,880 If somebody found a fast algorithm for the travelling salesman problem, 588 00:38:10,880 --> 00:38:12,280 it would be hugely significant. 589 00:38:12,280 --> 00:38:15,240 If one of my students came up with an efficient algorithm 590 00:38:15,240 --> 00:38:17,320 for the travelling salesman problem, 591 00:38:17,320 --> 00:38:20,280 I would get him to explain it to me, 592 00:38:20,280 --> 00:38:23,200 I would kill him and then I'd go and claim 593 00:38:23,200 --> 00:38:25,720 the Clay prize, 1 million. 594 00:38:25,720 --> 00:38:28,360 But I think my students are safe. 595 00:38:29,680 --> 00:38:32,680 The problem crops up in lots of areas. 596 00:38:32,680 --> 00:38:35,000 From soldering circuit boards... 597 00:38:37,360 --> 00:38:40,680 ..to planning the routes for supermarket deliveries. 598 00:38:40,680 --> 00:38:45,320 But has the travelling salesman problem secretly already been solved? 599 00:38:49,960 --> 00:38:54,080 A team of scientists working at Rothamsted Research in Harpenden 600 00:38:54,080 --> 00:38:57,520 have turned to nature to see if it has found the answer. 601 00:39:03,200 --> 00:39:06,160 They're carrying out an elaborate experiment to study 602 00:39:06,160 --> 00:39:10,320 how the travelling salesman problem is tackled by the bumblebee. 603 00:39:13,480 --> 00:39:17,680 Bees have to forage for nectar in order to provision their hive. 604 00:39:17,680 --> 00:39:19,920 And so they have to visit 605 00:39:19,920 --> 00:39:22,520 possibly hundreds of flowers on each trip. 606 00:39:22,520 --> 00:39:25,240 What they want to do is find an efficient way 607 00:39:25,240 --> 00:39:28,040 to go between all these flowers that they visit. 608 00:39:31,360 --> 00:39:35,680 The humble bumblebee faces its own travelling salesman problem. 609 00:39:35,680 --> 00:39:38,360 The flowers are just like the cities. 610 00:39:38,360 --> 00:39:41,480 And the bee is the travelling salesman. 611 00:39:41,480 --> 00:39:45,600 One bee will go out foraging many, many times every day. 612 00:39:45,600 --> 00:39:47,360 So over the course of a day, 613 00:39:47,360 --> 00:39:51,680 it really helps to take the most efficient possible route. 614 00:39:51,680 --> 00:39:53,920 So what we're doing is trying to figure out 615 00:39:53,920 --> 00:39:58,000 exactly what rules they're using to narrow down the possibilities. 616 00:40:00,480 --> 00:40:04,160 Joe has laid out five feeders which play the role of flowers. 617 00:40:05,560 --> 00:40:10,200 Each feeder has just enough nectar to ensure the bee has to visit all five 618 00:40:10,200 --> 00:40:12,360 to give it a full honey stomach. 619 00:40:13,560 --> 00:40:16,280 And how are you actually knowing where it's going? 620 00:40:16,280 --> 00:40:18,960 For this, we're using a harmonic radar. 621 00:40:18,960 --> 00:40:22,280 So as that spins round and round, it's emitting a radar signal. 622 00:40:22,280 --> 00:40:25,200 And we've attached a small antenna to the back of the bee, 623 00:40:25,200 --> 00:40:27,880 which then reflects the signal from the radar. 624 00:40:27,880 --> 00:40:31,200 And so this allows us to see exactly where the bee has gone 625 00:40:31,200 --> 00:40:32,800 as she moves around the field. 626 00:40:34,240 --> 00:40:38,000 So, how does the bumblebee tackle the travelling salesman problem? 627 00:40:38,000 --> 00:40:40,120 OK, we're switching it on now. 628 00:40:47,080 --> 00:40:51,600 With five feeders, there are a total of 60 possible routes. 629 00:40:51,600 --> 00:40:54,480 The shortest is around the outer edge. 630 00:40:58,040 --> 00:41:02,520 This heat map shows the path taken by a single bee. 631 00:41:02,520 --> 00:41:06,240 At first, it's simply discovering the positions of the feeders. 632 00:41:07,920 --> 00:41:12,360 Then the bee appears to methodically change different parts of the route 633 00:41:12,360 --> 00:41:14,680 to see if it can make it shorter. 634 00:41:16,920 --> 00:41:20,760 Within 20 trips, it's honed in on an efficient route. 635 00:41:26,480 --> 00:41:29,840 This route is not always the absolute shortest, 636 00:41:29,840 --> 00:41:31,760 but, for the bee, it's good enough. 637 00:41:36,440 --> 00:41:40,040 That's amazing that just after a very few tries, they've got 638 00:41:40,040 --> 00:41:44,040 to something which is efficient enough for them to do their foraging. 639 00:41:44,040 --> 00:41:47,920 Yes, that's right. They can't spend days or even, you know, 640 00:41:47,920 --> 00:41:50,560 it could take months or years to try every possibility. 641 00:41:50,560 --> 00:41:52,920 So they have to very quickly find a route 642 00:41:52,920 --> 00:41:55,680 that they can do again and again and again 643 00:41:55,680 --> 00:41:59,800 - in order to efficiently provide food. - Fantastic. 644 00:41:59,800 --> 00:42:01,960 I think the bee's become my favourite insect now. 645 00:42:01,960 --> 00:42:05,520 - It's obviously a mathematician at heart. - Absolutely. 646 00:42:06,920 --> 00:42:11,640 Let's be clear. Bees are not about to be awarded 1 million. 647 00:42:11,640 --> 00:42:15,120 They've not miraculously solved the travelling salesman problem 648 00:42:15,120 --> 00:42:18,080 because they don't always find the shortest route. 649 00:42:19,400 --> 00:42:21,760 But their algorithm is a clever approach. 650 00:42:21,760 --> 00:42:25,080 In maths, it's known as heuristics. 651 00:42:25,080 --> 00:42:29,320 Algorithms that are efficient, that don't find the perfect solution, 652 00:42:29,320 --> 00:42:31,080 but get as close as they can. 653 00:42:44,520 --> 00:42:46,720 The same heuristic approach 654 00:42:46,720 --> 00:42:49,960 has been used to develop an algorithm for Heathrow airport. 655 00:42:51,400 --> 00:42:54,040 DISPATCHER: 'Clear for takeoff...' 656 00:42:54,040 --> 00:42:57,880 Heathrow handles over 1,300 flights a day. 657 00:42:57,880 --> 00:43:00,000 It's Europe's busiest airport. 658 00:43:00,000 --> 00:43:04,640 '..430 clear for takeoff. Surface wind 247 degrees at three knots.' 659 00:43:12,840 --> 00:43:15,120 The challenge for air traffic control 660 00:43:15,120 --> 00:43:18,640 is to maximise the number of aircraft departing every hour 661 00:43:18,640 --> 00:43:22,800 and ensure that the airport operates both efficiently and safely. 662 00:43:22,800 --> 00:43:29,400 '..behind the British Airways 747, line up 27 right behind.' 663 00:43:29,400 --> 00:43:33,520 One of the key decisions is the order of takeoff. 664 00:43:33,520 --> 00:43:36,680 We're currently departing a group of medium aircraft, 665 00:43:36,680 --> 00:43:39,680 which will be separated one minute apart. 666 00:43:39,680 --> 00:43:43,400 Behind that, then, you can see a 747, which is a large aircraft. 667 00:43:44,800 --> 00:43:48,200 Medium aircraft need to be separated from the turbulence 668 00:43:48,200 --> 00:43:50,360 produced by larger aircraft. 669 00:43:50,360 --> 00:43:52,720 So the ordering of sizes is crucial. 670 00:43:53,800 --> 00:43:56,120 The ideal sequence for takeoff involves 671 00:43:56,120 --> 00:43:58,840 really blocking together groups of aircraft. 672 00:43:58,840 --> 00:44:01,080 So you want large aircraft to be grouped together, 673 00:44:01,080 --> 00:44:03,440 medium aircraft to be grouped together. 674 00:44:03,440 --> 00:44:05,240 And that allows the separation 675 00:44:05,240 --> 00:44:07,640 between those aircraft to be minimised. 676 00:44:10,640 --> 00:44:14,160 The other factor that needs to be considered where planning takeoff 677 00:44:14,160 --> 00:44:16,040 is where the planes are heading. 678 00:44:19,920 --> 00:44:22,320 We want one to be going to the north, one to the south, 679 00:44:22,320 --> 00:44:24,360 the next going to the north, then the south. 680 00:44:24,360 --> 00:44:29,040 If all the aircraft were going in the same direction, the separation would be much greater 681 00:44:29,040 --> 00:44:31,560 and we wouldn't use the runways as efficiently. 682 00:44:31,560 --> 00:44:34,600 All controllers are sitting in the control towers thinking, 683 00:44:34,600 --> 00:44:37,880 "I've all these aircraft going north, all these going south. 684 00:44:37,880 --> 00:44:39,640 "I've got these that are large ones, 685 00:44:39,640 --> 00:44:42,200 "so I want to try and group all the large ones together 686 00:44:42,200 --> 00:44:44,600 "so I don't have to go from a large one to a small one." 687 00:44:44,600 --> 00:44:48,000 And it's a very complex problem to solve in their heads. 688 00:44:48,000 --> 00:44:50,440 '..906 November...' 689 00:44:50,440 --> 00:44:54,280 In 2013, an algorithm joined the team. 690 00:44:54,280 --> 00:44:58,240 Its job is to predict the most likely order for takeoff 691 00:44:58,240 --> 00:45:00,400 and advise air traffic control 692 00:45:00,400 --> 00:45:03,240 when aircraft should push back from the gates. 693 00:45:03,240 --> 00:45:06,240 To do this involves nothing less than simulating 694 00:45:06,240 --> 00:45:09,480 the entire outward-bound operation of the airport. 695 00:45:11,280 --> 00:45:14,240 Carrying out millions of calculations every second. 696 00:45:14,240 --> 00:45:17,040 FAINT DISPATCHER 697 00:45:21,720 --> 00:45:25,080 The algorithm works by trying to predict 698 00:45:25,080 --> 00:45:28,360 what order the aircraft are going to take off in. 699 00:45:28,360 --> 00:45:30,640 If it knows what order they can take off in, 700 00:45:30,640 --> 00:45:32,560 then it can work backwards and say, 701 00:45:32,560 --> 00:45:34,600 "If it needs to take off at this time, 702 00:45:34,600 --> 00:45:37,480 "then it needs to enter the runway queue at this time, 703 00:45:37,480 --> 00:45:39,840 "then it needs finishing its taxi at this time, 704 00:45:39,840 --> 00:45:42,520 "so it needs to start its taxi operation at this time. 705 00:45:42,520 --> 00:45:45,480 "In that case, it needs to finish its pushback by this time, 706 00:45:45,480 --> 00:45:47,600 "so it needs to start its pushback by this time." 707 00:45:47,600 --> 00:45:50,600 And it can work all the way back from what time it should take off 708 00:45:50,600 --> 00:45:52,640 to what time it should start pushing back. 709 00:45:55,440 --> 00:45:58,720 The output of the algorithm is given to air traffic control 710 00:45:58,720 --> 00:46:01,560 through the airport's internal computer system 711 00:46:01,560 --> 00:46:05,800 and displayed to the pilot at the gate in the form of the TSAT, 712 00:46:05,800 --> 00:46:07,800 the recommended pushback time. 713 00:46:10,000 --> 00:46:12,800 The pilot can look on the stand-entry system 714 00:46:12,800 --> 00:46:15,960 to actually see what time he is expecting to depart. 715 00:46:17,880 --> 00:46:21,200 The biggest benefit of the algorithm is that it means you can 716 00:46:21,200 --> 00:46:25,040 hold aircraft on stand for longer without them taking off any later. 717 00:46:25,040 --> 00:46:28,440 So there's no loss for any passengers in terms of delays. 718 00:46:28,440 --> 00:46:30,840 What you can do is you can start your engines later. 719 00:46:33,080 --> 00:46:35,480 In actual fact, if we save two minutes' taxi time 720 00:46:35,480 --> 00:46:37,840 on the way to the end of the runway, over a year, 721 00:46:37,840 --> 00:46:40,520 that's actually A�15 million worth of fuel savings. 722 00:46:42,280 --> 00:46:46,240 The Heathrow sequencing algorithm shows just what can be accomplished 723 00:46:46,240 --> 00:46:47,920 with the heuristic approach. 724 00:46:49,040 --> 00:46:52,320 Just like the bees, the algorithm is not finding 725 00:46:52,320 --> 00:46:55,360 the absolute perfect solution all the time, 726 00:46:55,360 --> 00:46:58,720 but nevertheless makes a tough job that bit easier. 727 00:47:00,320 --> 00:47:02,080 We're very proud of the algorithm 728 00:47:02,080 --> 00:47:05,720 because it actually now, we feel, models the real world and is of use. 729 00:47:16,120 --> 00:47:19,080 In the beginning, algorithms were created 730 00:47:19,080 --> 00:47:21,640 by mathematicians for mathematicians. 731 00:47:21,640 --> 00:47:23,800 And over the last century, 732 00:47:23,800 --> 00:47:26,400 algorithms have been created for computers. 733 00:47:29,240 --> 00:47:33,960 But perhaps our relationship is about to go through a dramatic revolution. 734 00:47:39,720 --> 00:47:41,920 At Microsoft Research in Cambridge, 735 00:47:41,920 --> 00:47:46,360 scientists are using new techniques to develop algorithms... 736 00:47:46,360 --> 00:47:50,400 blurring the boundary between inventor and the algorithm itself. 737 00:47:56,600 --> 00:47:59,920 This is the Kinect skeletal-tracking algorithm. 738 00:47:59,920 --> 00:48:02,760 The amazing thing is that it's able to identify 739 00:48:02,760 --> 00:48:04,920 the different parts of my body. 740 00:48:04,920 --> 00:48:08,360 So you can see it's coloured the top of my head in red 741 00:48:08,360 --> 00:48:11,040 and my right hand here in blue. 742 00:48:11,040 --> 00:48:13,560 You can see it's coloured my neck green. 743 00:48:13,560 --> 00:48:16,080 Now, this algorithm has never met me before, 744 00:48:16,080 --> 00:48:18,760 doesn't know how I'm going to move in space, 745 00:48:18,760 --> 00:48:22,040 but just using the data coming from this special camera here, 746 00:48:22,040 --> 00:48:25,520 measuring the distance from the camera to my body, 747 00:48:25,520 --> 00:48:28,120 it's able to produce this map. 748 00:48:30,520 --> 00:48:33,960 Whatever posture I take, using nothing more than the input 749 00:48:33,960 --> 00:48:36,360 from the special depth-sensing camera, 750 00:48:36,360 --> 00:48:39,360 the algorithm is able to accurately identify, 751 00:48:39,360 --> 00:48:42,760 pixel by pixel, the different parts of my body. 752 00:48:46,640 --> 00:48:49,720 It was developed for the Microsoft Xbox console 753 00:48:49,720 --> 00:48:53,640 to track the movement of a player's body posture in real time. 754 00:48:58,440 --> 00:49:01,600 But just as remarkable as what this algorithm can do 755 00:49:01,600 --> 00:49:04,480 is the process behind how it was created, 756 00:49:04,480 --> 00:49:07,080 as researcher Jamie Shotton explains. 757 00:49:09,640 --> 00:49:12,640 What's happening is that every pixel in the image, 758 00:49:12,640 --> 00:49:16,080 we are running an algorithm called a decision tree. 759 00:49:16,080 --> 00:49:19,520 And you can think of a decision tree as a game of 20 questions. 760 00:49:19,520 --> 00:49:22,560 So the decision tree is sort of taking a pixel, say, on my hand, 761 00:49:22,560 --> 00:49:25,320 and trying to decide, OK, I've got to colour that blue 762 00:49:25,320 --> 00:49:28,480 - because that's on the hand rather than on my body. - Yes. 763 00:49:28,480 --> 00:49:31,200 The key to a decision tree is the fact that the 20 questions 764 00:49:31,200 --> 00:49:33,880 that you ask are not the same 765 00:49:33,880 --> 00:49:37,000 for every pixel that we're trying to classify. 766 00:49:37,000 --> 00:49:39,680 And the full set of the possible questions 767 00:49:39,680 --> 00:49:43,080 that could be answered is exponential. 768 00:49:43,080 --> 00:49:46,360 - It's two to the twenty. - Right, OK. That's over a million questions, 769 00:49:46,360 --> 00:49:49,240 a lot of questions you're going to have to program in there. 770 00:49:49,240 --> 00:49:51,080 Yes. It would take far too long 771 00:49:51,080 --> 00:49:55,120 and be far too error-prone for us as humans to program that by hand. 772 00:49:55,120 --> 00:49:58,760 - So, the algorithm's kind of writing itself, or...? - Absolutely. 773 00:50:02,960 --> 00:50:05,520 The algorithm was not designed by Jamie 774 00:50:05,520 --> 00:50:08,960 but instead through a process called machine learning. 775 00:50:11,440 --> 00:50:15,720 It involved showing the algorithm millions of training images, 776 00:50:15,720 --> 00:50:19,320 of bodies in different poses and of various shapes and sizes, 777 00:50:19,320 --> 00:50:23,600 from the very fat to the very thin, the very short to the very tall. 778 00:50:24,640 --> 00:50:28,880 And from this, the algorithm essentially learned by example, 779 00:50:28,880 --> 00:50:31,040 devising its own rules. 780 00:50:34,200 --> 00:50:37,760 Where our intelligence comes in as the designers of the system 781 00:50:37,760 --> 00:50:41,240 is not in programming the algorithm, per se, 782 00:50:41,240 --> 00:50:44,200 but in designing the training data set 783 00:50:44,200 --> 00:50:48,160 to capture all of the kind of variations that we expect to see 784 00:50:48,160 --> 00:50:51,040 when we deploy this system in people's living rooms 785 00:50:51,040 --> 00:50:52,360 to play their games. 786 00:50:52,360 --> 00:50:55,600 So in the end, do you actually know what the algorithm is doing? 787 00:50:55,600 --> 00:50:57,800 We can get a sense of what it's trying to do 788 00:50:57,800 --> 00:50:59,400 and how it's roughly working, 789 00:50:59,400 --> 00:51:02,960 but we couldn't possibly really understand what exactly is going on. 790 00:51:04,960 --> 00:51:09,920 The same approach of machine learning has been used in other applications. 791 00:51:09,920 --> 00:51:14,680 For example, this algorithm is able to do something that for a long time 792 00:51:14,680 --> 00:51:19,560 was thought to be a skill exclusive to neurosurgeons and radiologists. 793 00:51:19,560 --> 00:51:22,800 From an MRI scan, the algorithm can identify 794 00:51:22,800 --> 00:51:26,480 and map a brain tumour in 3-D. 795 00:51:26,480 --> 00:51:29,280 Meaning that a job that normally takes an hour 796 00:51:29,280 --> 00:51:31,360 can be done in a matter of minutes. 797 00:51:34,640 --> 00:51:37,640 Professor Chris Bishop is interested in developing 798 00:51:37,640 --> 00:51:40,880 the concept of machine learning even further. 799 00:51:40,880 --> 00:51:44,680 To create algorithms that can learn just like we do, 800 00:51:44,680 --> 00:51:46,600 directly from experience. 801 00:51:49,160 --> 00:51:52,120 So this demonstration, I think, illustrates the direction 802 00:51:52,120 --> 00:51:54,120 that algorithms will go in the years ahead. 803 00:51:54,120 --> 00:51:57,640 OK, I can see a lot of films up here, so what is the algorithm going to do? 804 00:51:57,640 --> 00:52:00,760 We've got a couple of hundred of the most commonly watched films, 805 00:52:00,760 --> 00:52:02,240 and what it's going to do, 806 00:52:02,240 --> 00:52:06,600 it's going to learn about your personal likes and dislikes. 807 00:52:06,600 --> 00:52:08,080 It's already been trained, 808 00:52:08,080 --> 00:52:11,080 so it's a machine-learning algorithm behind the scenes, 809 00:52:11,080 --> 00:52:14,480 but it's already been trained on data from about 10,000 people. 810 00:52:14,480 --> 00:52:18,160 What it's going to do now is to learn about your preferences. 811 00:52:18,160 --> 00:52:20,200 At the moment it knows nothing about you, 812 00:52:20,200 --> 00:52:22,760 so these films are just arranged at random on the screen. 813 00:52:22,760 --> 00:52:25,440 What I need you to do is to find one of these films, 814 00:52:25,440 --> 00:52:28,120 either one that you like or one that you don't like. 815 00:52:28,120 --> 00:52:31,160 If you like it, you can drag it across to the green region, 816 00:52:31,160 --> 00:52:33,600 if you don't like it, across to the red region. 817 00:52:33,600 --> 00:52:35,600 Rushmore, I'm a big fan of Rushmore. 818 00:52:35,600 --> 00:52:37,560 You like Rushmore? OK, right. 819 00:52:37,560 --> 00:52:41,120 So what's happening now is that if a film is down the right-hand side 820 00:52:41,120 --> 00:52:44,760 - near the green region, it's very confident you'll like it. - OK. 821 00:52:44,760 --> 00:52:46,600 So down here close to the red region, 822 00:52:46,600 --> 00:52:48,560 it's very confident you won't like it. 823 00:52:48,560 --> 00:52:51,400 In the middle, it's 50-50. It doesn't really know. 824 00:52:51,400 --> 00:52:54,320 So if I choose a movie in the middle here, 825 00:52:54,320 --> 00:52:57,680 I'm not a great Austin Powers fan, so let's shoot that one... 826 00:52:57,680 --> 00:53:00,800 So you see, they're beginning to spread out sideways, 827 00:53:00,800 --> 00:53:04,480 - it's going to be a little bit more confident. - It's pretty good. 828 00:53:04,480 --> 00:53:07,480 I'm a big fan of Dr Strangelove 829 00:53:07,480 --> 00:53:11,480 and I'm a big fan of Woody Allen, 830 00:53:11,480 --> 00:53:14,520 but Spinal Tap, it thinks I'll like that. 831 00:53:14,520 --> 00:53:18,040 So that's interesting, so when it was confident you liked them 832 00:53:18,040 --> 00:53:19,800 and you said you liked them, 833 00:53:19,800 --> 00:53:22,920 not much happened because it didn't learn much. 834 00:53:22,920 --> 00:53:25,840 When it was confident you'd like it, in the case of Spinal Tap 835 00:53:25,840 --> 00:53:28,280 and you said, "I don't like it," there was a big change. 836 00:53:28,280 --> 00:53:30,200 It's learning things from me. 837 00:53:30,200 --> 00:53:33,080 I'm actually changing the algorithm as I interact with it. 838 00:53:33,080 --> 00:53:36,520 Exactly. Whereas Kinect was trained in the laboratory and then frozen, 839 00:53:36,520 --> 00:53:38,560 this algorithm continues to adapt 840 00:53:38,560 --> 00:53:41,280 and continues to evolve throughout its life. 841 00:53:41,280 --> 00:53:44,120 The more films that you rate as like and don't like, 842 00:53:44,120 --> 00:53:45,960 the more it knows about you personally 843 00:53:45,960 --> 00:53:48,760 and the better able it is to make good recommendations. 844 00:53:48,760 --> 00:53:52,320 This algorithm is beginning to feel much more human 845 00:53:52,320 --> 00:53:54,840 in the way that it interacts with the world. 846 00:53:54,840 --> 00:53:57,840 Is that your aim, to find a way to produce algorithms 847 00:53:57,840 --> 00:54:00,560 that are a bit like the way that we negotiate the world? 848 00:54:00,560 --> 00:54:03,720 Exactly. It's a step down that very long road to producing machines 849 00:54:03,720 --> 00:54:05,880 that really are as capable as the human brain. 850 00:54:05,880 --> 00:54:08,720 We've a long way to go, but this is a small step in that direction 851 00:54:08,720 --> 00:54:10,160 because it's not fixed any more. 852 00:54:10,160 --> 00:54:12,480 It's now continuing to learn just the same way 853 00:54:12,480 --> 00:54:14,800 that we continue to learn in our daily lives. 854 00:54:19,680 --> 00:54:21,680 I think we're just starting 855 00:54:21,680 --> 00:54:24,240 to realise the full potential of algorithms 856 00:54:24,240 --> 00:54:26,600 and I have one more place I want to visit, 857 00:54:26,600 --> 00:54:28,840 which I'm told will give me a glimpse 858 00:54:28,840 --> 00:54:31,760 of just how much they are able to do for us. 859 00:54:40,600 --> 00:54:43,600 It's a world where almost everything is automated. 860 00:54:46,920 --> 00:54:49,400 Where algorithms are in control. 861 00:54:49,400 --> 00:54:53,920 It's the largest automated grocery warehouse on earth. 862 00:54:53,920 --> 00:54:57,520 It belongs to the online grocery retailer Ocado 863 00:54:57,520 --> 00:55:01,000 and it's the equivalent of 45 supermarkets in one. 864 00:55:02,720 --> 00:55:06,600 Over two million items flow through this warehouse every day. 865 00:55:06,600 --> 00:55:10,360 At any one time, there are something like 7,000 crates 866 00:55:10,360 --> 00:55:12,800 going over 25 kilometres of track, 867 00:55:12,800 --> 00:55:18,360 and controlling every aspect of this astonishing spectacle are algorithms. 868 00:55:25,520 --> 00:55:29,120 Each of those red crates is part of a customer order 869 00:55:29,120 --> 00:55:32,880 and they may go on from here to find other items 870 00:55:32,880 --> 00:55:35,160 that they want across the warehouse, 871 00:55:35,160 --> 00:55:37,280 until they are eventually finished, 872 00:55:37,280 --> 00:55:41,360 loaded onto a van and then driven out by our routing system 873 00:55:41,360 --> 00:55:43,720 on a route, which in many ways, 874 00:55:43,720 --> 00:55:47,360 is solving problems like the travelling salesman problem. 875 00:55:47,360 --> 00:55:49,720 There are decisions being made all over the place 876 00:55:49,720 --> 00:55:52,240 as a red crate goes this way and then that way. 877 00:55:52,240 --> 00:55:55,600 The complexity behind all this is beyond 878 00:55:55,600 --> 00:55:58,760 what any human could control or solve, 879 00:55:58,760 --> 00:56:01,760 and that is where these algorithms, 880 00:56:01,760 --> 00:56:03,960 these problem-solving techniques come in 881 00:56:03,960 --> 00:56:05,920 to overcome those challenges. 882 00:56:11,000 --> 00:56:15,480 Everywhere you look, the invisible hand of the algorithm is at work. 883 00:56:16,560 --> 00:56:20,360 Forecasting algorithms monitor and replenish the stock 884 00:56:20,360 --> 00:56:24,720 of more than 43,000 products, anticipating customer demand. 885 00:56:26,760 --> 00:56:29,840 Control system algorithms manage the traffic 886 00:56:29,840 --> 00:56:33,320 of the more than 7,000 crates around the warehouse. 887 00:56:36,360 --> 00:56:39,800 And van routing algorithms control the movement of the fleet 888 00:56:39,800 --> 00:56:41,960 of over 1,500 vans, 889 00:56:41,960 --> 00:56:46,240 testing over four million different route combinations every second. 890 00:56:48,120 --> 00:56:51,160 You can almost see the mind of the machine at work 891 00:56:51,160 --> 00:56:54,360 and it's not a static process, so that's why there is a huge amount 892 00:56:54,360 --> 00:56:59,520 of machine learning in here, so it's like a self-adapting organism. 893 00:56:59,520 --> 00:57:02,200 It's constantly having to learn how to do it better. 894 00:57:02,200 --> 00:57:04,360 People couldn't do that. 895 00:57:04,360 --> 00:57:06,600 The machine has to tune itself. 896 00:57:10,640 --> 00:57:14,080 So who would you say was actually in control of the whole thing? 897 00:57:14,080 --> 00:57:17,400 Ultimately, it's the algorithms that are in control. 898 00:57:17,400 --> 00:57:19,960 I think I'm getting algorithmic hot flushes 899 00:57:19,960 --> 00:57:22,080 by looking at this amazing thing! 900 00:57:24,440 --> 00:57:26,560 In some sense, this warehouse is like 901 00:57:26,560 --> 00:57:28,880 a little microcosm of the modern world. 902 00:57:28,880 --> 00:57:32,640 Algorithms are running everything from search engines on the internet, 903 00:57:32,640 --> 00:57:35,680 sat nav, even keeping our credit cards secure. 904 00:57:35,680 --> 00:57:39,680 Our world wouldn't function without the power of these algorithms. 905 00:57:45,440 --> 00:57:48,920 The Open University have produced a free pack for you to learn, 906 00:57:48,920 --> 00:57:52,880 create and discover more about digital technology past and present. 907 00:57:52,880 --> 00:57:55,280 To order your copy, phone... 908 00:57:58,560 --> 00:58:00,080 ..or follow the link below 909 00:58:00,080 --> 00:58:01,680 to the Open University. 80112

Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.