All language subtitles for 06 - Solution Perform a Bayesian analysis

af Afrikaans
sq Albanian
am Amharic
ar Arabic Download
hy Armenian
az Azerbaijani
eu Basque
be Belarusian
bn Bengali
bs Bosnian
bg Bulgarian
ca Catalan
ceb Cebuano
ny Chichewa
zh-CN Chinese (Simplified)
zh-TW Chinese (Traditional)
co Corsican
hr Croatian
cs Czech
da Danish
nl Dutch
en English
eo Esperanto
et Estonian
tl Filipino
fi Finnish
fr French
fy Frisian
gl Galician
ka Georgian
de German
el Greek
gu Gujarati
ht Haitian Creole
ha Hausa
haw Hawaiian
iw Hebrew
hi Hindi
hmn Hmong
hu Hungarian
is Icelandic
ig Igbo
id Indonesian
ga Irish
it Italian
ja Japanese
jw Javanese
kn Kannada
kk Kazakh
km Khmer
ko Korean
ku Kurdish (Kurmanji)
ky Kyrgyz
lo Lao
la Latin
lv Latvian
lt Lithuanian
lb Luxembourgish
mk Macedonian
mg Malagasy
ms Malay
ml Malayalam
mt Maltese
mi Maori
mr Marathi
mn Mongolian
my Myanmar (Burmese)
ne Nepali
no Norwegian
ps Pashto
fa Persian
pl Polish
pt Portuguese
pa Punjabi
ro Romanian
ru Russian
sm Samoan
gd Scots Gaelic
sr Serbian
st Sesotho
sn Shona
sd Sindhi
si Sinhala
sk Slovak
sl Slovenian
so Somali
es Spanish
su Sundanese
sw Swahili
sv Swedish
tg Tajik
ta Tamil
te Telugu
th Thai
tr Turkish
uk Ukrainian
ur Urdu
uz Uzbek
vi Vietnamese
cy Welsh
xh Xhosa
yi Yiddish
yo Yoruba
zu Zulu
or Odia (Oriya)
rw Kinyarwanda
tk Turkmen
tt Tatar
ug Uyghur
Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated: 1 00:00:00,000 --> 00:00:05,002 (upbeat synth music) 2 00:00:05,002 --> 00:00:07,008 - [Instructor] In the previous movie, I described a problem 3 00:00:07,008 --> 00:00:10,007 where you will determine the probability, 4 00:00:10,007 --> 00:00:13,002 an item is genuine when it's reported genuine. 5 00:00:13,002 --> 00:00:15,004 And an item is counterfeit when reported 6 00:00:15,004 --> 00:00:18,007 counterfeit by museum appraisers and curators 7 00:00:18,007 --> 00:00:21,006 based on the base rate of genuine items, 8 00:00:21,006 --> 00:00:23,009 comprising 70% of the collection 9 00:00:23,009 --> 00:00:27,008 and an appraiser accuracy rate of 90%. 10 00:00:27,008 --> 00:00:29,006 So let's go ahead and fill in the worksheet 11 00:00:29,006 --> 00:00:32,003 and perform our calculations. 12 00:00:32,003 --> 00:00:34,001 I'll click in cell B6. 13 00:00:34,001 --> 00:00:37,005 The probability in item is genuine is given in B3. 14 00:00:37,005 --> 00:00:41,003 So in B6 I'll, type equal and B3. 15 00:00:41,003 --> 00:00:44,000 That means that the probability it is counterfeit 16 00:00:44,000 --> 00:00:48,000 or fake, not real is 1 minus 70%, 17 00:00:48,000 --> 00:00:50,004 so in B7 we'll type equal 18 00:00:50,004 --> 00:00:53,005 one minus and our base rates in B3, so enter. 19 00:00:53,005 --> 00:00:56,008 70 and 30 is 100, which is correct. 20 00:00:56,008 --> 00:00:59,006 Now we need to determine the probability 21 00:00:59,006 --> 00:01:03,004 the appraisers are correct or incorrect. 22 00:01:03,004 --> 00:01:06,001 So in B9, I'll type equal. 23 00:01:06,001 --> 00:01:08,002 The probability they're correct is their accuracy, 24 00:01:08,002 --> 00:01:10,003 so that's in B4. 25 00:01:10,003 --> 00:01:14,004 And as with the probability of an item being counterfeit 26 00:01:14,004 --> 00:01:16,004 the probability of them being incorrect 27 00:01:16,004 --> 00:01:19,005 is 1 minus the probability they are correct. 28 00:01:19,005 --> 00:01:20,007 So in B10 29 00:01:20,007 --> 00:01:22,002 equal 1 minus 30 00:01:22,002 --> 00:01:24,008 B4 and enter, 31 00:01:24,008 --> 00:01:26,000 and we get 10%. 32 00:01:26,000 --> 00:01:27,009 90 plus 10 equals 100, 33 00:01:27,009 --> 00:01:29,009 so we're good there. 34 00:01:29,009 --> 00:01:32,004 Now we can go over to our classification matrix 35 00:01:32,004 --> 00:01:37,001 and calculate the probability of each one of our scenarios. 36 00:01:37,001 --> 00:01:41,001 So we have an item actually being genuine 37 00:01:41,001 --> 00:01:43,000 and being reported genuine. 38 00:01:43,000 --> 00:01:45,007 So in E4 I'll type equal 39 00:01:45,007 --> 00:01:51,002 and the base rate of them being genuine is 70%. 40 00:01:51,002 --> 00:01:54,005 And I have that probability in B6. 41 00:01:54,005 --> 00:01:56,000 And then I want to multiply that 42 00:01:56,000 --> 00:01:59,005 by the probability that the curator is correct, 43 00:01:59,005 --> 00:02:02,004 that's in B9 and enter. 44 00:02:02,004 --> 00:02:04,008 So that's the probability for that scenario. 45 00:02:04,008 --> 00:02:08,006 Now, in E5, I need to give the probability 46 00:02:08,006 --> 00:02:11,003 that an item is reported as genuine 47 00:02:11,003 --> 00:02:13,005 but is actually counterfeit, 48 00:02:13,005 --> 00:02:17,003 so in other words, a mistake was made, so equal. 49 00:02:17,003 --> 00:02:20,006 And I will have an item 50 00:02:20,006 --> 00:02:23,000 is being counterfeit. 51 00:02:23,000 --> 00:02:25,000 that is in B7. 52 00:02:25,000 --> 00:02:28,000 And we'll multiply that by the probability, 53 00:02:28,000 --> 00:02:31,004 that the appraiser is incorrect, 54 00:02:31,004 --> 00:02:35,004 and that is in cell B10 and enter, 55 00:02:35,004 --> 00:02:37,007 so we have 3%. 56 00:02:37,007 --> 00:02:41,002 Now we have reported counterfeit and actually genuine, 57 00:02:41,002 --> 00:02:43,000 so again, a mistake was made. 58 00:02:43,000 --> 00:02:46,003 So we in F4 we'll type in equal sign 59 00:02:46,003 --> 00:02:48,008 and we're multiplying the probability 60 00:02:48,008 --> 00:02:50,008 that the item is actually genuine, 61 00:02:50,008 --> 00:02:53,003 that is in B6. 62 00:02:53,003 --> 00:02:56,009 By the probability that the appraiser made 63 00:02:56,009 --> 00:02:58,007 a mistake that's in B10, enter. 64 00:02:58,007 --> 00:03:00,007 So that's 7%. 65 00:03:00,007 --> 00:03:03,006 And then in F5, we need to calculate the probability 66 00:03:03,006 --> 00:03:06,009 that an appraiser correctly identified a counterfeit object. 67 00:03:06,009 --> 00:03:11,002 So in F5 equal, and we are multiplying the probability 68 00:03:11,002 --> 00:03:14,000 that the item is counterfeit that's in B7. 69 00:03:14,000 --> 00:03:17,000 By the probability that the appraiser is correct, 70 00:03:17,000 --> 00:03:18,007 that's in B9, enter. 71 00:03:18,007 --> 00:03:20,009 And we get 27%. 72 00:03:20,009 --> 00:03:22,009 And we can look at our values 73 00:03:22,009 --> 00:03:25,006 to make sure that everything lines up. 74 00:03:25,006 --> 00:03:29,003 So our correct guesses are 63% and 27% 75 00:03:29,003 --> 00:03:32,006 of the time that's 90, that is our probability, correct. 76 00:03:32,006 --> 00:03:34,006 And 7+3 is 10, 77 00:03:34,006 --> 00:03:36,001 and that is the probability 78 00:03:36,001 --> 00:03:39,002 of the appraiser being incorrect. 79 00:03:39,002 --> 00:03:41,004 Now we can calculate the probability 80 00:03:41,004 --> 00:03:43,008 that an item is genuine when reported genuine 81 00:03:43,008 --> 00:03:46,008 and counterfeit reported counterfeit. 82 00:03:46,008 --> 00:03:50,002 So I'll click in cell E8 type an equal sign. 83 00:03:50,002 --> 00:03:52,004 And we need to calculate the percentage 84 00:03:52,004 --> 00:03:54,008 of times that an item is reported as genuine 85 00:03:54,008 --> 00:03:56,006 and it actually is. 86 00:03:56,006 --> 00:03:59,009 So reported genuine is and cell E4 87 00:03:59,009 --> 00:04:02,004 divide it by the total number of times 88 00:04:02,004 --> 00:04:05,006 an item is identified as genuine, 89 00:04:05,006 --> 00:04:06,009 both right and wrong. 90 00:04:06,009 --> 00:04:10,001 And that is adding E4 and E5. 91 00:04:10,001 --> 00:04:14,007 So I'll type the left parenthesis E4+E5, 92 00:04:14,007 --> 00:04:16,003 right parenthesis and tab, 93 00:04:16,003 --> 00:04:18,007 and we get 95%. 94 00:04:18,007 --> 00:04:21,001 Now we can calculate the probability 95 00:04:21,001 --> 00:04:23,004 an item is actually counterfeit 96 00:04:23,004 --> 00:04:25,000 when it's reported counterfeit. 97 00:04:25,000 --> 00:04:27,002 So in F8 I'll type equal 98 00:04:27,002 --> 00:04:29,002 and we need to divide 99 00:04:29,002 --> 00:04:31,008 the correct guesses by total guesses. 100 00:04:31,008 --> 00:04:35,001 For this column, the correct guesses are in F5, 101 00:04:35,001 --> 00:04:38,002 then I'll type a forward slash for division, 102 00:04:38,002 --> 00:04:40,007 left parentheses and we need to add the total number 103 00:04:40,007 --> 00:04:43,001 of times that an item is reported as counterfeit, 104 00:04:43,001 --> 00:04:44,004 both right and wrong. 105 00:04:44,004 --> 00:04:48,006 So that's F4+ F5, right parentheses and enter, 106 00:04:48,006 --> 00:04:51,001 and we get 79%. 107 00:04:51,001 --> 00:04:54,003 And so it appears that our team will do a very good job 108 00:04:54,003 --> 00:04:57,009 of identifying genuine items with very little error 109 00:04:57,009 --> 00:04:59,000 and still a pretty good 110 00:04:59,000 --> 00:05:02,008 but not quite as good job identifying counterfeits. 111 00:05:02,008 --> 00:05:04,005 I hope you enjoyed this problem. 112 00:05:04,005 --> 00:05:06,008 Bayesian analysis is very different 113 00:05:06,008 --> 00:05:09,009 from other types analysis that we typically do in Excel, 114 00:05:09,009 --> 00:05:13,006 so don't worry if you didn't get it right on the first time. 115 00:05:13,006 --> 00:05:15,008 Take a few minutes, walk away from the workbook, 116 00:05:15,008 --> 00:05:18,000 come back and feel free to try again. 8638

Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.