Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
0
00:00:00,000 --> 00:00:03,370
MICHAEL HEMANN: So let's put in some numbers here.
1
00:00:03,370 --> 00:00:07,260
So we have just that formula again, 100 times
2
00:00:07,260 --> 00:00:13,830
T plus 6NPD over 2E.
3
00:00:13,830 --> 00:00:27,140
So say you have 75 PDs, you have 20 T's, and you have 5 NPDs.
4
00:00:27,140 --> 00:00:30,950
So here we have your distance in centimorgans
5
00:00:30,950 --> 00:00:38,120
equals 100 times the number of T's plus 6 times the number
6
00:00:38,120 --> 00:00:42,660
of NPDs over 2 times the total number,
7
00:00:42,660 --> 00:00:56,400
which is 100, which equals 50 over 200, or 25 centimorgans.
8
00:00:56,400 --> 00:00:57,900
So think about this.
9
00:00:57,900 --> 00:01:01,740
If you just counted random gametes,
10
00:01:01,740 --> 00:01:07,464
so if you just counted random gametes, you would have,
11
00:01:07,464 --> 00:01:10,830
obviously, T's, and NPDs, and PDs, but you're
12
00:01:10,830 --> 00:01:13,545
just going to count just single cells without regard.
13
00:01:13,545 --> 00:01:16,890
14
00:01:16,890 --> 00:01:23,460
So you'd expect for every T you have,
15
00:01:23,460 --> 00:01:33,720
so you have 20 T's, you'd have two recombinants within those
16
00:01:33,720 --> 00:01:35,040
T's.
17
00:01:35,040 --> 00:01:36,315
So that would be the T class.
18
00:01:36,315 --> 00:01:39,490
19
00:01:39,490 --> 00:01:45,308
Plus you'd have four recombinants for every NPD
20
00:01:45,308 --> 00:01:45,850
that you had.
21
00:01:45,850 --> 00:01:47,980
So you're just counting these randomly.
22
00:01:47,980 --> 00:01:52,590
So T's and NPDs.
23
00:01:52,590 --> 00:02:01,250
So if you looked at 100 times 40, so 20 times 2
24
00:02:01,250 --> 00:02:08,100
plus 5 times 4 over the total number of gametes
25
00:02:08,100 --> 00:02:14,680
that you have, so it's 100 times 4,
26
00:02:14,680 --> 00:02:17,140
it would give you 15 centimorgans.
27
00:02:17,140 --> 00:02:20,410
If, essentially, we're not taking the double recombinants
28
00:02:20,410 --> 00:02:24,460
into account, we're systematically underestimating
29
00:02:24,460 --> 00:02:25,930
genetic distances.
30
00:02:25,930 --> 00:02:30,100
And so this ability to look at tetrad types
31
00:02:30,100 --> 00:02:34,630
allows you to essentially infer the existence
32
00:02:34,630 --> 00:02:38,490
of double crossovers based on the number of NPDs,
33
00:02:38,490 --> 00:02:41,170
so the number of double crossovers that we don't see.
34
00:02:41,170 --> 00:02:43,840
The double crossovers that are tetra types,
35
00:02:43,840 --> 00:02:47,620
or the double crossovers that come from--
36
00:02:47,620 --> 00:02:50,120
that look like parental ditypes you remember here.
37
00:02:50,120 --> 00:02:53,350
So for example, in this case where
38
00:02:53,350 --> 00:02:55,870
we have a first crossover of 2, 3,
39
00:02:55,870 --> 00:02:59,020
and a second crossover of 2, 3, this
40
00:02:59,020 --> 00:03:01,540
resulted in a parental ditype.
41
00:03:01,540 --> 00:03:04,480
But in reality, there were two recombination events here.
42
00:03:04,480 --> 00:03:06,040
We're just not seeing them, because
43
00:03:06,040 --> 00:03:09,700
the second recombination shifted the initial recombination
44
00:03:09,700 --> 00:03:10,820
event back.
45
00:03:10,820 --> 00:03:15,490
So what the NPD stuff does is it allows us to look at--
46
00:03:15,490 --> 00:03:18,820
infer the existence of these double crossover events.
47
00:03:18,820 --> 00:03:21,830
Now importantly, this formula can be applied
48
00:03:21,830 --> 00:03:25,240
if there are no NPD events.
49
00:03:25,240 --> 00:03:31,640
So if there are no NPD events, it's just 100 times
50
00:03:31,640 --> 00:03:38,630
T plus 6 times 0 over 2E, which equals
51
00:03:38,630 --> 00:03:43,220
T over 2E, which is just the formula that we had before
52
00:03:43,220 --> 00:03:44,630
for a tightly linked gene.
53
00:03:44,630 --> 00:03:49,118
So you can include it, even as a general formula.
54
00:03:49,118 --> 00:03:50,660
And you can also think OK, well, what
55
00:03:50,660 --> 00:03:55,070
happens if you had totally unlinked genes?
56
00:03:55,070 --> 00:03:59,450
So we would expect this ratio of one to four to one
57
00:03:59,450 --> 00:04:00,450
if they're unlinked.
58
00:04:00,450 --> 00:04:11,030
So 10PD to 40T to 10NPD.
59
00:04:11,030 --> 00:04:15,440
So what happens if we actually put-- so unlinked numbers
60
00:04:15,440 --> 00:04:16,800
into this equation?
61
00:04:16,800 --> 00:04:23,570
So say we have 100 times the number
62
00:04:23,570 --> 00:04:35,860
of T's, so in this case, say, 40 T's, plus the number of NPDs
63
00:04:35,860 --> 00:04:43,240
times 6 over the total, or 2E, which would be 2 times 60.
64
00:04:43,240 --> 00:04:46,970
I'm just putting in these numbers here into that formula.
65
00:04:46,970 --> 00:05:02,008
We would get 100 times 100 over 120, which is something
66
00:05:02,008 --> 00:05:02,800
like, I don't know.
67
00:05:02,800 --> 00:05:06,280
What is it, 83 centimorgans?
68
00:05:06,280 --> 00:05:08,320
So if they're unlinked, it's giving us
69
00:05:08,320 --> 00:05:11,680
a number that's greater than 50 centimorgans, which suggests
70
00:05:11,680 --> 00:05:13,870
that they're in fact unlinked.
71
00:05:13,870 --> 00:05:17,620
So it allows us to generally put in numbers, regardless
72
00:05:17,620 --> 00:05:19,750
whether they are linked or unlinked,
73
00:05:19,750 --> 00:05:21,850
to get some kind of genetic distance.
74
00:05:21,850 --> 00:05:24,910
And again, if you're going to try to exclude some hypothesis
75
00:05:24,910 --> 00:05:30,118
or take another into account, your numbers are going to--
76
00:05:30,118 --> 00:05:31,660
the numbers that you have, the number
77
00:05:31,660 --> 00:05:38,470
of tetrads that you analyze are going to matter, absolutely.
78
00:05:38,470 --> 00:05:42,850
So numbers do matter here in making a statistical inference
79
00:05:42,850 --> 00:05:44,590
based on this data.
80
00:05:44,590 --> 00:05:48,910
But with the data in general, or what the approach generally
81
00:05:48,910 --> 00:05:51,580
allows you to do is remember, we had this issue
82
00:05:51,580 --> 00:06:05,570
before comparing genetic distance and physical distance
83
00:06:05,570 --> 00:06:09,290
where essentially in the other systems,
84
00:06:09,290 --> 00:06:12,620
in the Drosophila system and also in mammalian systems,
85
00:06:12,620 --> 00:06:16,220
this is really breaking down early on.
86
00:06:16,220 --> 00:06:23,700
So basically at 50 centimorgans, there's
87
00:06:23,700 --> 00:06:30,180
really no ability to discern any distance whatsoever, but really
88
00:06:30,180 --> 00:06:32,310
breaks down significantly before them.
89
00:06:32,310 --> 00:06:37,420
With yeast and tetrad analysis, we
90
00:06:37,420 --> 00:06:40,990
can actually look at accurate interactions,
91
00:06:40,990 --> 00:06:44,200
maybe up until 40 centimorgans or so.
92
00:06:44,200 --> 00:06:49,120
Once you start getting to greater genetic distances,
93
00:06:49,120 --> 00:06:51,550
the recombination rates, or double recombination rates
94
00:06:51,550 --> 00:06:54,220
really obscure this connection between genetic distance
95
00:06:54,220 --> 00:06:55,370
and physical distance.
96
00:06:55,370 --> 00:07:00,370
So this tetrad analysis again provides much more accurate
97
00:07:00,370 --> 00:07:05,230
measures of genetic distance.
98
00:07:05,230 --> 00:07:10,000
The real truth here is that that 50 centimorgan distance
99
00:07:10,000 --> 00:07:14,890
is the distance at which recombination distances are
100
00:07:14,890 --> 00:07:18,465
impossible to measure, because 50 means 50% recombination.
101
00:07:18,465 --> 00:07:19,840
This is a really good point, Sky,
102
00:07:19,840 --> 00:07:21,880
and I appreciate you bringing it up.
103
00:07:21,880 --> 00:07:24,740
50 centimorgan distance means it's just equally probable.
104
00:07:24,740 --> 00:07:25,990
This is 50% recombination.
105
00:07:25,990 --> 00:07:27,590
So it's random chance.
106
00:07:27,590 --> 00:07:32,440
So the reality is that in most of the systems that we use,
107
00:07:32,440 --> 00:07:35,440
the connections really start breaking down
108
00:07:35,440 --> 00:07:38,920
at even much lower, 30 centimorgans, where we're just
109
00:07:38,920 --> 00:07:42,070
seeing a very high rate of double recombination that
110
00:07:42,070 --> 00:07:45,160
is making us systematically underestimate
111
00:07:45,160 --> 00:07:46,120
genetic distances.
112
00:07:46,120 --> 00:07:49,910
And so even yeast recombination breaks down at 50 centimorgans.
113
00:07:49,910 --> 00:07:52,930
So regardless how robust the systems are,
114
00:07:52,930 --> 00:07:55,330
the reality is, in yeast, it's very accurate
115
00:07:55,330 --> 00:07:58,210
up until greater than 40 centimorgans.
116
00:07:58,210 --> 00:08:03,430
And in people or in flies, it's not
117
00:08:03,430 --> 00:08:05,080
accurate at 40 centimorgans.
118
00:08:05,080 --> 00:08:11,430
So this distance-- you are right in that the absolute bar,
119
00:08:11,430 --> 00:08:17,420
the absolute barrier here is 50 centimorgans.
120
00:08:17,420 --> 00:08:18,680
That is correct.
121
00:08:18,680 --> 00:08:22,250
But if we think about a point below there,
122
00:08:22,250 --> 00:08:26,290
say 40 centimorgans it's already broken down
123
00:08:26,290 --> 00:08:29,390
if you look at random gametes, and it's still
124
00:08:29,390 --> 00:08:31,780
a pretty good correlation between physical distance
125
00:08:31,780 --> 00:08:34,990
and genetic distance if you're looking at yeast tetrad
126
00:08:34,990 --> 00:08:35,679
analysis.
127
00:08:35,679 --> 00:08:38,500
128
00:08:38,500 --> 00:08:41,140
Again, I think that the reason why
129
00:08:41,140 --> 00:08:42,760
we talk about this, and the reason
130
00:08:42,760 --> 00:08:45,280
why we introduced this idea is that it's an interesting way
131
00:08:45,280 --> 00:08:50,470
to think about the way that meiosis occurs and introduces
132
00:08:50,470 --> 00:08:53,950
the possibility of what you could see if you could actually
133
00:08:53,950 --> 00:08:55,750
follow all of those recombination events
134
00:08:55,750 --> 00:08:58,330
and see all of the haploids that are generated
135
00:08:58,330 --> 00:09:00,700
from this double recombination event.
136
00:09:00,700 --> 00:09:04,000
And so it's why I think a lot of initial genetics work
137
00:09:04,000 --> 00:09:05,500
and really significant genetics work
138
00:09:05,500 --> 00:09:08,650
has been done in systems like yeast that allow us to study
139
00:09:08,650 --> 00:09:11,640
recombination really well.
10747
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.