All language subtitles for nlm040722

af Afrikaans
sq Albanian
am Amharic
ar Arabic
hy Armenian
az Azerbaijani
eu Basque
be Belarusian
bn Bengali
bs Bosnian
bg Bulgarian
ca Catalan
ceb Cebuano
ny Chichewa
zh-CN Chinese (Simplified)
zh-TW Chinese (Traditional) Download
co Corsican
hr Croatian
cs Czech
da Danish
nl Dutch
en English
eo Esperanto
et Estonian
tl Filipino
fi Finnish
fr French
fy Frisian
gl Galician
ka Georgian
de German
el Greek
gu Gujarati
ht Haitian Creole
ha Hausa
haw Hawaiian
iw Hebrew
hi Hindi
hmn Hmong
hu Hungarian
is Icelandic
ig Igbo
id Indonesian
ga Irish
it Italian
ja Japanese
jw Javanese
kn Kannada
kk Kazakh
km Khmer
ko Korean
ku Kurdish (Kurmanji)
ky Kyrgyz
lo Lao
la Latin
lv Latvian
lt Lithuanian
lb Luxembourgish
mk Macedonian
mg Malagasy
ms Malay
ml Malayalam
mt Maltese
mi Maori
mr Marathi
mn Mongolian
my Myanmar (Burmese)
ne Nepali
no Norwegian
ps Pashto
fa Persian
pl Polish
pt Portuguese
pa Punjabi
ro Romanian
ru Russian
sm Samoan
gd Scots Gaelic
sr Serbian
st Sesotho
sn Shona
sd Sindhi
si Sinhala
sk Slovak
sl Slovenian
so Somali
es Spanish
su Sundanese
sw Swahili
sv Swedish
tg Tajik
ta Tamil
te Telugu
th Thai
tr Turkish
uk Ukrainian
ur Urdu
uz Uzbek
vi Vietnamese
cy Welsh
xh Xhosa
yi Yiddish
yo Yoruba
zu Zulu
or Odia (Oriya)
rw Kinyarwanda
tk Turkmen
tt Tatar
ug Uyghur
Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated: 1 00:00:05,520 --> 00:00:10,480 ALL RIGHT, THIS MEETING OF THE 2 00:00:10,480 --> 00:00:13,560 NATIONAL LIBRARY OF MEDICINE IS 3 00:00:13,560 --> 00:00:16,320 NOW BEING CALLED TO ORDER AND 4 00:00:16,320 --> 00:00:19,400 BEING BROADCAST LIVE AND WILL BE 5 00:00:19,400 --> 00:00:21,920 RECORDED FOR ON DEMAND VIEWING. 6 00:00:21,920 --> 00:00:24,880 I'D LIKE TO WELCOME THE 7 00:00:24,880 --> 00:00:27,720 COMMITTEE AND THANK THEM FOR 8 00:00:27,720 --> 00:00:30,400 THEIR HARD WORK AND PARTICULARLY 9 00:00:30,400 --> 00:00:33,720 FOR THOSE DOING REVIEW IT'S NOT 10 00:00:33,720 --> 00:00:36,160 EASY TO DO WHEN IT'S NOT YOUR 11 00:00:36,160 --> 00:00:36,960 AREA. 12 00:00:36,960 --> 00:00:38,920 I'M APPRECIATIVE WORK. 13 00:00:38,920 --> 00:00:40,760 BEFORE WE BEGIN, A FEW 14 00:00:40,760 --> 00:00:42,480 HOUSEKEEPING ANNOUNCEMENTS. 15 00:00:42,480 --> 00:00:45,520 THE FIRST IS TO USE THE RAISE 16 00:00:45,520 --> 00:00:46,240 HAND FEATURE IN THE ZOOM TO BE 17 00:00:46,240 --> 00:00:49,640 RECOGNIZED BY THE CHAIR. 18 00:00:49,640 --> 00:00:51,320 IF YOU HAVE ISSUES I'LL ALSO BE 19 00:00:51,320 --> 00:00:52,640 MONITORING THE CHAT STREAM AS 20 00:00:52,640 --> 00:00:55,000 WELL AND I WILL CALL ON PEOPLE 21 00:00:55,000 --> 00:01:00,280 WITH RAISED HANDS IN THE ORDER I 22 00:01:00,280 --> 00:01:02,800 SEE THEM RAISED. 23 00:01:02,800 --> 00:01:06,280 AND IF YOU NEED ASSISTANCE 24 00:01:06,280 --> 00:01:07,280 OPERATIONS WILL BE MONITORING 25 00:01:07,280 --> 00:01:10,880 THE CHAT WINDOW THROUGHOUT THE 26 00:01:10,880 --> 00:01:12,600 AND ABLE TO PROVIDE TECHNICAL 27 00:01:12,600 --> 00:01:14,000 ASSISTANCE AS NEEDED AND REACH 28 00:01:14,000 --> 00:01:15,960 THEM BY E-MAIL AS WELL AND I'LL 29 00:01:15,960 --> 00:01:19,960 PASTE THAT INTO THE CHAT WINDOW. 30 00:01:19,960 --> 00:01:21,880 IN TERMS OF MICS AND CAMERAS 31 00:01:21,880 --> 00:01:23,520 BECAUSE IT CAN BE A LARGE 32 00:01:23,520 --> 00:01:25,320 AUDIENCE AND WE ARE DISTRIBUTED, 33 00:01:25,320 --> 00:01:28,600 BE MINDFUL OF YOUR CAMERA AND 34 00:01:28,600 --> 00:01:30,520 MICROPHONE AND BE SURE TO MUTE 35 00:01:30,520 --> 00:01:32,400 YOURSELF WHEN NOT PARTICIPATING 36 00:01:32,400 --> 00:01:34,120 OR PRESENTING AND MEDIA 37 00:01:34,120 --> 00:01:36,040 OPERATIONS WILL ALSO HELP BY 38 00:01:36,040 --> 00:01:38,400 SWITCHING OFF CAMERAS AND MICS 39 00:01:38,400 --> 00:01:41,600 TO MAINTAIN AUDIO IN CASE 40 00:01:41,600 --> 00:01:44,120 THERE'S BANDWIDTH DROP OUTS AND 41 00:01:44,120 --> 00:01:45,760 KEEP THE MEETING FROM BEING 42 00:01:45,760 --> 00:01:46,480 DISTRACTED. 43 00:01:46,480 --> 00:01:49,360 THERE WILL BE CLOSED SESSIONS 44 00:01:49,360 --> 00:01:50,480 WHERE WE WILL BE DISCUSSING 45 00:01:50,480 --> 00:01:52,720 INTERNALLY AS A BOARD OF 46 00:01:52,720 --> 00:01:54,480 SCIENTIFIC COUNSELORS TO PEOPLE 47 00:01:54,480 --> 00:01:58,440 MAKING THE PRESENTATIONS. 48 00:01:58,440 --> 00:02:02,440 SO FLORIDA -- IF THERE ARE NO 49 00:02:02,440 --> 00:02:03,400 QUESTIONS WE CAN GO AHEAD AND 50 00:02:03,400 --> 00:02:05,600 GET STARTED I'LL GIVE A MOMENT 51 00:02:05,600 --> 00:02:07,360 FOR QUESTIONS IN THE CHAT AND SO 52 00:02:07,360 --> 00:02:21,920 FAR I DON'T SEE ANY. 53 00:02:21,920 --> 00:02:35,720 ALL RIGHT. 54 00:02:35,720 --> 00:02:40,040 >> ARE YOU READY FOR ME TO TAKE 55 00:02:40,040 --> 00:02:42,000 OVER. 56 00:02:42,000 --> 00:02:45,280 >> I SHOULD SAID PATTI I'M READY 57 00:02:45,280 --> 00:02:46,720 FOR YOU. 58 00:02:46,720 --> 00:02:48,840 >> THANK YOU FOR BEING HERE AND 59 00:02:48,840 --> 00:02:50,520 PARTICULARLY TO THE BOARD OF 60 00:02:50,520 --> 00:02:51,320 SCIENTIFIC COUNSELORS. 61 00:02:51,320 --> 00:02:53,120 YOUR WORK IS REALLY IMPORTANT TO 62 00:02:53,120 --> 00:02:55,120 US AND I REALLY APPRECIATE THE 63 00:02:55,120 --> 00:02:56,920 EFFORT THAT GOES INTO THIS AS 64 00:02:56,920 --> 00:02:59,600 WE'RE PROGRESSING IN OUR 65 00:02:59,600 --> 00:03:01,640 DEVELOPMENT OF UNIFIED 66 00:03:01,640 --> 00:03:02,320 SCIENTIFIC PROGRAM WITHIN THE 67 00:03:02,320 --> 00:03:04,200 INTRAMURAL RESEARCH PROGRAM THE 68 00:03:04,200 --> 00:03:06,640 GUIDANCE FROM THE BOARD OF 69 00:03:06,640 --> 00:03:10,320 SCIENTIFIC COUNSELORS GIVES 70 00:03:10,320 --> 00:03:11,480 PARTICULAR ASSISTANCE TO OUR NIH 71 00:03:11,480 --> 00:03:12,800 PROGRAM LEADERSHIP SO YOUR 72 00:03:12,800 --> 00:03:14,400 EFFORTS ARE APPRECIATED. 73 00:03:14,400 --> 00:03:16,160 I WELCOME YOU TO TO WHAT WILL 74 00:03:16,160 --> 00:03:17,680 LOOK LIKE A BUILDING FULL OF 75 00:03:17,680 --> 00:03:20,920 PEOPLE AN FEW WEEKS BUT STILL IS 76 00:03:20,920 --> 00:03:22,520 NOT A BUILDING FULL OF PEOPLE. 77 00:03:22,520 --> 00:03:25,120 WE'RE IN THE PROCESS OF 78 00:03:25,120 --> 00:03:26,160 RETURNING TO THE PHYSICAL WORK 79 00:03:26,160 --> 00:03:32,280 SPACE AT THE NIH. 80 00:03:32,280 --> 00:03:35,320 THE NATIONAL LIBRARY OF MEDICINE 81 00:03:35,320 --> 00:03:40,520 IS BEING RENOVATED WITH NEW HVAC 82 00:03:40,520 --> 00:03:42,520 AND RE-PURPOSING SPACES AND I'LL 83 00:03:42,520 --> 00:03:44,920 GIVE PICTURES HOW THAT'S COMING 84 00:03:44,920 --> 00:03:45,120 ALONG. 85 00:03:45,120 --> 00:03:47,360 MOST THE NATIONAL LIBRARY OF 86 00:03:47,360 --> 00:03:49,800 MEDICINE STAFF ON CAMPUS IS 87 00:03:49,800 --> 00:04:05,840 EITHER IN BUILDING 38 OR NASSER 88 00:04:05,840 --> 00:04:06,480 BUILDING BUT NIH HAS BENEFITTED 89 00:04:06,480 --> 00:04:07,120 DURING THIS CONSTRUCTION PHASE 90 00:04:07,120 --> 00:04:09,720 BECAUSE WE'VE HAD SOMEWHAT SOME 91 00:04:09,720 --> 00:04:12,000 UNANTICIPATED LESS DISRUPTION. 92 00:04:12,000 --> 00:04:15,000 WHEN WE RETURN THE NIH 93 00:04:15,000 --> 00:04:16,760 ANTICIPATE MAXIMUM FLEXIBILITIES 94 00:04:16,760 --> 00:04:19,360 FOR OUR STAFF AND OUR TEAM HAS 95 00:04:19,360 --> 00:04:21,240 EVALUATED EVERY ONE OF OUR 96 00:04:21,240 --> 00:04:22,240 POSITIONS AT THE NATIONAL 97 00:04:22,240 --> 00:04:25,240 LIBRARY OF MEDICINE TO DETERMINE 98 00:04:25,240 --> 00:04:27,320 WHICH POSITIONS ARE MISSION 99 00:04:27,320 --> 00:04:29,000 CRITICAL AND HAVE FLEXIBILITY IN 100 00:04:29,000 --> 00:04:29,720 THE WORK PROCESSES. 101 00:04:29,720 --> 00:04:35,040 WE IDENTIFIED THE MAXIMUM FLEX 102 00:04:35,040 --> 00:04:38,320 FLEXIBILITIES AND WE'LL BE 103 00:04:38,320 --> 00:04:39,760 GIVING GUIDANCE TO SUPERVISORS 104 00:04:39,760 --> 00:04:41,680 TO WORK WITH INDIVIDUALS ON HOW 105 00:04:41,680 --> 00:04:43,120 MUCH FLEXIBILITIES THEY CAN 106 00:04:43,120 --> 00:04:44,960 EXPECT OR WOULD LIKE TO HAVE. 107 00:04:44,960 --> 00:04:47,320 NO ONE WILL BE REQUIRED TO TAKE 108 00:04:47,320 --> 00:04:49,520 ANY WORKPLACE FLEXIBILITY WHICH 109 00:04:49,520 --> 00:04:50,520 MEANS WE'LL BE CONTINUING TO 110 00:04:50,520 --> 00:04:51,640 PROVIDE WORKPLACE SUPPORT FOR 111 00:04:51,640 --> 00:04:57,280 STAFF AS NEEDED AND FOR STAFF 112 00:04:57,280 --> 00:04:59,200 WHOSE WORK IS MISSION CRITICAL. 113 00:04:59,200 --> 00:05:01,640 WE ANTICIPATE MANY WILL CHOOSE 114 00:05:01,640 --> 00:05:03,560 MORE FLEX ABLE WORK SCHEDULES 115 00:05:03,560 --> 00:05:05,600 INCLUDING TELEWORK MEANING A 116 00:05:05,600 --> 00:05:06,920 STAFF MEMBER COMES ON CAMPUS 117 00:05:06,920 --> 00:05:09,160 ABOUT ONCE A WEEK WHAT WE REFER 118 00:05:09,160 --> 00:05:13,440 TO AS LOCAL REMOTE ALLOWING 119 00:05:13,440 --> 00:05:15,000 PEOPLE TO LIVE IN THE AREA BUT 120 00:05:15,000 --> 00:05:16,800 ONLY COME IN THE CAMPUS HALF A 121 00:05:16,800 --> 00:05:20,120 DAY OR SO EVERY WEEK AND THEN 122 00:05:20,120 --> 00:05:21,720 FULLY REMOTE WHICH ARE 123 00:05:21,720 --> 00:05:22,280 INDIVIDUALS THAT CAN LIVE 124 00:05:22,280 --> 00:05:23,520 ANYWHERE IN THE COUNTRY. 125 00:05:23,520 --> 00:05:25,640 THIS IS OPENING A GREAT 126 00:05:25,640 --> 00:05:27,000 OPPORTUNITY TO ENGAGE STAFF FROM 127 00:05:27,000 --> 00:05:33,000 AROUND THE COUNTRY AND HIRE A 128 00:05:33,000 --> 00:05:35,720 NEW AND DIVERSE WORKFORCE AND 129 00:05:35,720 --> 00:05:41,680 HAVE FLEXIBILITIES TO ALIGN WITH 130 00:05:41,680 --> 00:05:43,120 FAMILY SITUATIONS OR PERSONAL 131 00:05:43,120 --> 00:05:43,720 LIVES. 132 00:05:43,720 --> 00:05:44,520 I HAVE 10 SLIDES TO PRESENT AND 133 00:05:44,520 --> 00:05:46,280 LEAVE TIME FOR QUESTIONS AND 134 00:05:46,280 --> 00:05:46,600 CONVERSATION. 135 00:05:46,600 --> 00:05:52,280 I WANT TO BEGIN BY TELLING -- 136 00:05:52,280 --> 00:05:53,640 REMINDING YOU THE NATIONAL 137 00:05:53,640 --> 00:05:54,920 LIBRARY OF MEDICINE HAS A BROAD 138 00:05:54,920 --> 00:05:56,120 AND STRONG COMMITMENT TO BASIC 139 00:05:56,120 --> 00:05:58,440 OPERATIONS BUT ALSO GUIDED BY A 140 00:05:58,440 --> 00:06:01,680 STRATEGIC PLAN DEVELOPED IN 2017 141 00:06:01,680 --> 00:06:04,000 TO ACCELERATE DISCOVERY IN 142 00:06:04,000 --> 00:06:04,840 DATA-POWERED HEALTH. 143 00:06:04,840 --> 00:06:09,480 THE FIRST PILLAR, ACCELERATED 144 00:06:09,480 --> 00:06:10,360 DISCOVERY THROUGH DATA-DRIVEN 145 00:06:10,360 --> 00:06:12,360 RESEARCH IS MUCH OF THE FOCUS OF 146 00:06:12,360 --> 00:06:13,320 THE PROGRAM AND WE'VE HAD MANY 147 00:06:13,320 --> 00:06:15,320 OPPORTUNITIES TO MOVE IN THIS 148 00:06:15,320 --> 00:06:15,720 AREA. 149 00:06:15,720 --> 00:06:16,920 I'LL BE TELLING YOU SOME IN THE 150 00:06:16,920 --> 00:06:18,520 NEXT FEW MINUTES. 151 00:06:18,520 --> 00:06:20,520 I DON'T WANT TO OVERLOOK 152 00:06:20,520 --> 00:06:22,520 REACHING MORE PEOPLE AND ENHANCE 153 00:06:22,520 --> 00:06:25,520 DISSEMINATION AND ENGAGEMENT AND 154 00:06:25,520 --> 00:06:27,120 BUILD WORKFORCE FOR DATA DRIVEN 155 00:06:27,120 --> 00:06:28,120 RESEARCH AND HEALTH. 156 00:06:28,120 --> 00:06:30,880 ONE THING YOU'LL HEAR ABOUT IS 157 00:06:30,880 --> 00:06:33,600 NEW INITIATIVES IN BRINGING A 158 00:06:33,600 --> 00:06:35,000 NEW DIVERSE WORKFORCE IN THE 159 00:06:35,000 --> 00:06:36,880 PROGRAM AND WE'RE USING THE 160 00:06:36,880 --> 00:06:39,280 TALENTS OF OUR RESEARCHERS, 161 00:06:39,280 --> 00:06:40,520 PARTICULARLY DR. LU AND SOME 162 00:06:40,520 --> 00:06:42,720 COLLEAGUES TO LOOK AT NEW WAYS 163 00:06:42,720 --> 00:06:45,120 TO PRESENT THE RESOURCES THE 164 00:06:45,120 --> 00:06:45,760 NATIONAL LIBRARY OF MEDICINE HAS 165 00:06:45,760 --> 00:06:48,840 TO REACH THE PUBLIC. 166 00:06:48,840 --> 00:06:50,360 PRIMARILY, WE'RE FOCUSSING IN 167 00:06:50,360 --> 00:06:51,960 THE CONVERSATION AROUND 168 00:06:51,960 --> 00:06:53,520 ACCELERATED DISCOVERY AND HEALTH 169 00:06:53,520 --> 00:06:57,320 THROUGH DATA-DRIVEN RESEARCH. 170 00:06:57,320 --> 00:07:04,520 WE HAVE WANTED TO HELP THE 171 00:07:04,520 --> 00:07:05,520 PUBLIC BE FAMILIAR WITH THE 172 00:07:05,520 --> 00:07:09,280 PROGRAM AND WE HAVE VIDEO 173 00:07:09,280 --> 00:07:10,800 VIGNETTES TO BRING THE 174 00:07:10,800 --> 00:07:11,960 OUTSTANDING WORK OF RESEARCHERS. 175 00:07:11,960 --> 00:07:14,560 I'D LIKE TO YOU WATCH ONE OF 176 00:07:14,560 --> 00:07:16,600 THESE VIDEOS NOW TO SEE THE 177 00:07:16,600 --> 00:07:17,120 WORK. 178 00:07:17,120 --> 00:07:17,720 IF WE CAN HAVE THE VIDEO, 179 00:07:17,720 --> 00:07:28,760 PLEASE. 180 00:07:28,760 --> 00:07:30,520 [*] 181 00:07:30,520 --> 00:07:32,600 >> EARLY IN MY LIFE I WANTED TO 182 00:07:32,600 --> 00:07:43,440 BE A PALEONTOLOGY PROFESSIONAL. 183 00:07:43,440 --> 00:07:45,840 I WANTED TO INFORMATION THE 184 00:07:45,840 --> 00:07:47,640 PROTEIN UNIVERSE. 185 00:07:47,640 --> 00:07:52,680 THEY CAN BE DIVIDED INTO 186 00:07:52,680 --> 00:07:55,840 EVOLUTIONARY UNITS PRESERVED 187 00:07:55,840 --> 00:07:57,120 OVER EVOLUTION BECAUSE NATURAL 188 00:07:57,120 --> 00:07:58,000 SELECTION IS MAINTAINING THAT 189 00:07:58,000 --> 00:08:01,880 PART FOR SOME REASON. 190 00:08:01,880 --> 00:08:06,520 AND ONE REALIZATION WHICH DAWN 191 00:08:06,520 --> 00:08:09,560 ON US AROUND THE EARLY '90s AND 192 00:08:09,560 --> 00:08:11,720 IT WAS PROFOUND IS THERE'S A 193 00:08:11,720 --> 00:08:13,480 RELATIVELY SMALL NUMBER OF THESE 194 00:08:13,480 --> 00:08:15,760 EVOLUTIONARY UNITS OF PROTEINS 195 00:08:15,760 --> 00:08:18,960 WHICH WE TERM DOMAINS WHICH 196 00:08:18,960 --> 00:08:21,880 CONSTITUTES THE ENTIRE PROTEIN 197 00:08:21,880 --> 00:08:23,120 UNIVERSE OF ALL ORGANISMS ACROSS 198 00:08:23,120 --> 00:08:27,680 THE TREE OF LIFE. 199 00:08:27,680 --> 00:08:28,760 IF WE CAN UNDERSTAND THE 200 00:08:28,760 --> 00:08:31,480 FUNCTIONS OF THE UNITS IT GOES A 201 00:08:31,480 --> 00:08:33,080 LONG WAY TO UNDERSTANDING WHEN 202 00:08:33,080 --> 00:08:33,920 ORGANISMS DO. 203 00:08:33,920 --> 00:08:35,120 AND GIVEN THERE'S MANY GAPS IN 204 00:08:35,120 --> 00:08:37,640 OUR UNDERSTANDING OF WHAT 205 00:08:37,640 --> 00:08:39,840 ORGANISMS DO, ONE WAY TO GET AT 206 00:08:39,840 --> 00:08:43,160 IT IS TO FIRST FIND ALL THE 207 00:08:43,160 --> 00:08:43,400 DOMAINS. 208 00:08:43,400 --> 00:08:45,400 THE SECOND ASPECT IS PREDICTING 209 00:08:45,400 --> 00:08:46,000 FUNCTIONS FOR THEM. 210 00:08:46,000 --> 00:08:48,760 THE FIRST PHASE OF MY RESEARCH 211 00:08:48,760 --> 00:08:51,200 WE CAPTURED MOST OF THE 212 00:08:51,200 --> 00:08:53,480 LOW-HANGING FRUIT WHICH WERE THE 213 00:08:53,480 --> 00:08:56,480 BIG FAMILIES CONSERVED ACROSS 214 00:08:56,480 --> 00:08:56,840 ALL ORGANISM. 215 00:08:56,840 --> 00:08:59,720 NOW WE'RE MOVING ON TO THE MORE 216 00:08:59,720 --> 00:09:01,000 DIFFICULT TERRAIN. 217 00:09:01,000 --> 00:09:05,800 THE DIFFICULT TERRAIN ALSO HOLDS 218 00:09:05,800 --> 00:09:07,320 A LOT OF PROMISE BECAUSE MANY 219 00:09:07,320 --> 00:09:07,960 UNUNDERSTOOD FUNCTIONS ARE 220 00:09:07,960 --> 00:09:09,720 HIDING WITHIN THAT TERRAIN AND 221 00:09:09,720 --> 00:09:14,680 GIVES THE OFF-SHOOTS IN THE FORM 222 00:09:14,680 --> 00:09:17,400 OF BIOTECHNOLOGICAL REAGENTS AND 223 00:09:17,400 --> 00:09:20,240 ENZYMES AND THE CRISPR SYSTEMS 224 00:09:20,240 --> 00:09:23,080 AND MODIFICATIONS SYSTEMS. 225 00:09:23,080 --> 00:09:29,680 THEY'VE ALL BECOME POPULAR 226 00:09:29,680 --> 00:09:29,920 RE-AGENTS. 227 00:09:29,920 --> 00:09:33,080 WE HAVE PROTEIN SEQUENCES AND 228 00:09:33,080 --> 00:09:39,400 STRUCTURES AND INFERRING THIS 229 00:09:39,400 --> 00:09:41,000 INFORMATION AND IT'S BEEN A LONG 230 00:09:41,000 --> 00:09:41,920 STANDING INTEREST OF MINE SO 231 00:09:41,920 --> 00:10:03,920 IT'S BEEN THE PLACE TO BE. 232 00:10:03,920 --> 00:10:05,680 >> I SEE I'M GOING HAVE TO SET 233 00:10:05,680 --> 00:10:08,520 UP A VIEWING PARTY FOR ALL OF 234 00:10:08,520 --> 00:10:09,320 YOU. 235 00:10:09,320 --> 00:10:14,520 WE HAVE A WONDERFUL SET OF VIDEO 236 00:10:14,520 --> 00:10:16,200 VIDEOS AND IT'S NICE TO SEE. 237 00:10:16,200 --> 00:10:20,320 THEY'RE ABOUT A MINUTE OR TWO 238 00:10:20,320 --> 00:10:23,520 LONG AND ONE OF OUR OFFICE OF 239 00:10:23,520 --> 00:10:24,840 COMMUNICATIONS PUBLIC LIAISON 240 00:10:24,840 --> 00:10:25,680 TEAM HAS WORKED WELL WITH 241 00:10:25,680 --> 00:10:28,320 PULLING THE STORIES OUT OF THESE 242 00:10:28,320 --> 00:10:29,880 FAIRLY COMPLICATED SCIENTIFIC 243 00:10:29,880 --> 00:10:31,960 PATHWAYS AND CAPTURING THE 244 00:10:31,960 --> 00:10:33,800 EXCITEMENT OF WHY THESE 245 00:10:33,800 --> 00:10:37,360 INVESTIGATORS ARE SUCH GREAT 246 00:10:37,360 --> 00:10:38,440 ASSET TO THE NIH AND NLM AND 247 00:10:38,440 --> 00:10:40,320 SOCIETY AS A WHOLE. 248 00:10:40,320 --> 00:10:45,040 I'LL MAKE SURE TO PUT THE 249 00:10:45,040 --> 00:10:46,120 YOUTUBE LINK IN THE AND IT'S 250 00:10:46,120 --> 00:10:47,400 GREAT TO SEE AND PEOPLE YOU'VE 251 00:10:47,400 --> 00:10:49,120 KNOWN FOR A WHILE YOU SEE IN A 252 00:10:49,120 --> 00:10:49,840 WHOLE NEW WAY. 253 00:10:49,840 --> 00:10:51,520 LET ME TAKE YOU ON TO SOMETHING 254 00:10:51,520 --> 00:10:53,600 QUITE IMPORTANT IN LOOKING AT 255 00:10:53,600 --> 00:10:55,040 THE FUTURE OF THE NATIONAL 256 00:10:55,040 --> 00:10:58,000 LIBRARY OF MEDICINE AND THE NIH 257 00:10:58,000 --> 00:10:58,280 OVERALL. 258 00:10:58,280 --> 00:11:02,160 THAT'S THE NLM'S RACIAL AND 259 00:11:02,160 --> 00:11:04,680 ETHNIC EQUITY PLAN OR THE REAP. 260 00:11:04,680 --> 00:11:08,480 IT REFLECTS AN ACTIVITY DIRECT 261 00:11:08,480 --> 00:11:10,440 THE UNITE PROGRAM AND UNITE IS 262 00:11:10,440 --> 00:11:12,920 THE NIH'S PROGRAM TO ADDRESS 263 00:11:12,920 --> 00:11:17,520 STRUCTURAL RACISM AND EQUITY. 264 00:11:17,520 --> 00:11:20,680 THE REEP EACH INSTITUTE AND 265 00:11:20,680 --> 00:11:22,640 CENTER WAS CHALLENGED TO CREATE 266 00:11:22,640 --> 00:11:25,640 AN EQUITY PLAN TO EVALUATE THEIR 267 00:11:25,640 --> 00:11:26,240 ORGANIZATIONAL CULTURES TO 268 00:11:26,240 --> 00:11:27,840 EXAMINE OUR STRUCTURES AND TO 269 00:11:27,840 --> 00:11:30,320 MAKE CHANGES IN A WAY THAT WOULD 270 00:11:30,320 --> 00:11:33,400 PROMOTE, EQUITY, DIVERSITY AND 271 00:11:33,400 --> 00:11:35,120 INCLUSION THROUGHOUT THE 272 00:11:35,120 --> 00:11:35,400 WORKFORCE. 273 00:11:35,400 --> 00:11:38,520 THE REEP GOALS ARE TO APPLY THE 274 00:11:38,520 --> 00:11:40,600 RACIAL AND ETHNIC EQUITY LENS TO 275 00:11:40,600 --> 00:11:43,000 LOOK AT OUR INSTITUTES, 276 00:11:43,000 --> 00:11:44,280 WORKFORCE, STRUCTURES AND 277 00:11:44,280 --> 00:11:47,480 SYSTEMS TO IDENTIFY AND 278 00:11:47,480 --> 00:11:50,840 DISMANTLE ANY ETHNIC DISPARITIES 279 00:11:50,840 --> 00:11:52,120 IN THE WORKFORCE AND ENHANCE THE 280 00:11:52,120 --> 00:11:53,960 WORKFORCE WITHIN THE INSTITUTE. 281 00:11:53,960 --> 00:11:55,200 WHAT'S IMPORTANT TO THE NLM 282 00:11:55,200 --> 00:11:57,520 THAT'S UNLIKE ACROSS THE CAMPUS 283 00:11:57,520 --> 00:12:00,360 IS ALMOST 60% OF OUR WORKFORCE 284 00:12:00,360 --> 00:12:04,600 ARE WORKERS WE REFER TO AS 285 00:12:04,600 --> 00:12:05,320 CONTRACTORS, INDIVIDUALS WHOSE 286 00:12:05,320 --> 00:12:06,040 EMPLOYMENT IS THROUGH ANOTHER 287 00:12:06,040 --> 00:12:07,640 AGENCY AND COME TO US BECAUSE 288 00:12:07,640 --> 00:12:09,880 THEY BRING SPECIALIZED TALENT OR 289 00:12:09,880 --> 00:12:10,320 SPECIALIZED SERVICE. 290 00:12:10,320 --> 00:12:13,120 WE'RE VERY FORTUNATE TO HAVE 291 00:12:13,120 --> 00:12:14,560 ACCESS TO AN INCREDIBLE 292 00:12:14,560 --> 00:12:16,280 TECHNICAL AND SCIENTIFIC 293 00:12:16,280 --> 00:12:17,360 WORKFORCE THAT COMES THROUGH 294 00:12:17,360 --> 00:12:18,920 CONTRACTING SERVICES. 295 00:12:18,920 --> 00:12:21,080 AT THE SAME TIME WE DO 296 00:12:21,080 --> 00:12:23,440 UNDERSTAND WHILE THEY ARE OUR 297 00:12:23,440 --> 00:12:24,680 COLLEAGUES, WE ARE NOT THEIR 298 00:12:24,680 --> 00:12:24,920 EMPLOYERS. 299 00:12:24,920 --> 00:12:28,080 AS THE NLM LOOKS AT THE REEP 300 00:12:28,080 --> 00:12:30,520 PLAN, THE RACIAL AND ETHNIC 301 00:12:30,520 --> 00:12:34,240 EQUITY PLAN, WE'RE COMMITTED TO 302 00:12:34,240 --> 00:12:35,320 ENSURING INFORMATION OF OUR 303 00:12:35,320 --> 00:12:38,560 CONTRACT AND TRAINING WORKFORCE 304 00:12:38,560 --> 00:12:39,840 AND OUR NIH STAFFING WORKFORCE. 305 00:12:39,840 --> 00:12:41,720 THIS HAS FRANKLY CHALLENGED THE 306 00:12:41,720 --> 00:12:44,280 NIH TO THINK IN NEW WAYS TO MAKE 307 00:12:44,280 --> 00:12:46,080 SURE WE WRITE OUR CONTRACTS WITH 308 00:12:46,080 --> 00:12:47,120 THE CONTRACTING COMPANIES IN A 309 00:12:47,120 --> 00:12:51,560 WAY THAT ALLOWS THEIR STAFF TO 310 00:12:51,560 --> 00:12:54,320 PARTICIPATE IN INNOVATION AND 311 00:12:54,320 --> 00:12:56,000 EXPLORATION ABOUT THE EXPERIENCE 312 00:12:56,000 --> 00:12:59,240 OF ACCEPTABILITY OR 313 00:12:59,240 --> 00:13:01,040 UNACCEPTABILITY OF WORKFORCE 314 00:13:01,040 --> 00:13:01,320 BEHAVIOR. 315 00:13:01,320 --> 00:13:02,000 WE'LL BE CONTINUING TO DO THIS 316 00:13:02,000 --> 00:13:03,240 THROUGH THE NEXT COUPLE YEARS. 317 00:13:03,240 --> 00:13:05,240 IT'S A DEEP COMMITMENT NOT JUST 318 00:13:05,240 --> 00:13:06,640 FOR THE NATIONAL INSTITUTES OF 319 00:13:06,640 --> 00:13:09,440 HEALTH BUT ME AND MY LEADERSHIP 320 00:13:09,440 --> 00:13:10,320 TEAM. 321 00:13:10,320 --> 00:13:12,240 WE HAVE SOMETHING CALLED THE NLM 322 00:13:12,240 --> 00:13:18,040 IDEA COUNSEL, INCLUSIVE, EQUITY, 323 00:13:18,040 --> 00:13:19,720 DIVERSITY AND THE NAMES ARE IN 324 00:13:19,720 --> 00:13:21,720 FRONT OF YOU AND DR. LANDSMAN 325 00:13:21,720 --> 00:13:23,720 HAS BEEN A KEY ORGANIZER FROM 326 00:13:23,720 --> 00:13:25,760 THE BEGINNING IN THIS EFFORT AND 327 00:13:25,760 --> 00:13:27,920 HAS BEEN HELPFUL TO HELP THINK 328 00:13:27,920 --> 00:13:29,720 HOW TO APPLY THE REAL LENS TO 329 00:13:29,720 --> 00:13:35,440 OUR SCIENTIFIC WORKFORCE. 330 00:13:35,440 --> 00:13:37,240 MANY TIMES APPOINTMENT TO A 331 00:13:37,240 --> 00:13:38,560 FEDERAL POSITION AT THE NLM 332 00:13:38,560 --> 00:13:39,720 COMES TO SOMEONE WHO HAS WORKED 333 00:13:39,720 --> 00:13:42,120 AS A CONTRACTOR YEARS BEFORE. 334 00:13:42,120 --> 00:13:44,040 SO WE RECOGNIZE THE IMPORTANCE 335 00:13:44,040 --> 00:13:46,120 OF THE PIPELINE AND SKILLS OUR 336 00:13:46,120 --> 00:13:47,360 CONTRACT WORKFORCE BRINGS NOT 337 00:13:47,360 --> 00:13:48,760 ONLY TO THE WHOLE OF OUR 338 00:13:48,760 --> 00:13:51,440 FUNCTIONING AND OUR PRODUCTS AND 339 00:13:51,440 --> 00:13:53,360 SERVICES IN THE NATIONAL LIBRARY 340 00:13:53,360 --> 00:13:58,240 OF MEDICINE BUT TO OUR 341 00:13:58,240 --> 00:13:59,480 SCIENTIFIC EFFORTS. 342 00:13:59,480 --> 00:14:02,760 MR. WE WORK WITH ARE IN FACT 343 00:14:03,120 --> 00:14:05,560 MANY WE WORK WITH ARE THE 344 00:14:05,560 --> 00:14:06,160 CONTRACT WORKERS. 345 00:14:06,160 --> 00:14:07,800 WHEN WE BUILT THE PLAN IT WAS 346 00:14:07,800 --> 00:14:10,560 JUST NOT IN THE HANDS OF THE 347 00:14:10,560 --> 00:14:13,360 IDEA COUNSEL THERE WAS DESIGN 348 00:14:13,360 --> 00:14:14,400 COMMITTEE ENGAGEMENT ACROSS THE 349 00:14:14,400 --> 00:14:17,720 ENTIRETY OF THE NLM. 350 00:14:17,720 --> 00:14:23,720 WE HAD LOOKED AT THE THREE GOALS 351 00:14:23,720 --> 00:14:27,520 AND STAFF AWARENESS AND 352 00:14:27,520 --> 00:14:29,720 UNDERSTANDING OF STRUCTURAL 353 00:14:29,720 --> 00:14:31,240 RACISM AND EACH TEAMS GOT 354 00:14:31,240 --> 00:14:33,520 TOGETHER TO PLAN ASSESSMENT 355 00:14:33,520 --> 00:14:34,280 STRATEGIES AS WELL AS 356 00:14:34,280 --> 00:14:34,720 PRELIMINARY ACTIONS. 357 00:14:34,720 --> 00:14:39,440 THEY WERE PRESENTED TO THE NLM 358 00:14:39,440 --> 00:14:40,720 LEADERSHIP ABOUT TWO WEEKS AGO 359 00:14:40,720 --> 00:14:41,720 FOR THEIR FINAL PRESENTATION AND 360 00:14:41,720 --> 00:14:43,920 WE REVIEWED IT AND MADE COMMENTS 361 00:14:43,920 --> 00:14:46,520 AND SUGGESTIONS AND WORKED 362 00:14:46,520 --> 00:14:48,080 CLOSELY WITH THEM AND SUBMIT THE 363 00:14:48,080 --> 00:14:51,640 DOCUMENT TO THE NIH ON APRIL 1 364 00:14:51,640 --> 00:14:53,600 AS DID ALL THE 26 INSTITUTES AND 365 00:14:53,600 --> 00:15:00,640 CENTER AROUND THE NIH. 366 00:15:00,640 --> 00:15:02,520 WE ARE EXPECTING A RESPONSE FROM 367 00:15:02,520 --> 00:15:06,400 EARLY MAY AND OUR ACTING 368 00:15:06,400 --> 00:15:11,640 DIRECTOR IS EXPECTING THE 369 00:15:11,640 --> 00:15:12,880 INSTITUTES AND CENTERS KNOW IT 370 00:15:12,880 --> 00:15:14,080 WILL COST MORE MONEY AND 371 00:15:14,080 --> 00:15:15,120 UNDERSTAND AND TAKE ACTION 372 00:15:15,120 --> 00:15:16,720 LIKELY TO MAKE OUR WORKFORCE 373 00:15:16,720 --> 00:15:16,920 BETTER. 374 00:15:16,920 --> 00:15:19,360 ONCE OUR PLAN IS ENDORSED BY THE 375 00:15:19,360 --> 00:15:21,240 NIH, THE ENTIRE PLAN WILL BE 376 00:15:21,240 --> 00:15:22,600 CIRCULATED ACROSS OUR STAFF FOR 377 00:15:22,600 --> 00:15:28,280 REVIEW AND ENGAGEMENT. 378 00:15:28,280 --> 00:15:30,360 AND I'M GOING TO BE PUTTING A 379 00:15:30,360 --> 00:15:31,520 CALL TO STAFF MEMBERS LISTENING 380 00:15:31,520 --> 00:15:34,240 TO JOIN THE IMPLEMENTATION 381 00:15:34,240 --> 00:15:35,120 COMMITTEES BECAUSE THE 382 00:15:35,120 --> 00:15:36,520 IMPLEMENTATION OF THE STRATEGY 383 00:15:36,520 --> 00:15:41,320 TO DISMANTLE STRUCTURAL RACISM 384 00:15:41,320 --> 00:15:44,080 AND ADDRESS INEQUITIES EVEN IN 385 00:15:44,080 --> 00:15:44,720 THE NATIONAL LIBRARY OF MEDICINE 386 00:15:44,720 --> 00:15:45,880 WILL REQUIRE THE COMMITMENT OF 387 00:15:45,880 --> 00:15:50,200 EVERY STAFF MEMBER ACROSS THE 388 00:15:50,200 --> 00:15:50,480 INSTITUTE. 389 00:15:50,480 --> 00:15:51,440 A COUPLE MORE UPDATES. 390 00:15:51,440 --> 00:15:53,520 YOU MAY HAVE HEARD THE 391 00:15:53,520 --> 00:15:55,360 CONSOLIDATED APPROPRIATIONS ACT 392 00:15:55,360 --> 00:15:57,440 WAS FINALLY SIGNED AND WE NOW 393 00:15:57,440 --> 00:15:59,360 HAVE A BUDGET. 394 00:15:59,360 --> 00:16:01,560 THIS WILL KEEP THE GOVERNMENT 395 00:16:01,560 --> 00:16:04,200 FUNDED THROUGH SEPTEMBER 30 WE 396 00:16:04,200 --> 00:16:05,440 WERE PLEASED WE DIDN'T GET 397 00:16:05,440 --> 00:16:06,240 ANOTHER CONTINUING RESOLUTION. 398 00:16:06,240 --> 00:16:10,120 THOSE MONITORING US FOR A WHILE 399 00:16:10,120 --> 00:16:12,320 MAY REMEMBER WE GET THEM WELL 400 00:16:12,320 --> 00:16:13,320 INTO JUNE AND THIS CAUTIONS 401 00:16:13,320 --> 00:16:14,920 HAVOC WITH OUR FUNDING BECAUSE 402 00:16:14,920 --> 00:16:18,320 OUR FUNDS ARE COMMITTED FOR ONE 403 00:16:18,320 --> 00:16:20,640 YEAR ONLY SO WE TO BE READY TO 404 00:16:20,640 --> 00:16:21,520 SPEND DOWN THE FUNDS. 405 00:16:21,520 --> 00:16:23,720 THIS YEAR WE WERE FORTUNATE TO 406 00:16:23,720 --> 00:16:25,920 GET A $479 MILLION 407 00:16:25,920 --> 00:16:26,280 APPROPRIATION. 408 00:16:26,280 --> 00:16:27,320 THAT'S OUR BASE BUDGET NOW. 409 00:16:27,320 --> 00:16:30,880 THIS IS A $17 MILLION INCREASE 410 00:16:30,880 --> 00:16:32,960 OVER FISCAL '21. 411 00:16:32,960 --> 00:16:35,720 OUR PRIORITIES FOR FISCAL '22 412 00:16:35,720 --> 00:16:38,280 CONTINUE TO BE THE RENOVATION OF 413 00:16:38,280 --> 00:16:41,000 OUR BUILDING A FOUR-YEAR PLAN 414 00:16:41,000 --> 00:16:41,920 COSTING IN THE NEIGHBORHOOD OF 415 00:16:41,920 --> 00:16:42,760 $35 MILLION. 416 00:16:42,760 --> 00:16:44,880 I'M COMMITTED TO THE EXPANSION 417 00:16:44,880 --> 00:16:45,760 OF THE INTRAMURAL RESEARCH 418 00:16:45,760 --> 00:16:46,000 PROGRAM. 419 00:16:46,000 --> 00:16:49,920 WE HAVE A SEARCH UNDERWAY FOR 420 00:16:49,920 --> 00:16:54,000 NEW SCIENTIFIC DIRECTOR AND HOPE 421 00:16:54,000 --> 00:16:57,840 TO MAKE THE APPOINTMENT BY 422 00:16:57,840 --> 00:17:00,000 MIDSUMMER AND LOOKING INTO 423 00:17:00,000 --> 00:17:01,840 INVESTIGATOR TRACKS FOR THE NEW 424 00:17:01,840 --> 00:17:02,520 SCIENTIFIC DIRECTOR TO MAKE USE 425 00:17:02,520 --> 00:17:03,520 OF IN THE NEXT FEW YEARS. 426 00:17:03,520 --> 00:17:06,520 IN ADDITION, IT'S CRITICAL WE 427 00:17:06,520 --> 00:17:07,760 MODERNIZE OUR COMPUTER 428 00:17:07,760 --> 00:17:09,960 INFRASTRUCTURE. 429 00:17:09,960 --> 00:17:10,600 THE NATIONAL LIBRARY OF MEDICINE 430 00:17:10,600 --> 00:17:13,040 IS LESS A LIBRARY OF OBJECTS AND 431 00:17:13,040 --> 00:17:15,600 MORE OF ELECTRONS AND MAKING 432 00:17:15,600 --> 00:17:20,320 SURE OUR ELECTRONIC RESOURCES 433 00:17:20,320 --> 00:17:29,720 WHETHER CLINICALTRIALS.gov, PUB 434 00:17:29,720 --> 00:17:30,320 MED ARE ACCESSIBLE AND SECURE 435 00:17:30,320 --> 00:17:31,280 AND HAVE SOUND INFRASTRUCTURE 436 00:17:31,280 --> 00:17:32,360 AND A CRITICAL INVESTMENT I'M 437 00:17:32,360 --> 00:17:33,760 MAKING OVER THIS PERIOD OF TIME. 438 00:17:33,760 --> 00:17:36,720 I'M HAPPY TO TALK MORE ABOUT 439 00:17:36,720 --> 00:17:37,920 THIS IN QUESTION. 440 00:17:37,920 --> 00:17:42,560 THIS ACT ALSO CREATED 441 00:17:42,560 --> 00:17:47,720 FORMALIZING THE ARPA-H PRODUCTS 442 00:17:47,720 --> 00:17:49,120 AGENCY FOR HEALTH AN INNOVATION 443 00:17:49,120 --> 00:17:49,760 ENGINE. 444 00:17:49,760 --> 00:17:52,520 THIS WAS ASSIGNED TO THE 445 00:17:52,520 --> 00:17:53,400 SECRETARY'S OFFICE IN HHS BUT 446 00:17:53,400 --> 00:17:56,680 WILL IN FACT BE TRANSFERRED TO 447 00:17:56,680 --> 00:17:57,720 THE NATIONAL INSTITUTES OF 448 00:17:57,720 --> 00:17:57,920 HEALTH. 449 00:17:57,920 --> 00:18:02,000 THE OPERATIONS AND FUNDING WILL 450 00:18:02,000 --> 00:18:04,280 OCCUR WITHIN THE NIH ENCLAVE BUT 451 00:18:04,280 --> 00:18:08,920 THE DIRECTOR OF ARPA-H WILL BE A 452 00:18:08,920 --> 00:18:12,520 PRESIDENTIAL APPOINTEES WITH A 453 00:18:12,520 --> 00:18:13,320 DOTTED LINE RELATIONSHIP TO THE 454 00:18:13,320 --> 00:18:15,080 NIH DIRECTOR. 455 00:18:15,080 --> 00:18:16,800 WE ANTICIPATE NEW INITIATIVE 456 00:18:16,800 --> 00:18:18,800 FUNDING WILL BEGIN BY SEPTEMBER 457 00:18:18,800 --> 00:18:19,640 OF '24. 458 00:18:19,640 --> 00:18:21,080 I WANT TO CALL YOUR ATTENTION TO 459 00:18:21,080 --> 00:18:22,800 SOMETHING YOU PROBABLY HAVE 460 00:18:22,800 --> 00:18:24,120 HEARD OF BEFORE. 461 00:18:24,120 --> 00:18:25,480 WE'LL BE HEARING A LOT AND VERY 462 00:18:25,480 --> 00:18:28,400 MUCH NEED YOUR GUIDANCE TO HELP 463 00:18:28,400 --> 00:18:31,520 OUR INVESTIGATORS IN THE 464 00:18:31,520 --> 00:18:32,520 INTRAMURAL PROGRAM MAKE SURE 465 00:18:32,520 --> 00:18:33,200 THEY'RE COMPLIANT. 466 00:18:33,200 --> 00:18:34,400 THE DATA MANAGEMENT AND SHARING 467 00:18:34,400 --> 00:18:36,240 PLAN HAS BEEN UNDER DEVELOPMENT 468 00:18:36,240 --> 00:18:37,920 MANY YEARS AND WILL GO INTO 469 00:18:37,920 --> 00:18:39,320 AFFECT JANUARY 23. 470 00:18:39,320 --> 00:18:41,360 FOR ALL APPLICATIONS SUBMITTED 471 00:18:41,360 --> 00:18:47,800 ON OUR AFTER JANUARY 25, 2023 AS 472 00:18:47,800 --> 00:18:48,680 WELL AS FOR INTRAMURAL PROGRAMS 473 00:18:48,680 --> 00:18:50,520 INITIATED AFTER THAT TIME. 474 00:18:50,520 --> 00:18:51,720 APPLICATIONS MUST EXPLAIN WHAT 475 00:18:51,720 --> 00:18:53,520 DATA WILL BE GENERATED OR 476 00:18:53,520 --> 00:18:57,040 COLLECTED DURING THE STUDY AND 477 00:18:57,040 --> 00:18:58,520 WHETHER IT WILL BE SHARED AND 478 00:18:58,520 --> 00:19:01,280 WHERE AND WHEN MADE AVAILABLE. 479 00:19:01,280 --> 00:19:04,360 THE SHARING PLAN DOES NOT 480 00:19:04,360 --> 00:19:06,480 REQUIRE DATA SHARING BUT 481 00:19:06,480 --> 00:19:06,760 ENCOURAGES. 482 00:19:06,760 --> 00:19:07,720 THE NATIONAL LIBRARY OF MEDICINE 483 00:19:07,720 --> 00:19:10,760 IS PLAYING A KEY ROLE IN HELPING 484 00:19:10,760 --> 00:19:13,240 TO STRENGTHEN THE DATA SCIENCE 485 00:19:13,240 --> 00:19:14,720 LIBRARY WORKFORCE AROUND THE 486 00:19:14,720 --> 00:19:15,640 COUNTRY. 487 00:19:15,640 --> 00:19:18,920 WE BELIEVE THIS IS AN EXCITING 488 00:19:18,920 --> 00:19:20,200 OPPORTUNITY FOR THE DATA SCIENCE 489 00:19:20,200 --> 00:19:21,680 COMMUNITY TO DEMONSTRATE 490 00:19:21,680 --> 00:19:23,120 LEADERSHIP IN DATA MANAGEMENT 491 00:19:23,120 --> 00:19:27,320 AND SHARING AND AUTOMATED 492 00:19:27,320 --> 00:19:29,520 CURATION AND ACCESSIBILITY OF 493 00:19:29,520 --> 00:19:33,320 DATA AND CREATING MODELS. 494 00:19:33,320 --> 00:19:35,160 OUR PROGRAM IS AN OPPORTUNITY 495 00:19:35,160 --> 00:19:38,200 FOR THE T-15 PROGRAMS IN THEIR 496 00:19:38,200 --> 00:19:39,480 INSTITUTION TO ENSURE THE DATA 497 00:19:39,480 --> 00:19:42,240 MANAGEMENT AND SHARING PRACTICES 498 00:19:42,240 --> 00:19:44,160 GET DIFFUSED THROUGHOUT THE 499 00:19:44,160 --> 00:19:44,520 INSTITUTIONS. 500 00:19:44,520 --> 00:19:46,920 WE FOUND OUR T-15 PROGRAMS TO BE 501 00:19:46,920 --> 00:19:49,720 MAJOR LEADERS IN DATA THINKERS 502 00:19:49,720 --> 00:19:51,320 SO THIS IS AN IMPORTANT 503 00:19:51,320 --> 00:19:52,440 COMMUNICATION ENCLAVE. 504 00:19:52,440 --> 00:19:54,560 WE ALSO HOPE YOU WILL HAVE US 505 00:19:54,560 --> 00:19:57,320 THINK HOW TO INFUSE THE SKILLS 506 00:19:57,320 --> 00:19:59,120 IN OUR INTRAMURAL TRAINING 507 00:19:59,120 --> 00:20:00,640 PROGRAM WE NEED TO HELP TRAINEES 508 00:20:00,640 --> 00:20:02,440 UNDERSTAND HOW TO MANAGE THEIR 509 00:20:02,440 --> 00:20:04,400 OWN DATA AND HOW TO SUPPORT THE 510 00:20:04,400 --> 00:20:05,120 DATA IN THE INSTITUTION WHERE'S 511 00:20:05,120 --> 00:20:12,040 THEY MAY BE WORKING. 512 00:20:12,040 --> 00:20:13,680 YOU SEE INSIDE AND OUTSIDE 513 00:20:13,680 --> 00:20:13,960 RENOVATIONS. 514 00:20:13,960 --> 00:20:16,320 ON THE LEFT YOU SEE WHAT ONCE 515 00:20:16,320 --> 00:20:18,320 WAS CALLED CONFERENCE ROOM B A 516 00:20:18,320 --> 00:20:20,400 TERRIBLY SMALL AND DARK 517 00:20:20,400 --> 00:20:21,880 CONFERENCE ROOM ON THE SECOND 518 00:20:21,880 --> 00:20:23,360 FLOOR ON THE MEZZANINE WILL NOW 519 00:20:23,360 --> 00:20:26,520 BE AN OFFICE SUITE. 520 00:20:26,520 --> 00:20:29,920 WE'RE BUILDING A MEZZANINE SPACE 521 00:20:29,920 --> 00:20:31,600 WITH OPPORTUNITIES FOR PEOPLE 522 00:20:31,600 --> 00:20:33,920 AND SUFFICIENT SPACE FOR PEOPLE 523 00:20:33,920 --> 00:20:35,600 TO STORE FOOD AND HEAT FOOD UP. 524 00:20:35,600 --> 00:20:37,880 ON THE LOWER RIGHT YOU SEE THE 525 00:20:37,880 --> 00:20:39,680 HAND RAILS UP THE FRONT OF STEPS 526 00:20:39,680 --> 00:20:43,920 FINALLY BEING REPAIRED AND OUR 527 00:20:43,920 --> 00:20:45,120 TERRAZO STEP WELLS ARE BEING 528 00:20:45,120 --> 00:20:45,360 REPAIRED. 529 00:20:45,360 --> 00:20:49,040 ON THE UPPER RIGHT YOU SEE WHAT 530 00:20:49,040 --> 00:20:52,080 WAS THE READING ROOM AND THE OLD 531 00:20:52,080 --> 00:20:57,120 LARGE CABINETS ARE BEING 532 00:20:57,120 --> 00:20:58,360 COMBINED INTO OFFICE SPACES WITH 533 00:20:58,360 --> 00:21:00,760 A VARIETY OF HUDDLE ROOMS AND 534 00:21:00,760 --> 00:21:02,520 PRIVATE OFFICES FOR INDIVIDUALS 535 00:21:02,520 --> 00:21:03,720 TO USE. 536 00:21:03,720 --> 00:21:04,360 IMPORTANT TO OUR OPERATION IN 537 00:21:04,360 --> 00:21:05,960 THE CENTER PICTURE AT THE TOP IS 538 00:21:05,960 --> 00:21:07,440 OUR DATA CENTER. 539 00:21:07,440 --> 00:21:09,200 OUR DATA CENTER HAS NOT BEEN 540 00:21:09,200 --> 00:21:10,520 MODERNIZED IN MANY YEARS. 541 00:21:10,520 --> 00:21:13,120 WE'RE NOW BRINGING IN ADDITIONAL 542 00:21:13,120 --> 00:21:18,000 POWER RESOURCES AS WELL AS 543 00:21:18,000 --> 00:21:20,320 STABLIZING THE DRAINAGE AND THE 544 00:21:20,320 --> 00:21:21,880 COLD CHILL WATER FLOW. 545 00:21:21,880 --> 00:21:25,000 THE $7 MILLION INVESTMENT IN OUR 546 00:21:25,000 --> 00:21:27,560 DATA CENTER HAS COME NOT FROM 547 00:21:27,560 --> 00:21:29,320 NLM FUNDS BUT FROM FUNDS FROM 548 00:21:29,320 --> 00:21:30,640 THE NIH RECOGNIZING THE 549 00:21:30,640 --> 00:21:31,920 IMPORTANCE OF OUR DATA CENTER. 550 00:21:31,920 --> 00:21:34,040 I'M PLEASED WITH THE WORK 551 00:21:34,040 --> 00:21:34,320 OCCURRING. 552 00:21:34,320 --> 00:21:35,720 I'LL STOP AND SEE IF YOU HAVE 553 00:21:35,720 --> 00:21:37,360 ANY QUESTIONS OR COMMENTS. 554 00:21:37,360 --> 00:21:39,280 I CLOSE BY THANKING YOU AGAIN 555 00:21:39,280 --> 00:21:41,720 FOR YOUR EFFORTS ON WE HALF OF 556 00:21:41,720 --> 00:21:43,080 THE NATIONAL LIBRARY OF MEDICINE 557 00:21:43,080 --> 00:21:44,360 AND SCIENCE AROUND THE WORLD. 558 00:21:44,360 --> 00:21:46,080 PETER, IF YOU'D LIKE TO HANDLE 559 00:21:46,080 --> 00:21:46,680 THE QUESTIONS, I'D APPRECIATE 560 00:21:46,680 --> 00:22:10,840 THAT. 561 00:22:10,840 --> 00:22:14,560 >> I HAD A QUESTION OF THE DATA 562 00:22:14,560 --> 00:22:17,120 SHARING PLAN AND WHAT REPRESENTS 563 00:22:17,120 --> 00:22:19,080 A GOOD ONE OR HOW DO WE THINK 564 00:22:19,080 --> 00:22:19,920 ABOUT THAT? 565 00:22:19,920 --> 00:22:21,200 >> IT'S NOT SCORABLE. 566 00:22:21,200 --> 00:22:26,520 IT WILL SUBMITTED IN JUST IN 567 00:22:26,520 --> 00:22:30,120 TIME. 568 00:22:30,120 --> 00:22:32,440 WE THINK IT THE BE MOST 569 00:22:32,440 --> 00:22:33,920 EFFICIENT WITH THE REVIEW AND 570 00:22:33,920 --> 00:22:36,120 THE SHARING PLAN IS IN ITSELF A 571 00:22:36,120 --> 00:22:38,520 SKILL AND WHILE IT NEEDS DOMAIN 572 00:22:38,520 --> 00:22:40,480 EXPERTISE THERE ARE SPECIFIC 573 00:22:40,480 --> 00:22:44,320 STRATEGIES OR FEATURES WE'D 574 00:22:44,320 --> 00:22:45,040 EXPECT. 575 00:22:45,040 --> 00:22:46,960 ARE THERE FUNDS TO SUPPORT DATA 576 00:22:46,960 --> 00:22:48,880 MANAGEMENT OR SHARING OR 577 00:22:48,880 --> 00:22:49,840 TRAINING PROGRAMS AND TOOLS AND 578 00:22:49,840 --> 00:22:51,640 I WOULD SAY YES TO ALL THREE. 579 00:22:51,640 --> 00:22:55,240 IT'S ACCEPTABLE TO PUT FUNDS 580 00:22:55,240 --> 00:22:57,600 INTO THE DIRECT COST FOR DATA 581 00:22:57,600 --> 00:22:59,360 MANAGEMENT AND SHARING AND WE'RE 582 00:22:59,360 --> 00:23:02,320 HOPING IT WILL HELP INCENTIVIZE 583 00:23:02,320 --> 00:23:03,200 RESEARCHERS FROM THE BEGINNING 584 00:23:03,200 --> 00:23:06,040 THINK HOW TO MANAGE THIS. 585 00:23:06,040 --> 00:23:07,880 SECONDLY, WE ARE PROVIDING IN 586 00:23:07,880 --> 00:23:09,760 SOME INSTITUTE HAVE STARTED AND 587 00:23:09,760 --> 00:23:14,160 SOME ARE PREPARING TEMPLATES FOR 588 00:23:14,160 --> 00:23:16,440 THE PLAN TO BE DEVELOPED IN 589 00:23:16,440 --> 00:23:16,960 EFFICIENT WAYS. 590 00:23:16,960 --> 00:23:21,160 THEY'RE NOT GOING TO BE 591 00:23:21,160 --> 00:23:23,720 ELECTRONIC WE PUSHED FOR THAT SO 592 00:23:23,720 --> 00:23:26,520 THEY CAN BE REUSED BY 593 00:23:26,520 --> 00:23:28,520 INSTITUTIONS TO HELP 594 00:23:28,520 --> 00:23:30,520 INSTITUTIONS ANTICIPATE WHAT THE 595 00:23:30,520 --> 00:23:33,040 CHALLENGES WOULD BE. 596 00:23:33,040 --> 00:23:37,960 AND FINALLY, WE'RE INVESTING IN 597 00:23:37,960 --> 00:23:39,520 DEVELOPING DATA SCIENCE 598 00:23:39,520 --> 00:23:42,360 LIBRARIANS AROUND THE COUNTRY TO 599 00:23:42,360 --> 00:23:43,880 PROVIDE INDIVIDUAL SUPPORT. 600 00:23:43,880 --> 00:23:46,520 THERE'S A SERIES OF TRAINING 601 00:23:46,520 --> 00:23:48,480 PROGRAMS THAT WILL BE OFFERED 602 00:23:48,480 --> 00:23:49,160 THROUGH THE EXTRAMURAL RESEARCH 603 00:23:49,160 --> 00:23:51,600 PROGRAM THROUGH THE NEXT YEAR TO 604 00:23:51,600 --> 00:23:53,120 HELP PEOPLE IN HOW TO SET UP A 605 00:23:53,120 --> 00:23:55,320 PLAN AND HOW TO THINK ABOUT DATA 606 00:23:55,320 --> 00:23:57,160 IN A SYSTEMATIC WAY. 607 00:23:57,160 --> 00:24:00,760 THE DETERMINATION OF WHICH DATA 608 00:24:00,760 --> 00:24:01,920 IS APPROPRIATE TO SHARE IS 609 00:24:01,920 --> 00:24:04,800 ACTUALLY IN ITSELF A CHALLENGING 610 00:24:04,800 --> 00:24:07,280 THING FOR SCIENTISTS. 611 00:24:07,280 --> 00:24:09,720 EVERY SNEEZE FOR BURP OR STEP 612 00:24:09,720 --> 00:24:11,400 ACROSS A FLOOR IS NOT ESSENTIAL 613 00:24:11,400 --> 00:24:13,400 TO BE SHARED AND YET IN SOME 614 00:24:13,400 --> 00:24:15,120 CASES, IF YOU'RE MANAGING RATS 615 00:24:15,120 --> 00:24:17,120 IN A SPECIFIC FOOD PRODUCTION 616 00:24:17,120 --> 00:24:18,360 WITH A SPECIFIC ENVIRONMENTAL 617 00:24:18,360 --> 00:24:20,120 CONTROL ALL THAT DATA IS 618 00:24:20,120 --> 00:24:21,800 NECESSARY TO SHARE. 619 00:24:21,800 --> 00:24:25,160 WE RECOGNIZE IT'S GOING TO 620 00:24:25,160 --> 00:24:27,120 REQUIRE SOPHISTICATION AND 621 00:24:27,120 --> 00:24:27,480 FAMILIARITY. 622 00:24:27,480 --> 00:24:29,520 WE EXPECT IT WILL BE A LEARNING 623 00:24:29,520 --> 00:24:32,880 PROCESS AS WE BEGIN TO GET MORE 624 00:24:32,880 --> 00:24:34,040 SOPHISTICATION WITH DATA 625 00:24:34,040 --> 00:24:35,720 MANAGEMENT AND SHARING PLAN WE 626 00:24:35,720 --> 00:24:37,480 CAN SHARE BACK TO COMMUNITIES TO 627 00:24:37,480 --> 00:24:42,120 BE ABLE TO HAVE THAT EFFORT. 628 00:24:42,120 --> 00:24:45,520 AND THERE'S A PLAN TO WORK WITH 629 00:24:45,520 --> 00:24:47,640 THE OFFICE OF SCIENCE POLICY AND 630 00:24:47,640 --> 00:24:49,120 EXTRAMURAL RESEARCH TO PROVIDE 631 00:24:49,120 --> 00:24:49,880 TRAINING SESSIONS AT CONFERENCES 632 00:24:49,880 --> 00:24:52,600 AROUND THE COUNTRY IN THE NEXT 633 00:24:52,600 --> 00:24:54,240 YEAR TO GIVE PEOPLE HANDS ON 634 00:24:54,240 --> 00:24:56,440 EXPERIENCE IN WORKING WITH THEM. 635 00:24:56,440 --> 00:24:57,040 >> THANKS. 636 00:24:57,040 --> 00:24:59,120 AND THE QUICKLY CENTER WITH THE 637 00:24:59,120 --> 00:24:59,880 WORLD GOING TO CLOUD DID YOU 638 00:24:59,880 --> 00:25:02,880 THINK ABOUT THAT AT ALL? 639 00:25:02,880 --> 00:25:04,800 >> WE SURE DID. 640 00:25:04,800 --> 00:25:07,960 THE WORLD IS GOING TO CLOUD AND 641 00:25:07,960 --> 00:25:13,760 ON PREM DATA RESOURCES ARE 642 00:25:13,760 --> 00:25:14,000 NEEDED. 643 00:25:14,000 --> 00:25:17,040 WE MANAGE A SIGNIFICANT AMOUNT 644 00:25:17,040 --> 00:25:17,920 OF CONTROLLED ACCESS DATA. 645 00:25:17,920 --> 00:25:20,080 AT THIS POINT IN TIME THE 646 00:25:20,080 --> 00:25:21,520 STRATEGIES FOR MOVING DATA TO 647 00:25:21,520 --> 00:25:22,720 THE CLOUD PARTICULARLY WHEN IT 648 00:25:22,720 --> 00:25:25,240 CONSISTS OF CONTROLLED ACCESS 649 00:25:25,240 --> 00:25:26,320 DATA ARE NOT COMPLETELY WORKED 650 00:25:26,320 --> 00:25:31,280 OUT AND WE DON'T ENVISION EVER 651 00:25:31,280 --> 00:25:35,200 HAVING A SOLE ON CLOUD PREMISE. 652 00:25:35,200 --> 00:25:38,920 WE WILL ALWAYS HAVE SOME AND 653 00:25:38,920 --> 00:25:40,120 WHAT THE DATA PROCESSING CENTER 654 00:25:40,120 --> 00:25:42,520 IS DOING IS ALLOWING US TO 655 00:25:42,520 --> 00:25:46,520 DETERMINE THE KIND OF LONG-TERM 656 00:25:46,520 --> 00:25:48,840 SUPPORTS WE MUST HAVE AND YET 657 00:25:48,840 --> 00:25:52,240 THOSE WOULD BE REDUNDANT AND NOT 658 00:25:52,240 --> 00:25:53,720 REDUNDANT WITH THE CLOUD. 659 00:25:53,720 --> 00:25:55,800 WE WOULDN'T PROVIDE 100% 660 00:25:55,800 --> 00:25:56,760 REDUNDANCY. 661 00:25:56,760 --> 00:25:57,760 IT WILL ALLOW TO REDUCE AND 662 00:25:57,760 --> 00:26:01,920 PERHAPS ELIMINATE THE REDUNDANT 663 00:26:01,920 --> 00:26:03,880 DATA CENTER IN STERLING. 664 00:26:03,880 --> 00:26:05,400 THAT'S WHERE A BIG EFFICIENCY 665 00:26:05,400 --> 00:26:06,440 WILL COME IN PULLING BACK FROM 666 00:26:06,440 --> 00:26:06,720 STERLING. 667 00:26:06,720 --> 00:26:21,520 >> THANK YOU. 668 00:26:21,520 --> 00:26:23,000 >> OTHER QUESTIONS FROM ANYBODY? 669 00:26:23,000 --> 00:26:24,920 >> I HAVE ONE. 670 00:26:24,920 --> 00:26:26,760 GOOD MORNING, ALL. 671 00:26:26,760 --> 00:26:29,520 HI, PATTI. 672 00:26:29,520 --> 00:26:30,680 >> HOW ARE YOU, STELLA? 673 00:26:30,680 --> 00:26:31,160 >> GREAT. 674 00:26:31,160 --> 00:26:33,360 I WAS GREAT TO SEE THE PROGRESS 675 00:26:33,360 --> 00:26:34,440 AND THE BUILDING CONSTRUCTION. 676 00:26:34,440 --> 00:26:36,280 YOU'LL HAVE AN AWESOME SPACE FOR 677 00:26:36,280 --> 00:26:38,560 EVERYBODY. 678 00:26:38,560 --> 00:26:41,720 CAN'T WAIT TO SEE IT. 679 00:26:41,720 --> 00:26:45,320 I HAVE A QUESTION FOLLOWING UP 680 00:26:45,320 --> 00:26:46,520 ON THE QUESTION ON 681 00:26:46,520 --> 00:26:47,520 INFRASTRUCTURE. 682 00:26:47,520 --> 00:26:59,160 WE ARE WORLD DATA AND SOME 683 00:26:59,160 --> 00:27:03,920 ALTERNATIVE DATA AND OTHER DATA 684 00:27:03,920 --> 00:27:07,160 BEING PRODUCED BY SCIENTISTS 685 00:27:07,160 --> 00:27:14,600 AROUND THE NATION AND CAN BE 686 00:27:14,600 --> 00:27:26,000 SHARED AND PROVIDING A CATALOG. 687 00:27:26,000 --> 00:27:28,360 >> THE NIH OVERALL IS DEVELOPING 688 00:27:28,360 --> 00:27:31,920 A STRATEGIC PLAN FOR INFORMATION 689 00:27:31,920 --> 00:27:32,960 TECHNOLOGY WHICH NEVER HAS HAD 690 00:27:32,960 --> 00:27:33,400 BEFORE. 691 00:27:33,400 --> 00:27:35,720 IT'S AN EXCITING ACTIVITY AND 692 00:27:35,720 --> 00:27:37,400 I'M CO-CHAIRING THIS AND ONE OF 693 00:27:37,400 --> 00:27:38,640 THE CRITICAL THINGS THAT'S COME 694 00:27:38,640 --> 00:27:44,680 UP IS WHAT DO WE DO WITH CLINIC 695 00:27:44,680 --> 00:27:48,960 DATA AND I'M PRIVILEGED TO HAVE 696 00:27:48,960 --> 00:27:50,760 ACCESS TO MAJOR UNIVERSITIES. 697 00:27:50,760 --> 00:27:54,080 SOME INSTITUTIONS ARE 698 00:27:54,080 --> 00:27:58,520 ENVISIONING DOING REDUNDANCY 699 00:27:58,520 --> 00:28:01,720 INTO A CLINIC DATA REPOSITORIES 700 00:28:01,720 --> 00:28:04,320 AND SOME WOULD HAVE 701 00:28:04,320 --> 00:28:05,720 PATIENT-GENERATED DATA AND DATA 702 00:28:05,720 --> 00:28:07,920 BROUGHT TOGETHER IN A MORE 703 00:28:07,920 --> 00:28:08,920 SYSTEMATIC FASHION. 704 00:28:08,920 --> 00:28:09,840 WE'RE WORKING WITH THE 705 00:28:09,840 --> 00:28:12,920 INSTITUTES AND CENTERS ACROSS 706 00:28:12,920 --> 00:28:16,480 THE NIH TO BETTER UNDERSTAND THE 707 00:28:16,480 --> 00:28:21,240 NEEDS OF REAL-WORLD DATA AND 708 00:28:21,240 --> 00:28:24,840 SPOKEN SPEECH OF CHILDREN IS 709 00:28:24,840 --> 00:28:28,040 BECOMING AN ISSUE AND ASSESSING 710 00:28:28,040 --> 00:28:30,560 SWALLOWING IN PEOPLE WHO HAVE 711 00:28:30,560 --> 00:28:33,080 HAD STROKES REQUIRES 712 00:28:33,080 --> 00:28:33,720 ELECTROPHYSIOLOGY AND MOVEMENT 713 00:28:33,720 --> 00:28:34,680 OF FLUIDS. 714 00:28:34,680 --> 00:28:38,240 THERE'S NEW DATA COMING. 715 00:28:38,240 --> 00:28:43,800 I DON'T EXPECT THE NLM WILL 716 00:28:43,800 --> 00:28:46,920 BECOME A REPOSITORY OF DATA 717 00:28:46,920 --> 00:28:50,120 GENERATED BY NIH BUT I EXPECT TO 718 00:28:50,120 --> 00:28:52,280 HAVE SOLUTIONS FOR THE NEED TO 719 00:28:52,280 --> 00:28:55,360 THE DATA REPOSITORIES AND THE 720 00:28:55,360 --> 00:28:56,640 CONVERSATION IS BALANCING WHAT 721 00:28:56,640 --> 00:28:59,200 TO BE DONE BY A STEWARD AS A 722 00:28:59,200 --> 00:29:00,520 REPOSITORY ON THE CAMPUS AND AS 723 00:29:00,520 --> 00:29:06,800 A NIH-FUNDED REPOSITORY. 724 00:29:06,800 --> 00:29:08,120 WE HAVE SEVERAL OPPORTUNITIES 725 00:29:08,120 --> 00:29:10,120 THROUGH THE OFFICE OF DATA 726 00:29:10,120 --> 00:29:14,280 SCIENCE STRATEGY AND NOTICE OF 727 00:29:14,280 --> 00:29:18,720 SPECIAL INTEREST AND STIMULATION 728 00:29:18,720 --> 00:29:22,080 OF NEW AWARDS WE'RE TRYING TO 729 00:29:22,080 --> 00:29:25,480 MOVE THEM OUT OF THE RO1 FUNDING 730 00:29:25,480 --> 00:29:27,680 LINE WHICH IS THE TRADITIONAL 731 00:29:27,680 --> 00:29:29,760 WAY MANY HAVE BEEN FUNDED AND 732 00:29:29,760 --> 00:29:39,280 HARD TO SUSTAIN AND FINAL 733 00:29:39,280 --> 00:29:41,640 CLOSING REMARKS, FDA HAS AN 734 00:29:41,640 --> 00:29:43,680 INTERESTING REAL-WORLD DATA AND 735 00:29:43,680 --> 00:29:44,080 WE'RE LOOKING AT 736 00:29:44,080 --> 00:29:44,720 CROSS-GOVERNMENT SOLUTIONS FOR 737 00:29:44,720 --> 00:29:53,080 THIS. 738 00:29:53,080 --> 00:29:54,480 THINK IT'S EXCITING. 739 00:29:54,480 --> 00:30:00,120 I WAS WONDERING IF NLM WOULDN'T 740 00:30:00,120 --> 00:30:02,960 BE THE EXACT PLACE WHERE 741 00:30:02,960 --> 00:30:04,320 EDUCATION TO THE PUBLIC IN 742 00:30:04,320 --> 00:30:10,520 GENERAL ABOUT IMPLICATIONS OF 743 00:30:10,520 --> 00:30:14,520 HAVING THEIR DATA USED FOR 744 00:30:14,520 --> 00:30:20,160 RESEARCH AND COULD BE CENTERS 745 00:30:20,160 --> 00:30:21,640 THERE WHAT I BELIEVE AFTER 746 00:30:21,640 --> 00:30:23,720 HAVING BEEN A RECRUITER AND A 747 00:30:23,720 --> 00:30:26,120 DATA STEWARD IS PEOPLE DON'T 748 00:30:26,120 --> 00:30:27,200 ACTUALLY KNOW WHAT THEY'RE 749 00:30:27,200 --> 00:30:29,480 SIGNING OFF TO AND THEN AFTER 750 00:30:29,480 --> 00:30:31,200 THEY'LL FIND OUT FOR EXAMPLE 751 00:30:31,200 --> 00:30:32,760 THAT THE DATA HAD BEEN SHARED IN 752 00:30:32,760 --> 00:30:38,320 THIS WAY OR THAT WAY AND THAT 753 00:30:38,320 --> 00:30:41,720 MIGHT HAVE SOME PROBLEM WITH 754 00:30:41,720 --> 00:30:45,040 TRUST FOR FUTURE RESEARCH AS 755 00:30:45,040 --> 00:30:46,280 WELL AS LINKING. 756 00:30:46,280 --> 00:30:48,760 IT'S JUST A SUGGEST BUT I THINK 757 00:30:48,760 --> 00:30:51,120 NLM IS THE RIGHT PLACE TO 758 00:30:51,120 --> 00:30:51,920 INITIATE SUCH EFFORTS. 759 00:30:51,920 --> 00:30:54,520 >> SO WE'RE WORKING CLOSELY WITH 760 00:30:54,520 --> 00:30:56,080 OUR NETWORK AT THE NATIONAL 761 00:30:56,080 --> 00:30:57,680 LIBRARY OF MEDICINE WHO HAVE 762 00:30:57,680 --> 00:30:58,920 POINTS OF PRESENCE IN 8,000 763 00:30:58,920 --> 00:31:00,200 COMMUNITIES AROUND THE COUNTRY 764 00:31:00,200 --> 00:31:01,200 TO BE PART OF OUR COMMUNICATION 765 00:31:01,200 --> 00:31:07,640 CHAIN. 766 00:31:07,640 --> 00:31:08,720 AND THERE'S AN INITIATIVE AND 767 00:31:08,720 --> 00:31:10,400 YOU MAY RECALL THIS SUMMER IN 768 00:31:10,400 --> 00:31:12,440 OUR CONTROLLED DATA ACCESS 769 00:31:12,440 --> 00:31:13,800 WORKING GROUP TO BETTER 770 00:31:13,800 --> 00:31:18,520 UNDERSTAND WHAT THE ISSUES ARE 771 00:31:18,520 --> 00:31:20,800 AND WE FOUND GIVEN A PARADOX 772 00:31:20,800 --> 00:31:22,280 WITH PATIENTS. 773 00:31:22,280 --> 00:31:23,640 ON ONE HAND PEOPLE ACTIVELY 774 00:31:23,640 --> 00:31:27,120 ENGAGED WITH RESEARCH AND OFTEN 775 00:31:27,120 --> 00:31:27,920 FAMILIES WITH CHILDREN WITH 776 00:31:27,920 --> 00:31:30,480 COMPLEX ILLNESSES WANT TO STOP 777 00:31:30,480 --> 00:31:33,120 GIVING THE SAME DATA AND WANT IT 778 00:31:33,120 --> 00:31:34,360 DEPOSITED ONCE AND USED MANY 779 00:31:34,360 --> 00:31:35,320 TIMES BY DIFFERENT RESEARCHERS. 780 00:31:35,320 --> 00:31:38,720 AT THE SAME TIME THE IDEA YOUR 781 00:31:38,720 --> 00:31:39,680 CLINICAL RECORD COULD NOW BE 782 00:31:39,680 --> 00:31:41,720 SITTING IN A REPOSITORY AND 783 00:31:41,720 --> 00:31:44,240 COULD BE USED TO CREATE NEW 784 00:31:44,240 --> 00:31:45,480 UNDERSTANDING ON HOW TO TAKE 785 00:31:45,480 --> 00:31:48,720 CARE OF HYPERTENSION OR A NEW 786 00:31:48,720 --> 00:31:52,400 DEVICE WHICH WILL LEAD TO 787 00:31:52,400 --> 00:31:53,720 SPECIALIZATION IS SOMETHING WE 788 00:31:53,720 --> 00:31:59,720 HAVE NEED TO MAKE DIFFERENT. 789 00:31:59,720 --> 00:32:01,600 AND THERE'S CHOICE DATA AND YET 790 00:32:01,600 --> 00:32:04,480 WHAT I REALIZED IS YOU CAN'T DO 791 00:32:04,480 --> 00:32:06,400 IT AT THE POINT OF THE STUDY OR 792 00:32:06,400 --> 00:32:06,720 CARE. 793 00:32:06,720 --> 00:32:08,320 IT MUST BE EMBEDDED IN THE LIFE 794 00:32:08,320 --> 00:32:09,920 OF THE INDIVIDUAL AND RETURNED 795 00:32:09,920 --> 00:32:14,520 TO A CONCEPT CALLED THE 796 00:32:14,520 --> 00:32:15,960 KINDERGARTEN CURRICULUM FOR DATA 797 00:32:15,960 --> 00:32:16,920 SAYING WASH YOU HANDS AFTER YOU 798 00:32:16,920 --> 00:32:19,320 GO TO THE POTTY, ALL KIDS KNOW 799 00:32:19,320 --> 00:32:19,520 THAT. 800 00:32:19,520 --> 00:32:21,960 IF WE CAN GET THE SAME LEVEL OF 801 00:32:21,960 --> 00:32:23,720 UNDERSTANDING AS DATA AS PART OF 802 00:32:23,720 --> 00:32:26,520 YOUR HEALTH PROCESS BEGINNING IN 803 00:32:26,520 --> 00:32:27,400 KINDERGARTEN AND MOVING THROUGH 804 00:32:27,400 --> 00:32:29,720 THE LIFE CYCLE WE'LL MAKE 805 00:32:29,720 --> 00:32:30,000 PROGRESS. 806 00:32:30,000 --> 00:32:31,960 IN TERMS OF HOW TO HELP PEOPLE 807 00:32:31,960 --> 00:32:34,280 AT THE MOMENT OF SIGNING THE 808 00:32:34,280 --> 00:32:36,680 PAPER, WE'RE EXPLORING ANIMATION 809 00:32:36,680 --> 00:32:38,040 AND DIFFERENT KINDS OF VISUAL 810 00:32:38,040 --> 00:32:40,520 TOOLS THAT WILL HELP PEOPLE 811 00:32:40,520 --> 00:32:44,280 CONVEY WHAT IS REQUIRED BY CF49 812 00:32:44,280 --> 00:32:46,040 BUT ALLOWS THEM TO INCORPORATE 813 00:32:46,040 --> 00:32:47,200 AND UNDERSTAND IN THEIR OWN 814 00:32:47,200 --> 00:32:55,600 FRAME OF REFERENCE. 815 00:32:55,600 --> 00:32:57,480 >> WE HAVE TIME FOR ONE MORE 816 00:32:57,480 --> 00:32:58,240 QUESTION BEFORE VALERIE STARTS 817 00:32:58,240 --> 00:33:16,880 HER REMARKS. 818 00:33:16,880 --> 00:33:18,960 I HAVE A QUESTION FOR DARPA FOR 819 00:33:18,960 --> 00:33:21,720 HEALTH AND WHETHER THE 820 00:33:21,720 --> 00:33:26,520 INSTITUTES OR NLM WILL HAVE 821 00:33:26,520 --> 00:33:36,520 ANYTHING TO DO WITH THAT. 822 00:33:36,520 --> 00:33:46,520 >> HERE'S THE PARADOX AND YOU 823 00:33:46,520 --> 00:33:51,120 HEARD NIH IS SLOW AND 824 00:33:51,120 --> 00:33:51,520 NON-RESPONSIVE. 825 00:33:51,520 --> 00:33:53,720 WE STAND READY TO BE PARTNERS 826 00:33:53,720 --> 00:33:55,720 WITH DARP H AND THERE WILL BE A 827 00:33:55,720 --> 00:33:59,720 NEED TO HAVE FUNDING TRANSFERS 828 00:33:59,720 --> 00:34:01,520 BECAUSE FOR ANY OTHER GOVERNMENT 829 00:34:01,520 --> 00:34:04,520 AGENCY WE PROVIDE SERVICES FOR 830 00:34:04,520 --> 00:34:05,720 THERE'S A FUNDING MODEL THAT 831 00:34:05,720 --> 00:34:06,440 SUPPORTS IT. 832 00:34:06,440 --> 00:34:09,600 I'M PARTICULARLY INTERESTED IN 833 00:34:09,600 --> 00:34:11,360 THE DATA MANAGEMENT ISSUES AND 834 00:34:11,360 --> 00:34:12,840 THEY'RE ON TWO LEVELS. 835 00:34:12,840 --> 00:34:14,680 ONE IS THE POLICY LEVEL WITH THE 836 00:34:14,680 --> 00:34:16,720 ARPA-H ACTIVITIES BE BOUND BY 837 00:34:16,720 --> 00:34:18,640 THE NIH POLICIES AROUND GENOME 838 00:34:18,640 --> 00:34:20,440 DATA SHARING AND SHARING 839 00:34:20,440 --> 00:34:21,920 POLICIES AND SECOND IF THEY ARE, 840 00:34:21,920 --> 00:34:23,760 AND IF THEY ARE AND GOING TO 841 00:34:23,760 --> 00:34:26,520 GENERATE HUGE AMOUNTS OF DATA 842 00:34:26,520 --> 00:34:29,000 WHERE WILL WE PUT IT. 843 00:34:29,000 --> 00:34:33,480 WE HAVE MOVED HIGH VALUE 844 00:34:33,480 --> 00:34:35,680 MOLECULAR DATABASES TO THE CLOUD 845 00:34:35,680 --> 00:34:37,520 FOR CONTINUED ACCESSIBILITY AND 846 00:34:37,520 --> 00:34:41,360 WHEN IT GETS TO REAL LOVE WORLD 847 00:34:41,360 --> 00:34:44,160 BEHAVIORAL DATA AND VISUAL IMAGE 848 00:34:44,160 --> 00:34:45,520 DATA, VIDEO, SOUND THEY'RE 849 00:34:45,520 --> 00:34:47,720 DIFFERENT KINDS OF DATA 850 00:34:47,720 --> 00:34:47,960 RESOURCES. 851 00:34:47,960 --> 00:34:49,640 WE'LL HAVE TO SCALE UP AND 852 00:34:49,640 --> 00:34:51,040 LOOKING FORWARD TO ARPA-H 853 00:34:51,040 --> 00:34:53,720 BRINGING INNOVATION IN DATA 854 00:34:53,720 --> 00:34:53,920 AREA. 855 00:34:53,920 --> 00:34:54,520 THANKS VERY MUCH. 856 00:34:54,520 --> 00:34:56,200 I'LL CLOSE BY THANKING YOU ALL. 857 00:34:56,200 --> 00:34:57,520 HAVE A TERRIFIC DAY. 858 00:34:57,520 --> 00:35:00,120 I'LL BE BACK TO HEAR THE 859 00:35:00,120 --> 00:35:03,120 PRESENTATIONS BUT I HAVE TO STEP 860 00:35:03,120 --> 00:35:04,000 AWAY FOR A BRIEFING MEETING 861 00:35:04,000 --> 00:35:06,240 RIGHT NOW. 862 00:35:06,240 --> 00:35:08,520 >> THANK YOU, PATTI. 863 00:35:08,520 --> 00:35:09,720 VALERIE, THANK YOU FOR GIVING US 864 00:35:09,720 --> 00:35:11,040 SOME WORDS AS THE ACTING 865 00:35:11,040 --> 00:35:11,760 DIRECTOR. 866 00:35:11,760 --> 00:35:12,840 PLEASE GO AHEAD. 867 00:35:12,840 --> 00:35:22,960 >> THANK YOU, HI EVERYBODY. 868 00:35:22,960 --> 00:35:25,320 I'M GOING TO TAKE A SECOND AND 869 00:35:25,320 --> 00:35:26,880 TRY TO BE TALENTED ENOUGH TO 870 00:35:26,880 --> 00:35:41,720 SHARE MY SCREEN WITH YOU. 871 00:35:41,720 --> 00:35:43,600 JUST TO GIVE A BRIEF OVERVIEW OF 872 00:35:43,600 --> 00:35:44,800 WHAT'S BEEN GOING ON IN THE 873 00:35:44,800 --> 00:35:46,520 OFFICE OF THE SCIENTIFIC 874 00:35:46,520 --> 00:35:47,680 DIRECTOR. 875 00:35:47,680 --> 00:35:49,440 AS YOU KNOW THAT'S BEEN MY ROLE 876 00:35:49,440 --> 00:35:50,720 FOR ABOUT THE LAST YEAR, YEAR 877 00:35:50,720 --> 00:35:55,280 AND A HALF. 878 00:35:55,280 --> 00:35:57,840 AND PART IS BECAUSE OF OUR 879 00:35:57,840 --> 00:36:00,640 EFFORTS TO BRING TOGETHER WHAT 880 00:36:00,640 --> 00:36:01,440 USED TO BE TWO SEPARATE RESEARCH 881 00:36:01,440 --> 00:36:05,800 PROGRAMS INTO A SINGLE ONE UNDER 882 00:36:05,800 --> 00:36:11,800 THE CARE AND FEEDING OF A SINGLE 883 00:36:11,800 --> 00:36:13,360 SCIENTIFIC DIRECTOR. 884 00:36:13,360 --> 00:36:18,920 AND SO THAT IS OUR HOME AND A 885 00:36:18,920 --> 00:36:20,640 WANT TO SAY SOME THINGS ABOUT 886 00:36:20,640 --> 00:36:20,840 IT. 887 00:36:20,840 --> 00:36:22,320 WHAT I'LL TALK ABOUT TODAY ARE A 888 00:36:22,320 --> 00:36:22,680 COUPLE THINGS. 889 00:36:22,680 --> 00:36:25,600 I WANT IT GIVE YOU A SUMMARY OF 890 00:36:25,600 --> 00:36:38,160 A REPORT THAT I GAVE TO THE NIH 891 00:36:38,160 --> 00:36:38,840 SPECIALIST COMMITTEE AND THAT'S 892 00:36:38,840 --> 00:36:40,320 BEEN AROUND A WHILE. 893 00:36:40,320 --> 00:36:42,280 IT'S THE FIRST TIME NIH EVER 894 00:36:42,280 --> 00:36:42,920 REPORTED TO IT. 895 00:36:42,920 --> 00:36:48,680 I AM -- I'LL EXPLAIN WHAT THEY 896 00:36:48,680 --> 00:36:50,280 DO AND GIVE A TRAINING 897 00:36:50,280 --> 00:36:51,640 COORDINATOR, VIRGINIA MEIER IS 898 00:36:51,640 --> 00:36:53,480 ALSO IN THE ROOM SO TO SPEAK. 899 00:36:53,480 --> 00:36:55,320 WHEN WE GET TO QUESTIONS, IF YOU 900 00:36:55,320 --> 00:36:59,120 HAVE SPECIFIC QUESTIONS, SHE MAY 901 00:36:59,120 --> 00:37:02,320 BE THE ONE WHO HELPS ANSWER THEM 902 00:37:02,320 --> 00:37:03,920 AND I'LL CLOSE WITH COMMENTS 903 00:37:03,920 --> 00:37:06,120 ABOUT OUR WORK TO POPULATE THE 904 00:37:06,120 --> 00:37:10,600 NINTH FLOOR OF THE BUILDING 905 00:37:10,600 --> 00:37:18,400 BEHIND MY HEAD. 906 00:37:18,400 --> 00:37:19,840 THE NIH EQUITY COMMITTEE LOOKS 907 00:37:19,840 --> 00:37:22,720 AT METRICS FOR EACH INTRAMURAL 908 00:37:22,720 --> 00:37:25,920 PROGRAM AT NIH. 909 00:37:25,920 --> 00:37:27,720 EVERY COUPLE YEARS THEY DO A 910 00:37:27,720 --> 00:37:29,520 REVIEW FOR EVERY SINGLE 911 00:37:29,520 --> 00:37:29,760 INSTITUTE. 912 00:37:29,760 --> 00:37:34,320 SOME INSTITUTES HAVE MULTIPLE 913 00:37:34,320 --> 00:37:35,200 INTRAMURAL RESEARCH PROGRAMS. 914 00:37:35,200 --> 00:37:37,520 WE ARE PROBABLY THE SMALLEST OR 915 00:37:37,520 --> 00:37:39,080 ONE OF THE SMALLEST, 916 00:37:39,080 --> 00:37:42,200 NEVERTHELESS, THEY LOOK AT KEY 917 00:37:42,200 --> 00:37:43,160 FACTORS WHICH INCLUDE 918 00:37:43,160 --> 00:37:46,560 DEMOGRAPHICS AND SALARIES, 919 00:37:46,560 --> 00:37:48,680 HIRING LIKE THE SIZE OF THE 920 00:37:48,680 --> 00:37:52,320 RESEARCH TEAMS THAT 921 00:37:52,320 --> 00:37:52,960 INVESTIGATORS ADVISORY COUNCIL 922 00:37:52,960 --> 00:37:55,080 THE REVIEW PRACTICES OF THE 923 00:37:55,080 --> 00:37:58,600 BOARD OF SCIENTIFIC COUNSELORS. 924 00:37:58,600 --> 00:38:02,200 DECEMBER, 2021 WAS OUR FIRST NEC 925 00:38:02,200 --> 00:38:02,520 REVIEW. 926 00:38:02,520 --> 00:38:06,360 THEY PROVIDED SOME TABLES AND 927 00:38:06,360 --> 00:38:08,320 ASKED KNOW PROVIDE COMMENTARY. 928 00:38:08,320 --> 00:38:11,280 I'LL SHOW YOU A COUPLE TABLES. 929 00:38:11,280 --> 00:38:11,920 THEY HIGHLIGHT WE HAVE WORK TO 930 00:38:11,920 --> 00:38:17,760 DO. 931 00:38:17,760 --> 00:38:19,840 THESE ARE SOME TABLES I WANT TO 932 00:38:19,840 --> 00:38:21,240 SHOW YOU WHICH SHOWS YOU HOW 933 00:38:21,240 --> 00:38:24,680 MANY SENIOR INVESTIGATORS WE 934 00:38:24,680 --> 00:38:25,560 HAVE. 935 00:38:25,560 --> 00:38:26,480 HOW MANY TENURE TRACK 936 00:38:26,480 --> 00:38:29,480 INVESTIGATORS WE HAVE AS OF THE 937 00:38:29,480 --> 00:38:30,840 DATE AS OF APRIL 2021. 938 00:38:30,840 --> 00:38:36,280 YOU CAN SEE WE'RE A SMALL GROUP. 939 00:38:36,280 --> 00:38:41,960 YOU CAN ALSO SEE WE DEFINITELY 940 00:38:41,960 --> 00:38:45,800 DON'T HAVE GENDER PARITY IN OUR 941 00:38:45,800 --> 00:38:46,280 INVESTIGATOR POOL. 942 00:38:46,280 --> 00:38:48,520 THIS IS WORSE, SORRY, I FEEL IT 943 00:38:48,520 --> 00:38:49,280 IS AND YOU HAVE TO BE HONEST. 944 00:38:49,280 --> 00:38:56,920 LOOK AT THIS. 945 00:38:56,920 --> 00:38:59,720 WE HAVE VERY FEW UNDER 946 00:38:59,720 --> 00:39:05,360 REPRESENTED MINORITIES IN THE 947 00:39:05,360 --> 00:39:07,040 INVESTIGATOR POOL AND HAVE WORK 948 00:39:07,040 --> 00:39:08,680 TO DO TO GET TO THE PLACE WE 949 00:39:08,680 --> 00:39:09,280 WANT TO BE. 950 00:39:09,280 --> 00:39:21,160 WE ALSO TALKED TO THEM A BIT 951 00:39:21,160 --> 00:39:23,320 WE'RE CALLED A DRY LAB OPPOSED 952 00:39:23,320 --> 00:39:26,000 TO WET LAB. 953 00:39:26,000 --> 00:39:28,520 YOU GUYS ARE COMPUTATIONAL AND 954 00:39:28,520 --> 00:39:29,760 DON'T NEED TO BE TOGETHER. 955 00:39:29,760 --> 00:39:30,560 OUR INVESTIGATORS AND TRAINEES 956 00:39:30,560 --> 00:39:33,640 DON'T FEEL THAT WAY. 957 00:39:33,640 --> 00:39:36,320 IN FACT, THEY FELT IT WAS THE 958 00:39:36,320 --> 00:39:39,560 IMPACT OF HAVING TO BE FULLY 959 00:39:39,560 --> 00:39:40,760 REMOTE INTERFERED WITH THEIR 960 00:39:40,760 --> 00:39:46,280 RESEARCH AND ALSO THEIR 961 00:39:46,280 --> 00:39:53,680 INTERACTIONS SO THERE'S WAVES OF 962 00:39:53,680 --> 00:39:58,520 PEOPLE COMING BACK UNTIL NEXT 963 00:39:58,520 --> 00:40:00,120 WEEK WHERE ALMOST ALL NIH 964 00:40:00,120 --> 00:40:04,400 FEDERAL EMPLOYEES WILL BE 965 00:40:04,400 --> 00:40:05,520 STARTING ON REGULAR THOUGH 966 00:40:05,520 --> 00:40:11,720 PROBABLY RESTRICT FROM FORMER 967 00:40:11,720 --> 00:40:13,680 SCHEDULES. 968 00:40:13,680 --> 00:40:15,000 SOME VOLUNTEERED TO COME BACK 969 00:40:15,000 --> 00:40:16,120 FOR THAT REASON BECAUSE THEY 970 00:40:16,120 --> 00:40:16,920 FELT THEY WEREN'T GETTING 971 00:40:16,920 --> 00:40:17,720 EVERYTHING THEY WANTED. 972 00:40:17,720 --> 00:40:23,720 WE HAVE HAD PEOPLE IN PLACE A 973 00:40:23,720 --> 00:40:24,440 FEW MONTHS. 974 00:40:24,440 --> 00:40:26,480 I MADE SOME RECOMMENDATIONS TO 975 00:40:26,480 --> 00:40:27,120 THEM TOO. 976 00:40:27,120 --> 00:40:39,880 I WAS ALLOWED TO DO THAT AND 977 00:40:39,880 --> 00:40:41,960 LOOK AT THE STEPS TO TAKE AND 978 00:40:41,960 --> 00:40:47,240 ASKED THEM IF WE CAN INCREASE 979 00:40:47,240 --> 00:40:49,720 THE VISIBILITY FOR CLINICAL 980 00:40:49,720 --> 00:40:52,640 INFORMATICS AND BIOMEDICAL 981 00:40:52,640 --> 00:40:54,320 SCIENCE AND IF YOU SEARCHED NIH 982 00:40:54,320 --> 00:40:56,680 IRP AND LOOK AT THE LIST THAT 983 00:40:56,680 --> 00:41:01,880 SAYS SHOW ME THE TOPICS YOU 984 00:41:01,880 --> 00:41:04,240 COVER, THEY DON'T MENTION THESE 985 00:41:04,240 --> 00:41:05,720 THEY MENTION LOTS OF BIOLOGY 986 00:41:05,720 --> 00:41:08,080 TOPICS AND EPIDEMIOLOGY IS THERE 987 00:41:08,080 --> 00:41:09,600 AND THERE'S AN ENGINEERING 988 00:41:09,600 --> 00:41:09,960 TOPIC. 989 00:41:09,960 --> 00:41:15,120 WHEN YOU ACTUALLY GO AND LOOK 990 00:41:15,120 --> 00:41:20,960 FOR INVESTIGATORS FOR EXAMPLE, 991 00:41:20,960 --> 00:41:27,000 TWO OF THE PEOPLE IN OUR 992 00:41:27,000 --> 00:41:30,040 COMPUTATIONAL SEARCH BRANCH ARE 993 00:41:30,040 --> 00:41:31,120 LISTED ADDS COMPUTATIONAL 994 00:41:31,120 --> 00:41:34,520 BIOLOGIST AND IF YOU LOOK AT 995 00:41:34,520 --> 00:41:35,720 SOMEONE PROVIDING TRAINING 996 00:41:35,720 --> 00:41:38,520 OPPORTUNITIES IN AN AREA I CARE 997 00:41:38,520 --> 00:41:40,720 ABOUT IT'S HARDER TO FIND US. 998 00:41:40,720 --> 00:41:42,520 I HAVEN'T GIVEN UP TRYING TO 999 00:41:42,520 --> 00:41:43,880 CONVINCE THEM THEY NEED TO 1000 00:41:43,880 --> 00:41:46,680 EXPAND THEIR LIST OF TOPICS. 1001 00:41:46,680 --> 00:41:48,640 THE SECOND THING FOR US, WE NEED 1002 00:41:48,640 --> 00:41:51,480 BETTER STRATEGIES AND OPTIONS 1003 00:41:51,480 --> 00:41:55,960 FOR RECRUITING WOMEN AND UNDER 1004 00:41:55,960 --> 00:41:56,840 REPRESENTED MINORITIES. 1005 00:41:56,840 --> 00:41:58,520 STATISTICS SHOW THE PERCENTAGE 1006 00:41:58,520 --> 00:42:02,320 OF WOMEN OBTAIN, FOR EXAMPLE, 1007 00:42:02,320 --> 00:42:03,560 OBTAINING Ph.D.s IN COMPUTER 1008 00:42:03,560 --> 00:42:06,200 SCIENCE AND ENGINEERING IS STILL 1009 00:42:06,200 --> 00:42:13,720 BELOW 35% WHERE IT'S 50% FOR 1010 00:42:13,720 --> 00:42:16,320 BIOLOGY AND THE NUMBERS FOR 1011 00:42:16,320 --> 00:42:18,520 UNDER REPRESENTED MINORITIES IS 1012 00:42:18,520 --> 00:42:20,280 NOT AS GOOD AS THAT. 1013 00:42:20,280 --> 00:42:26,120 WE NEED TO COME UP WITH WAYS TO 1014 00:42:26,120 --> 00:42:27,000 DIVERSIFY OUR INVESTIGATOR POOL. 1015 00:42:27,000 --> 00:42:28,640 THAT TAKES ME TO THE TRAINING 1016 00:42:28,640 --> 00:42:28,880 UPDATE. 1017 00:42:28,880 --> 00:42:31,840 I'M QUITE PROUD OF THIS. 1018 00:42:31,840 --> 00:42:34,520 WE LAUNCHED THE PROGRAM AFTER 1019 00:42:34,520 --> 00:42:43,280 THIS REPORT DATA SCIENCE 1020 00:42:43,280 --> 00:42:45,400 ANDATICS PROGRAM AND WE WANT 1021 00:42:45,400 --> 00:42:46,480 DIVERSITY IN OUR WORKFORCE POOL 1022 00:42:46,480 --> 00:42:47,720 AND IT WILL HELP OUR SCIENCE 1023 00:42:47,720 --> 00:42:55,320 MOVE FORWARD AND SO WE STARTED 1024 00:42:55,320 --> 00:42:56,720 ADVERTISING THE PROGRAM FOR 1025 00:42:56,720 --> 00:42:57,240 SUMMER INTERNS. 1026 00:42:57,240 --> 00:42:58,760 WE TOLD PEOPLE THIS IS WHAT YOU 1027 00:42:58,760 --> 00:42:59,520 WOULD GET. 1028 00:42:59,520 --> 00:43:01,480 GET A MENTORED RESEARCH PROJECT 1029 00:43:01,480 --> 00:43:03,640 WITH ONE OF OUR SCIENTISTS. 1030 00:43:03,640 --> 00:43:05,200 YOU GET SOME COHORT ACTIVITIES 1031 00:43:05,200 --> 00:43:07,320 BECAUSE AS WE KNOW EVEN FROM OUR 1032 00:43:07,320 --> 00:43:09,720 TRAINING PROGRAMS AND THE 1033 00:43:09,720 --> 00:43:10,520 UNIVERSITIES, BUILDING A COHORT 1034 00:43:10,520 --> 00:43:14,520 OF TRAINEES IS REALLY IMPORTANT. 1035 00:43:14,520 --> 00:43:17,760 THEY NEED TO KNOW EACH OTHER AS 1036 00:43:17,760 --> 00:43:18,520 WELL AS THEIR INVESTIGATE PERP 1037 00:43:18,520 --> 00:43:22,040 PERP -- INVESTIGATOR AND SOCIAL 1038 00:43:22,040 --> 00:43:23,760 ACTIVITIES AND JOURNAL CLUB AND 1039 00:43:23,760 --> 00:43:25,920 WE HAVE MENTORS AND ROLE MODELS 1040 00:43:25,920 --> 00:43:31,120 AND THERE'S COURSES AND OTHER 1041 00:43:31,120 --> 00:43:32,760 ACTIVITIES THE NIH INTRAMURAL 1042 00:43:32,760 --> 00:43:34,320 PROGRAM OFFERS LIKE RESILIENCE 1043 00:43:34,320 --> 00:43:36,160 AND WELLNESS WORK SHOPS AND 1044 00:43:36,160 --> 00:43:40,920 SMALL GROUP DISCUSSIONS FROM OUR 1045 00:43:40,920 --> 00:43:41,840 NIH OFFICE OF TRAINING AND 1046 00:43:41,840 --> 00:43:52,640 EDUCATION. 1047 00:43:52,640 --> 00:43:56,160 OUR TRAINER VIRGINIA MEIER MADE 1048 00:43:56,160 --> 00:43:57,240 THE PROGRAM AS VISIBLE AS WE 1049 00:43:57,240 --> 00:43:57,480 COULD. 1050 00:43:57,480 --> 00:43:59,400 WE HAVE A NEW CYCLE OF 1051 00:43:59,400 --> 00:44:01,920 APPLICATIONS COMING IN NOVEMBER 1052 00:44:01,920 --> 00:44:04,640 BUT I WANT TO TELL YOU I'M 1053 00:44:04,640 --> 00:44:06,560 THRILLED TO SAY WE HAVE FIVE 1054 00:44:06,560 --> 00:44:09,400 INTERNS COMES THIS SUMMER IN THE 1055 00:44:09,400 --> 00:44:10,080 DIVERSITY PROGRAM. 1056 00:44:10,080 --> 00:44:13,520 VIRGINIA HAS MATCHED THEM WITH 1057 00:44:13,520 --> 00:44:17,520 INVESTIGATIVE GROUPS NOW AND WE 1058 00:44:17,520 --> 00:44:19,760 HOPE NEXT YEAR WE CAN EXPAND ON 1059 00:44:19,760 --> 00:44:23,280 THAT TO ONE TO TWO-YEAR 1060 00:44:23,280 --> 00:44:23,920 INTERNSHIPS FOR COLLEGE GRADS IN 1061 00:44:23,920 --> 00:44:28,480 THE SAME PROGRAM. 1062 00:44:28,480 --> 00:44:30,040 SO TO ME THAT'S A STEP. 1063 00:44:30,040 --> 00:44:34,200 IF WE NEED TO GROW OUR OWN, WE 1064 00:44:34,200 --> 00:44:35,400 BETTER START GROWING NOW. 1065 00:44:35,400 --> 00:44:36,840 WHILE WE FIND OTHER STRATEGY WE 1066 00:44:36,840 --> 00:44:39,120 CAN AT LEAST START BRINGING 1067 00:44:39,120 --> 00:44:42,040 PEOPLE IN AND LET THEM SEE HOW 1068 00:44:42,040 --> 00:44:43,320 EXCITING THE RESEARCH WE SUPPORT 1069 00:44:43,320 --> 00:44:44,160 HERE IS. 1070 00:44:44,160 --> 00:44:46,360 SPEAKING FOR THE SUMMER BROADLY 1071 00:44:46,360 --> 00:44:49,720 FOR 2022, WE HAVE FOUR INTERNS 1072 00:44:49,720 --> 00:44:53,720 THAT WILL BE SPONSORED BY THE 1073 00:44:53,720 --> 00:44:54,760 NIH INTRAMURAL PROGRAM. 1074 00:44:54,760 --> 00:45:02,520 WE HAVE FIVE DDSI, DATA SCIENCE 1075 00:45:02,520 --> 00:45:04,520 DIVERSITY INTERNS AND SUMMER 1076 00:45:04,520 --> 00:45:05,120 INTERNS AS WELL. 1077 00:45:05,120 --> 00:45:11,240 THAT'S 23 PEOPLE WE'LL HAVE HERE 1078 00:45:11,240 --> 00:45:17,880 WE HOPE TO ENGAGE IN A WAY THAT 1079 00:45:17,880 --> 00:45:20,280 KEEPS THEM COMING BACK IN THIS 1080 00:45:20,280 --> 00:45:21,720 AREA OF COMPUTATIONAL HEALTH 1081 00:45:21,720 --> 00:45:22,000 RESEARCH. 1082 00:45:22,000 --> 00:45:25,880 WE HAVE AN INCREASE IN THE OF 1083 00:45:25,880 --> 00:45:28,760 FEMALE TRAINEES THIS YEAR OVER 1084 00:45:28,760 --> 00:45:33,160 LAST YEAR. 1085 00:45:33,160 --> 00:45:34,120 THAT'S ANOTHER PLUS. 1086 00:45:34,120 --> 00:45:36,320 AND TO THE TIME THING I WANTED 1087 00:45:36,320 --> 00:45:38,880 TO MENTION IS THE BUILDING BACK 1088 00:45:38,880 --> 00:45:39,080 THERE. 1089 00:45:39,080 --> 00:45:41,320 WE'RE POPULATING THE NINTH FLOOR 1090 00:45:41,320 --> 00:45:43,640 OF THE BUILDING 38A. 1091 00:45:43,640 --> 00:45:46,520 THE TOWER BEHIND NLM OR NEXT TO 1092 00:45:46,520 --> 00:45:50,320 NLM DEPENDING ON WHERE YOU'RE 1093 00:45:50,320 --> 00:45:55,880 STANDING AND THE NINTH FLOOR 1094 00:45:55,880 --> 00:45:59,640 WILL BE THE CENTER FOR OUR 1095 00:45:59,640 --> 00:46:05,200 INTEGRATED SCIENTIFIC DIRECTOR 1096 00:46:05,200 --> 00:46:07,000 GUIDANCE AND TO BE A SHOWCASE 1097 00:46:07,000 --> 00:46:08,600 AND THE TRAINING COORDINATOR AND 1098 00:46:08,600 --> 00:46:09,880 OUR OFFICE STAFF WILL BE LOCATED 1099 00:46:09,880 --> 00:46:12,840 THERE AND IN FACT MY OFFICE IS 1100 00:46:12,840 --> 00:46:13,960 LOCATED THERE RIGHT NOW THOUGH 1101 00:46:13,960 --> 00:46:15,360 I'M THE ONLY PERSON ON THE 1102 00:46:15,360 --> 00:46:15,560 FLOOR. 1103 00:46:15,560 --> 00:46:20,520 MORE WILL COME. 1104 00:46:20,520 --> 00:46:27,400 WE INTEND TO HAVE A MIX OF 1105 00:46:27,400 --> 00:46:29,160 TRAINEES AND THEY'RE LOCATED ON 1106 00:46:29,160 --> 00:46:31,200 SEVERAL OTHER FLOORS BELOW US IN 1107 00:46:31,200 --> 00:46:35,480 THE BUILDING AND SO WE WILL 1108 00:46:35,480 --> 00:46:36,840 BRING THEM TOGETHER AND ALSO I 1109 00:46:36,840 --> 00:46:44,560 HAVE TO GO BACK FOR A SECOND. 1110 00:46:44,560 --> 00:46:46,840 I MISSED A SLIDE AND THIS SLIDE, 1111 00:46:46,840 --> 00:46:49,280 DR. MICHAEL CHANG THE DIRECTOR 1112 00:46:49,280 --> 00:46:54,280 OF THE NATIONAL EYE INSTITUTE 1113 00:46:54,280 --> 00:46:57,720 WILL HAVE HIS LAB ON THE NINTH 1114 00:46:57,720 --> 00:47:03,040 FLOOR BECAUSE PATTI HAS A 1115 00:47:03,040 --> 00:47:04,320 RESEARCH LAB IN THE NURSING 1116 00:47:04,320 --> 00:47:05,240 INSTITUTE AND DR. CHANG IS 1117 00:47:05,240 --> 00:47:06,600 PUTTING HIS LAB IN OUR SPACE. 1118 00:47:06,600 --> 00:47:12,520 WE'RE VERY EXCITED ABOUT THAT 1119 00:47:12,520 --> 00:47:18,160 BECAUSE HIS AREA IS ARTIFICIAL 1120 00:47:18,160 --> 00:47:20,400 INTELLIGENCE DIAGNOSIS AND 1121 00:47:20,400 --> 00:47:22,040 ELECTRONIC HEALTH RECORDS AND 1122 00:47:22,040 --> 00:47:23,800 THINKING ABOUT REAL WORLD CARE 1123 00:47:23,800 --> 00:47:26,560 FOR MEDICALLY UNDER SERVED AREAS 1124 00:47:26,560 --> 00:47:30,120 IS DEFINITELY RIGHT IN OUR SWEET 1125 00:47:30,120 --> 00:47:32,600 SPOT, SO TO SPEAK. 1126 00:47:32,600 --> 00:47:35,080 SO ON THIS FLOOR, THIS IS OUR 1127 00:47:35,080 --> 00:47:37,320 CONCEPT MAP FOR THE FLOOR. 1128 00:47:37,320 --> 00:47:38,040 RIGHT NOW THE CORNER THING 1129 00:47:38,040 --> 00:47:42,040 CALLED SD IS THE ONE PLACE I SIT 1130 00:47:42,040 --> 00:47:45,040 THERE ONCE A WEEK AND WE ARE 1131 00:47:45,040 --> 00:47:47,720 BEGINNING TO GET ROOM NUMBERS 1132 00:47:47,720 --> 00:47:49,120 FOR THE REST. 1133 00:47:49,120 --> 00:47:52,160 WE HAVE ABOUT EIGHT OFFICES AND 1134 00:47:52,160 --> 00:47:56,120 34 OR 40 CUBICLES ON THE FLOOR. 1135 00:47:56,120 --> 00:47:57,720 WE ENVISION THIS KIND OF MIX. 1136 00:47:57,720 --> 00:48:02,080 NOT EXACTLY WHAT YOU SEE BUT 1137 00:48:02,080 --> 00:48:03,920 SOMETHING LIKE THIS. 1138 00:48:03,920 --> 00:48:06,120 WE'LL HAVE PEOPLE FROM THE 1139 00:48:06,120 --> 00:48:09,920 COMPUTATIONAL BIOLOGY BRANCH AND 1140 00:48:09,920 --> 00:48:12,000 PEOPLE FROM THE COMPUTATIONAL 1141 00:48:12,000 --> 00:48:13,440 RESEARCH BRANCH AND MAY SIT IN 1142 00:48:13,440 --> 00:48:17,520 CLUMPS OR MAY MIX. 1143 00:48:17,520 --> 00:48:19,480 WE KNOW WE CAN FOSTER CROSS-TALK 1144 00:48:19,480 --> 00:48:21,920 THIS WAY EASIER THAN PEOPLE 1145 00:48:21,920 --> 00:48:27,920 SEARCHING FOR PEOPLE. 1146 00:48:27,920 --> 00:48:29,800 WE'LL HAVE A NEW INVESTIGATOR IN 1147 00:48:29,800 --> 00:48:31,880 THE COMPUTATIONAL HEALTH 1148 00:48:31,880 --> 00:48:33,440 RESEARCH BRANCH IN JUNE. 1149 00:48:33,440 --> 00:48:42,280 HE'LL BE LOCATED UP HERE TOO. 1150 00:48:42,280 --> 00:48:44,560 WE'RE STARTING TO MOVE IN A 1151 00:48:44,560 --> 00:48:46,680 GROUP FROM THE COMPUTATIONAL 1152 00:48:46,680 --> 00:48:48,720 HEALTH RESEARCH BRANCH AS SOON 1153 00:48:48,720 --> 00:48:50,440 AS WE GET ROOM NUMBERS SO THEY 1154 00:48:50,440 --> 00:48:52,120 CAN PUT IT IN THEIR ADDRESSES. 1155 00:48:52,120 --> 00:48:55,440 THAT'S REALLY EVERYTHING I 1156 00:48:55,440 --> 00:48:56,640 WANTED TO TALK ABOUT WITH YOU. 1157 00:48:56,640 --> 00:48:58,560 I'M HAPPY TO ANSWER QUESTIONS IF 1158 00:48:58,560 --> 00:49:01,480 YOU HAVE THEM OR TAKE YOUR 1159 00:49:01,480 --> 00:49:03,640 SUGGESTIONS IF YOU WANT TO GIVE 1160 00:49:03,640 --> 00:49:07,520 THEM TO US ABOUT THE BEST WAYS. 1161 00:49:07,520 --> 00:49:12,040 THIS SAY CHALLENGING THING. 1162 00:49:12,040 --> 00:49:13,920 PATTY TOLD ABOUT UNITE AND REEP. 1163 00:49:13,920 --> 00:49:17,440 IT'S A CHALLENGING THING TO TRY 1164 00:49:17,440 --> 00:49:19,560 AND DRAW PEOPLE INTO A FIELD 1165 00:49:19,560 --> 00:49:23,080 WHEN WE HAVE TO REACH HARDER FOR 1166 00:49:23,080 --> 00:49:23,800 THEM THAN IN THE PAST SO WE'RE 1167 00:49:23,800 --> 00:49:32,120 TRYING. 1168 00:49:32,120 --> 00:49:34,560 OKAY. 1169 00:49:34,560 --> 00:49:38,360 ANY QUESTIONS? 1170 00:49:38,360 --> 00:49:41,360 >> THANK YOU VERY MUCH AND GOOD 1171 00:49:41,360 --> 00:49:42,560 TO SEE YOU. 1172 00:49:42,560 --> 00:49:46,520 I AGREE WITH YOU WHOLEHEARTEDLY. 1173 00:49:46,520 --> 00:49:48,160 I'M EXCITED ABOUT WHAT YOU'RE 1174 00:49:48,160 --> 00:49:50,200 DOING TO DIVERSIFY THE PIPELINE. 1175 00:49:50,200 --> 00:49:53,760 IT'S DEFINITELY DIFFICULT TO 1176 00:49:53,760 --> 00:49:54,720 DIVERSIFY WHEN THE PIPELINE 1177 00:49:54,720 --> 00:50:01,280 ISN'T THAT DIVERSE. 1178 00:50:01,280 --> 00:50:07,720 I'M CURIOUS WHAT THE PLANS ARE 1179 00:50:07,720 --> 00:50:09,360 TO INCREASE THE VISIBILITY 1180 00:50:09,360 --> 00:50:11,000 BEYOND THE HBCUs. 1181 00:50:11,000 --> 00:50:15,240 EVERYBODY MAY HAVE CANDIDATES. 1182 00:50:15,240 --> 00:50:18,040 >> AND WE STARTED WITH A 1183 00:50:18,040 --> 00:50:18,600 TARGETED FOCUS. 1184 00:50:18,600 --> 00:50:24,960 AFTER ALL, IT WAS OUR FIRST 1185 00:50:24,960 --> 00:50:26,520 TIME. 1186 00:50:26,520 --> 00:50:28,000 THERE'S MANY MINORITY SERVING 1187 00:50:28,000 --> 00:50:29,120 INSTITUTIONS AND OTHER 1188 00:50:29,120 --> 00:50:29,880 INSTITUTIONS. 1189 00:50:29,880 --> 00:50:35,720 WE'LL DEFINITELY BE BROADER IN 1190 00:50:35,720 --> 00:50:37,640 OUR SCOPE I DON'T KNOW IF 1191 00:50:37,640 --> 00:50:40,000 VIRGINIA WANTS TO COMMENT ON 1192 00:50:40,000 --> 00:50:40,320 THAT. 1193 00:50:40,320 --> 00:50:42,440 VIRGINIA IS THE PERSON WHO 1194 00:50:42,440 --> 00:50:46,520 LAUNCHED THIS FOR US. 1195 00:50:46,520 --> 00:50:48,120 DO YOU WANT TO SAY MORE ABOUT 1196 00:50:48,120 --> 00:50:56,720 HOW WE CAN REACH OUT FURTHER? 1197 00:50:56,720 --> 00:50:57,680 VALERIE MENTIONED CONFERENCES 1198 00:50:57,680 --> 00:50:58,880 BUT WE HAVE SEVERAL OTHERS NOW 1199 00:50:58,880 --> 00:51:06,600 WE CAN ATTEMPT THOSE IN PERSON. 1200 00:51:06,600 --> 00:51:09,440 WE'RE HOPING TO SPREAD THE WORD 1201 00:51:09,440 --> 00:51:11,760 AND THERE'S SO MANY INSTITUTIONS 1202 00:51:11,760 --> 00:51:14,520 WITHIN A 10-MILE RADIUS OF NIH 1203 00:51:14,520 --> 00:51:19,800 AND I THINK BY BRINGING PEOPLE 1204 00:51:19,800 --> 00:51:22,360 IN WHOA DON'T HAVE TO WORRY 1205 00:51:22,360 --> 00:51:24,680 ABOUT MOVING OR THINGS THAT ARE 1206 00:51:24,680 --> 00:51:27,440 STRESSFUL TO TAKE AN INTERNSHIP 1207 00:51:27,440 --> 00:51:32,680 WILL HELP GET MORE PEOPLE TO USE 1208 00:51:32,680 --> 00:51:34,520 THE RESOURCES IN THEIR OWN LOCAL 1209 00:51:34,520 --> 00:51:35,000 AREA. 1210 00:51:35,000 --> 00:51:38,520 >> OKAY. 1211 00:51:38,520 --> 00:51:40,320 THAT'S A GOOD POINT. 1212 00:51:40,320 --> 00:51:41,720 IT WOULD BE GOOD TO COLLECT THE 1213 00:51:41,720 --> 00:51:43,000 RESOURCES AND MAKE THEIR 1214 00:51:43,000 --> 00:51:43,720 AWARENESS A BROADER THING. 1215 00:51:43,720 --> 00:51:46,240 I AGREE IT MAY BE HARDER OR 1216 00:51:46,240 --> 00:51:55,320 DISCOURAGING FOR PEOPLE. 1217 00:51:55,320 --> 00:51:57,000 WE HAVE THE WEBSITE AND 1218 00:51:57,000 --> 00:51:58,160 COMMUNICATION STAFF. 1219 00:51:58,160 --> 00:52:00,680 WE STARTED IN THE MOST FOCUSSED 1220 00:52:00,680 --> 00:52:02,800 WAY AND WE'LL LEARNING SOMETHING 1221 00:52:02,800 --> 00:52:04,600 THIS YEAR TOO ABOUT WHAT ELSE WE 1222 00:52:04,600 --> 00:52:06,520 NEED TO DO MAKE IT A GOOD 1223 00:52:06,520 --> 00:52:13,680 EXPERIENCE FOR THEM AS WELL. 1224 00:52:13,680 --> 00:52:17,560 YOU MAY BE ABLE TO LEVERAGE THE 1225 00:52:17,560 --> 00:52:17,920 VIDEO. 1226 00:52:17,920 --> 00:52:26,360 >> WE HAVE AWESOME VIDEOS. 1227 00:52:26,360 --> 00:52:27,880 THAT'S ONE OF THE GREAT THINGS 1228 00:52:27,880 --> 00:52:29,840 THE COMMUNICATION PEOPLE HAVE 1229 00:52:29,840 --> 00:52:35,920 DONE AND WE HAVE A BUNCH OF 1230 00:52:35,920 --> 00:52:41,200 SLIDES TO TRY TO BRING THE 1231 00:52:41,200 --> 00:52:44,240 LANGUAGE TO PEOPLE WHO AREN'T 1232 00:52:44,240 --> 00:52:59,160 Ph.D.s AND PHYSICS. 1233 00:52:59,160 --> 00:53:00,840 >> I'M A FIRM BELIEVER YOU NEED 1234 00:53:00,840 --> 00:53:07,640 TO REACH DOWN AND TRAIN EARLIER. 1235 00:53:07,640 --> 00:53:19,600 IDEALLY, THERE'S NOT UNLIMITED 1236 00:53:19,600 --> 00:53:19,880 RESOURCES. 1237 00:53:19,880 --> 00:53:27,720 I HAVE THE HONOR OF RUNNING THE 1238 00:53:27,720 --> 00:53:30,840 PROGRAM AS A DIVERSITY 1239 00:53:30,840 --> 00:53:31,280 SUPPLEMENT. 1240 00:53:31,280 --> 00:53:33,200 THERE'S A LOT I HAVE LEARNED I 1241 00:53:33,200 --> 00:53:35,280 CAN SHARE. 1242 00:53:35,280 --> 00:53:38,280 WE HAVE RIGHT NOW NINE DIVERSITY 1243 00:53:38,280 --> 00:53:40,600 STUDENTS FROM ACROSS THE NATION 1244 00:53:40,600 --> 00:53:45,240 EVEN FROM HAWAI'I. 1245 00:53:45,240 --> 00:53:46,960 AND WE HAVE A DISTANCE ONE. 1246 00:53:46,960 --> 00:53:47,720 IT'S ALL REMOTE. 1247 00:53:47,720 --> 00:53:49,960 I THOUGHT IT WAS NOT GOING TO 1248 00:53:49,960 --> 00:53:51,720 WORK BUT IT'S WORKED WELL. 1249 00:53:51,720 --> 00:53:52,200 ONE IMPORTANT THING IS 1250 00:53:52,200 --> 00:53:55,680 MENTORSHIP. 1251 00:53:55,680 --> 00:53:58,720 I HAVE A STAFF SCIENTIST MENTOR 1252 00:53:58,720 --> 00:54:00,080 TWO STUDENTS AND THEY MEET WITH 1253 00:54:00,080 --> 00:54:03,880 THEM EVERY WEEK AND THEY HAVE 1254 00:54:03,880 --> 00:54:05,120 THEIR OWN RESEARCH PROJECT AND 1255 00:54:05,120 --> 00:54:05,920 SO ON. 1256 00:54:05,920 --> 00:54:07,800 THERE'S SEVERAL THINGS I THINK 1257 00:54:07,800 --> 00:54:10,240 HAVE WORKED WELL AND THEY'LL BE 1258 00:54:10,240 --> 00:54:10,960 PUBLISHING THEIR MAERPS AT THE 1259 00:54:10,960 --> 00:54:11,920 END OF THIS EXPERIENCE WHICH IS 1260 00:54:11,920 --> 00:54:16,960 IN JUNE OR SO. 1261 00:54:16,960 --> 00:54:18,760 IT'S DEFINITELY THE WAY TO GO. 1262 00:54:18,760 --> 00:54:23,440 THEY NEED THE PUSH AND WE HAVE 1263 00:54:23,440 --> 00:54:25,880 UNDERGRADUATES AND I HAVE SOME 1264 00:54:25,880 --> 00:54:26,520 MASTER STUDENTS. 1265 00:54:26,520 --> 00:54:28,120 ALL CONSIDERING FURTHER STUDYING 1266 00:54:28,120 --> 00:54:32,320 AND GOING FOR Ph.D.s AND SO ON. 1267 00:54:32,320 --> 00:54:35,000 I SEE THE EFFECT AND IF THE NLM 1268 00:54:35,000 --> 00:54:38,520 WERE TO DO IT DIRECTLY IT WOULD 1269 00:54:38,520 --> 00:54:44,720 BE AN EXCELLENT PROGRAM. 1270 00:54:44,720 --> 00:54:47,320 AND COMING FROM COMPUTER 1271 00:54:47,320 --> 00:54:50,040 SCIENCE, THERE'S A LARGE INFLUX 1272 00:54:50,040 --> 00:54:55,880 OF PEOPLE PARTICIPATING IN 1273 00:54:55,880 --> 00:54:58,440 PROGRAMMING CONTEST OR ACM, 1274 00:54:58,440 --> 00:55:00,160 ASSOCIATION OF COMPUTER 1275 00:55:00,160 --> 00:55:02,520 MACHINERY EVERY YEAR THEY 1276 00:55:02,520 --> 00:55:05,120 ORGANIZE A DIFFERENT CONFERENCE 1277 00:55:05,120 --> 00:55:06,320 AND STUDENTS PARTICIPATE AND 1278 00:55:06,320 --> 00:55:07,400 THERE'S A NATIONAL WINNER. 1279 00:55:07,400 --> 00:55:14,240 THE WINNER FROM EACH OF THE 1280 00:55:14,240 --> 00:55:14,800 INDIVIDUAL CONFERENCES AND 1281 00:55:14,800 --> 00:55:15,880 STUDENTS COMPETE AND PRESENT 1282 00:55:15,880 --> 00:55:18,200 THEIR POSTERS AND PROJECTS AND 1283 00:55:18,200 --> 00:55:20,480 WIN AND THEN THEY COMPETE AT THE 1284 00:55:20,480 --> 00:55:21,680 NATIONAL LEVEL THE WINNERS OF 1285 00:55:21,680 --> 00:55:23,760 EACH OF THE CONFERENCES. 1286 00:55:23,760 --> 00:55:26,520 SO IT'S A MECHANISM THAT HAS 1287 00:55:26,520 --> 00:55:32,520 WORKED AND COULD BRING IN VERY 1288 00:55:32,520 --> 00:55:35,600 TALENTED PEOPLE AND COULD BE AN 1289 00:55:35,600 --> 00:55:37,920 EXTRA MOTIVATION FOR THEM. 1290 00:55:37,920 --> 00:55:39,560 >> THANK YOU. 1291 00:55:39,560 --> 00:55:41,360 WE TAKE COPIOUS NOTES. 1292 00:55:41,360 --> 00:55:44,720 >> I KNOW IT'S RECORDED. 1293 00:55:44,720 --> 00:55:49,120 IT'S ALL SPONSORED BY THE NLM. 1294 00:55:49,120 --> 00:55:50,960 >> VALERIE, IF YOU HAVE AN 1295 00:55:50,960 --> 00:55:55,800 OPPORTUNITY TO TALK ABOUT THIS 1296 00:55:55,800 --> 00:56:06,680 THIS AFTERNOON. 1297 00:56:06,680 --> 00:56:10,560 >> DR. ZIANG, GO AHEAD AND USE 1298 00:56:10,560 --> 00:56:12,800 THE ORIGINAL 30 MINUTES YOU HAD 1299 00:56:12,800 --> 00:56:14,560 AND WE'LL SHIFT THINGS BY FIVE 1300 00:56:14,560 --> 00:56:29,760 MINUTES. 1301 00:56:29,760 --> 00:56:32,320 YOU ARE MUTED. 1302 00:56:32,320 --> 00:56:38,960 >> I'M DR. JIANG AND I JOINED IN 1303 00:56:38,960 --> 00:56:42,280 2019 AND TODAY I'LL BE 1304 00:56:42,280 --> 00:56:46,520 PRESENTING MY PURCHASE ON 1305 00:56:46,520 --> 00:56:47,680 COMPARATIVE GENOMIC ANALYSIS OF 1306 00:56:47,680 --> 00:56:53,800 MICROBIOME AND SARS COV2. 1307 00:56:53,800 --> 00:56:55,320 WE HAVE MASSIVE AMOUNT OF DATA 1308 00:56:55,320 --> 00:57:06,520 GENERATED. 1309 00:57:06,520 --> 00:57:11,720 WE'RE TAKING ACTION. 1310 00:57:11,720 --> 00:57:16,680 EVEN THOUGH WE HAVE GENOMIC 1311 00:57:16,680 --> 00:57:20,640 SEQUENCE WE DON'T KNOW MUCH 1312 00:57:20,640 --> 00:57:21,920 FUNCTION OF HOW THE BACTERIA 1313 00:57:21,920 --> 00:57:22,160 PERFORM. 1314 00:57:22,160 --> 00:57:26,800 MY LAB IS WORKING TO ADDRESS THE 1315 00:57:26,800 --> 00:57:29,400 FUNCTION IN BACTERIA BY LOOKING 1316 00:57:29,400 --> 00:57:30,840 AT COMPARATIVE GENOMICS. 1317 00:57:30,840 --> 00:57:35,760 THE GOAL IS TO IMPROVE OUR 1318 00:57:35,760 --> 00:57:40,840 UNDERSTANDING OF THE MICROBIOME 1319 00:57:40,840 --> 00:57:41,760 AND USING COMPETITIONAL 1320 00:57:41,760 --> 00:57:50,520 APPROACHES. 1321 00:57:50,520 --> 00:58:03,520 I HAVE FOUR POST-DOCS IN MY LAB. 1322 00:58:03,520 --> 00:58:22,560 WE WROTE MANUSCRIPTS. 1323 00:58:22,560 --> 00:58:28,480 WE HAVE PROJECTS AND MY FOCUSES 1324 00:58:28,480 --> 00:58:32,320 ON ANALYSIS AND LOOKING AT 1325 00:58:32,320 --> 00:58:34,560 COVID-19 PANDEMIC. 1326 00:58:34,560 --> 00:58:39,120 WE SHIFTED OUR EFFORT TO 1327 00:58:39,120 --> 00:58:39,360 COVID-19. 1328 00:58:39,360 --> 00:58:41,760 I'LL FOCUS ON THE THREE PROJECTS 1329 00:58:41,760 --> 00:58:56,120 HIGHLIGHTED HERE. 1330 00:58:56,120 --> 00:59:00,200 WE HAVE A TOOL TO CATEGORIZE THE 1331 00:59:00,200 --> 00:59:02,200 PRESENCE ACROSS THE GENOMES. 1332 00:59:02,200 --> 00:59:05,520 THE GRAPH SHOWS AN OVERVIEW OF 1333 00:59:05,520 --> 00:59:11,800 THE FUNCTION PROFILE FRAMEWORK. 1334 00:59:11,800 --> 00:59:16,920 FIRST WE KNOW THIS CONTRIBUTE TO 1335 00:59:16,920 --> 00:59:17,560 HEALTH. 1336 00:59:17,560 --> 00:59:26,560 AND WE LOOKED AT OUR DATABASE. 1337 00:59:26,560 --> 00:59:29,800 WE TRANSMITTED THE GENE TO 1338 00:59:29,800 --> 00:59:37,200 MEASURE COMPUTATIONALLY. 1339 00:59:37,200 --> 00:59:42,560 WE RETRIEVED THEM FROM PUBLIC 1340 00:59:42,560 --> 00:59:47,600 DATABASES AND LOOKED AT GENES. 1341 00:59:47,600 --> 00:59:49,240 WE ALSO NEED TO CURATE THE 1342 00:59:49,240 --> 00:59:53,760 FUNCTION AND PATHWAYS TO ENSURE 1343 00:59:53,760 --> 01:00:01,920 THE ACCURACY. 1344 01:00:01,920 --> 01:00:08,040 WE HAVE FUNCTIONAL BACTERIA. 1345 01:00:08,040 --> 01:00:12,560 AND WE LOOK AT THE FUNCTION 1346 01:00:12,560 --> 01:00:14,520 PROFILE AND LOOKED AT THE GENE 1347 01:00:14,520 --> 01:00:22,520 AND PATHWAY IS PRESENT AND WHAT 1348 01:00:22,520 --> 01:00:27,000 IS THE ASSOCIATION TO DISEASE. 1349 01:00:27,000 --> 01:00:34,520 THIS IS OUR INITIAL PREDICTION. 1350 01:00:34,520 --> 01:00:40,040 SOME LOOKED AT THE MICROBIOME. 1351 01:00:40,040 --> 01:00:46,520 IT THIS LOOKS AT THE PATHWAYS 1352 01:00:46,520 --> 01:00:53,000 AND WE CAN USE SYNTHESIS. 1353 01:00:53,000 --> 01:00:57,800 THESE ARE ONES WHERE WE KNOW THE 1354 01:00:57,800 --> 01:00:58,400 GENETIC BASIS AS WELL AS THE 1355 01:00:58,400 --> 01:01:15,920 MECHANISM. 1356 01:01:15,920 --> 01:01:17,040 HISTAMINE PLAYS AN IMPORTANT 1357 01:01:17,040 --> 01:01:19,000 ROLE AS A CHEMICAL MESSENGER IN 1358 01:01:19,000 --> 01:01:23,720 THE IMMUNE SYSTEM. 1359 01:01:23,720 --> 01:01:38,520 IT PRODUCES A RESPONSE. 1360 01:01:38,520 --> 01:01:45,680 AND THIS STUDY SHOWS IT 1361 01:01:45,680 --> 01:01:46,520 INCREASES THE ASSOCIATION WITH 1362 01:01:46,520 --> 01:01:55,640 DISEASE. 1363 01:01:55,640 --> 01:01:57,120 IT CAN AFFECT DISEASE SUCH AS 1364 01:01:57,120 --> 01:01:58,520 ASTHMA. 1365 01:01:58,520 --> 01:02:00,720 HOWEVER, WE DON'T KNOW WHICH 1366 01:02:00,720 --> 01:02:06,520 BACTERIA IS CAPABLE OF HISTAMINE 1367 01:02:06,520 --> 01:02:06,760 SECRETION. 1368 01:02:06,760 --> 01:02:08,360 AND WE FIRST NEEDED TO 1369 01:02:08,360 --> 01:02:19,160 UNDERSTAND THE MECHANISM. 1370 01:02:19,160 --> 01:02:23,120 HIS 1371 01:02:23,120 --> 01:02:28,120 HISTIDINE IS NEEDED TO PERFORM 1372 01:02:28,120 --> 01:02:33,760 THE FUNCTION AND THEY WORK 1373 01:02:33,760 --> 01:02:34,400 TOGETHER. 1374 01:02:34,400 --> 01:02:40,760 SO TWO TYPES OF THE 1375 01:02:40,760 --> 01:02:41,720 DECARBOXYLASE TURNED IT INTO A 1376 01:02:41,720 --> 01:02:53,160 FUNCTIONAL MODEL. 1377 01:02:53,160 --> 01:02:57,000 WE LOOKED AT THE SPECIES AND 1378 01:02:57,000 --> 01:03:02,520 THEY DISTRIBUTE AND EXPAND THE 1379 01:03:02,520 --> 01:03:11,760 HISTAMINE BACTERIA. 1380 01:03:11,760 --> 01:03:15,320 THIS HETEROGENEITY WAS OBSERVED 1381 01:03:15,320 --> 01:03:20,280 AND WE LOOKED AT DIFFERENT 1382 01:03:20,280 --> 01:03:21,400 CONFIGURATIONS ADDITIONAL GENES 1383 01:03:21,400 --> 01:03:26,440 AND COPIES OF THE SAME GENES. 1384 01:03:26,440 --> 01:03:34,720 AND THIS FORMS THE COMPONENT OF 1385 01:03:34,720 --> 01:03:45,400 THE SYSTEM AND IN THIS SYSTEM, 1386 01:03:45,400 --> 01:03:50,120 THE DECARBOXYLASE AND PART OF 1387 01:03:50,120 --> 01:03:52,880 THIS AND THIS IS PASSED IN AND 1388 01:03:52,880 --> 01:03:59,200 OUT OF THE CORRESPONDING AMINO 1389 01:03:59,200 --> 01:03:59,560 ACIDS. 1390 01:03:59,560 --> 01:04:17,600 NEXT WE LOOKED AT TO THE DATA 1391 01:04:17,600 --> 01:04:25,440 AND WE PERFORMED THE ANALYSIS. 1392 01:04:25,440 --> 01:04:28,120 WE LOOKED AT THE HISTAMINES 1393 01:04:28,120 --> 01:04:30,520 SECRETED IN PATIENT WITH IN 1394 01:04:30,520 --> 01:04:34,160 FLAMED BOWEL DISEASE AND 1395 01:04:34,160 --> 01:04:36,200 COLORECTAL CANCER AND THERE'S AN 1396 01:04:36,200 --> 01:04:41,760 ASSOCIATION BETWEEN BACTERIA AND 1397 01:04:41,760 --> 01:04:47,520 INFLAMED BOWEL DISEASE WHEN WE 1398 01:04:47,520 --> 01:04:51,920 LOOKED AT IBD THERE'S NOT ONE 1399 01:04:51,920 --> 01:04:56,480 CONTRIBUTION ON THIS. 1400 01:04:56,480 --> 01:05:02,520 IN SUMMARY, WE LOOK AT THE 1401 01:05:02,520 --> 01:05:05,720 HISTAMINE SECRETED BACTERIA AND 1402 01:05:05,720 --> 01:05:10,120 WE FOUND A HISTAMINE BACTERIA 1403 01:05:10,120 --> 01:05:17,440 WHICH MAN TESTED INTO THE PACKET 1404 01:05:17,440 --> 01:05:18,760 FOR IMMUNOLOGICAL DISEASE. 1405 01:05:18,760 --> 01:05:20,840 THIS WRAPS UP MY FIRST PROJECT. 1406 01:05:20,840 --> 01:05:25,040 NOW I'M GOING TO TALK ABOUT THE 1407 01:05:25,040 --> 01:05:28,320 SECOND PROJECT, BACTERIA ON THE 1408 01:05:28,320 --> 01:05:33,280 GI TRACT. 1409 01:05:33,280 --> 01:05:43,320 WE KNOW THE FUNCTIONS. 1410 01:05:43,320 --> 01:05:45,760 WE STAY TUNED OF STUDIED AND 1411 01:05:45,760 --> 01:05:52,440 TRIED TO FIND THE BACTERIA FUNGI 1412 01:05:52,440 --> 01:05:55,280 FOR ADAPTATION. 1413 01:05:55,280 --> 01:05:57,400 AND LOOKING AT THE GASTROIN TES 1414 01:05:57,400 --> 01:05:58,520 AND WE WANT TO ADDRESS THE 1415 01:05:58,520 --> 01:06:08,280 FOLLOWING THREE QUESTIONS. 1416 01:06:08,280 --> 01:06:13,080 WHAT FUNCTION CLADES ARE THE 1417 01:06:13,080 --> 01:06:16,120 VERTEBRATE OF THE G.I. TRACT 1418 01:06:16,120 --> 01:06:18,120 ADAPTED CLADE AND WHAT GENE 1419 01:06:18,120 --> 01:06:23,440 FUNCTIONS MIGHT CONFER THIS 1420 01:06:23,440 --> 01:06:29,320 ADAPTATION AND WE LOOKED AT THE 1421 01:06:29,320 --> 01:06:32,440 BACTERIA AND THIS PRESENTS THE 1422 01:06:32,440 --> 01:06:38,160 TAXONOMY AND NEXT WE PERFORMED 1423 01:06:38,160 --> 01:06:39,720 THE ANNOTATION OF THE SPECIES 1424 01:06:39,720 --> 01:06:46,440 AND LOOK AT THE MIAMI -- 1425 01:06:46,440 --> 01:06:52,120 MICROBIA AND WE LOOKED AT THE 1426 01:06:52,120 --> 01:06:56,080 FIVE CATEGORIES. 1427 01:06:56,080 --> 01:07:05,200 AND WE PERFORMED THE 1428 01:07:05,200 --> 01:07:09,560 CONSTRUCTION USING THIS AND THIS 1429 01:07:09,560 --> 01:07:22,560 SHOWS THE DISTRIBUTION AND 1430 01:07:22,560 --> 01:07:29,800 PREDICTOR. 1431 01:07:29,800 --> 01:07:34,600 AND THIS IS LIKELY FROM AN 1432 01:07:34,600 --> 01:07:37,560 ANCESTOR AND THEY'RE IDENTIFIED 1433 01:07:37,560 --> 01:07:38,520 IN THE ACTION. 1434 01:07:38,520 --> 01:07:46,520 THEY'RE PREDICTED TO HAVE 1435 01:07:46,520 --> 01:07:49,680 ORIGINATED IN CLADES AND THIS IS 1436 01:07:49,680 --> 01:07:58,080 PREDICTED TO HAVE ORIGINATED. 1437 01:07:58,080 --> 01:07:59,800 NOW WE WANT TO INVESTIGATE WHAT 1438 01:07:59,800 --> 01:08:01,240 IS ASSOCIATED WITH THE PATIENT 1439 01:08:01,240 --> 01:08:03,520 AND THE G.I. TRACT. 1440 01:08:03,520 --> 01:08:10,560 WE ANNOTATED GENOMES OF SPECIES 1441 01:08:10,560 --> 01:08:13,840 AND LOOKED AT GROUPS. 1442 01:08:13,840 --> 01:08:17,840 WE PERFORMED THE REGRESSION 1443 01:08:17,840 --> 01:08:19,960 ANALYSIS WITH A FUNCTION TO LOOK 1444 01:08:19,960 --> 01:08:23,720 AT THE GENETIC SIGNAL FROM THE 1445 01:08:23,720 --> 01:08:26,560 REGRESSION ANALYSIS. 1446 01:08:26,560 --> 01:08:29,840 AND THIS IS ONE EXAMPLE. 1447 01:08:29,840 --> 01:08:35,840 SOME ARE CORRELATED WITH OUR 1448 01:08:35,840 --> 01:08:46,520 ADAPTATION AND WE LOOKED AT THE 1449 01:08:46,520 --> 01:08:48,520 ANALYSIS AND THIS IS ASSOCIATED 1450 01:08:48,520 --> 01:08:53,040 WITH A G.I. PATIENT. 1451 01:08:53,040 --> 01:08:54,520 AND THIS IS A MAJOR CONTRIBUTOR 1452 01:08:54,520 --> 01:08:58,520 TO HOST ADHERENCE. 1453 01:08:58,520 --> 01:09:02,520 THIS IS INTERESTING. 1454 01:09:02,520 --> 01:09:05,800 MANY ARE ASSOCIATED WITH THE 1455 01:09:05,800 --> 01:09:10,960 HABITAT AND THIS INCLUDES THE 1456 01:09:10,960 --> 01:09:15,520 GENE 1457 01:09:15,520 --> 01:09:26,480 GENES AND WE SHOWED WHAT IS 1458 01:09:26,480 --> 01:09:27,600 ASSOCIATED WITH THE HOST G.I. 1459 01:09:27,600 --> 01:09:32,880 TRACT. 1460 01:09:32,880 --> 01:09:38,680 THE GRAPH ON THE LEFT SHOWS IT'S 1461 01:09:38,680 --> 01:09:42,720 MAPPED TO THE PHYLOGENETIC TREE 1462 01:09:42,720 --> 01:09:44,520 AND WE LOOKED AT THE MODE. 1463 01:09:44,520 --> 01:09:46,720 THE GRAPH ON THE RIGHT SHOWED 1464 01:09:46,720 --> 01:09:52,520 THE PRESENCE OR ABSENCE OF 1465 01:09:52,520 --> 01:09:56,720 FLAGELLA GENES AND THEY'RE 1466 01:09:56,720 --> 01:09:57,040 PRESENT. 1467 01:09:57,040 --> 01:10:04,360 WE CAN SEE FLAGELLA GENES IN THE 1468 01:10:04,360 --> 01:10:04,920 FIRMICUTES. 1469 01:10:04,920 --> 01:10:10,440 WHEN WE COMPARED THE GRAPHS AND 1470 01:10:10,440 --> 01:10:19,720 IT GENERATES THE SYSTEM AND WE 1471 01:10:19,720 --> 01:10:24,440 CAN SPECULATE REASONS. 1472 01:10:24,440 --> 01:10:26,760 THE STUDY SHOWED THE BACTERIA IS 1473 01:10:26,760 --> 01:10:34,440 ANOTHER REASON. 1474 01:10:34,440 --> 01:10:37,520 IN THIS ONGOING PROJECT WE'RE 1475 01:10:37,520 --> 01:10:41,000 ABLE TO IDENTIFY THIS AND WE CAN 1476 01:10:41,000 --> 01:10:42,520 DETECT THE FUNCTION ASSOCIATED 1477 01:10:42,520 --> 01:10:47,160 WITH THE PATIENT AND THE G.I. 1478 01:10:47,160 --> 01:10:47,760 TRACT. 1479 01:10:47,760 --> 01:10:51,640 WE HAVE FOUND IT'S FREQUENTLY 1480 01:10:51,640 --> 01:10:54,320 ASSOCIATED WITH THE FIRMICUTES. 1481 01:10:54,320 --> 01:10:58,920 IN THE NEXT STEPS WE'LL CHECK 1482 01:10:58,920 --> 01:11:05,720 FUNCTION ASSOCIATED WITH AND 1483 01:11:05,720 --> 01:11:16,520 FOCUS ON THE FORMATION AND WE 1484 01:11:16,520 --> 01:11:17,800 HAVE MORE DETAILED ANALYSIS ON 1485 01:11:17,800 --> 01:11:21,800 THE FUNCTIONAL INTEREST. 1486 01:11:21,800 --> 01:11:26,520 AND LOOKING TO UNDERSTAND THE 1487 01:11:26,520 --> 01:11:30,080 ADA 1488 01:11:30,080 --> 01:11:34,760 ADAPTATION OF THE G.I. TRACT AND 1489 01:11:34,760 --> 01:11:38,520 LOOK AT THE MICROBIOME AND THE 1490 01:11:38,520 --> 01:11:40,320 ACTION. 1491 01:11:40,320 --> 01:11:46,280 THE NEXT I WANT TO TALK ABOUT IS 1492 01:11:46,280 --> 01:11:48,840 THE RESEARCH DONE. 1493 01:11:48,840 --> 01:11:52,520 I HAD AN OPPORTUNITY WONDERFUL 1494 01:11:52,520 --> 01:11:54,960 TO COLLABORATE BETWEEN INTERIM 1495 01:11:54,960 --> 01:11:59,400 AND EXTRAMURAL RESEARCH AND I 1496 01:11:59,400 --> 01:12:04,280 WORKED WITH ARIZONA STATE 1497 01:12:04,280 --> 01:12:13,440 UNIVERSITY AND WE LOOK AT THIS 1498 01:12:13,440 --> 01:12:16,040 FROM APRIL 7 TO JUNE 16. 1499 01:12:16,040 --> 01:12:18,360 WE DEMONSTRATED THE POTENTIAL 1500 01:12:18,360 --> 01:12:24,200 FOR WASTE WATER EPIDEMIOLOGY AND 1501 01:12:24,200 --> 01:12:38,520 SHOWING THIS AND LOOK AT WATER 1502 01:12:38,520 --> 01:12:41,280 COLLECTION SIZE AND WASTE WATER 1503 01:12:41,280 --> 01:12:46,200 SAMPLE AND WE WANTED TO ANALYZE 1504 01:12:46,200 --> 01:12:54,520 THE DATA. 1505 01:12:54,520 --> 01:13:10,800 AND THIS IS PAIRS OF THE GENOME. 1506 01:13:10,800 --> 01:13:13,320 AND THIS MEANS THERE'S GENOMES. 1507 01:13:13,320 --> 01:13:21,080 THIS ALLOWS US TO AMPLIFY THE 1508 01:13:21,080 --> 01:13:32,120 GENOMES FROM WASTE WATER. 1509 01:13:32,120 --> 01:13:38,520 OUR GROUP CAN THEN BEGIN TO 1510 01:13:38,520 --> 01:13:45,120 BALANCE AND PRESENT THIS FROM MY 1511 01:13:45,120 --> 01:13:46,920 LAB. 1512 01:13:46,920 --> 01:13:52,440 THERE WAS NO EXISTENCE OF THIS 1513 01:13:52,440 --> 01:13:58,520 AND TO DETECT THIS WASTE WATER 1514 01:13:58,520 --> 01:13:59,920 SAMPLE WE LOOKED AT A NEW 1515 01:13:59,920 --> 01:14:01,120 PIPELINE. 1516 01:14:01,120 --> 01:14:06,040 THE NEXT THING WE'RE TRYING TO 1517 01:14:06,040 --> 01:14:17,840 DO IS TO BUILD OUR RESEARCH 1518 01:14:17,840 --> 01:14:22,800 SAMPLE AND WE HAVE THE PROCESS 1519 01:14:22,800 --> 01:14:27,160 WHICH IS CHALLENGES. 1520 01:14:27,160 --> 01:14:33,800 FIRST, IT'S COMPOSED OF 345 1521 01:14:33,800 --> 01:14:38,520 APPLICANTS AND WE LOOKED AT 1522 01:14:38,520 --> 01:14:42,520 GENOMES. 1523 01:14:42,520 --> 01:14:47,880 ADDITIONALLY, SOME IMAGES HAD 1524 01:14:47,880 --> 01:14:50,720 MASS RESOLUTION. 1525 01:14:50,720 --> 01:14:56,520 AND THERE'S A POSSIBILITY THE 1526 01:14:56,520 --> 01:15:01,920 IMAGES WILL COME BACK AND MAKES 1527 01:15:01,920 --> 01:15:05,760 THIS DIFFICULT. 1528 01:15:05,760 --> 01:15:19,320 WE LOOKED AT WHAT IS SHOWN HERE. 1529 01:15:19,320 --> 01:15:26,120 AND THE COLOR OF THIS BRANCH 1530 01:15:26,120 --> 01:15:34,520 PRESENTS THE FREQUENCY AND WE 1531 01:15:34,520 --> 01:15:41,320 CAN GET THE MIXTURE. 1532 01:15:41,320 --> 01:15:43,320 NOW, WE HAVE ONE SAMPLE AND HOW 1533 01:15:43,320 --> 01:15:45,600 TO COMPARE ALL THE SAMPLES SO WE 1534 01:15:45,600 --> 01:15:51,440 CAN TRACK THE CHANGES OF WHERE A 1535 01:15:51,440 --> 01:15:53,760 POPULATION BETWEEN LOCATION AND 1536 01:15:53,760 --> 01:16:00,720 THE AUTOMATE ANALYSIS I TRIED TO 1537 01:16:00,720 --> 01:16:02,960 DO THIS AND THERE'S A NEW TYPE 1538 01:16:02,960 --> 01:16:09,920 OF METAGENOME DATA. 1539 01:16:09,920 --> 01:16:18,560 AND WE LOOK DIFFERENT 1540 01:16:18,560 --> 01:16:18,880 TRANSITION. 1541 01:16:18,880 --> 01:16:23,840 HOWEVER, WE CANNOT COMBINE AND 1542 01:16:23,840 --> 01:16:27,720 THEREFORE THE METAGENOMICS BASED 1543 01:16:27,720 --> 01:16:29,840 ON COMPONENT OF COMPOSITION 1544 01:16:29,840 --> 01:16:36,920 CANNOT BE APPLIED HERE. 1545 01:16:36,920 --> 01:16:42,520 WE FIRST STARTED CALCULATING THE 1546 01:16:42,520 --> 01:16:47,240 DIFFERENCE BY APPLYING THIS AND 1547 01:16:47,240 --> 01:16:49,200 WE USE DATA AS A METRIC. 1548 01:16:49,200 --> 01:16:56,240 IN THIS WAY, WE CAN CALCULATE 1549 01:16:56,240 --> 01:17:02,520 THE WASTE WATER SAMPLES WE HAVE 1550 01:17:02,520 --> 01:17:11,840 AND WE ALSO WANT TO LOOK AT THE 1551 01:17:11,840 --> 01:17:16,680 SAME GRAPH. 1552 01:17:16,680 --> 01:17:25,800 THESE ARE SAMPLES FROM TEMPE, 1553 01:17:25,800 --> 01:17:26,680 ARIZONA. 1554 01:17:26,680 --> 01:17:31,440 THE SAMPLES HAVE DATA AND THE 1555 01:17:31,440 --> 01:17:32,280 LATEST DATA IS BLUE. 1556 01:17:32,280 --> 01:17:35,160 WE COLLECTED SAMPLES FROM APRIL 1557 01:17:35,160 --> 01:17:40,600 2021 TO JUNE 2021. 1558 01:17:40,600 --> 01:17:54,920 HERE, WE HAVE CONSISTENCY IN THE 1559 01:17:54,920 --> 01:17:55,360 SAMP 1560 01:17:55,360 --> 01:17:55,600 SAMPLE. 1561 01:17:55,600 --> 01:17:58,520 ALSO THEY CAN ALSO BE REFLECTED 1562 01:17:58,520 --> 01:18:03,040 IN THIS PLOT. 1563 01:18:03,040 --> 01:18:06,600 THERE'S SAMPLES HERE AND CLOSE 1564 01:18:06,600 --> 01:18:09,400 TO 21A AND 21J. 1565 01:18:09,400 --> 01:18:15,200 TO TAKE A CLOSER LOOK AT THE 1566 01:18:15,200 --> 01:18:20,680 THREE SAMPLE, FIRST WE LOOKED AT 1567 01:18:20,680 --> 01:18:25,680 THE FREQUENCY IN THE WASTE WATER 1568 01:18:25,680 --> 01:18:33,200 SAMPLE AND THERE WAS A SHIFT OF 1569 01:18:33,200 --> 01:18:40,280 THE MOVE TO 21 AND LOOKED AT THE 1570 01:18:40,280 --> 01:18:40,520 SAMPLE. 1571 01:18:40,520 --> 01:18:52,040 THIS SHIFT HAD VIRULENCE TO 1572 01:18:52,040 --> 01:18:53,520 TRANSMISSION AND CAN HELP MANAGE 1573 01:18:53,520 --> 01:19:01,280 THE RESPONSE TO THE VIRUS. 1574 01:19:01,280 --> 01:19:11,440 OVERALL, WE LOOKED AT THE IMAGES 1575 01:19:11,440 --> 01:19:15,600 AND IN THIS PROJECT THE GROUP 1576 01:19:15,600 --> 01:19:20,480 WE'RE CONTINUING TO WORK ON 1577 01:19:20,480 --> 01:19:25,680 SAMPLES THIS YEAR AND LOOKING AT 1578 01:19:25,680 --> 01:19:30,840 THE FRAMEWORK AND THE HOPE IS WE 1579 01:19:30,840 --> 01:19:33,560 COULD EXPAND IT TO METAGENOMIC 1580 01:19:33,560 --> 01:19:38,520 INSTANCE THAT ALLOWS FOR A 1581 01:19:38,520 --> 01:19:40,400 BROADER SPECTRUM PATHOGEN 1582 01:19:40,400 --> 01:19:45,320 DETECTION AND AVOID OUTBREAKS. 1583 01:19:45,320 --> 01:19:47,120 NOW I'M GOING TALK ABOUT MY 1584 01:19:47,120 --> 01:19:49,320 FUTURE PLAN. 1585 01:19:49,320 --> 01:19:51,520 THE FOCUS IS ON THE FEATURES OF 1586 01:19:51,520 --> 01:19:56,560 THE MICROBES. 1587 01:19:56,560 --> 01:20:00,120 WE'LL CONTINUE TO ADD TOOLS AND 1588 01:20:00,120 --> 01:20:02,360 IMPROVE THE EFFICIENCY. 1589 01:20:02,360 --> 01:20:05,480 IN ADDITION, WE'LL CONTINUE 1590 01:20:05,480 --> 01:20:06,760 INCORPORATING MORE FEATURES IN 1591 01:20:06,760 --> 01:20:11,320 THE PIPELINE. 1592 01:20:11,320 --> 01:20:13,240 IN ADDITION WE'LL TRY TO 1593 01:20:13,240 --> 01:20:14,120 ESTABLISH A COLLABORATION WITH 1594 01:20:14,120 --> 01:20:21,120 THE PREDICTION. 1595 01:20:21,120 --> 01:20:24,840 WE'RE COLLABORATING WITH THE 1596 01:20:24,840 --> 01:20:29,800 UNIVERSITY OF MIRANDA TO LOOK AT 1597 01:20:29,800 --> 01:20:33,840 BILIRUBIN REDUCTASE. 1598 01:20:33,840 --> 01:20:42,520 IT ALLOWS DATA TO BE RE-ABSORBED 1599 01:20:42,520 --> 01:20:49,560 AND THERE'S CERTAIN BACTERIA AND 1600 01:20:49,560 --> 01:20:52,640 MOLS WHICH CAN BE SAFELY 1601 01:20:52,640 --> 01:20:54,240 EXCRETED FROM THE BODY. 1602 01:20:54,240 --> 01:20:57,360 UNDERSTANDING WHICH BACTERIA ARE 1603 01:20:57,360 --> 01:21:02,480 ABLE TO PRODUCE BILIRUBIN CAN 1604 01:21:02,480 --> 01:21:04,840 CONTRIBUTE TO THE TREATMENT. 1605 01:21:04,840 --> 01:21:17,840 AND OUR FIRST WILL LOOKED TO 1606 01:21:17,840 --> 01:21:24,920 ESTABLISH THE HMP2 ASSOCIATED 1607 01:21:24,920 --> 01:21:32,640 WITH A REDUCTION POTENTIAL. 1608 01:21:32,640 --> 01:21:40,080 WE HAVE IDENTIFIED A WAY TO 1609 01:21:40,080 --> 01:21:44,000 REDUCE BILIRUBIN WITH CHA HAS 1610 01:21:44,000 --> 01:21:44,800 BEEN DEVELOPED. 1611 01:21:44,800 --> 01:21:52,560 WE CAN LOOK AT GENOMIC ANALYSIS 1612 01:21:52,560 --> 01:21:56,520 ANALYSIS AND THE REDUCTASE GENES 1613 01:21:56,520 --> 01:22:00,680 CAN BE USED IN THE WHOLE LAB. 1614 01:22:00,680 --> 01:22:03,880 THIS IS OUR APPROACH AND WE'RE 1615 01:22:03,880 --> 01:22:15,680 CONSTANTLY REVISING AND 1616 01:22:15,680 --> 01:22:26,040 ADJUSTING OUR PLAN. 1617 01:22:26,040 --> 01:22:34,520 AND WE EXPLORED THIS AREA. 1618 01:22:34,520 --> 01:22:39,200 AND WE HAVE LOOKED AT THE USE 1619 01:22:39,200 --> 01:22:43,160 AND ABLE TO WORK WITH MORE 1620 01:22:43,160 --> 01:22:43,800 COMPLEX DATA SETS. 1621 01:22:43,800 --> 01:22:48,720 THERE'S BEEN A KEY PATTERN OF 1622 01:22:48,720 --> 01:22:50,520 THE INVESTMENT AND THERE'S 1623 01:22:50,520 --> 01:22:55,720 PATIENT INFORMATION AND IT CAN 1624 01:22:55,720 --> 01:22:59,800 BE USED TO TRACK. 1625 01:22:59,800 --> 01:23:05,360 THIS CAN BE USED IN COMPARATIVE 1626 01:23:05,360 --> 01:23:07,520 GENOMICS AND THE FEATURE AND THE 1627 01:23:07,520 --> 01:23:11,600 GRAPH CAN BE USED TO LOOK AT 1628 01:23:11,600 --> 01:23:14,920 DIVERGENCE IN THE GENOME. 1629 01:23:14,920 --> 01:23:16,160 WE DESCRIBED HOW DATA CAN COME 1630 01:23:16,160 --> 01:23:20,720 BACK TO THE GENOMIC DATA TO MAP 1631 01:23:20,720 --> 01:23:22,440 GENOMES AND IDENTIFY THE 1632 01:23:22,440 --> 01:23:26,560 PRESENCE OR ABSENCE OF 1633 01:23:26,560 --> 01:23:38,000 SEQUENCING IN THE GENOME. 1634 01:23:38,000 --> 01:23:43,080 AND LOOKED AT THE GENE SUCH AS 1635 01:23:43,080 --> 01:23:57,680 ANTIBIOTIC RESISTANT GENE OR AND 1636 01:23:57,680 --> 01:24:03,480 THERE'S POTENTIAL TO LOOK AT THE 1637 01:24:03,480 --> 01:24:08,520 SEQUENCES. 1638 01:24:08,520 --> 01:24:14,560 AND WE CAN LOOK AT THE RESOURCE 1639 01:24:14,560 --> 01:24:16,840 SO WE'RE HOPEFUL TO EXPAND THE 1640 01:24:16,840 --> 01:24:20,240 CATEGORY OF NON-MOBILE GENOMIC 1641 01:24:20,240 --> 01:24:20,600 ATOMS. 1642 01:24:20,600 --> 01:24:23,120 THIS WRAPS UP MY FUTURE RESEARCH 1643 01:24:23,120 --> 01:24:23,320 PLAN. 1644 01:24:23,320 --> 01:24:27,800 AT THEN OF MY TALK I'D LIKE TO 1645 01:24:27,800 --> 01:24:31,480 THANK MY POST-DOCS FROM MY LAB 1646 01:24:31,480 --> 01:24:36,360 AND MY COLLABORATORS AND 1647 01:24:36,360 --> 01:24:38,720 COLLABORATORS AND MY MENTOR 1648 01:24:38,720 --> 01:24:39,520 COMMUNITIES. 1649 01:24:39,520 --> 01:24:42,040 I'D LIKE TO THANK THE EXTRAMURAL 1650 01:24:42,040 --> 01:24:42,920 RESEARCH PROGRAM FOR THE STRONG 1651 01:24:42,920 --> 01:24:44,960 SUPPORT. 1652 01:24:44,960 --> 01:24:45,240 THANK YOU. 1653 01:24:45,240 --> 01:24:46,520 I'LL BE HAPPY IT TAKE ANY 1654 01:24:46,520 --> 01:24:56,840 QUESTIONS. 1655 01:24:56,840 --> 01:24:59,640 >> THANK YOU. 1656 01:24:59,640 --> 01:25:01,880 I ACTUALLY HAVE A NUMBER OF 1657 01:25:01,880 --> 01:25:03,120 QUESTIONS FROM THROUGHOUT THE 1658 01:25:03,120 --> 01:25:03,880 TALK. 1659 01:25:03,880 --> 01:25:05,120 FIRST, MY LAB INVENTED 1660 01:25:05,120 --> 01:25:09,360 COMPARATIVE GENOMICS AND I'M A 1661 01:25:09,360 --> 01:25:14,560 LITTLE CONFUSED HOW THE FIRST 1662 01:25:14,560 --> 01:25:16,800 PART OF YOUR TALK USE VIEWED 1663 01:25:16,800 --> 01:25:23,840 COMPARATIVE GENOMIC. 1664 01:25:23,840 --> 01:25:24,520 >> THE FIRST PART? 1665 01:25:24,520 --> 01:25:27,000 >> WHEN YOU TALKED ABOUT THE 1666 01:25:27,000 --> 01:25:27,920 MICROBIOME AND GENOMIC. 1667 01:25:27,920 --> 01:25:29,840 I WANT TO GET A FEEL FOR WHAT 1668 01:25:29,840 --> 01:25:32,320 KIND OF COMPUTATION YOU WERE 1669 01:25:32,320 --> 01:25:38,560 USING, WHAT KIND OF METHODS, 1670 01:25:38,560 --> 01:25:40,520 THERE'S ALL SORTS OF PROBLEMS IN 1671 01:25:40,520 --> 01:25:44,480 MICROBIOMICS AND METAGENOMICS 1672 01:25:44,480 --> 01:25:45,120 AND DISTINGUISHING SPECIES AND 1673 01:25:45,120 --> 01:25:45,480 GENES. 1674 01:25:45,480 --> 01:25:50,520 I WANT TO GET A FEEL FOR SOME OF 1675 01:25:50,520 --> 01:25:51,960 THE COMPUTATION. 1676 01:25:51,960 --> 01:25:53,320 I HEARD A LOT OF INTERESTING 1677 01:25:53,320 --> 01:25:55,920 BIOLOGY RESULTS BUT WAS CONFUSED 1678 01:25:55,920 --> 01:25:58,240 WHY YOU WERE CALLING THIS 1679 01:25:58,240 --> 01:25:59,800 COMPARATIVE GENOMIC. 1680 01:25:59,800 --> 01:26:04,600 >> SO THE APPROACH I USED IS 1681 01:26:04,600 --> 01:26:06,120 BASED ON EXPERIMENTS I KNOW 1682 01:26:06,120 --> 01:26:10,600 WHICH GENES AND SPECIES AND MOST 1683 01:26:10,600 --> 01:26:14,600 TIME IS IN FUNCTION. 1684 01:26:14,600 --> 01:26:25,280 I RETRIEVED THOSE GENES AND WE 1685 01:26:25,280 --> 01:26:27,760 USED THAT TO BE AT A 1686 01:26:27,760 --> 01:26:28,240 MULTI-MODAL. 1687 01:26:28,240 --> 01:26:29,680 THAT'S THE MOST COMMON APPROACH 1688 01:26:29,680 --> 01:26:33,640 WE USED AND SCANNING THE GENOMES 1689 01:26:33,640 --> 01:26:38,120 OF OTHER BACTERIA SPECIES AND WE 1690 01:26:38,120 --> 01:26:43,520 STUDIES THE GENOMIC CONTEXT AND 1691 01:26:43,520 --> 01:26:47,840 ASSOCIATE GENES TO THE FUNCTION 1692 01:26:47,840 --> 01:26:52,920 AND IN THIS WAY WE CAN TRANSFER 1693 01:26:52,920 --> 01:26:56,120 FROM ONE SPECIES TO THE ANOTHER 1694 01:26:56,120 --> 01:26:57,680 AND EXPAND THE CATEGORIES OF 1695 01:26:57,680 --> 01:27:01,920 BACTERIA WE KNOW ARE FUNCTIONAL. 1696 01:27:01,920 --> 01:27:03,760 MULTI-MODAL IS ONE COMMONLY USED 1697 01:27:03,760 --> 01:27:07,640 AND THE USED THE MH AND OTHER 1698 01:27:07,640 --> 01:27:09,200 COMMON TOOLS TO DO THAT. 1699 01:27:09,200 --> 01:27:17,040 >> LOOK AT OUR MDGB PROGRAM. 1700 01:27:17,040 --> 01:27:18,600 IT'S IN CELL SYMPTOMS AND THERE 1701 01:27:18,600 --> 01:27:22,240 WAS AN AWARD FOR THAT AND ME AND 1702 01:27:22,240 --> 01:27:26,840 RYAN SCHMITTKE AND FROM 2021. 1703 01:27:26,840 --> 01:27:30,480 IT GETS THE LARGEST PAN GENOME 1704 01:27:30,480 --> 01:27:34,240 ASSEMBLIES QUICKLY ON ORDERERS 1705 01:27:34,240 --> 01:27:35,560 OF MAGNITUDE FASTER WITH VERY 1706 01:27:35,560 --> 01:27:37,880 LITTLE MEMORY, TWO ORDERS OF 1707 01:27:37,880 --> 01:27:41,080 MAGNITUDE FASTER MEMORY AND IT'S 1708 01:27:41,080 --> 01:27:46,440 FOR LONG SEQUENCING DATA WHICH 1709 01:27:46,440 --> 01:27:49,800 IT SEEMS LIKE YOU HAD SOME OF 1710 01:27:49,800 --> 01:27:51,440 AND ASSEMBLED ALL THE METAGENOME 1711 01:27:51,440 --> 01:27:52,920 DATA WE COULD GET OUR HANDS ON. 1712 01:27:52,920 --> 01:27:56,320 THAT WOULD BE A GOOD TOOL. 1713 01:27:56,320 --> 01:27:59,520 NOW YOU MADE IT CLEAR TO ME, 1714 01:27:59,520 --> 01:27:59,840 GREAT. 1715 01:27:59,840 --> 01:28:05,280 WHAT KIND OF COMPUTATIONAL 1716 01:28:05,280 --> 01:28:07,760 TECHNIQUES YOU WERE USING. 1717 01:28:07,760 --> 01:28:10,600 I HAD ANOTHER QUESTION AND THIS 1718 01:28:10,600 --> 01:28:11,880 TELLS ME HOW YOU'RE 1719 01:28:11,880 --> 01:28:13,200 DISTINGUISHING SPECIES AND 1720 01:28:13,200 --> 01:28:14,600 GENES, BETTER, RIGHT? 1721 01:28:14,600 --> 01:28:17,880 THE COMPARATIVE GENOMICS. 1722 01:28:17,880 --> 01:28:21,360 OKAY. 1723 01:28:21,360 --> 01:28:23,720 I HAD ANOTHER QUESTION HOW YOU 1724 01:28:23,720 --> 01:28:26,600 WERE ANALYZING FUNCTION 1725 01:28:26,600 --> 01:28:29,680 COMPUTATIONALLY ONCE YOU DO 1726 01:28:29,680 --> 01:28:29,920 THIS. 1727 01:28:29,920 --> 01:28:31,600 >> DOES THAT QUESTION MEAN THAT 1728 01:28:31,600 --> 01:28:37,680 OUR COMPUTATION -- OKAY, I SEE. 1729 01:28:37,680 --> 01:28:40,240 SO GENERALLY I USED -- 1730 01:28:40,240 --> 01:28:42,520 >> HOW ARE YOU COMPUTING 1731 01:28:42,520 --> 01:28:42,800 ORTHOLOGS? 1732 01:28:42,800 --> 01:28:46,840 I KNOW YOU SAID HH PRED BUT 1733 01:28:46,840 --> 01:28:52,120 THERE'S BETTER ORTHOLOG PROGRAMS 1734 01:28:52,120 --> 01:29:11,680 OUT THERE. 1735 01:29:11,680 --> 01:29:22,520 >> YOU LINKED THEM TO FUNCTION? 1736 01:29:22,520 --> 01:29:23,840 WHICH DATABASE? 1737 01:29:23,840 --> 01:29:29,040 I KNOW YOU SAID. 1738 01:29:29,040 --> 01:29:33,240 >> WE LOOKED AT THE ONES 1739 01:29:33,240 --> 01:29:35,520 COMMONLY USED BUT I'M NOT DOING 1740 01:29:35,520 --> 01:29:38,440 DATA ON LARGER SCALES BUT 1741 01:29:38,440 --> 01:29:41,360 ASSURING THE ACCURACY MORE 1742 01:29:41,360 --> 01:29:41,680 MANUAL. 1743 01:29:41,680 --> 01:29:53,480 I USE THOSE ALSO TOOLS BUT ALSO 1744 01:29:53,480 --> 01:30:06,280 WE GENERATE AND LOOK AT THE 1745 01:30:06,280 --> 01:30:13,800 PHYLOGENETIC TREES AND I FEEL WE 1746 01:30:13,800 --> 01:30:14,480 HAVE THE FUNCTION. 1747 01:30:14,480 --> 01:30:16,080 >> I GET THAT, GREAT. 1748 01:30:16,080 --> 01:30:22,520 IN TERMS OF WASTE WATER AND 1749 01:30:22,520 --> 01:30:31,040 COVID I GET YOUR CONTRIBUTION IS 1750 01:30:31,040 --> 01:30:35,760 A NEW CORRELATION METRIC FOR 1751 01:30:35,760 --> 01:30:36,320 THAT. 1752 01:30:36,320 --> 01:30:37,680 I UNDERSTOOD AS IT WENT ALONG 1753 01:30:37,680 --> 01:30:39,880 AND WAS WONDERING WHAT YOUR ROLE 1754 01:30:39,880 --> 01:30:41,680 WAS IN THE PROJECT BECAUSE 1755 01:30:41,680 --> 01:30:42,680 YOU'RE DOING COMPUTATIONAL 1756 01:30:42,680 --> 01:30:45,680 ANALYSIS AND YOU SAID THERE WERE 1757 01:30:45,680 --> 01:30:47,480 NO EXISTING BIOINFORMATIC 1758 01:30:47,480 --> 01:30:50,520 APPROACHES FOR DETECTING SNVs 1759 01:30:50,520 --> 01:30:54,680 AROUND WAS WONDERING IF YOU WERE 1760 01:30:54,680 --> 01:30:58,240 AWARE OF THE AARON GOWAN LAB 1761 01:30:58,240 --> 01:30:59,360 WORK ON WASTE WATER COVID 1762 01:30:59,360 --> 01:31:00,320 DETECTION IN THE BOSTON AREA. 1763 01:31:00,320 --> 01:31:03,160 >> YES, I'M AWARE OF THAT. 1764 01:31:03,160 --> 01:31:11,080 MY POST-DOC WAS THE SUPERVISOR. 1765 01:31:11,080 --> 01:31:13,120 >> ERIC DON WAS YOUR POST-DOC 1766 01:31:13,120 --> 01:31:13,480 SUPERVISOR? 1767 01:31:13,480 --> 01:31:14,320 >> YES. 1768 01:31:14,320 --> 01:31:18,520 >> OH, GOOD. 1769 01:31:18,520 --> 01:31:22,920 YOU COMPARED WITH PANGULIN AND 1770 01:31:22,920 --> 01:31:24,920 WONDERING IF THAT WAS THE SPIKE 1771 01:31:24,920 --> 01:31:27,040 PROTEIN BECAUSE THAT -- IT'S 1772 01:31:27,040 --> 01:31:30,320 KNOWN TO BE THE CLOSEST HUMAN 1773 01:31:30,320 --> 01:31:31,400 SPECIES IN TERMS OF THE SPIKE 1774 01:31:31,400 --> 01:31:34,520 PROTEIN, NOT HUMAN, CLOSEST TO 1775 01:31:34,520 --> 01:31:39,560 THE HUMAN SPECIES FOR THE SPIKE 1776 01:31:39,560 --> 01:31:40,080 PROTE 1777 01:31:40,080 --> 01:31:40,320 PROTEIN. 1778 01:31:40,320 --> 01:31:44,040 >> I THINK WE'RE TALKING ABOUT 1779 01:31:44,040 --> 01:31:51,600 DIFFERENT PENGULIN'S HERE. 1780 01:31:51,600 --> 01:31:56,520 >> WE HAVE DIFFERENT SPECIES. 1781 01:31:56,520 --> 01:31:58,520 >> OH, MY GOSH. 1782 01:31:58,520 --> 01:32:02,520 THANK YOU. 1783 01:32:02,520 --> 01:32:07,320 I'M JUST CLARIFYING. 1784 01:32:07,320 --> 01:32:07,520 OKAY. 1785 01:32:07,520 --> 01:32:10,560 WHY IS IT IMPORTANT THERE'S 1786 01:32:10,560 --> 01:32:12,560 SINGLE NUCLEOTIDE VARIATIONS 1787 01:32:12,560 --> 01:32:17,840 ACROSS THE SAMPLE? 1788 01:32:17,840 --> 01:32:29,920 >> IT'S TO ASSIGN THE IMAGES AND 1789 01:32:29,920 --> 01:32:33,040 HAVE DIFFERENT MUTATIONS. 1790 01:32:33,040 --> 01:32:34,320 >> IT'S KNOWN DIFFERENT VARIANTS 1791 01:32:34,320 --> 01:32:40,560 ARE GETTING MORE AND MORE 1792 01:32:40,560 --> 01:32:40,840 MUTATIONS. 1793 01:32:40,840 --> 01:32:54,360 COMBINATORIAL MUTATIONS. 1794 01:32:54,360 --> 01:32:58,520 WE CAN ACCESS VARIANTS IN THE 1795 01:32:58,520 --> 01:33:05,000 POPULATION. 1796 01:33:05,000 --> 01:33:06,600 IN TERMS OF THE GRAPHS I THINK 1797 01:33:06,600 --> 01:33:13,120 YOU SHOULD STILL LOOK AT OUR 1798 01:33:13,120 --> 01:33:16,600 MGBG PAPER. 1799 01:33:16,600 --> 01:33:19,840 IT'S MINIMIZER SPACE AND I THINK 1800 01:33:19,840 --> 01:33:25,680 YOUR PLAN FOR DEBOIN GRAPHS WILL 1801 01:33:25,680 --> 01:33:38,920 TAKE TOO MUCH TIME AND SPACE AND 1802 01:33:38,920 --> 01:33:42,440 KNEW COMPUTATIONAL METHODS DO 1803 01:33:42,440 --> 01:33:43,840 YOU SEE DEVELOPING LIKE 1804 01:33:43,840 --> 01:33:48,880 FUNCTIONAL MODULES? 1805 01:33:48,880 --> 01:33:51,400 >> THERE'S TWO PATHS IT'S 1806 01:33:51,400 --> 01:34:00,040 IMPORTANT FOR DIFFERENT TOOLS. 1807 01:34:00,040 --> 01:34:06,760 AND WE WANT TO INCORPORATE AND 1808 01:34:06,760 --> 01:34:08,600 MAKE A FLAT BAR SO NO MATTER 1809 01:34:08,600 --> 01:34:11,800 WHAT TOOLS WE USE TO SEARCH THE 1810 01:34:11,800 --> 01:34:14,560 FUNCTION YOU CAN LOOK AT THE 1811 01:34:14,560 --> 01:34:14,760 DATA. 1812 01:34:14,760 --> 01:34:16,000 THAT'S ONE PATH. 1813 01:34:16,000 --> 01:34:20,040 THE OTHER IS THE DATA PATH AND 1814 01:34:20,040 --> 01:34:23,360 POTENTIAL FILES OF THE 1815 01:34:23,360 --> 01:34:29,840 INFORMATIONS SO THAT AFTER WE 1816 01:34:29,840 --> 01:34:38,200 HAVE GENOMIC DATA YOU CAN USE 1817 01:34:38,200 --> 01:34:43,120 THE PROFILE TO SEARCH THE DATA 1818 01:34:43,120 --> 01:34:43,760 WITHIN THE OWN DATA AND HAVE THE 1819 01:34:43,760 --> 01:34:47,120 FUNCTIONS THERE. 1820 01:34:47,120 --> 01:34:47,680 >> OKAY. 1821 01:34:47,680 --> 01:34:53,960 THANK YOU FOR ANSWERING MY 1822 01:34:53,960 --> 01:34:54,280 QUESTION. 1823 01:34:54,280 --> 01:34:54,680 JES 1824 01:34:54,680 --> 01:34:57,200 JESSE, PLEASE AS YOUR QUESTION. 1825 01:34:57,200 --> 01:34:59,520 >> YES, IT'S EXCITING WORK 1826 01:34:59,520 --> 01:35:01,320 BEFORE I SWITCHED TO PUBLIC 1827 01:35:01,320 --> 01:35:03,120 HEALTH I WOULD HAVE TOTALLY 1828 01:35:03,120 --> 01:35:04,680 REACHED OUT TO COLLABORATE. 1829 01:35:04,680 --> 01:35:07,800 MAYBE LATER WE CAN TALK ABOUT 1830 01:35:07,800 --> 01:35:09,360 SOME OF THE TRANSLATIONAL 1831 01:35:09,360 --> 01:35:09,600 THINGS. 1832 01:35:09,600 --> 01:35:16,760 I WONDERED FOR THE 1833 01:35:16,760 --> 01:35:18,320 BIOINFORMATIBIOINFORMATIC 1834 01:35:18,320 --> 01:35:20,840 BIOINFORMATICS INFRASTRUCTURE 1835 01:35:20,840 --> 01:35:23,320 FOR COV BECAUSE THE WASTE WATER 1836 01:35:23,320 --> 01:35:25,320 IS GOING TO BE CRITICAL AS 1837 01:35:25,320 --> 01:35:29,560 PEOPLE GET TESTED ALSO AND LESS. 1838 01:35:29,560 --> 01:35:31,720 WHAT DISSEMINATION HAVE YOU DONE 1839 01:35:31,720 --> 01:35:32,320 ON THAT? 1840 01:35:32,320 --> 01:35:34,000 I'M SURE YOU'RE PUBLISHING ON IT 1841 01:35:34,000 --> 01:35:37,040 BUT THESE DAYS I WORK WITH STATE 1842 01:35:37,040 --> 01:35:38,200 LABS AND PUBLIC HEALTH 1843 01:35:38,200 --> 01:35:39,800 DEPARTMENTS AND I THINK THEY 1844 01:35:39,800 --> 01:35:43,080 WOULD FIND THE TOOLS USEFUL. 1845 01:35:43,080 --> 01:35:45,160 I WANT TO MAKE SURE YOU MAKE IT 1846 01:35:45,160 --> 01:35:46,520 AVAILABLE OR KNOWN ABOUT IN THE 1847 01:35:46,520 --> 01:36:02,520 OTHER AVENUES. 1848 01:36:02,520 --> 01:36:08,920 >> I MAKE THE DATA PUBLIC FOR 1849 01:36:08,920 --> 01:36:10,240 PRESENTATION. 1850 01:36:10,240 --> 01:36:10,640 >> OKAY. 1851 01:36:10,640 --> 01:36:11,680 BEEN A LITTLE BUSY. 1852 01:36:11,680 --> 01:36:13,480 I CAN TRY TO POINT PEOPLE IN 1853 01:36:13,480 --> 01:36:13,840 THAT DIRECTION. 1854 01:36:13,840 --> 01:36:14,120 >> AWESOME. 1855 01:36:14,120 --> 01:36:28,240 THANK YOU. 1856 01:36:28,240 --> 01:36:29,840 I HAVE A COUPLE OTHERS IF OTHERS 1857 01:36:29,840 --> 01:36:34,080 DON'T HAVE QUESTIONS. 1858 01:36:34,080 --> 01:36:35,520 >> LIKES GRACIELA HAS A 1859 01:36:35,520 --> 01:36:45,800 QUESTION. 1860 01:36:45,800 --> 01:36:52,080 >> GREAT WORK, CONGRATULATIONS. 1861 01:36:52,080 --> 01:36:54,400 I WAS WONDERING WHETHER YOU HAVE 1862 01:36:54,400 --> 01:36:58,520 AN INTERFACE OR A WAY TO MAKE 1863 01:36:58,520 --> 01:36:59,920 THE FINDINGS AVAILABLE. 1864 01:36:59,920 --> 01:37:01,120 I DIDN'T CATCH IT IN YOUR 1865 01:37:01,120 --> 01:37:03,560 PRESENTATION SO I WANTED TO MAKE 1866 01:37:03,560 --> 01:37:05,240 SURE WHETHER YOU HAD THAT TO THE 1867 01:37:05,240 --> 01:37:07,000 PUBLIC OR TO OTHER SCIENTISTS. 1868 01:37:07,000 --> 01:37:08,160 I GUESS THERE'S TWO LEVELS HERE 1869 01:37:08,160 --> 01:37:14,000 I WAS CURIOUS ABOUT. 1870 01:37:14,000 --> 01:37:15,680 >> THIS IS A COLLABORATIVE 1871 01:37:15,680 --> 01:37:29,520 PROJECT. 1872 01:37:29,520 --> 01:37:31,840 AND DISPLAYED THE DATA AND WE'RE 1873 01:37:31,840 --> 01:37:34,560 WORKING ON THAT PATH. 1874 01:37:34,560 --> 01:37:47,160 >> ALL RIGHT. 1875 01:37:47,160 --> 01:37:49,760 >> I THOUGHT THE WORK YOU 1876 01:37:49,760 --> 01:37:54,800 DESCRIBED SWITCHING TO THE COVID 1877 01:37:54,800 --> 01:37:59,600 RELATED RESEARCH WAS AN EXAMPLE 1878 01:37:59,600 --> 01:38:01,040 OF RAPID PIVOTING ADDRESSING 1879 01:38:01,040 --> 01:38:01,720 IMPORTANT QUESTIONS. 1880 01:38:01,720 --> 01:38:05,040 I THOUGHT THAT WAS VERY 1881 01:38:05,040 --> 01:38:05,400 INSPIRING. 1882 01:38:05,400 --> 01:38:09,520 I HAD A QUESTION ON THE SARS 1883 01:38:09,520 --> 01:38:10,320 COV2 RESEARCH. 1884 01:38:10,320 --> 01:38:14,640 YOU NOTED CHIMERIC SARS COV2 1885 01:38:14,640 --> 01:38:15,560 HOST RNA. 1886 01:38:15,560 --> 01:38:17,280 I'M WONDERING IF YOU CAN COMMENT 1887 01:38:17,280 --> 01:38:18,680 BRIEFLY MORE ON THAT IN TERMS OF 1888 01:38:18,680 --> 01:38:20,040 WHAT YOU THINK THE FREQUENCY OF 1889 01:38:20,040 --> 01:38:22,600 THAT MIGHT BE AND WHAT SOME OF 1890 01:38:22,600 --> 01:38:29,160 THE CONSEQUENCES MIGHT BE. 1891 01:38:29,160 --> 01:38:34,560 >> YEAH, SO THE REASON I HAVE 1892 01:38:34,560 --> 01:38:39,640 THIS IDEA WAS THE OMICRON ORIGIN 1893 01:38:39,640 --> 01:38:45,240 WE DON'T KNOW WHY IT COMES FROM 1894 01:38:45,240 --> 01:38:50,560 IF IT COMES FROM HOST YOU CAN 1895 01:38:50,560 --> 01:38:58,160 MASTER THE HOST AND THE CHIMERIC 1896 01:38:58,160 --> 01:39:17,680 WAY AND WE LOOKED AT THE 1897 01:39:17,680 --> 01:39:26,600 FREQUENCY AND IT POINTS TO ONE 1898 01:39:26,600 --> 01:39:31,920 PERSON MAYBE AND THIS INDICATED 1899 01:39:31,920 --> 01:39:41,320 THE CHIMERA CAN EXIST AND IT HAS 1900 01:39:41,320 --> 01:39:45,480 A COMPARTMENT. 1901 01:39:45,480 --> 01:39:49,640 AND WE LOOKED AT THE WAYS. 1902 01:39:49,640 --> 01:39:54,920 THAT'S THE FIRST WE FOUND AND 1903 01:39:54,920 --> 01:40:02,560 THEN WE GO FURTHER TO FOUND IT 1904 01:40:02,560 --> 01:40:12,240 CAN TRACK THE ORIGIN AND LIKELY 1905 01:40:12,240 --> 01:40:17,880 IT COMES FROM A HOST. 1906 01:40:17,880 --> 01:40:23,400 SO THIS MEANS THERE'S ALWAYS A 1907 01:40:23,400 --> 01:40:40,680 CHANCE THAT AND MAYBE WE LOOKED 1908 01:40:40,680 --> 01:40:45,960 AT DIFFERENT ANIMALS I THINK 1909 01:40:45,960 --> 01:40:50,920 MAYBE IN THE FUTURE WE CAN LOOK 1910 01:40:50,920 --> 01:40:53,640 AT THE GENOME ORIGIN FROM 1911 01:40:53,640 --> 01:40:54,440 DIFFERENT HOST. 1912 01:40:54,440 --> 01:40:54,920 >> THANK YOU. 1913 01:40:54,920 --> 01:40:55,520 >> GREAT. 1914 01:40:55,520 --> 01:41:00,320 WE ARE NOW GOING START OUR 1915 01:41:00,320 --> 01:41:02,520 BREAKOUT ROOM AND GO TO CLOSED 1916 01:41:02,520 --> 01:41:04,880 SESSION WITH DR. JIANG AND HAVE 1917 01:41:04,880 --> 01:41:06,400 A CLOSED SESSION FOR JUST THE 1918 01:41:06,400 --> 01:41:07,240 REVIEW COMMITTEE. 1919 01:41:07,240 --> 01:41:10,080 WE'LL GO AHEAD AND KEEP THE 1920 01:41:10,080 --> 01:41:11,680 REGULAR AMOUNT OF TIME AND TAKE 1921 01:41:11,680 --> 01:41:14,680 THE FIVE MINUTES FROM OUR CLOSED 1922 01:41:14,680 --> 01:41:17,480 SCIENTIFIC COUNSELOR SESSION. 1923 01:41:17,480 --> 01:41:18,800 TODAY I WANT TO TALK TO YOU 1924 01:41:18,800 --> 01:41:21,280 ABOUT COMPUTATIONAL PREDICTION 1925 01:41:21,280 --> 01:41:22,600 AND EXPERIMENTAL VALIDATION OF 1926 01:41:22,600 --> 01:41:25,280 FOLD-SWITCHING PROTEINS. 1927 01:41:25,280 --> 01:41:27,560 SO IF YOU LOOK AT THE DATABASE 1928 01:41:27,560 --> 01:41:37,280 RIGHT NOW YOU'LL SEE IT HAS OVER 1929 01:41:37,280 --> 01:41:37,920 220 MILLION HIGH-QUALITY 1930 01:41:37,920 --> 01:41:41,280 NON-REDUNDANT PROTEIN SEQUENCES. 1931 01:41:41,280 --> 01:41:49,160 BY CONTRAST, THERE ARE 18,000 1932 01:41:49,160 --> 01:41:51,720 HIGH QUALITY PROTEIN STRUCTURES 1933 01:41:51,720 --> 01:41:52,320 AVAILABLE IN THE PROTEIN DATA 1934 01:41:52,320 --> 01:41:57,160 BANK. 1935 01:41:57,160 --> 01:42:02,400 THOUGH IT'S AN ENORMOUS AMOUNT 1936 01:42:02,400 --> 01:42:05,240 OF WORK IT PALES TO THE PROTEIN 1937 01:42:05,240 --> 01:42:08,960 SEQUENCES OUT THERE AND MY LAB 1938 01:42:08,960 --> 01:42:11,920 IS INTERESTING IN KNOWING WHAT'S 1939 01:42:11,920 --> 01:42:13,360 OG ON IN THE DARK APART IN THE 1940 01:42:13,360 --> 01:42:14,880 PROTEIN UNIVERSE. 1941 01:42:14,880 --> 01:42:18,080 WITH 16,000 PROTEINS WE KNOW THE 1942 01:42:18,080 --> 01:42:19,880 AMINO ACID SEQUENCE IS IMPORTANT 1943 01:42:19,880 --> 01:42:25,440 IN DETERMINING THE STRUCTURE AND 1944 01:42:25,440 --> 01:42:26,080 FUNCTIONS AND THAT BACKS UP WHAT 1945 01:42:26,080 --> 01:42:28,080 IT WAS SAID MANY DECADES AGO 1946 01:42:28,080 --> 01:42:38,120 NOW. 1947 01:42:38,120 --> 01:42:43,880 WE HAVE THE DOMAIN AND WE HAVE 1948 01:42:43,880 --> 01:42:44,720 THE SACS 5 INVOLVED IN 1949 01:42:44,720 --> 01:42:46,560 SIGNALLING BUT THE PROTEINS I 1950 01:42:46,560 --> 01:42:48,880 WANT TO TALK ABOUT TODAY ARE 1951 01:42:48,880 --> 01:42:57,280 NEITHER SINGLE FOLDING NOR 1952 01:42:57,280 --> 01:42:58,280 DISORDERED. 1953 01:42:58,280 --> 01:43:01,280 AND I MEAN A RE-ARRANGEMENT OR 1954 01:43:01,280 --> 01:43:03,640 REMODELLING OF SECONDARY AND 1955 01:43:03,640 --> 01:43:07,680 TERTIARY STRUCTURE THAT LEADS TO 1956 01:43:07,680 --> 01:43:12,400 CHANGES IN FUNCTION OR 1957 01:43:12,400 --> 01:43:20,160 REGULATION. 1958 01:43:20,160 --> 01:43:24,320 AND THE MECHANISM CHALLENGES 1959 01:43:24,320 --> 01:43:28,800 CURRENT PERCEPTIONS OF 1960 01:43:28,800 --> 01:43:31,960 ENERGETICS AND DYNAMICS AND NEW 1961 01:43:31,960 --> 01:43:34,000 BIOPENCE PRINCIPLES -- 1962 01:43:37,280 --> 01:43:43,240 BIOPRINCIPLES CAN BE REVEALED 1963 01:43:43,240 --> 01:43:43,760 AND SWITCHES CHANGE THEIR 1964 01:43:43,760 --> 01:43:44,400 RESPONSE TO A DESIRE TRIGGER AND 1965 01:43:44,400 --> 01:43:46,320 MAY LEAD TO NEW THERAPEUTIC 1966 01:43:46,320 --> 01:43:47,040 INTERVENTIONS THAT TARGET FOLD 1967 01:43:47,040 --> 01:44:04,280 SWITCHING PROTEINS. 1968 01:44:04,280 --> 01:44:07,920 WE LOOK AT GASTROENTER ITIST AND 1969 01:44:07,920 --> 01:44:10,720 A FOLD-SWITCHING PROTEIN HAS 1970 01:44:10,720 --> 01:44:12,720 BEEN FOUND IN COVID-19. 1971 01:44:12,720 --> 01:44:16,320 WHAT MY LAB WOULD FIRST LIKE TO 1972 01:44:16,320 --> 01:44:18,360 DO IS DEVELOP COMPUTATIONAL 1973 01:44:18,360 --> 01:44:21,280 APPROACHES TO IDENTIFY THEM FROM 1974 01:44:21,280 --> 01:44:24,800 THEIR DYNAMIC SEQUENCES. 1975 01:44:24,800 --> 01:44:28,480 AND YOU MAY SAY, LAUREN CAN YOU 1976 01:44:28,480 --> 01:44:31,480 READ THE NEWS IN ALPHA FOLD 2 1977 01:44:31,480 --> 01:44:33,280 CAN ALREADY PREDICT FROM 1978 01:44:33,280 --> 01:44:36,680 SEQUENCE SO WHY DON'T YOU JUST 1979 01:44:36,680 --> 01:44:36,960 USE THAT. 1980 01:44:36,960 --> 01:44:43,560 BUT IT DIDN'T DO SO WELL 1981 01:44:43,560 --> 01:44:58,960 PREDICTING BOTH AND WE HAVE TWO 1982 01:44:58,960 --> 01:45:01,480 FOLDS AND TWO SECONDARY 1983 01:45:01,480 --> 01:45:01,800 STRUCTURES. 1984 01:45:01,800 --> 01:45:13,280 THAT'S IN THE PDB. 1985 01:45:13,280 --> 01:45:16,520 HERE REAR MEASURING PREDICTION 1986 01:45:16,520 --> 01:45:18,880 ACCURACY AND WE CAN USE A 1987 01:45:18,880 --> 01:45:21,280 MEASURE TO GET SIMILAR RESULTS 1988 01:45:21,280 --> 01:45:23,680 AND IF CAN CAPTURE BOTH 1989 01:45:23,680 --> 01:45:25,400 CONFIRMATIONS THROUGH LOOKING AT 1990 01:45:25,400 --> 01:45:28,080 MULTIPLES OF ITS MODELS, YOU'D 1991 01:45:28,080 --> 01:45:31,360 HAVE A FAIRLY EQUAL DISTRIBUTION 1992 01:45:31,360 --> 01:45:32,720 OF POINTS ALONG THE IDENTITY 1993 01:45:32,720 --> 01:45:33,480 LINE BUT THAT'S NOT WHAT YOU SEE 1994 01:45:33,480 --> 01:45:41,280 HERE. 1995 01:45:41,280 --> 01:45:43,960 AND PREDICTIONS ARE BIASSED 1996 01:45:43,960 --> 01:45:46,880 TOWARDS ONE FOLD. 1997 01:45:46,880 --> 01:45:52,840 AND SO ALPHA FOLD 2 AS POWERFUL 1998 01:45:52,840 --> 01:45:55,040 AND AMAZING TOOL IS NOT ABLE TO 1999 01:45:55,040 --> 01:45:56,800 CAPTURE FOLD SWITCHING BEHAVIOR. 2000 01:45:56,800 --> 01:46:00,520 SO WE DO NEEDLE ALTERNATIVE 2001 01:46:00,520 --> 01:46:01,360 METHODS TO LOOK AT FOLD 2002 01:46:01,360 --> 01:46:01,800 SWITCHING. 2003 01:46:01,800 --> 01:46:03,800 SO I WANTED TO APPROACH THIS 2004 01:46:03,800 --> 01:46:05,480 PROBLEM WITH A HYPOTHESIS AND 2005 01:46:05,480 --> 01:46:09,160 IT'S A SIMPLE ONE. 2006 01:46:09,160 --> 01:46:10,680 FOLD SWITCHING PROTEINS BECAUSE 2007 01:46:10,680 --> 01:46:12,280 THEY'RE CHANGING THEIR SECONDARY 2008 01:46:12,280 --> 01:46:15,280 STRUCTURES ARE LIKELY TO HAVE 2009 01:46:15,280 --> 01:46:17,280 SECONDARY STRUCTURE PROPENSITIES 2010 01:46:17,280 --> 01:46:23,680 FOR BOTH FOLDS NOT JUST ONE. 2011 01:46:23,680 --> 01:46:27,160 AND WE'LL TALK HOW WE TEST THE 2012 01:46:27,160 --> 01:46:28,960 HYPOTHESIS ON THE NESS G PROTEIN 2013 01:46:28,960 --> 01:46:29,200 FAMILY. 2014 01:46:29,200 --> 01:46:31,920 THIS FAMILY OF PROTEINS IS THE 2015 01:46:31,920 --> 01:46:34,000 ONLY KNOWN FAMILY OF 2016 01:46:34,000 --> 01:46:35,440 TRANSCRIPTION FACTORS CONSERVED 2017 01:46:35,440 --> 01:46:40,240 ALL THE WAY FROM BACTERIA TO 2018 01:46:40,240 --> 01:46:40,480 HUMANS. 2019 01:46:40,480 --> 01:46:42,440 IT HAS ONE MEMBER WITH THE 2020 01:46:42,440 --> 01:46:47,160 STRUCTURE CALLED RFAH. 2021 01:46:47,160 --> 01:46:49,800 ALL THE THESE PROTEINS HAVE AN 2022 01:46:49,800 --> 01:46:51,040 END TERMINAL DOMAIN WITH THE 2023 01:46:51,040 --> 01:46:56,080 STRUCTURE YOU CAN SEE IN GRAY. 2024 01:46:56,080 --> 01:47:00,480 THAT BINDS TO RNA POLYMERASE 2025 01:47:00,480 --> 01:47:02,400 FOSTERING TRANSCRIPTIONAL READ 2026 01:47:02,400 --> 01:47:08,480 THROUGH AND RFAH HAS A DOMAIN 2027 01:47:08,480 --> 01:47:15,880 THAT FOLDS INTO AN ALPHA HELICAL 2028 01:47:15,880 --> 01:47:19,880 BUNDLE CONFIRMATION AND THAT 2029 01:47:19,880 --> 01:47:25,280 REGULATES RNA TO TRANSCRIPTIONAL 2030 01:47:25,280 --> 01:47:28,640 SPECIFICITY AND UPON BINDING OX 2031 01:47:28,640 --> 01:47:31,000 DNA AND RNA POLYMERASE THE C 2032 01:47:31,000 --> 01:47:33,480 TERMINAL DOMAIN, WE DON'T KNOW 2033 01:47:33,480 --> 01:47:37,160 HOW YET, DETACHES FROM THE END 2034 01:47:37,160 --> 01:47:39,680 TERMINAL DOMAIN AND REFOLDS INTO 2035 01:47:39,680 --> 01:47:49,280 THE ROLE THAT CAN THEN RECRUIT 2036 01:47:49,280 --> 01:47:51,440 AND FOSTER EFFICIENT 2037 01:47:51,440 --> 01:47:51,720 TRANSLATION. 2038 01:47:51,720 --> 01:47:53,280 THERE'S ONLY ONE STRUCTURE THAT 2039 01:47:53,280 --> 01:47:57,080 DOES THIS IN THE PDB AND THAT'S 2040 01:47:57,080 --> 01:48:01,240 RFAH I'M SHOWING YOU HERE BUT 2041 01:48:01,240 --> 01:48:04,160 THERE ARE NUMEROUS NESS G 2042 01:48:04,160 --> 01:48:05,720 STRUCTURES WITH THE SAME DOMAIN 2043 01:48:05,720 --> 01:48:07,400 STRUCTURE DISCUSSED BEFORE BUT 2044 01:48:07,400 --> 01:48:09,120 THE C TERMINAL DOMAIN AS FAR AS 2045 01:48:09,120 --> 01:48:13,240 ANYBODY KNOWS ONLY ASSUMES THE 2046 01:48:13,240 --> 01:48:23,200 BAY TO ROLE FOLD. 2047 01:48:23,200 --> 01:48:25,600 AND LOOKING AT THE C TERMINAL 2048 01:48:25,600 --> 01:48:27,080 DOMAIN TO LOOK WHETHER IT'SAL OR 2049 01:48:27,080 --> 01:48:30,200 BETA OR JUST BETA ALL THE TIME. 2050 01:48:30,200 --> 01:48:32,280 USUALLY WHEN ONE WANTS TO 2051 01:48:32,280 --> 01:48:34,680 PREDICT THE STRUCTURE OF THE 2052 01:48:34,680 --> 01:48:36,280 PROTEIN, IT WORKS LIKE THIS. 2053 01:48:36,280 --> 01:48:37,960 YOU TAKE THE FULL LENGTH 2054 01:48:37,960 --> 01:48:41,080 SEQUENCE OF THE PROTEIN AND 2055 01:48:41,080 --> 01:48:42,800 SEARCH FOR HOMOLOGOUS SEQUENCES 2056 01:48:42,800 --> 01:48:45,240 AND PROFILE THE AMINO ACIDS, 2057 01:48:45,240 --> 01:48:47,000 THERE'S MANY WAYS OF DOING THIS 2058 01:48:47,000 --> 01:48:51,320 THROUGH HIDDEN MARK-OFF MODELS 2059 01:48:51,320 --> 01:48:54,760 AND CO-EVOLUTION AND THERE'S 2060 01:48:54,760 --> 01:48:56,280 MANY WAYS THEY'RE PROFILED AND 2061 01:48:56,280 --> 01:48:58,160 THEY CAN BE MATCHED TO KNOWN 2062 01:48:58,160 --> 01:48:58,360 FOLDS. 2063 01:48:58,360 --> 01:49:01,880 THE PROBLEM IS THE SEQUENCE 2064 01:49:01,880 --> 01:49:05,760 PROFILES OF NESS G AND NFRH ARE 2065 01:49:05,760 --> 01:49:07,560 SIMILAR ENOUGH THEY PREDICT BOTH 2066 01:49:07,560 --> 01:49:10,080 OF THEM IN THE GROUND STATE WILL 2067 01:49:10,080 --> 01:49:12,680 ASSUME THIS BETA ROLE 2068 01:49:12,680 --> 01:49:16,000 CONFIRMATION IN THE C TERMINAL 2069 01:49:16,000 --> 01:49:21,840 DOMAIN THIS IS NOT ADEQUATE TO 2070 01:49:21,840 --> 01:49:22,480 DISTINGUISH BECAUSE THE SEQUENCE 2071 01:49:22,480 --> 01:49:23,400 PROFILES ARE SIMILAR. 2072 01:49:23,400 --> 01:49:25,280 TO GET AROUND THE PROBLEM WE 2073 01:49:25,280 --> 01:49:29,280 CAME UP WITH AN APPROACH CALLED 2074 01:49:29,280 --> 01:49:33,280 SECONDARY PROPENSITY CARE SON 2075 01:49:33,280 --> 01:49:36,160 AND THESE ARE THE SAME AS WHAT 2076 01:49:36,160 --> 01:49:38,760 YOU SEE UP HERE TO THE MATCH 2077 01:49:38,760 --> 01:49:42,400 PROFILE PART AND WE ALSO TAKE A 2078 01:49:42,400 --> 01:49:44,040 CROPPED PIECE OF THE SEQUENCE IN 2079 01:49:44,040 --> 01:49:46,120 THIS CASE WHAT CORRESPONDS TO 2080 01:49:46,120 --> 01:49:47,280 THE C TERMINAL DOMAIN. 2081 01:49:47,280 --> 01:49:49,800 WE GET TWO SEQUENCES WITH TWO 2082 01:49:49,800 --> 01:49:52,000 DIFFERENT PROFILES AND RUN THEM 2083 01:49:52,000 --> 01:49:54,520 THROUGH A SECONDARY STRUCTURE 2084 01:49:54,520 --> 01:49:56,880 PREDICTOR AND THE ONE OF CHOICE 2085 01:49:56,880 --> 01:50:00,680 IS J PRED 4 AND WE FOUND J PRED 2086 01:50:00,680 --> 01:50:03,480 4 WORKS BEATER AND WE CAN TALK 2087 01:50:03,480 --> 01:50:06,080 THAT ABOUT THAT LATER IF YOU 2088 01:50:06,080 --> 01:50:08,280 WANT AND THE J PRED 4 PREDICTS 2089 01:50:08,280 --> 01:50:11,920 BOTH THE C TERMINAL DOMAIN AND 2090 01:50:11,920 --> 01:50:14,120 THE FULL LENGTH ASSUME THE BETA 2091 01:50:14,120 --> 01:50:16,800 SHEET CONFIRMATION AND PREDICT 2092 01:50:16,800 --> 01:50:19,080 IT DOES NOT SWITCH FOLDS. 2093 01:50:19,080 --> 01:50:22,360 HOWEVER, FOR RFAH YOU CAN SEE WE 2094 01:50:22,360 --> 01:50:24,080 END UP WITH A DIFFERENT OUTCOME 2095 01:50:24,080 --> 01:50:27,160 HERE AND THE FULLth LENGTH IS 2096 01:50:27,160 --> 01:50:28,280 PREDICTED TO BE LIKE WHAT YOU'D 2097 01:50:28,280 --> 01:50:31,240 EXPECT FROM THE HOMOLOGY 2098 01:50:31,240 --> 01:50:34,040 PROFILING ALGORITHMS BUT WHEN WE 2099 01:50:34,040 --> 01:50:37,040 LOOK AT THE CROP C TERMINAL 2100 01:50:37,040 --> 01:50:38,640 DOMAIN YOU SEE THE COMBINATION 2101 01:50:38,640 --> 01:50:43,960 SIMILAR TO WHAT WE WERE HIGH 2102 01:50:43,960 --> 01:50:49,280 HYPOTHESIZING AND THE SECONDARY 2103 01:50:49,280 --> 01:50:52,200 PREDICTOR ALGORITHMS RARELY 2104 01:50:52,200 --> 01:50:53,040 CONFIRM PREDICTIONS AND BETA 2105 01:50:53,040 --> 01:50:55,920 SHEET PREDICTIONS IN SINGLE 2106 01:50:55,920 --> 01:50:57,280 FOLDING PROTEINS. 2107 01:50:57,280 --> 01:51:03,120 WE INTERPRET THESE ALPHA HELIX 2108 01:51:03,120 --> 01:51:06,720 DISCREPANCIES ESPECIALLY MANY IN 2109 01:51:06,720 --> 01:51:08,640 A ROW AS A STRONG SIGNAL FOR 2110 01:51:08,640 --> 01:51:09,280 FOLD SWITCHING. 2111 01:51:09,280 --> 01:51:14,360 SO IN THIS CASE WE PREDICT RFAH 2112 01:51:14,360 --> 01:51:16,360 WHICH ARE FOLDS WHICH IS 2113 01:51:16,360 --> 01:51:17,480 CONSISTENT WITH WHAT WE SEE 2114 01:51:17,480 --> 01:51:23,080 EXPERIMENTALLY. 2115 01:51:23,080 --> 01:51:24,880 AND THE APPROACH WORKS NOT ONLY 2116 01:51:24,880 --> 01:51:28,120 FOR RFAH BUT FOR A NUMBER OF 2117 01:51:28,120 --> 01:51:29,280 FOLD SWITCHING PROTEINS WITH 2118 01:51:29,280 --> 01:51:31,440 GOOD STATISTICAL SIGNIFICANCE 2119 01:51:31,440 --> 01:51:31,960 AND GOOD CORRELATION. 2120 01:51:31,960 --> 01:51:36,400 IT SUGGESTS WE CAN USE IT BEYOND 2121 01:51:36,400 --> 01:51:38,120 RFAH BUT FOR VARIOUS REASONS 2122 01:51:38,120 --> 01:51:42,840 HAVING TO DO WITH EXPERIMENTAL 2123 01:51:42,840 --> 01:51:44,120 TRACTABILITY WE'RE STARTING WITH 2124 01:51:44,120 --> 01:51:45,920 THE MSG SEQUENCE AND WE CAN MAP 2125 01:51:45,920 --> 01:51:47,880 OUT THE BASE AND FIND OVER 2126 01:51:47,880 --> 01:51:50,880 15,000 NON-REDUNDANT SEQUENCES. 2127 01:51:50,880 --> 01:51:55,920 WE CAN CLUSTER THOSE USING A 2128 01:51:55,920 --> 01:51:59,320 GLOM RATIVE SEQUENCING AND END 2129 01:51:59,320 --> 01:52:03,800 UP IN 304 CLUSTERS AND CAN 2130 01:52:03,800 --> 01:52:06,640 CONNECT I AND J IF THE SEQUENCES 2131 01:52:06,640 --> 01:52:09,040 IN BOTH CLUSTERS ARE AT LEAST 2132 01:52:09,040 --> 01:52:09,280 24%. 2133 01:52:09,280 --> 01:52:12,320 THIS IS A CUT-OFF TO CONDITION 2134 01:52:12,320 --> 01:52:13,240 SOMEONE'S EXPECTATIONS ON 2135 01:52:13,240 --> 01:52:14,320 WHETHER THE PROTEIN WOULD OR 2136 01:52:14,320 --> 01:52:17,240 WOULD NOT ASSUME THE SAME FOLD 2137 01:52:17,240 --> 01:52:21,240 AND THEN WE CAN USE VARIABLE 2138 01:52:21,240 --> 01:52:22,880 STRUCTURE COMPARISON SO INFER 2139 01:52:22,880 --> 01:52:24,320 FOLD SWITCHING OR SINGLE 2140 01:52:24,320 --> 01:52:25,080 FOLDING. 2141 01:52:25,080 --> 01:52:27,120 WHEN WE DO THIS WE COME UP WITH 2142 01:52:27,120 --> 01:52:30,000 A FORCE DIRECTED GRAPH SO THE 2143 01:52:30,000 --> 01:52:31,360 CONSEQUENCES CLOSER OR CLUSTERS 2144 01:52:31,360 --> 01:52:34,880 CLOSER IN SPACE TO ONE ANOTHER 2145 01:52:34,880 --> 01:52:35,800 HAVE CLOSER AVERAGE OVERALL 2146 01:52:35,800 --> 01:52:37,280 SEQUENCE IDENTITY TO ONE 2147 01:52:37,280 --> 01:52:41,280 ANOTHER. 2148 01:52:41,280 --> 01:52:48,160 AND THERE'S OVER 00 -- 100 2149 01:52:48,160 --> 01:52:50,440 SEQUENCES ANY TWO CONNECTS BY AN 2150 01:52:50,440 --> 01:52:53,280 EDGE HAVE AT LEAST AN AVERAGE 2151 01:52:53,280 --> 01:53:00,520 IDENTITY OF 24%. 2152 01:53:00,520 --> 01:53:05,160 WE FIND IN CERTAIN PLACES OF 2153 01:53:05,160 --> 01:53:07,360 SEQUENCE SPACE MANY CLUSTERS ARE 2154 01:53:07,360 --> 01:53:09,040 PREDICTED TO SWITCH FOLDS AND 2155 01:53:09,040 --> 01:53:10,640 OTHER PLACES MANY ARE NOT AND IN 2156 01:53:10,640 --> 01:53:15,600 SOME CASES WE SEE A MIXTURE. 2157 01:53:15,600 --> 01:53:18,200 OVERALL, WE PREDICT THE TOTAL 2158 01:53:18,200 --> 01:53:19,880 PREDICTED FOLD SWITCHING 2159 01:53:19,880 --> 01:53:22,400 PERCENTAGE OF FOLD SWITCHING 2160 01:53:22,400 --> 01:53:24,600 PROTEINS IS ABOUT 25% WHICH IS 2161 01:53:24,600 --> 01:53:25,920 SIGNIFICANTLY HIGHER THAN WHAT 2162 01:53:25,920 --> 01:53:27,160 WE ESTIMATED FOR PROTEINS 2163 01:53:27,160 --> 01:53:30,200 OVERALL. 2164 01:53:30,200 --> 01:53:32,840 AND THIS IS CONSISTENT WITH NES 2165 01:53:32,840 --> 01:53:36,080 G DATA SETS WE'D EXPECT TO NOT 2166 01:53:36,080 --> 01:53:42,280 SWITCH FOLDS BUT HAVE SOME SORT 2167 01:53:42,280 --> 01:53:43,440 OF SPLITTING FOR OTHER 2168 01:53:43,440 --> 01:53:45,880 UNANNOTATED PROTEINS. 2169 01:53:45,880 --> 01:53:51,680 SO PREDICTIONS ARE NICE BUT IT'S 2170 01:53:51,680 --> 01:53:55,960 NICE TO BE ABLE TO VALIDATE OUR 2171 01:53:55,960 --> 01:54:01,280 PREDICTIONS EXPERIMENTALLY. 2172 01:54:01,280 --> 01:54:13,160 WE USE SPECTRUM DISCOPROCY AND 2173 01:54:13,160 --> 01:54:16,760 WE COMPARE THE BETWEEN PROTEINS 2174 01:54:16,760 --> 01:54:19,280 AND CD DOES AN ESPECIALLY GOOD 2175 01:54:19,280 --> 01:54:27,360 JOB OF PICKING OUT ALPHA HELICAL 2176 01:54:27,360 --> 01:54:28,200 STRUCTURES AND WHEREAS BETA 2177 01:54:28,200 --> 01:54:31,120 SHEET HAS A LESS INTENSE SIGNAL 2178 01:54:31,120 --> 01:54:35,960 WITH ONE MINIMUM AROUND 217 AND 2179 01:54:35,960 --> 01:54:41,280 THERE'S A FLAT SIGNAL AND HAS A 2180 01:54:41,280 --> 01:54:43,040 MINIMUM AROUND 195. 2181 01:54:43,040 --> 01:54:44,240 THE ASSUMPTION WE'RE MAKING IS 2182 01:54:44,240 --> 01:54:49,960 THE END TERMINAL DOMAINS OF ALL 2183 01:54:49,960 --> 01:54:54,080 MSG PROTEINS HAVE MORE OR LESS 2184 01:54:54,080 --> 01:54:56,840 THE SAME STRUCTURE COMPOSITION. 2185 01:54:56,840 --> 01:54:58,680 THERE ARE QUITE A FEW SOLVED 2186 01:54:58,680 --> 01:55:00,720 PROTEINS IN THE PROTEIN FAMILY 2187 01:55:00,720 --> 01:55:01,960 AVAILABLE IN THE DATA BANK AND 2188 01:55:01,960 --> 01:55:22,480 ALL DO ASSUME THE SAME AND THIS 2189 01:55:22,480 --> 01:55:24,120 ASSUMES WHETHER IT'S ON THE FOLD 2190 01:55:24,120 --> 01:55:28,400 AND WE CAN TEST THIS ON THE 2191 01:55:28,400 --> 01:55:30,160 PROTEIN THAT MIGHT HAVE SOLVED 2192 01:55:30,160 --> 01:55:30,480 STRUCTURES. 2193 01:55:30,480 --> 01:55:31,680 WE DO TWO DIFFERENT PREPS ON TWO 2194 01:55:31,680 --> 01:55:32,880 DIFFERENT DAYS. 2195 01:55:32,880 --> 01:55:36,280 YOU CAN SEE THAT RFAH INDEED HAS 2196 01:55:36,280 --> 01:55:41,280 A MORE INTENSE SIGNAL WITH TWO 2197 01:55:41,280 --> 01:55:47,400 PRONOUNCES MINIMUM AND NES G HAS 2198 01:55:47,400 --> 01:55:50,880 LESS PRONOUNCED MINIMUM AND WE 2199 01:55:50,880 --> 01:55:52,680 DID DIFFERENT PREPS ON DIFFERENT 2200 01:55:52,680 --> 01:55:52,880 DAYS. 2201 01:55:52,880 --> 01:55:59,720 WE CAN LINEARLY DECOMPOSE THE 2202 01:55:59,720 --> 01:56:01,360 TWO SPECTRA TO GET PERCENTAGES 2203 01:56:01,360 --> 01:56:04,680 AND TAKE A RATIO OF THOSE SO IF 2204 01:56:04,680 --> 01:56:07,680 WE TAKE THE RATIO OF ALPHA HELIX 2205 01:56:07,680 --> 01:56:09,680 TO BETA STRAND IN A GIVEN 2206 01:56:09,680 --> 01:56:12,520 STRUCTURE YOU CAN SEE THERE'S 2207 01:56:12,520 --> 01:56:14,360 SIGNIFICANTLY MORE BETA STRAND 2208 01:56:14,360 --> 01:56:18,680 IN NES G AND SIGNIFICANTLY MORE 2209 01:56:18,680 --> 01:56:21,680 HELIX IN THE OTHER WE CAN 2210 01:56:21,680 --> 01:56:22,360 EXPECT. 2211 01:56:22,360 --> 01:56:24,360 WE SELECT OUR SEQUENCE AND 2212 01:56:24,360 --> 01:56:26,160 PREDICT WHETHER IT SWITCHES 2213 01:56:26,160 --> 01:56:32,080 FOLDS AND CAN TEST USING CD FOR 2214 01:56:32,080 --> 01:56:32,320 OPENERS. 2215 01:56:32,320 --> 01:56:35,360 WE SELECTED ACTUAL 16 BUT ONLY 2216 01:56:35,360 --> 01:56:37,920 10 WORKED SEQUENCES FROM ALL 2217 01:56:37,920 --> 01:56:40,480 OVER SEQUENCE SPACE. 2218 01:56:40,480 --> 01:56:42,280 THEY'RE HIGHLY DIVERSE, MOST 2219 01:56:42,280 --> 01:56:45,080 SEQUENCES HAVE LESS THAN 30% 2220 01:56:45,080 --> 01:56:47,040 PARALYZED IDENTITY TO ONE 2221 01:56:47,040 --> 01:56:47,520 ANOTHER. 2222 01:56:47,520 --> 01:56:51,520 THEY'RE FROM ALL DIFFERENT 2223 01:56:51,520 --> 01:56:52,120 BACTERIA PHILA AND DIFFERENT 2224 01:56:52,120 --> 01:56:53,080 SEQUENCE BASE. 2225 01:56:53,080 --> 01:56:55,840 YOU CAN SEE FOR THESE FOUR 2226 01:56:55,840 --> 01:56:57,880 SEQUENCES THAT WE PREDICTED TO 2227 01:56:57,880 --> 01:57:01,800 NOT SWITCH FOLDS THAT HAVE 2228 01:57:01,800 --> 01:57:05,880 CONSIDERABLY MORE BETA SHEET 2229 01:57:05,880 --> 01:57:09,240 CHARACTER AND CONSIDERABLY LESS 2230 01:57:09,240 --> 01:57:11,880 ALPHA HELICAL CHARACTER WHEREAS 2231 01:57:11,880 --> 01:57:14,520 FOR OUR PREDICTED FOLD SWITCHERS 2232 01:57:14,520 --> 01:57:29,120 THERE'S MORE ALPHA HELICAL 2233 01:57:29,120 --> 01:57:30,840 STRUCTURE AND THAT GIVES 2234 01:57:30,840 --> 01:57:33,080 CONFIDENCE WHAT GOES ABOVE IT 2235 01:57:33,080 --> 01:57:35,680 PROBABLY ALSO HAS AN ALPHA 2236 01:57:35,680 --> 01:57:37,800 HELICAL DOMAIN AND YOU CAN SEE 2237 01:57:37,800 --> 01:57:41,280 THE CD SPECTRA OF OUR SINGLE 2238 01:57:41,280 --> 01:57:44,680 FOLDING VARIANTS CLUSTER 2239 01:57:44,680 --> 01:57:46,600 TOGETHER NICELY AND MOST THE 2240 01:57:46,600 --> 01:57:49,440 FOLD SWITCHING VARIANTS DO AS 2241 01:57:49,440 --> 01:57:49,680 WELL. 2242 01:57:49,680 --> 01:57:51,440 AGAIN WE HAVE THE ASSUMPTION THE 2243 01:57:51,440 --> 01:57:52,880 END TERMINAL DOMAIN ALWAYS 2244 01:57:52,880 --> 01:57:54,240 MAINTAINS THE SAME STRUCTURE. 2245 01:57:54,240 --> 01:57:56,560 SO WE WANTED TO LOOK AT A COUPLE 2246 01:57:56,560 --> 01:57:59,520 VARIANTS IN MORE DETAIL AND FOR 2247 01:57:59,520 --> 01:58:06,360 THAT WE TURN TO NMR SPECTROSCOPY 2248 01:58:06,360 --> 01:58:07,920 AND USED AN EXPERIMENT. 2249 01:58:07,920 --> 01:58:12,280 ON THE X WE HAVE HYDROGEN 2250 01:58:12,280 --> 01:58:14,880 CHEMICAL SHIFTS AND EACH SHIFT 2251 01:58:14,880 --> 01:58:20,040 CORRESPONDS IN THIS CASE TO A 2252 01:58:20,040 --> 01:58:25,880 BACKBONE AMID AND EACH AMINO 2253 01:58:25,880 --> 01:58:28,160 ACID WILL HAVE A SHIFT AND 2254 01:58:28,160 --> 01:58:31,920 THERE'S SOME CASES WHERE THERE'S 2255 01:58:31,920 --> 01:58:33,280 CHEMICAL SHIFTS BUT MOST 2256 01:58:33,280 --> 01:58:41,280 CHEMICAL SHIFTS DO CORRESPOND TO 2257 01:58:41,280 --> 01:58:43,960 BACKBONE A MINUS AND IF WE 2258 01:58:43,960 --> 01:58:45,240 CHANGE CONDITIONS WE HAVE TWO 2259 01:58:45,240 --> 01:58:46,280 POTENTIAL POPULATIONS. 2260 01:58:46,280 --> 01:58:48,960 ONE IS THAT THE CHEMICAL SHIFTS 2261 01:58:48,960 --> 01:58:50,920 STAY THE SAME AND THAT WOULD 2262 01:58:50,920 --> 01:58:51,720 MEAN THE PROTEIN'S ENVIRONMENT 2263 01:58:51,720 --> 01:58:53,760 IS ESSENTIALLY THE SAME OR THE 2264 01:58:53,760 --> 01:58:55,920 CHEMICAL SHIFTS COULD CHANGE AND 2265 01:58:55,920 --> 01:58:58,320 THAT WOULD MEAN THE PROTEIN'S 2266 01:58:58,320 --> 01:59:00,520 ENVIRONMENT HAS ALSO CHANGED. 2267 01:59:00,520 --> 01:59:03,480 AND SO WE CAN DO THIS EXPERIMENT 2268 01:59:03,480 --> 01:59:06,720 ON VARIANT 8 WHICH IS PREDICTED 2269 01:59:06,720 --> 01:59:08,600 TO NOT SWITCH FOLDS. 2270 01:59:08,600 --> 01:59:11,440 ALL C TERMINAL DOMAINS OF ALL 2271 01:59:11,440 --> 01:59:12,360 VARIANTS IF EXPRESSED IN 2272 01:59:12,360 --> 01:59:15,080 ISOLATION ARE EXPECTED TO ASSUME 2273 01:59:15,080 --> 01:59:19,320 A BETA FOLD AND SO YOU CAN SEE 2274 01:59:19,320 --> 01:59:21,880 IF WE SUPER IMPOSE THE C 2275 01:59:21,880 --> 01:59:25,880 TERMINAL DOMAIN OF VARIANT 8 WE 2276 01:59:25,880 --> 01:59:31,720 GET 98% PEAK OVERLAP STRONGLY 2277 01:59:31,720 --> 01:59:33,280 SHOWING THE CHEMICAL ENVIRONMENT 2278 01:59:33,280 --> 01:59:35,560 HAS NOT CHANGE AND THE PROTEIN 2279 01:59:35,560 --> 01:59:37,760 IS NOT SWITCHING FOLD BUT HERE 2280 01:59:37,760 --> 01:59:39,280 IN THE WHOLE PROTEIN SHOULD BE 2281 01:59:39,280 --> 01:59:42,320 ALPHA HELIX BUT IN THE C 2282 01:59:42,320 --> 01:59:43,680 TERMINAL DOMAIN SHOULD BE BETA 2283 01:59:43,680 --> 01:59:43,960 SHEET. 2284 01:59:43,960 --> 01:59:47,040 YOU CAN SEE IN THIS CASE THERE'S 2285 01:59:47,040 --> 01:59:49,400 VERY LITTLE PEAK OVERLAP BETWEEN 2286 01:59:49,400 --> 01:59:52,160 THE C TERMINAL DOMAIN AND WHOLE 2287 01:59:52,160 --> 01:59:55,040 PROTEIN, ONLY 12%. 2288 01:59:55,040 --> 01:59:56,080 HOWEVER, CHANGES IN PEAK OVERLAP 2289 01:59:56,080 --> 01:59:58,280 COULD BE FOR OTHER REASONS THAN 2290 01:59:58,280 --> 01:59:58,680 SWITCHING FOLDS. 2291 01:59:58,680 --> 02:00:03,280 SO TO DIG DEEPER WE FIRST 2292 02:00:03,280 --> 02:00:05,080 MEASURED OR ASSIGNED THE C 2293 02:00:05,080 --> 02:00:06,880 TERMINAL DOMAIN'S EXPRESSED IN 2294 02:00:06,880 --> 02:00:08,520 ISOLATION OF BOTH VARIANT 8 AND 2295 02:00:08,520 --> 02:00:11,600 VARIANT 5 AND YOU CAN SEE THEY 2296 02:00:11,600 --> 02:00:17,680 BOTH HAVE STRONG BETA SHEET 2297 02:00:17,680 --> 02:00:26,720 CHARACTER AS REPORTED BY THE 2298 02:00:26,720 --> 02:00:29,400 PEOPLE ON THE CALL AND BY 2299 02:00:29,400 --> 02:00:32,280 CONTRAST YOU CAN SEE WE'RE IN 2300 02:00:32,280 --> 02:00:34,960 ISOLATION BUT IT WAS NEARLY ALL 2301 02:00:34,960 --> 02:00:37,480 BETA SHEET OR COIL WHEN 2302 02:00:37,480 --> 02:00:40,160 EXPRESSED IN THE WHOLE PROTEIN 2303 02:00:40,160 --> 02:00:41,960 IT'S ALPHA HELIX. 2304 02:00:41,960 --> 02:00:46,520 THIS IS MUCH MORE HIGH 2305 02:00:46,520 --> 02:00:47,800 RESOLUTION SUPPORT SHOWING AT 2306 02:00:47,800 --> 02:00:49,280 LEAST VARIANT 5 IS SWITCHING 2307 02:00:49,280 --> 02:00:49,680 FOLDS. 2308 02:00:49,680 --> 02:00:51,080 NOW AN INTERESTING QUESTION THAT 2309 02:00:51,080 --> 02:00:54,240 COMES UP IS WELL, HOW DO OTHER 2310 02:00:54,240 --> 02:00:57,280 PROTEIN PREDICTION ALGORITHMS DO 2311 02:00:57,280 --> 02:00:58,680 ON VARIANT 5? 2312 02:00:58,680 --> 02:01:00,320 IN EVERY SINGLE CASE THEY 2313 02:01:00,320 --> 02:01:01,920 PREDICT THE GROUND STATE FOLD OF 2314 02:01:01,920 --> 02:01:05,360 VARIANT 5 IS BETA BARREL AND I'M 2315 02:01:05,360 --> 02:01:07,120 ONLY SHOWING ONE MODEL THAT'S 2316 02:01:07,120 --> 02:01:09,040 BECAUSE THERE'S A LOT OF 2317 02:01:09,040 --> 02:01:09,280 MOVEMENT. 2318 02:01:09,280 --> 02:01:10,880 YOU CAN'T SEE THINGS AS WELL IF 2319 02:01:10,880 --> 02:01:20,120 I WERE TO SUPER IMPOSE THEM ALL. 2320 02:01:20,120 --> 02:01:22,920 AND IT SEEMS LIKE WE ARE PICKING 2321 02:01:22,920 --> 02:01:25,680 UP THE STATE OF THE ART METHODS 2322 02:01:25,680 --> 02:01:31,400 ARE NOT IDENTIFYING YET. 2323 02:01:31,400 --> 02:01:34,880 AND IT SEEMS TO BE TRUE FOR ALL 2324 02:01:34,880 --> 02:01:37,200 SIX VARIANTS WE PREDICTED TO 2325 02:01:37,200 --> 02:01:39,680 SWITCH FOLDS WITH THE ONE 2326 02:01:39,680 --> 02:01:41,920 EXCEPTION OF VARIANT 3 AND ALPHA 2327 02:01:41,920 --> 02:01:42,320 FOLD 2. 2328 02:01:42,320 --> 02:01:44,160 THE IMPORTANT THING TO POINT OUT 2329 02:01:44,160 --> 02:01:47,520 IS THE STRUCTURE OF VARIANT 3 2330 02:01:47,520 --> 02:01:50,000 WITH THE EXACT SAME SEQUENCE WAS 2331 02:01:50,000 --> 02:01:51,880 IN ALPHA FOLD 2'S TRAINING SET. 2332 02:01:51,880 --> 02:01:53,720 IT'S NOT THAT IT PREDICTED ON 2333 02:01:53,720 --> 02:01:55,680 ITS OWN THE PROTEIN HAS A GROUND 2334 02:01:55,680 --> 02:01:58,480 STATE ALPHA HELICAL STRUCTURE 2335 02:01:58,480 --> 02:02:01,280 BUT RATHER IT'S BASICALLY JUST 2336 02:02:01,280 --> 02:02:03,080 SPITTING OUT WHAT IT WAS GIVEN 2337 02:02:03,080 --> 02:02:09,880 IN ITS TRAINING SET. 2338 02:02:09,880 --> 02:02:12,320 AND THERE'S OTHER STATE OF THE 2339 02:02:12,320 --> 02:02:15,040 METHODS THAT CAN'T PICK UP. 2340 02:02:15,040 --> 02:02:16,680 YOU MAY ASK WHY AREN'T THEY 2341 02:02:16,680 --> 02:02:17,680 PICKING IT UP AND SOMETHING 2342 02:02:17,680 --> 02:02:21,600 INTERESTING IS IF WE LOOK AT 2343 02:02:21,600 --> 02:02:24,280 ALPHA FOLD 2 PREDICTION 2344 02:02:24,280 --> 02:02:27,640 CONFIDENCES FOR SINGLE FOLDERS 2345 02:02:27,640 --> 02:02:31,280 AND DISORDERED PROTEIN WHICH ARE 2346 02:02:31,280 --> 02:02:32,880 CONFIRMATIONALLY HETEROGENEOUS 2347 02:02:32,880 --> 02:02:35,040 IT KNOWS THE CONFIDENCES ARE LOW 2348 02:02:35,040 --> 02:02:36,280 FOR A LOT OF DISORDERED PROTEINS 2349 02:02:36,280 --> 02:02:36,880 SO IT KNOWS WHAT IT DOESN'T 2350 02:02:36,880 --> 02:02:43,920 KNOW. 2351 02:02:43,920 --> 02:02:46,920 AND FOR FOLDING OFTEN THE 2352 02:02:46,920 --> 02:02:49,280 CONFIDENCES ARE HIGH NOT FOR 2353 02:02:49,280 --> 02:02:53,200 SINGLE FOLD BUT MORE THAN FOR 2354 02:02:53,200 --> 02:02:54,080 DISORDERED PROTEINS AND THERE'S 2355 02:02:54,080 --> 02:02:54,960 TWO REASONS. 2356 02:02:54,960 --> 02:02:57,080 ONE IS BECAUSE OF THE MULTIPLE 2357 02:02:57,080 --> 02:03:02,000 SEQUENCE ALIGNMENTS OR ONE COULD 2358 02:03:02,000 --> 02:03:08,480 BE ITS LEARNING MODEL FOR 2359 02:03:08,480 --> 02:03:08,760 STRUCTURE. 2360 02:03:08,760 --> 02:03:11,080 WE LOOKED AT AVERAGE PREDICTION 2361 02:03:11,080 --> 02:03:12,480 CONFIDENCE FOR FIVE MODELS AND 2362 02:03:12,480 --> 02:03:15,520 YOU CAN SEE THEY'RE COMPLETELY 2363 02:03:15,520 --> 02:03:15,880 UNCORRELATED. 2364 02:03:15,880 --> 02:03:17,320 THERE'S ALSO BASICALLY NO 2365 02:03:17,320 --> 02:03:18,440 CORRELATION BETWEEN AVERAGE 2366 02:03:18,440 --> 02:03:20,560 PREDICTION CONFIDENCE AND 2367 02:03:20,560 --> 02:03:21,080 EVOLUTIONARY RATES. 2368 02:03:21,080 --> 02:03:24,280 SO IT SEEMS LIKE AT LEAST THE 2369 02:03:24,280 --> 02:03:25,400 OBVIOUS PROPERTIES OF MULTIPLE 2370 02:03:25,400 --> 02:03:29,040 SEQUENCE ALIGNMENT THAT MIGHT 2371 02:03:29,040 --> 02:03:30,280 AFFECT PREDICTION CONFIDENCES 2372 02:03:30,280 --> 02:03:32,840 AREN'T AND SO WE THINK THE ALPHA 2373 02:03:32,840 --> 02:03:35,680 FOLD 2 LIKELY SEARCHS FOR ONE 2374 02:03:35,680 --> 02:03:38,040 MOST PROBABLE CONFIRMATION AND 2375 02:03:38,040 --> 02:03:40,080 MISSES OTHER PROTEIN PROPERTIES 2376 02:03:40,080 --> 02:03:44,720 SUCH AS FOLD SWITCHING. 2377 02:03:44,720 --> 02:03:48,000 THIS HAS BEEN SHOWN BY OTHERS AS 2378 02:03:48,000 --> 02:03:49,120 WELL THAT SHOWED IT CAN'T 2379 02:03:49,120 --> 02:03:50,000 CAPTURE FOLDING PATHWAYS FOR 2380 02:03:50,000 --> 02:03:56,040 PROTEINS. 2381 02:03:56,040 --> 02:03:59,680 THEN THE QUESTION IS HOW CAN WE 2382 02:03:59,680 --> 02:04:00,880 PREDICT FOLD SWITCHING? 2383 02:04:00,880 --> 02:04:05,280 IT WOULD BE NICE TO BE ABLE TO 2384 02:04:05,280 --> 02:04:06,560 PREDICT THREE DIMENSIONAL 2385 02:04:06,560 --> 02:04:08,840 STRUCTURES OF TWO FOLDS FROM ONE 2386 02:04:08,840 --> 02:04:09,200 SEQUENCE. 2387 02:04:09,200 --> 02:04:11,360 I WANT TO REFERENCE WORK DONE BY 2388 02:04:11,360 --> 02:04:13,200 JOSEPH SCHAFER IN THE LAB A 2389 02:04:13,200 --> 02:04:14,600 POST-DOC THAT STARTED IN AUGUST 2390 02:04:14,600 --> 02:04:16,120 AND HE'LL TELL YOU MORE DETAILS 2391 02:04:16,120 --> 02:04:19,440 OF WHAT HE'S BEEN DOING BUT 2392 02:04:19,440 --> 02:04:21,240 THROUGH SOME SOPHISTICATED 2393 02:04:21,240 --> 02:04:24,760 FILTERING AND CLUSTERING WE HAVE 2394 02:04:24,760 --> 02:04:28,280 BEEN ABLE TO IDENTIFY DUAL-FOLD 2395 02:04:28,280 --> 02:04:30,920 CONTACTS IN FOLD-SWITCHING 2396 02:04:30,920 --> 02:04:31,280 SEQUENCES. 2397 02:04:31,280 --> 02:04:33,480 I'LL EXPLAIN WHAT THIS. 2398 02:04:33,480 --> 02:04:35,240 THIS IS A CONTACT MAP. 2399 02:04:35,240 --> 02:04:41,280 ANY DOT YOU CAN SEE IN BLACK, 2400 02:04:41,280 --> 02:04:43,520 WHITE OR GRAY IS AN 2401 02:04:43,520 --> 02:04:46,880 EXPERIMENTALLY DETERMINED 2402 02:04:46,880 --> 02:04:49,000 CONTACT AND FIVE ATOMS WITHIN 2403 02:04:49,000 --> 02:04:50,520 FIVE ANGSTROMS TO ONE ANOTHER 2404 02:04:50,520 --> 02:04:53,120 AND THEY'RE UNIQUE TO THE BETA 2405 02:04:53,120 --> 02:04:56,000 SHEET FOLD HERE AND ANYTHING 2406 02:04:56,000 --> 02:04:59,280 WHITE IS UNIQUE TO THE ALPHA 2407 02:04:59,280 --> 02:05:00,000 HELICAL FOLD HERE. 2408 02:05:00,000 --> 02:05:04,920 YOU CAN SEE WE PICK UP CONTACT 2409 02:05:04,920 --> 02:05:07,520 IN THE MATCH AND PICK UP CORRECT 2410 02:05:07,520 --> 02:05:10,160 CONTACTS FOR BOTH STRUCTURES AND 2411 02:05:10,160 --> 02:05:12,360 WE'RE OPTIMISTIC WE COULD ABLE 2412 02:05:12,360 --> 02:05:21,960 TO MAP THESE CONTACTS ON TO 2413 02:05:21,960 --> 02:05:23,240 THREE DIMENSIONAL STRUCTURES AND 2414 02:05:23,240 --> 02:05:25,840 WE WANT TO IN THE FUTURE. 2415 02:05:25,840 --> 02:05:30,600 THIS DOESN'T JUST WORK FOR RFAH 2416 02:05:30,600 --> 02:05:35,280 BUT FOR THE PROTEIN AND WHAT 2417 02:05:35,280 --> 02:05:37,880 REGULATES THE CIRCADIAN CLOCK 2418 02:05:37,880 --> 02:05:39,720 AND WHAT IS INVOLVED IN 2419 02:05:39,720 --> 02:05:40,680 BACTERIAL CELL DIVISION. 2420 02:05:40,680 --> 02:05:43,920 WE'RE OPTIMISTIC THIS CAN BE USE 2421 02:05:43,920 --> 02:05:49,080 FOR A NUMBER OF FOLD-SWITCHING 2422 02:05:49,080 --> 02:05:52,680 PROTEINS AND INDICATES FOLD 2423 02:05:52,680 --> 02:05:54,880 SWITCHING IS SELECTED FOR 2424 02:05:54,880 --> 02:06:00,040 EVOLUTIONARY AND IT HIGHLIGHTS 2425 02:06:00,040 --> 02:06:04,320 THE FUNCTIONAL IMPORTANCE IN THE 2426 02:06:04,320 --> 02:06:07,560 LAST FIVE MINUTES I WANT TO TALK 2427 02:06:07,560 --> 02:06:08,080 ABOUT FUTURE DIRECTION. 2428 02:06:08,080 --> 02:06:13,600 ONE IS OBSERVATIONAL AND ONE IS 2429 02:06:13,600 --> 02:06:20,680 MECHANISTIC AND OF THE PROJECTS 2430 02:06:20,680 --> 02:06:24,520 FIVE ARE EXPERIMENTS AND WE WANT 2431 02:06:24,520 --> 02:06:27,040 TO LOOK AT ANALYSIS AND A 2432 02:06:27,040 --> 02:06:29,080 TECHNICIAN IN MY LAB HAS BEEN 2433 02:06:29,080 --> 02:06:32,080 DEVELOPING TO DEVELOP A HIGH 2434 02:06:32,080 --> 02:06:41,480 THROUGHPUT PRU TEEN PURIFICATION 2435 02:06:41,480 --> 02:06:43,280 MODEL AND IT LOOKS LIKE WE 2436 02:06:43,280 --> 02:06:48,720 SHOULD BE ABLE TO PRODUCE UP TO 2437 02:06:48,720 --> 02:06:58,760 18 RFAH PROTEINS AND RUN 2438 02:06:58,760 --> 02:07:01,200 CIRCULAR PREDICTIONS AND SEE 2439 02:07:01,200 --> 02:07:03,480 WHERE THEY WORK AND FAIL AND 2440 02:07:03,480 --> 02:07:04,320 WHY. 2441 02:07:04,320 --> 02:07:06,680 WE WANT TO PREDICT WHETHER A 2442 02:07:06,680 --> 02:07:08,560 PROTEIN SWITCHES FOLD BUT WHICH 2443 02:07:08,560 --> 02:07:11,920 FOLDS IT ASSUMES AND EVENTUALLY 2444 02:07:11,920 --> 02:07:15,920 WE'D LIKE TO GENERATE THREE 2445 02:07:15,920 --> 02:07:17,200 DIMENSIONAL MODELS FOR 2446 02:07:17,200 --> 02:07:33,880 STRUCTURES AND FOLD SWITCHING 2447 02:07:33,880 --> 02:07:36,480 SEEMS TO BE PRESERVED IN 2448 02:07:36,480 --> 02:07:39,600 DIFFERENT LIFE AND WE WOULD LIKE 2449 02:07:39,600 --> 02:07:40,600 TO KNOW WHETHER ALL FOLD 2450 02:07:40,600 --> 02:07:42,640 SWITCHING PROTEINS EVOLVE FROM 2451 02:07:42,640 --> 02:07:45,000 ONE COMMON ANCESTOR OR IT'S 2452 02:07:45,000 --> 02:07:46,800 EVOLVED MORE THAN ONCE AND 2453 02:07:46,800 --> 02:07:49,880 MOVING TO THE MECHANISTIC SIDE 2454 02:07:49,880 --> 02:07:51,080 WE'RE INTERESTED IN IDENTIFYING 2455 02:07:51,080 --> 02:07:55,960 LOCAL INTERACTIONS THAT FOSTER 2456 02:07:55,960 --> 02:07:57,240 RFAH FOLD SWITCHING. 2457 02:07:57,240 --> 02:07:59,640 WE HYPOTHESIZED I SAID IN THE 2458 02:07:59,640 --> 02:08:01,720 BEGINNING PROPENSITIES ARE LOCAL 2459 02:08:01,720 --> 02:08:03,280 AT LEAST SOMEWHAT IN NATURE AND 2460 02:08:03,280 --> 02:08:08,680 WE SHOULD BE ABLE TO MAKE LOCAL 2461 02:08:08,680 --> 02:08:09,920 MUTATIONS THAT AFFECT THE RFAH 2462 02:08:09,920 --> 02:08:12,760 IN THIS CASE AND WE HAVE 2463 02:08:12,760 --> 02:08:13,520 PRELIMINARY EVIDENCE SUGGESTING 2464 02:08:13,520 --> 02:08:16,680 THIS APPROACH DOES WORK AND WE 2465 02:08:16,680 --> 02:08:19,360 IDENTIFIED A PAIR OF PROTEINS IN 2466 02:08:19,360 --> 02:08:23,240 THE RFAH FAMILY THAT ARE 76% 2467 02:08:23,240 --> 02:08:24,360 IDENTICAL AND FROM THE DATA WE 2468 02:08:24,360 --> 02:08:27,160 HAVE IT APPEARS ONE SWITCHES 2469 02:08:27,160 --> 02:08:29,000 FOLDS AND THE OTHER DOESN'T AND 2470 02:08:29,000 --> 02:08:31,040 WE SEE THIS AS A GREAT 2471 02:08:31,040 --> 02:08:32,000 OPPORTUNITY TO SEARCH MANY 2472 02:08:32,000 --> 02:08:42,280 POSSIBLE MUTATIONAL PATHWAYS TO 2473 02:08:42,280 --> 02:08:44,240 IDENTIFY SEQUENCES AND LOOK AT 2474 02:08:44,240 --> 02:08:47,120 THE MINIMAL NUMBER OF MUTATIONS 2475 02:08:47,120 --> 02:08:49,280 TO CAUSE IT TO SWITCH FOLDS AND 2476 02:08:49,280 --> 02:08:51,160 FINALLY WE'D LIKE TO MEASURE THE 2477 02:08:51,160 --> 02:08:52,680 FOLDING FREE ENERGIES OF 2478 02:08:52,680 --> 02:08:55,960 CONFIRMATIONS OF RFAH AND TO THE 2479 02:08:55,960 --> 02:08:58,760 C TERMINAL AND TERMINAL DOMAIN 2480 02:08:58,760 --> 02:08:59,480 INTERFACE. 2481 02:08:59,480 --> 02:09:03,640 JOSEPH SCHAFER HAS ALSO WORKED 2482 02:09:03,640 --> 02:09:05,880 ON A VARIANT WE THINK MIGHT 2483 02:09:05,880 --> 02:09:09,080 ALLOW A THERMAL DYNAMIC CYCLE TO 2484 02:09:09,080 --> 02:09:11,840 TEASE OUT ALL THREE OF THE 2485 02:09:11,840 --> 02:09:13,680 ENERGIES AND HOPING PUTTING THIS 2486 02:09:13,680 --> 02:09:16,480 TOGETHER WILL HAVE A BETTER 2487 02:09:16,480 --> 02:09:18,600 PREDICTIVE AND BIO PHYSICAL 2488 02:09:18,600 --> 02:09:21,040 FRAMEWORK FOR FOLD SWITCHING TO 2489 02:09:21,040 --> 02:09:22,280 EVENTUALLY TAKE SEQUENCES OF 2490 02:09:22,280 --> 02:09:23,480 WHOLE GENOMES AND PUT THEM 2491 02:09:23,480 --> 02:09:25,280 THROUGH OUR SOFTWARE AND 2492 02:09:25,280 --> 02:09:27,680 PREDICT, KNOW WHERE THIS IS 2493 02:09:27,680 --> 02:09:28,880 HAPPENING BIOLOGICALLY AND BE 2494 02:09:28,880 --> 02:09:30,600 ABLE TO TEST IT IN THE LAB AND 2495 02:09:30,600 --> 02:09:32,080 POTENTIALLY DISCOVER BIOLOGICAL 2496 02:09:32,080 --> 02:09:33,280 PATHWAYS THAT HAVEN'T BEEN FOUND 2497 02:09:33,280 --> 02:09:36,600 YET. 2498 02:09:36,600 --> 02:09:38,000 WITH THAT I'D LIKE TO THANK ALL 2499 02:09:38,000 --> 02:09:41,720 THESE PEOPLE, ESPECIALLY THESE 2500 02:09:41,720 --> 02:09:43,400 PEOPLE WHO STARTED IN THE LAB IN 2501 02:09:43,400 --> 02:09:45,280 AUGUST AND HAVE BEEN DOING A 2502 02:09:45,280 --> 02:09:48,160 FANTASTIC JOB AND I THANK YOU 2503 02:09:48,160 --> 02:09:49,960 FOR YOUR ATTENTION AND HAPPY TO 2504 02:09:49,960 --> 02:09:57,880 TAKE ANY QUESTIONS. 2505 02:09:57,880 --> 02:10:01,240 >> THANK YOU, VERY MUCH, LAUREN. 2506 02:10:01,240 --> 02:10:03,400 I'LL KNOW OPEN IT UP TO LAUREN. 2507 02:10:03,400 --> 02:10:04,960 THAT WAS A GREAT PRESENTATION 2508 02:10:04,960 --> 02:10:06,840 AND YOU GOT DONE A COUPLE 2509 02:10:06,840 --> 02:10:08,440 MINUTES EARLY. 2510 02:10:08,440 --> 02:10:17,240 >> THANK YOU FOR THE GREAT -- 2511 02:10:17,240 --> 02:10:19,560 SORE I, YOU FIRST. 2512 02:10:19,560 --> 02:10:22,320 SAID YOU TRIED 16 BUT 10 JUST 2513 02:10:22,320 --> 02:10:22,560 WORKED. 2514 02:10:22,560 --> 02:10:23,600 WHAT'S WORKED MEANS? 2515 02:10:23,600 --> 02:10:29,080 >> WHAT WORK MEANS IS THAT THE 2516 02:10:29,080 --> 02:10:31,680 OTHER SIX -- SOME DIDN'T EXPRESS 2517 02:10:31,680 --> 02:10:34,960 AT ALL AND SOME DIDN'T EXPRESS 2518 02:10:34,960 --> 02:10:35,200 SOLUBLY. 2519 02:10:35,200 --> 02:10:40,040 SO WE WERE NOT ABLE TO 2520 02:10:40,040 --> 02:10:41,200 CHARACTERIZE THEM BECAUSE WE 2521 02:10:41,200 --> 02:10:41,840 COULDN'T GET ENOUGH PROTEIN TO 2522 02:10:41,840 --> 02:10:50,280 DO IT. 2523 02:10:50,280 --> 02:10:55,000 >> ONE QUESTION AND ONE -- HOW 2524 02:10:55,000 --> 02:11:01,960 MANY OF THE 93 PROTEINS YOU 2525 02:11:01,960 --> 02:11:07,840 TESTED HAVE THEM AND THE SECOND 2526 02:11:07,840 --> 02:11:14,840 QUESTION IS DO YOU THINK THAT 2527 02:11:14,840 --> 02:11:16,080 PROTEIN STRUCTURE SHOULD CHANGE 2528 02:11:16,080 --> 02:11:21,080 BECAUSE OF THESE PROPERTIES OR 2529 02:11:21,080 --> 02:11:21,960 HOW DO YOU SCALE THIS 2530 02:11:21,960 --> 02:11:22,880 REPRESENTATION IN A STANDARD 2531 02:11:22,880 --> 02:11:23,880 WAY? 2532 02:11:23,880 --> 02:11:29,280 >> TO ANSWER YOUR FIRST 2533 02:11:29,280 --> 02:11:37,280 QUESTIOQUESTIO 2534 02:11:37,280 --> 02:11:38,880 QUESTION, THEY REGULATE 2535 02:11:38,880 --> 02:11:40,280 EXPRESSION OF BACTERIA AND IT'S 2536 02:11:40,280 --> 02:11:42,440 RESPONSIBLE FOR MAKING US SICK 2537 02:11:42,440 --> 02:11:47,560 WHEN WE GET FOOD POISONING. 2538 02:11:47,560 --> 02:11:49,280 SO I THINK THERE ARE EVEN 2539 02:11:49,280 --> 02:11:51,600 PROTEINS THAT SWITCH FOLD ARE IN 2540 02:11:51,600 --> 02:11:53,280 BACTERIA AND VIRUSES MATTER TO 2541 02:11:53,280 --> 02:11:53,440 US. 2542 02:11:53,440 --> 02:11:58,600 THERE ARE HUMAN PROTEINS THAT 2543 02:11:58,600 --> 02:12:05,760 SWITCH FOLDS, THERE'S A HUMAN 2544 02:12:05,760 --> 02:12:07,960 CHEMOKINE PROTEIN AND HAS TWO 2545 02:12:07,960 --> 02:12:11,680 FOLDS AND MAKES SENSE BECAUSE 2546 02:12:11,680 --> 02:12:13,160 IT'S SECRETED SO THE SWITCH IS 2547 02:12:13,160 --> 02:12:15,280 BASED ON THE CHANGING BONDS AND 2548 02:12:15,280 --> 02:12:17,240 WE SEE THAT BY CO-EVOLUTIONARY 2549 02:12:17,240 --> 02:12:22,480 ANALYSIS SO THAT AGAIN SUGGESTS 2550 02:12:22,480 --> 02:12:25,280 THAT THIS IS BEHAVIOR 2551 02:12:25,280 --> 02:12:27,520 SELECTABLE. 2552 02:12:27,520 --> 02:12:29,640 AS TO YOUR SECOND QUESTION SO IN 2553 02:12:29,640 --> 02:12:31,280 TERMS OF DO WE NEED TO CHANGE 2554 02:12:31,280 --> 02:12:31,520 THINGS? 2555 02:12:31,520 --> 02:12:33,080 I THINK THIS IS WHY IT'S SO 2556 02:12:33,080 --> 02:12:34,440 IMPORTANT FOR US TO BE ABLE TO 2557 02:12:34,440 --> 02:12:36,080 SCALE UP OUR METHODS AND BE ABLE 2558 02:12:36,080 --> 02:12:39,320 TO TEST IT ON MANY DIFFERENT 2559 02:12:39,320 --> 02:12:42,440 PROTEINS BECAUSE RIGHT NOW WE 2560 02:12:42,440 --> 02:12:44,240 REALLY HAVE NO CLUE HOW WIDE 2561 02:12:44,240 --> 02:12:46,480 SPREAD FOLD SWITCHING IS AS A 2562 02:12:46,480 --> 02:12:47,040 PHENOMENON. 2563 02:12:47,040 --> 02:12:47,800 WE DON'T KNOW. 2564 02:12:47,800 --> 02:12:50,520 I'M ALMOST SURE IT'S NOT NEARLY 2565 02:12:50,520 --> 02:12:54,160 AS COMMON AS IUPs OR 2566 02:12:54,160 --> 02:12:56,040 SINGLE-FOLDING PROTEINS BUT WE 2567 02:12:56,040 --> 02:12:57,680 DON'T KNOW WHERE IT LIES IN THE 2568 02:12:57,680 --> 02:13:04,920 SPACE 10% OF PROTEINS AND IS IT 2569 02:13:04,920 --> 02:13:07,240 5 AND WE WANT TO PREDICT ON THE 2570 02:13:07,240 --> 02:13:13,120 WHOLE GENOME AND VALIDATE THE 2571 02:13:13,120 --> 02:13:13,920 PREDICTION EXPERIMENTALLY AND 2572 02:13:13,920 --> 02:13:21,240 INFER WHAT THE NUMBERS MIGHT BE. 2573 02:13:21,240 --> 02:13:32,040 WIRE DEVELOPING OTHER ASSAYS AND 2574 02:13:32,040 --> 02:13:35,840 IN THE FUTURE IN MY LAB I THINK 2575 02:13:35,840 --> 02:13:37,280 WE'LL DEFINITELY NEED TO DEVELOP 2576 02:13:37,280 --> 02:13:41,320 HIGHER THROUGHPUT EXPERIMENT TO 2577 02:13:41,320 --> 02:13:45,920 LOOK AT FOLD SWITCHING AND WE'RE 2578 02:13:45,920 --> 02:13:50,000 LOOKING AT MASS SPECKS AWAY TO 2579 02:13:50,000 --> 02:13:53,280 DO THAT IN COMBINATION WITH 2580 02:13:53,280 --> 02:13:55,760 OTHER METHODS AND MAY BE ABLE TO 2581 02:13:55,760 --> 02:13:57,640 PROFILE THEIR STRUCTURES AND 2582 02:13:57,640 --> 02:13:58,880 MAYBE IN FOUR YEARS I'LL BE ABLE 2583 02:13:58,880 --> 02:14:01,280 TO TELL YOU MORE ABOUT THAT. 2584 02:14:01,280 --> 02:14:11,320 >> THANK YOU. 2585 02:14:11,320 --> 02:14:13,920 >> I'M NOT LOOKING IF ANYONE IS 2586 02:14:13,920 --> 02:14:14,680 LOOKING AT THE RAISED HANDS BUT 2587 02:14:14,680 --> 02:14:16,200 I'LL ASK MY QUESTION. 2588 02:14:16,200 --> 02:14:21,720 YOUR WORK IS REALLY INTERESTING. 2589 02:14:21,720 --> 02:14:25,240 I'M EXCITED TO SEE ALPHA FOLD 2590 02:14:25,240 --> 02:14:28,880 TALKED ABOUT THIS MUCH BUT I'M 2591 02:14:28,880 --> 02:14:30,160 CURIOUS HAVE YOU REACHED OUT TO 2592 02:14:30,160 --> 02:14:32,280 THE DEEP MIND TEAM OR TRIED TO 2593 02:14:32,280 --> 02:14:34,880 USE THAT NETWORK BECAUSE THAT 2594 02:14:34,880 --> 02:14:40,720 NETWORK CAN -- IF YOU WERE TO 2595 02:14:40,720 --> 02:14:46,920 PREDICT A PROTEIN WITH DIFFERENT 2596 02:14:46,920 --> 02:14:48,960 FOLDING STATES AND 2597 02:14:48,960 --> 02:14:52,000 EXPERIMENTALLY VERIFY IT AFTER 2598 02:14:52,000 --> 02:14:52,200 THAT. 2599 02:14:52,200 --> 02:14:54,800 >> I HAVE NOT REACHED OUT TO 2600 02:14:54,800 --> 02:14:56,960 ALPHA FOLD YET AND I HAVE 2601 02:14:56,960 --> 02:14:57,760 THOUGHT ABOUT IT BUT IF YOU 2602 02:14:57,760 --> 02:14:59,440 THINK IT'S A GOOD IDEA. 2603 02:14:59,440 --> 02:15:00,880 >> THEY LOVE COLLABORATIONS. 2604 02:15:00,880 --> 02:15:06,600 WE HAVE NO EXPERIMENTALISTS. 2605 02:15:06,600 --> 02:15:06,960 >> GREAT. 2606 02:15:06,960 --> 02:15:12,120 I'M HAPPY TO DO THAT. 2607 02:15:12,120 --> 02:15:14,200 WITH THAT CO-EVOLUTIONARY 2608 02:15:14,200 --> 02:15:19,000 ANALYSIS THE WAY WE GOT THE DUAL 2609 02:15:19,000 --> 02:15:19,960 FOLD CONTACTS WE HAD TO BE 2610 02:15:19,960 --> 02:15:22,160 CHOOSEY IN OUR ALIGNMENTS. 2611 02:15:22,160 --> 02:15:24,400 I KNOW OTHER GROUPS HAVE USED 2612 02:15:24,400 --> 02:15:26,600 ALPHA FOLD TWO ON SHALLOWER 2613 02:15:26,600 --> 02:15:29,000 ALIGNMENT TO PREDICT 2614 02:15:29,000 --> 02:15:29,640 CONFIRMATIONAL VARIABILITY. 2615 02:15:29,640 --> 02:15:30,840 I HAVEN'T SEEN THAT DO THAT IN 2616 02:15:30,840 --> 02:15:33,200 THE SECONDARY STRUCTURE BUT WE 2617 02:15:33,200 --> 02:15:33,760 ACTUALLY TRIED. 2618 02:15:33,760 --> 02:15:37,760 SO WE TOOK OUR MSAs THAT GOT OUT 2619 02:15:37,760 --> 02:15:42,080 THE DUAL FOLD PROTENSE 2620 02:15:42,080 --> 02:15:44,080 -- 2621 02:15:44,080 --> 02:15:45,840 PROPENSITIES AND PUT THEM IN THE 2622 02:15:45,840 --> 02:15:48,320 FOLD AND IT DID NOT WORK. 2623 02:15:48,320 --> 02:15:50,520 >> IT SHOULDN'T WORK IF YOU WERE 2624 02:15:50,520 --> 02:15:53,640 JUST DETECT THE ORIGINAL ALPHA 2625 02:15:53,640 --> 02:15:56,840 FOLD AND IT MAY BE WORK AND THIS 2626 02:15:56,840 --> 02:16:01,040 IS WHERE WE WAY TALK ABOUT 2627 02:16:01,040 --> 02:16:02,680 COLLABORATING REDUCING THE LAYER 2628 02:16:02,680 --> 02:16:05,280 TO A SINGLE STRUCTURE, DUAL 2629 02:16:05,280 --> 02:16:09,000 STRUCTURE, SOMEWHERE AND FORCE 2630 02:16:09,000 --> 02:16:09,560 THE PREDICTION. 2631 02:16:09,560 --> 02:16:11,360 I'M JUST THROWING IT OUT THERE. 2632 02:16:11,360 --> 02:16:14,360 >> THAT WOULD BE AMAZING. 2633 02:16:14,360 --> 02:16:17,280 WE'D BE TOTALLY OPEN TO THAT. 2634 02:16:17,280 --> 02:16:23,080 >> COOL. 2635 02:16:23,080 --> 02:16:25,320 >> ANY OTHER QUESTIONS 2636 02:16:25,320 --> 02:16:29,280 SPECIFICALLY FROM THOSE NOT ON 2637 02:16:29,280 --> 02:16:46,960 THE SCIENTIFIC COUNSEL? 2638 02:16:46,960 --> 02:16:49,240 >> WE HAVE THE CLOSED SESSION 2639 02:16:49,240 --> 02:16:50,240 FOR MORE FROM THE SCIENTIFIC 2640 02:16:50,240 --> 02:17:03,840 COUNCIL AS WELL. 2641 02:17:03,840 --> 02:17:09,600 >> I'M HAPPY TO ASK MORE 2642 02:17:09,600 --> 02:17:13,640 QUESTIONS AND WE LOOKED AT MINOR 2643 02:17:13,640 --> 02:17:15,320 CHANGES IN SUBSTRATE AND CAN 2644 02:17:15,320 --> 02:17:16,520 AFFECT SECONDARY STRUCTURE AND 2645 02:17:16,520 --> 02:17:17,680 THINGS LIKE THAT. 2646 02:17:17,680 --> 02:17:20,000 HAVE YOU TRIED USING THOSE 2647 02:17:20,000 --> 02:17:21,040 APPROACHES AS YOU'RE JUST 2648 02:17:21,040 --> 02:17:21,880 CHANGING A FEW SEQUENCES. 2649 02:17:21,880 --> 02:17:23,640 >> SO HERE'S THE PLAN. 2650 02:17:23,640 --> 02:17:24,720 WE HAVEN'T GOTTEN THERE YET BUT 2651 02:17:24,720 --> 02:17:25,960 THIS IS ABOUT WHAT WE'RE DOING. 2652 02:17:25,960 --> 02:17:28,560 THE VARIANTS -- SO WE HAVE THIS 2653 02:17:28,560 --> 02:17:30,400 PROJECT NOW I'M TRYING TO FINISH 2654 02:17:30,400 --> 02:17:37,000 WHERE WE MADE VARIANTS OF RFAH 2655 02:17:37,000 --> 02:17:43,440 THAT SEEM TO AFFECT THE 2656 02:17:43,440 --> 02:17:43,880 FELICITY. 2657 02:17:43,880 --> 02:17:45,520 CAN I SHARE THE DATA WITH YOU? 2658 02:17:45,520 --> 02:17:47,920 I HAVE IT ON MY POWER POINT. 2659 02:17:47,920 --> 02:17:48,720 IT'S THIS ONE. 2660 02:17:48,720 --> 02:17:52,400 SO WE MADE A MUTATION HERE. 2661 02:17:52,400 --> 02:17:55,800 SO THERE'S A HELIX CAP HIGHLY 2662 02:17:55,800 --> 02:17:57,760 CONSERVED IN THE RFAHs AND WE 2663 02:17:57,760 --> 02:18:03,880 CHANGE THE ACCEPTER TO A PROLINE 2664 02:18:03,880 --> 02:18:06,680 THINKING THAT WILL GET RID OF 2665 02:18:06,680 --> 02:18:14,280 THE HELIX CAP AND HERE'S THE H 2666 02:18:14,280 --> 02:18:16,920 HELICITY AND SIMILARLY HERE 2667 02:18:16,920 --> 02:18:21,960 THERE'S A NICE CONTACT THAT'S 2668 02:18:21,960 --> 02:18:24,000 HIGHLY CONSERVED AND IF WE 2669 02:18:24,000 --> 02:18:26,760 CHANGE IS TO A GLYCINE UP HERE 2670 02:18:26,760 --> 02:18:34,080 YOU CAN SEE AGAIN THE HELICITY'S 2671 02:18:34,080 --> 02:18:34,320 DECREASE. 2672 02:18:34,320 --> 02:18:40,320 WE WANT WANT TO MAKE SURE WE'RE 2673 02:18:40,320 --> 02:18:43,880 SEEING WHAT WE THINK WHAT WE'RE 2674 02:18:43,880 --> 02:18:46,680 SEEING AND I TALKED WITH NHLBI 2675 02:18:46,680 --> 02:18:48,440 AND THEY'RE HAPPY COLLABORATE TO 2676 02:18:48,440 --> 02:18:51,040 SEE WARPING IN THE SECONDARY 2677 02:18:51,040 --> 02:18:53,240 STRUCTURE OR BOTH STRUCTURES 2678 02:18:53,240 --> 02:18:54,960 BEING SAMPLED. 2679 02:18:54,960 --> 02:18:58,080 THAT'S ON OUR RADAR WE JUST 2680 02:18:58,080 --> 02:19:01,240 HAVEN'T GOTTEN THERE YET. 2681 02:19:01,240 --> 02:19:18,480 >> THAT'S REALLY COOL. 2682 02:19:18,480 --> 02:19:18,840 >> OKAY. 2683 02:19:18,840 --> 02:19:20,080 THERE NO FURTHER PARTICULARLY 2684 02:19:20,080 --> 02:19:22,480 GROUP QUESTIONS I PROPOSE WE 2685 02:19:22,480 --> 02:19:25,960 CATCH UP AND GO TO THE BREAKOUT 2686 02:19:25,960 --> 02:19:27,080 ROOMS SO LAUREN AND NEIL WILL 2687 02:19:27,080 --> 02:19:29,280 JOIN US AND THE BOARD OF 2688 02:19:29,280 --> 00:00:00,000 SCIENTIFIC COUNSEL MEMBERS. 170361

Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.