subtitlecat.com

All language subtitles for GENERA FOTOS DEL MISMO PERSONAJE (Erhan Meydan)

Afrikaans

Akan

Albanian

Amharic

Arabic

Armenian

Azerbaijani

Basque

Belarusian

Bemba

Bengali

Bihari

Bosnian

Breton

Bulgarian

Cambodian

Catalan

Cebuano

Cherokee

Chichewa

Chinese (Simplified)

Chinese (Traditional)

Corsican

Croatian

Czech

Danish

Dutch

English

Esperanto

Estonian

Ewe

Faroese

Filipino

Finnish

French

Frisian

Galician

Georgian

German

Greek

Guarani

Gujarati

Haitian Creole

Hausa

Hawaiian

Hebrew

Hindi

Hmong

Hungarian

Icelandic

Igbo

Indonesian

Interlingua

Irish

Italian

Japanese

Javanese

Kannada

Kazakh

Kinyarwanda

Kirundi

Kongo

Korean

Krio (Sierra Leone)

Kurdish

Kurdish (Soranî)

Kyrgyz

Laothian

Latin

Latvian

Lingala

Lithuanian

Lozi

Luganda

Luo

Luxembourgish

Macedonian

Malagasy

Malay

Malayalam

Maltese

Maori

Marathi

Mauritian Creole

Moldavian

Mongolian

Myanmar (Burmese)

Montenegrin

Nepali

Nigerian Pidgin

Northern Sotho

Norwegian

Norwegian (Nynorsk)

Occitan

Oriya

Oromo

Pashto

Persian

Polish

Portuguese (Brazil)

Portuguese (Portugal)

Punjabi

Quechua

Romanian

Romansh

Runyakitara

Russian

Samoan

Scots Gaelic

Serbian

Serbo-Croatian

Sesotho

Setswana

Seychellois Creole

Shona

Sindhi

Sinhalese

Slovak

Slovenian

Somali

Spanish

Spanish (Latin American) Download

Sundanese

Swahili

Swedish

Tajik

Tamil

Tatar

Telugu

Thai

Tigrinya

Tonga

Tshiluba

Tumbuka

Turkish

Turkmen

Twi

Uighur

Ukrainian

Urdu

Uzbek

Vietnamese

Welsh

Wolof

Xhosa

Yiddish

Yoruba

Zulu

Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated: 1 00:00:00,240 --> 00:00:01,839 With many artificial intelligence 2 00:00:01,839 --> 00:00:03,760 applications, we are all now producing 3 00:00:03,760 --> 00:00:06,400 images. At least we are trying. I also 4 00:00:06,400 --> 00:00:08,240 use many of them for both my own work 5 00:00:08,240 --> 00:00:10,320 and experiments. Each has its own 6 00:00:10,320 --> 00:00:12,559 strengths and weaknesses. In this video, 7 00:00:12,559 --> 00:00:14,000 I will talk about an artificial 8 00:00:14,000 --> 00:00:15,759 intelligence application that produces 9 00:00:15,759 --> 00:00:18,080 very successful, very realistic images 10 00:00:18,080 --> 00:00:19,439 that can address several important 11 00:00:19,439 --> 00:00:21,920 issues at once. The main issue is the 12 00:00:21,920 --> 00:00:24,160 successful AI image production tools are 13 00:00:24,160 --> 00:00:27,359 paid. They're right because producing 14 00:00:27,359 --> 00:00:29,439 anything with artificial intelligence 15 00:00:29,439 --> 00:00:31,519 requires significant computing power. 16 00:00:31,519 --> 00:00:32,960 There's no point in me agreeing with 17 00:00:32,960 --> 00:00:34,640 them. After all, these are all 18 00:00:34,640 --> 00:00:36,399 commercial ventures and naturally they 19 00:00:36,399 --> 00:00:38,320 want to make money. The easiest way to 20 00:00:38,320 --> 00:00:40,000 do these operations without spending 21 00:00:40,000 --> 00:00:41,920 money is to place our own computer at 22 00:00:41,920 --> 00:00:44,000 the point we call that computer. In 23 00:00:44,000 --> 00:00:45,840 other words, to leave all the load on 24 00:00:45,840 --> 00:00:48,160 our own computer. This leads us to open 25 00:00:48,160 --> 00:00:49,920 source software. We don't need to pay 26 00:00:49,920 --> 00:00:52,320 anything for these. Completely free. 27 00:00:52,320 --> 00:00:54,079 They are there for development and 28 00:00:54,079 --> 00:00:56,079 continue to be developed. After reaching 29 00:00:56,079 --> 00:00:58,160 a certain point, companies that wish can 30 00:00:58,160 --> 00:01:00,480 switch to a paid version. In fact, 31 00:01:00,480 --> 00:01:02,800 OpenAI has such a story. When they first 32 00:01:02,800 --> 00:01:05,199 started, as the name suggests, OpenAI 33 00:01:05,199 --> 00:01:07,520 began with this open software. Then they 34 00:01:07,520 --> 00:01:09,760 evolved into a company extremely secure 35 00:01:09,760 --> 00:01:11,520 open software. I have been using them 36 00:01:11,520 --> 00:01:13,360 for a long time. I haven't seen any 37 00:01:13,360 --> 00:01:15,360 problems. Since it's open, if there were 38 00:01:15,360 --> 00:01:17,520 any virus, no one would install it or 39 00:01:17,520 --> 00:01:19,200 those who know would immediately see it 40 00:01:19,200 --> 00:01:21,360 and request its removal. These types of 41 00:01:21,360 --> 00:01:23,200 open software are generally hosted on 42 00:01:23,200 --> 00:01:25,439 GitHub. In the past, installing these 43 00:01:25,439 --> 00:01:27,680 open software on our computers required 44 00:01:27,680 --> 00:01:29,680 some technical knowledge. Now with the 45 00:01:29,680 --> 00:01:31,680 Pinocchio software, that's no longer 46 00:01:31,680 --> 00:01:33,680 necessary. You can handle everything 47 00:01:33,680 --> 00:01:35,840 very easily. By the way, the biggest 48 00:01:35,840 --> 00:01:37,920 advantage of generating visuals on your 49 00:01:37,920 --> 00:01:39,520 own computer is that you don't face any 50 00:01:39,520 --> 00:01:41,600 copyright issues. The secondary solver 51 00:01:41,600 --> 00:01:43,200 of this software is character 52 00:01:43,200 --> 00:01:44,720 continuity. It's one of the most 53 00:01:44,720 --> 00:01:46,560 frequently asked questions I receive. 54 00:01:46,560 --> 00:01:48,560 So, how do we ensure the continuity of a 55 00:01:48,560 --> 00:01:50,720 character we've created? We created a 56 00:01:50,720 --> 00:01:52,799 female character. It also solves how to 57 00:01:52,799 --> 00:01:54,159 place this character in different 58 00:01:54,159 --> 00:01:56,399 settings like in Ulude during winter 59 00:01:56,399 --> 00:01:58,880 when it's snowing on the beach in summer 60 00:01:58,880 --> 00:02:02,079 at the cinema or in a studio. By the 61 00:02:02,079 --> 00:02:03,920 way, I couldn't help but wonder if this 62 00:02:03,920 --> 00:02:05,759 topic is related to a news article I 63 00:02:05,759 --> 00:02:08,560 read recently. Someone created an AI 64 00:02:08,560 --> 00:02:11,080 influencer named Itana and it's earning 65 00:02:11,080 --> 00:02:13,599 $11,000 a month. It's a figure that 66 00:02:13,599 --> 00:02:15,680 boggles the mind. I also found the 67 00:02:15,680 --> 00:02:17,599 Instagram account of the AI influencer 68 00:02:17,599 --> 00:02:19,840 and it's still active. In summary, with 69 00:02:19,840 --> 00:02:21,599 this technique, you can create such a 70 00:02:21,599 --> 00:02:23,680 character and maintain its presence in 71 00:02:23,680 --> 00:02:26,080 various environments. By the way, I 72 00:02:26,080 --> 00:02:28,400 think 2024 will be the year of AI 73 00:02:28,400 --> 00:02:30,160 influencers. I can almost hear you 74 00:02:30,160 --> 00:02:31,920 saying, "Who cares about your opinion? 75 00:02:31,920 --> 00:02:34,000 Just send the goods." I'm starting right 76 00:02:34,000 --> 00:02:35,879 away. I'm on the 77 00:02:35,879 --> 00:02:38,319 Pinocchio. As always, I will add all the 78 00:02:38,319 --> 00:02:40,239 links and prompts I use in the video 79 00:02:40,239 --> 00:02:41,840 description. I'm clicking the download 80 00:02:41,840 --> 00:02:43,840 button. On the page that opens, you can 81 00:02:43,840 --> 00:02:45,440 download Pinocchio according to the 82 00:02:45,440 --> 00:02:47,040 platform you are using. There are 83 00:02:47,040 --> 00:02:48,800 platform options available such as 84 00:02:48,800 --> 00:02:52,720 Windows, Mac, M1, M2, M3, Intel Mac, and 85 00:02:52,720 --> 00:02:54,800 Linux. Since I am using a computer with 86 00:02:54,800 --> 00:02:57,040 an M1 Apple silicon processor, I click 87 00:02:57,040 --> 00:03:00,160 on the M1, M2, M3 Mac link. On the page 88 00:03:00,160 --> 00:03:02,080 that opens, I click on the link click to 89 00:03:02,080 --> 00:03:06,800 download Pinocchio for M1, M2, M3 Max. 90 00:03:06,800 --> 00:03:08,319 It started downloading the latest 91 00:03:08,319 --> 00:03:10,239 version. The download is complete. I 92 00:03:10,239 --> 00:03:11,920 double click on the file I downloaded to 93 00:03:11,920 --> 00:03:13,760 open it. In the window that opens, I 94 00:03:13,760 --> 00:03:15,840 first drag the Pinocchio application and 95 00:03:15,840 --> 00:03:18,159 drop it onto the applications folder. 96 00:03:18,159 --> 00:03:20,000 Then I right click on the patch command 97 00:03:20,000 --> 00:03:21,920 file and click on open link from the 98 00:03:21,920 --> 00:03:24,080 menu that appears. A warning appears 99 00:03:24,080 --> 00:03:26,159 saying the developer is not verified. I 100 00:03:26,159 --> 00:03:28,159 know it's not a problem. I press the 101 00:03:28,159 --> 00:03:30,080 open button again. The terminal 102 00:03:30,080 --> 00:03:32,239 application opens on my computer. Here 103 00:03:32,239 --> 00:03:33,840 it asks me to enter my computer's 104 00:03:33,840 --> 00:03:35,680 password. I will enter it, but I want to 105 00:03:35,680 --> 00:03:37,760 clarify one point here. In the terminal, 106 00:03:37,760 --> 00:03:40,560 the cursor does not move as you type. It 107 00:03:40,560 --> 00:03:42,239 looks like you're not typing, but you 108 00:03:42,239 --> 00:03:45,120 actually are. I enter my password and 109 00:03:45,120 --> 00:03:46,879 press the enter key on the keyboard. As 110 00:03:46,879 --> 00:03:48,799 you can see, the process completed. 111 00:03:48,799 --> 00:03:50,879 Notification appeared. Now I return to 112 00:03:50,879 --> 00:03:52,799 the application folder and double click 113 00:03:52,799 --> 00:03:55,519 on the Pinocchio application to open it. 114 00:03:55,519 --> 00:03:58,319 Pinocchio has opened. From this page, I 115 00:03:58,319 --> 00:03:59,599 can choose where the Pinocchio 116 00:03:59,599 --> 00:04:01,760 applications files will be stored on my 117 00:04:01,760 --> 00:04:04,159 computer and select a light or dark 118 00:04:04,159 --> 00:04:06,400 theme. It can stay like this. I press 119 00:04:06,400 --> 00:04:08,560 the save button. Now my Pinocchio 120 00:04:08,560 --> 00:04:10,799 application is ready. From here I click 121 00:04:10,799 --> 00:04:12,720 on the visit discover page button or the 122 00:04:12,720 --> 00:04:14,560 discover icon over there and all the 123 00:04:14,560 --> 00:04:16,560 applications I can use within Pinocchio 124 00:04:16,560 --> 00:04:18,320 have opened. There are many applications 125 00:04:18,320 --> 00:04:21,199 like invokeai, stream diffusion, dream 126 00:04:21,199 --> 00:04:23,520 talk. I explained in detail what we can 127 00:04:23,520 --> 00:04:25,040 do with many of them in a previous 128 00:04:25,040 --> 00:04:27,280 video. I'm leaving the link. You can 129 00:04:27,280 --> 00:04:29,280 watch it from there. These applications 130 00:04:29,280 --> 00:04:30,639 don't have a direct connection with 131 00:04:30,639 --> 00:04:32,720 Pinocchio. You can think of Pinocchio as 132 00:04:32,720 --> 00:04:34,560 a player. In short, it lets you use 133 00:04:34,560 --> 00:04:36,320 whichever application is available on 134 00:04:36,320 --> 00:04:38,080 the counter. All the applications within 135 00:04:38,080 --> 00:04:40,160 Pinocchio are on GitHub if you want 136 00:04:40,160 --> 00:04:42,000 instead of using Pinocchio. You can go 137 00:04:42,000 --> 00:04:44,400 to GitHub, download the files, download 138 00:04:44,400 --> 00:04:46,400 the necessary software and versions to 139 00:04:46,400 --> 00:04:48,240 run the application, run the software, 140 00:04:48,240 --> 00:04:49,919 and set up all the settings to use the 141 00:04:49,919 --> 00:04:52,479 artificial intelligence applications. Or 142 00:04:52,479 --> 00:04:54,240 instead of all this, you can set up the 143 00:04:54,240 --> 00:04:56,160 Pinocchio I showed you and have it do 144 00:04:56,160 --> 00:04:58,320 these processes for you. I'm returning 145 00:04:58,320 --> 00:05:01,560 to the discover page. The AI app is 146 00:05:01,560 --> 00:05:04,800 Focus. Yes, it's spelled with three O's. 147 00:05:04,800 --> 00:05:07,120 I clicked on it. On the page that opens, 148 00:05:07,120 --> 00:05:08,639 there's a link to the GitHub project. 149 00:05:08,639 --> 00:05:10,720 Again, as I mentioned earlier, those who 150 00:05:10,720 --> 00:05:12,479 want can also set it up from here. 151 00:05:12,479 --> 00:05:14,400 There's also information that Focus uses 152 00:05:14,400 --> 00:05:16,320 the stable diffusion infrastructure. 153 00:05:16,320 --> 00:05:17,840 There's also a link to the Twitter 154 00:05:17,840 --> 00:05:19,520 account of Cocktail Peanut, the creator 155 00:05:19,520 --> 00:05:21,280 of Pinocchio, which you can follow if 156 00:05:21,280 --> 00:05:23,320 you want. I'm clicking on the download 157 00:05:23,320 --> 00:05:25,680 button. As you can see on the page that 158 00:05:25,680 --> 00:05:28,240 opens, many software components like 159 00:05:28,240 --> 00:05:31,120 Git, Zip, Node, JavaScript are missing 160 00:05:31,120 --> 00:05:33,360 for Fukus to work. Thanks to Pinocchio, 161 00:05:33,360 --> 00:05:34,800 I don't have to deal with where these 162 00:05:34,800 --> 00:05:37,039 are located, where to download them from 163 00:05:37,039 --> 00:05:38,680 or which version to 164 00:05:38,680 --> 00:05:41,120 download. I click on the install button 165 00:05:41,120 --> 00:05:42,880 and start downloading all of them. The 166 00:05:42,880 --> 00:05:44,960 downloads are complete. I click on the 167 00:05:44,960 --> 00:05:46,919 okay button. Now we can move on to 168 00:05:46,919 --> 00:05:49,680 Fukus. Just now when I click to install 169 00:05:49,680 --> 00:05:51,840 Fukus, Pinocchio helped me download the 170 00:05:51,840 --> 00:05:54,160 necessary software to use Fukus. Now I 171 00:05:54,160 --> 00:05:55,919 will download Fukus. I click on the 172 00:05:55,919 --> 00:05:58,080 download button. It downloaded. I click 173 00:05:58,080 --> 00:06:00,240 on it again. From the page that opens, I 174 00:06:00,240 --> 00:06:02,000 click on the install link. From the menu 175 00:06:02,000 --> 00:06:03,840 on the left, a warning came up under the 176 00:06:03,840 --> 00:06:05,280 installation required heading that 177 00:06:05,280 --> 00:06:07,039 another software is missing. I click on 178 00:06:07,039 --> 00:06:08,800 the install button again. The 179 00:06:08,800 --> 00:06:10,160 installation of the missing brew 180 00:06:10,160 --> 00:06:13,120 software has also started completed. I 181 00:06:13,120 --> 00:06:15,199 click on the okay button. Now the source 182 00:06:15,199 --> 00:06:17,560 files for focus have started 183 00:06:17,560 --> 00:06:19,520 downloading. These are quite large 184 00:06:19,520 --> 00:06:21,600 files. Please wait patiently for them to 185 00:06:21,600 --> 00:06:23,440 download. I must reiterate that you are 186 00:06:23,440 --> 00:06:25,880 downloading the entire AI model to your 187 00:06:25,880 --> 00:06:28,160 computer. I mentioned this before. 188 00:06:28,160 --> 00:06:29,759 Download the model to your computer, 189 00:06:29,759 --> 00:06:31,680 open it in Pinocchio, and start running 190 00:06:31,680 --> 00:06:33,600 it. If you're sure it's working, you can 191 00:06:33,600 --> 00:06:35,280 turn off the internet and continue with 192 00:06:35,280 --> 00:06:37,520 visual production. So, in summary, you 193 00:06:37,520 --> 00:06:39,280 download it, but you only download it 194 00:06:39,280 --> 00:06:41,600 once. After this, you can create 195 00:06:41,600 --> 00:06:43,919 unlimited, free, royalty-free 196 00:06:43,919 --> 00:06:46,479 productions as much as you want. The 197 00:06:46,479 --> 00:06:48,240 download is complete. I click the start 198 00:06:48,240 --> 00:06:50,160 button from the menu on the left. From 199 00:06:50,160 --> 00:06:52,319 the open submen, I can start using the 200 00:06:52,319 --> 00:06:54,639 mode or rather the model that I want. 201 00:06:54,639 --> 00:06:56,639 Today, I will proceed with the realistic 202 00:06:56,639 --> 00:06:58,960 mode. I click on the realistic mode. I 203 00:06:58,960 --> 00:07:00,800 started downloading its models. The 204 00:07:00,800 --> 00:07:04,080 model is about 6 and 1/2 GB in size. I'm 205 00:07:04,080 --> 00:07:05,919 waiting for it to be downloaded as well. 206 00:07:05,919 --> 00:07:08,240 The download is complete. To be sure, I 207 00:07:08,240 --> 00:07:10,560 restarted Pinocchio again. I click the 208 00:07:10,560 --> 00:07:12,319 start button from the menu on the left 209 00:07:12,319 --> 00:07:14,160 and select the realistic mode from the 210 00:07:14,160 --> 00:07:16,560 opened submen. I saw the message app 211 00:07:16,560 --> 00:07:19,039 started successfully. The processes are 212 00:07:19,039 --> 00:07:21,599 complete. If I want, I can click on the 213 00:07:21,599 --> 00:07:24,000 link here to open the focus application 214 00:07:24,000 --> 00:07:26,240 in an external browser or I can click 215 00:07:26,240 --> 00:07:28,319 the web UI button from the menu on the 216 00:07:28,319 --> 00:07:30,880 left to have the focus application run 217 00:07:30,880 --> 00:07:32,840 directly within 218 00:07:32,840 --> 00:07:35,199 Pinocchio. The most important feature of 219 00:07:35,199 --> 00:07:37,199 focus as I mentioned at the beginning is 220 00:07:37,199 --> 00:07:39,120 close-up or focus which I guess is where 221 00:07:39,120 --> 00:07:41,440 the name comes from. It produces very 222 00:07:41,440 --> 00:07:43,759 very realistic visuals. Right now, 223 00:07:43,759 --> 00:07:45,680 without touching any settings and using 224 00:07:45,680 --> 00:07:47,360 the default settings it opens with, when 225 00:07:47,360 --> 00:07:49,039 I paste a prompt into the box and 226 00:07:49,039 --> 00:07:50,960 produce any visual, you will see that it 227 00:07:50,960 --> 00:07:53,120 creates incredibly realistic visuals. 228 00:07:53,120 --> 00:07:55,680 I'm pasting my prompt right away. A very 229 00:07:55,680 --> 00:07:58,400 beautiful Turkish girl fitness at the 230 00:07:58,400 --> 00:08:00,800 gym. I meant to say a beautiful Turkish 231 00:08:00,800 --> 00:08:03,199 girl doing fitness at the gym. Let's see 232 00:08:03,199 --> 00:08:05,120 what kind of visual will come out. By 233 00:08:05,120 --> 00:08:07,120 the way, you absolutely need to enter 234 00:08:07,120 --> 00:08:08,720 the prompts in English. It doesn't 235 00:08:08,720 --> 00:08:10,800 understand Turkish. You can use deep l 236 00:08:10,800 --> 00:08:12,960 for quick translation. By the way, I 237 00:08:12,960 --> 00:08:14,879 don't want the same sexes. Since the 238 00:08:14,879 --> 00:08:17,199 topic started with itana AI influencer, 239 00:08:17,199 --> 00:08:20,080 I went with women. Of course, my female 240 00:08:20,080 --> 00:08:21,840 viewers can also produce visuals of 241 00:08:21,840 --> 00:08:23,840 handsome men. As you can appreciate, it 242 00:08:23,840 --> 00:08:25,440 would look a bit odd for me to produce 243 00:08:25,440 --> 00:08:27,280 visuals of handsome men. Did I seem 244 00:08:27,280 --> 00:08:29,520 homophobic this time? I wonder. It's 245 00:08:29,520 --> 00:08:32,200 very difficult, you know. Anyway, I'm 246 00:08:32,200 --> 00:08:33,919 continuing. We will see how 247 00:08:33,919 --> 00:08:35,599 knowledgeable our AI model is about 248 00:08:35,599 --> 00:08:37,599 Turkish girls. I'm pressing the generate 249 00:08:37,599 --> 00:08:39,599 button and waiting. This waiting is 250 00:08:39,599 --> 00:08:41,440 related to the graphics card power of my 251 00:08:41,440 --> 00:08:43,440 computer or rather your computer. 252 00:08:43,440 --> 00:08:45,279 Computers with powerful graphics cards 253 00:08:45,279 --> 00:08:47,200 are faster while those with weaker 254 00:08:47,200 --> 00:08:50,000 graphics cards are slower. You can also 255 00:08:50,000 --> 00:08:52,640 see it step by step. I speed it up in 256 00:08:52,640 --> 00:08:54,560 editing. Of course, I don't have such a 257 00:08:54,560 --> 00:08:56,640 computer. The first image is completed. 258 00:08:56,640 --> 00:08:58,640 I'm clicking on it and enlarging it. How 259 00:08:58,640 --> 00:09:01,120 is it? Not too bad. I guess it looks 260 00:09:01,120 --> 00:09:03,040 quite realistic. The hands turned out a 261 00:09:03,040 --> 00:09:04,959 bit problematic. I think the background 262 00:09:04,959 --> 00:09:07,040 and face came out nicely. The left arm 263 00:09:07,040 --> 00:09:09,040 looks a bit injured. I guess it happened 264 00:09:09,040 --> 00:09:10,560 while lifting weights. Whatever 265 00:09:10,560 --> 00:09:12,880 happened, the second one forms, changes 266 00:09:12,880 --> 00:09:14,720 clothes, it drew a model of a typical 267 00:09:14,720 --> 00:09:16,640 Turkish girl. The second one is also 268 00:09:16,640 --> 00:09:18,720 completed. This one looks nice, too. 269 00:09:18,720 --> 00:09:20,240 We'd have a hard time distinguishing 270 00:09:20,240 --> 00:09:23,040 whether it's a photo or AI generated. We 271 00:09:23,040 --> 00:09:24,959 know there's no such person on Earth. 272 00:09:24,959 --> 00:09:26,880 Let's enter another prompt. This time, 273 00:09:26,880 --> 00:09:28,720 let's not make it a person, but a room 274 00:09:28,720 --> 00:09:30,640 or room in a house. I want to show you 275 00:09:30,640 --> 00:09:32,959 different things. A 1960s style room 276 00:09:32,959 --> 00:09:35,760 with orange objects. I mean, I'm saying 277 00:09:35,760 --> 00:09:38,880 a 1960s style room with orange objects. 278 00:09:38,880 --> 00:09:40,240 Let's see what will come out. I'm 279 00:09:40,240 --> 00:09:41,519 pressing the generate button and 280 00:09:41,519 --> 00:09:43,440 waiting. It seems a nice room is coming. 281 00:09:43,440 --> 00:09:45,200 Let's see it complete. The first one is 282 00:09:45,200 --> 00:09:47,200 completed. I'm clicking on it to open. I 283 00:09:47,200 --> 00:09:48,800 think it turned out nice. It has the 284 00:09:48,800 --> 00:09:51,360 feel of 1960s homes we see in movies. 285 00:09:51,360 --> 00:09:53,200 It's predominantly orange, just as I 286 00:09:53,200 --> 00:09:54,800 wanted. If you want to save it, you can 287 00:09:54,800 --> 00:09:56,560 click on the download icon here to save 288 00:09:56,560 --> 00:09:59,200 it. I named it 01 and saved it in PNG 289 00:09:59,200 --> 00:10:01,120 format. The second one doesn't seem bad 290 00:10:01,120 --> 00:10:02,560 either. It looks like it won't have a 291 00:10:02,560 --> 00:10:04,240 television in it right now. There will 292 00:10:04,240 --> 00:10:07,279 be a fireplace. It's done. This is 293 00:10:07,279 --> 00:10:08,880 another house interior. So, they 294 00:10:08,880 --> 00:10:10,480 considered the reflection of the mirror. 295 00:10:10,480 --> 00:10:13,200 I'm saving this as 02png. Now, let's 296 00:10:13,200 --> 00:10:15,040 start examining the features of focus. 297 00:10:15,040 --> 00:10:16,560 For this, I'm clicking on the input 298 00:10:16,560 --> 00:10:18,800 image checkbox. Details have opened up 299 00:10:18,800 --> 00:10:21,200 at the bottom. I have four options here. 300 00:10:21,200 --> 00:10:22,959 We will discuss all of them in order. 301 00:10:22,959 --> 00:10:25,040 First, let's look at the upscale or 302 00:10:25,040 --> 00:10:27,760 variation tab. By uploading an image to 303 00:10:27,760 --> 00:10:29,839 the drop image here section, we can 304 00:10:29,839 --> 00:10:31,760 create slightly varied versions with 305 00:10:31,760 --> 00:10:34,240 various strongly varied versions with 306 00:10:34,240 --> 00:10:36,920 very strong and enlarge them with 307 00:10:36,920 --> 00:10:38,880 upscaler. I should mention that the 308 00:10:38,880 --> 00:10:40,959 upscaler here does not enlarge the image 309 00:10:40,959 --> 00:10:43,279 by stretching it. It recreates the image 310 00:10:43,279 --> 00:10:45,519 using the uploaded one as a reference. 311 00:10:45,519 --> 00:10:47,680 Let's do an example. I click on the drop 312 00:10:47,680 --> 00:10:49,440 image here section. Let me enter the 313 00:10:49,440 --> 00:10:52,000 1960s orange room image I just created 314 00:10:52,000 --> 00:10:54,240 for my computer. What is its size? for 315 00:10:54,240 --> 00:10:57,440 example, 1.3 MGB. Let's go check its 316 00:10:57,440 --> 00:10:59,200 dimensions in Finder. I right click on 317 00:10:59,200 --> 00:11:03,000 the image and select get info. It's 896x 318 00:11:03,000 --> 00:11:06,640 1,152 pixels. I go back to Pinocchio and 319 00:11:06,640 --> 00:11:08,560 upload my image by clicking on drop 320 00:11:08,560 --> 00:11:10,720 image here. Then I click on upscale to 321 00:11:10,720 --> 00:11:13,519 2x from the options below. I press the 322 00:11:13,519 --> 00:11:16,079 generate button and wait. Meanwhile, 323 00:11:16,079 --> 00:11:17,519 every time you try something different 324 00:11:17,519 --> 00:11:19,680 on focus, it will download other models. 325 00:11:19,680 --> 00:11:21,600 Wait patiently for it to download. Since 326 00:11:21,600 --> 00:11:23,440 everyone uses it for different purposes, 327 00:11:23,440 --> 00:11:25,120 it doesn't download the entire package 328 00:11:25,120 --> 00:11:27,360 at once. That would be silly. The sizes 329 00:11:27,360 --> 00:11:29,200 would become quite large. If you're not 330 00:11:29,200 --> 00:11:30,959 going to upscale at all, why would you 331 00:11:30,959 --> 00:11:33,519 download its model completed? It created 332 00:11:33,519 --> 00:11:35,360 the first one. I click on it and open 333 00:11:35,360 --> 00:11:37,040 it. It has become much higher quality 334 00:11:37,040 --> 00:11:39,200 compared to the previous one. Let me 335 00:11:39,200 --> 00:11:41,040 download it to my computer. I click on 336 00:11:41,040 --> 00:11:43,640 the download icon and save it as zero- 337 00:11:43,640 --> 00:11:46,399 qpng. I immediately check Finder. I 338 00:11:46,399 --> 00:11:47,920 right click on the first image and 339 00:11:47,920 --> 00:11:50,720 select get info. Its size was 1.3 340 00:11:50,720 --> 00:11:54,399 megabytes with dimensions of 896 x 1152 341 00:11:54,399 --> 00:11:56,240 pixels. I right click on the new one and 342 00:11:56,240 --> 00:11:58,959 select get info. Its size has become 4 7 343 00:11:58,959 --> 00:12:00,800 megabytes. The dimensions have also 344 00:12:00,800 --> 00:12:04,880 changed to 1792. By 2304 it was this. 345 00:12:04,880 --> 00:12:07,519 Now it's this. It's not just enlargement 346 00:12:07,519 --> 00:12:09,839 but reinterpretation. It improved 347 00:12:09,839 --> 00:12:12,240 sections. It corrected the distortion in 348 00:12:12,240 --> 00:12:14,320 this corner as well. We will continue 349 00:12:14,320 --> 00:12:16,240 from the input image section and show 350 00:12:16,240 --> 00:12:18,160 the advanced section because the other 351 00:12:18,160 --> 00:12:19,760 tabs of the input image are linked to 352 00:12:19,760 --> 00:12:21,440 the advanced section. I click on the 353 00:12:21,440 --> 00:12:23,200 advanced checkbox and the advanced 354 00:12:23,200 --> 00:12:25,040 settings open the performance section. 355 00:12:25,040 --> 00:12:27,600 Here I have three options. Speed, 356 00:12:27,600 --> 00:12:29,839 quality, extreme speed. In other words, 357 00:12:29,839 --> 00:12:32,560 when the AI creates photos, it goes step 358 00:12:32,560 --> 00:12:35,440 by step like in stages. It draws 359 00:12:35,440 --> 00:12:37,440 something more at each stage. The more 360 00:12:37,440 --> 00:12:39,200 stages there are, the higher the 361 00:12:39,200 --> 00:12:41,440 quality. Think of it like sanding. When 362 00:12:41,440 --> 00:12:44,000 you sand it 30 times, it becomes shiny. 363 00:12:44,000 --> 00:12:46,000 The speed option completes the visual in 364 00:12:46,000 --> 00:12:48,480 30 steps. Extreme speed completes it in 365 00:12:48,480 --> 00:12:50,959 eight steps. Quality completes it in 60 366 00:12:50,959 --> 00:12:52,880 steps. You can proceed with whichever 367 00:12:52,880 --> 00:12:54,800 one you prefer depending on the power of 368 00:12:54,800 --> 00:12:56,800 your computer's graphics card. I would 369 00:12:56,800 --> 00:12:58,800 like to clarify this. The quality 370 00:12:58,800 --> 00:13:00,720 difference related to this visual is 371 00:13:00,720 --> 00:13:02,720 actually a difference in AI usage. In 372 00:13:02,720 --> 00:13:05,360 other words, 60 steps mean thinking 60 373 00:13:05,360 --> 00:13:07,360 times. Eight steps mean thinking eight 374 00:13:07,360 --> 00:13:09,600 times. As the steps increase, you give 375 00:13:09,600 --> 00:13:11,440 the AI more chances to think, but 376 00:13:11,440 --> 00:13:14,160 naturally the time also extends. Under 377 00:13:14,160 --> 00:13:16,160 performance, there are aspect ratio 378 00:13:16,160 --> 00:13:18,720 visual dimensions. It supports all 379 00:13:18,720 --> 00:13:20,560 visual dimensions included in stable 380 00:13:20,560 --> 00:13:22,160 diffusion. Since we're here, I'm 381 00:13:22,160 --> 00:13:25,440 choosing 1024x 1024. The image number 382 00:13:25,440 --> 00:13:27,279 determines the number of visuals it will 383 00:13:27,279 --> 00:13:29,040 generate each time I press the generate 384 00:13:29,040 --> 00:13:31,839 button. By default, it comes as two. If 385 00:13:31,839 --> 00:13:33,360 you want, you can increase this number 386 00:13:33,360 --> 00:13:36,000 up to 32. I'm leaving it at two. The 387 00:13:36,000 --> 00:13:38,560 random checkbox allows the seat value to 388 00:13:38,560 --> 00:13:41,279 be assigned randomly. If you uncheck it, 389 00:13:41,279 --> 00:13:43,279 you can enter the seat value manually. 390 00:13:43,279 --> 00:13:46,000 The second tab is styles. This is 391 00:13:46,000 --> 00:13:48,160 actually the strongest aspect of focus. 392 00:13:48,160 --> 00:13:50,079 There are over a 100 styles you can 393 00:13:50,079 --> 00:13:52,000 select and use whichever you want. When 394 00:13:52,000 --> 00:13:54,160 you hover over these styles, a pop-up 395 00:13:54,160 --> 00:13:57,120 cat image appears. From these images, 396 00:13:57,120 --> 00:13:58,639 you can roughly understand what that 397 00:13:58,639 --> 00:14:01,040 style does. For example, focus cinematic 398 00:14:01,040 --> 00:14:02,880 is a cinematic cat image. There are 399 00:14:02,880 --> 00:14:04,880 bokeh lights and such in the background. 400 00:14:04,880 --> 00:14:07,360 What else? Adorable 3D character, 401 00:14:07,360 --> 00:14:08,720 meaning a cute three-dimensional 402 00:14:08,720 --> 00:14:10,959 character. The watercolor style, those 403 00:14:10,959 --> 00:14:13,199 like this. All of these are styles that 404 00:14:13,199 --> 00:14:15,600 have undergone very extensive training. 405 00:14:15,600 --> 00:14:17,920 For example, the watercolor style. I'm 406 00:14:17,920 --> 00:14:19,600 not entirely sure, but I guess it 407 00:14:19,600 --> 00:14:21,120 completed its training by scanning 408 00:14:21,120 --> 00:14:22,720 hundreds of thousands of watercolor 409 00:14:22,720 --> 00:14:24,959 images. Car advertisement images, for 410 00:14:24,959 --> 00:14:26,720 instance. It has been trained with 411 00:14:26,720 --> 00:14:29,040 thousands of car advertisement photos. 412 00:14:29,040 --> 00:14:30,959 To use them, simply click on them to 413 00:14:30,959 --> 00:14:32,639 activate. The other tab is the model 414 00:14:32,639 --> 00:14:34,160 which is the section we selected when 415 00:14:34,160 --> 00:14:35,920 opening focus. We are currently working 416 00:14:35,920 --> 00:14:37,839 with a realistic image but you can 417 00:14:37,839 --> 00:14:39,920 change it from here as well. In the 418 00:14:39,920 --> 00:14:42,800 advanced tab there is a gagen scale. As 419 00:14:42,800 --> 00:14:44,959 you increase its value the cleanliness, 420 00:14:44,959 --> 00:14:47,440 vibrancy and artistry increase. It 421 00:14:47,440 --> 00:14:49,040 becomes more beautiful yet there are 422 00:14:49,040 --> 00:14:51,360 nonsensical hallucinations. I generally 423 00:14:51,360 --> 00:14:53,680 try not to exceed a value of seven. But 424 00:14:53,680 --> 00:14:55,040 make sure to try different values 425 00:14:55,040 --> 00:14:57,279 yourself to gain experience. I can't 426 00:14:57,279 --> 00:14:58,720 change it right now because we're in 427 00:14:58,720 --> 00:15:00,480 extreme speed mode. I'm going back to 428 00:15:00,480 --> 00:15:02,480 the settings. If I set it to speed, 429 00:15:02,480 --> 00:15:04,320 meaning if I take it to 30 steps, it 430 00:15:04,320 --> 00:15:06,160 will be activated. Image sharpness is 431 00:15:06,160 --> 00:15:07,839 already understood from its name. 432 00:15:07,839 --> 00:15:09,680 Increasing it enhances the sharpness. 433 00:15:09,680 --> 00:15:11,920 Now, let's create an image using styles 434 00:15:11,920 --> 00:15:15,120 from the styles MK. I'm choosing Fango. 435 00:15:15,120 --> 00:15:17,680 I'm entering my prompt. An appetizing 436 00:15:17,680 --> 00:15:19,279 fruit platter of summer fruits on a 437 00:15:19,279 --> 00:15:21,600 wooden table. So, I want an appetizing 438 00:15:21,600 --> 00:15:23,680 fruit platter of summer fruits on a 439 00:15:23,680 --> 00:15:26,320 wooden table. Aspect ratio, meaning the 440 00:15:26,320 --> 00:15:29,240 dimensions. I choose 441 00:15:29,240 --> 00:15:31,920 1,344x74 pixels. I'm not changing the 442 00:15:31,920 --> 00:15:33,839 guidance scale. I press the generate 443 00:15:33,839 --> 00:15:35,560 button and it starts creating. 444 00:15:35,560 --> 00:15:38,399 Completed. Let's see. Very stylish. Like 445 00:15:38,399 --> 00:15:40,560 a Fanggo painting. If Fango saw how 446 00:15:40,560 --> 00:15:42,320 easily and quickly this was done, I 447 00:15:42,320 --> 00:15:43,760 think he would cut off another ear. 448 00:15:43,760 --> 00:15:45,519 Normally, we enter prompts in writing, 449 00:15:45,519 --> 00:15:47,199 but we can enter them with a visual 450 00:15:47,199 --> 00:15:49,759 prompt in Focus. In fact, we can also 451 00:15:49,759 --> 00:15:51,440 control what Fukus should do with these 452 00:15:51,440 --> 00:15:54,320 visual prompts. I activate the input 453 00:15:54,320 --> 00:15:56,720 image checkbox to be able to upload my 454 00:15:56,720 --> 00:15:59,440 image. Then I go to the image prompt in 455 00:15:59,440 --> 00:16:01,680 my second tab. I open the advanced 456 00:16:01,680 --> 00:16:03,360 checkbox located at the bottom of this 457 00:16:03,360 --> 00:16:05,920 tab so I can see all the features. Now I 458 00:16:05,920 --> 00:16:07,680 click on the first box and upload one of 459 00:16:07,680 --> 00:16:10,320 my own photos from my computer. I also 460 00:16:10,320 --> 00:16:12,560 open the advanced settings. I choose the 461 00:16:12,560 --> 00:16:16,240 image size as 1024x 1024 pixels. I 462 00:16:16,240 --> 00:16:18,639 switch to the styles tab. I remove the 463 00:16:18,639 --> 00:16:21,040 default ones. I have the idea of 464 00:16:21,040 --> 00:16:23,920 robotizing my uploaded image. For this 465 00:16:23,920 --> 00:16:26,240 reason, I type robot into the search 466 00:16:26,240 --> 00:16:28,480 box. A futuristic cybernetic robot 467 00:16:28,480 --> 00:16:30,079 appeared and I select it. There's a 468 00:16:30,079 --> 00:16:32,240 futuristic biomedical cyberpunk option. 469 00:16:32,240 --> 00:16:34,320 I select that too. Now I've reached the 470 00:16:34,320 --> 00:16:35,920 place where I can adjust the impact of 471 00:16:35,920 --> 00:16:37,600 the uploaded image on the new image that 472 00:16:37,600 --> 00:16:39,360 will be created. First, there's the stop 473 00:16:39,360 --> 00:16:42,240 it option. Stop at Look, this is I just 474 00:16:42,240 --> 00:16:44,800 had an epiphany. Stop at Stop it. Stop 475 00:16:44,800 --> 00:16:47,680 the engine. Stop at it comes from here. 476 00:16:47,680 --> 00:16:49,920 Look, I just figured it out. Stop. I 477 00:16:49,920 --> 00:16:51,519 mentioned that the image is created in 478 00:16:51,519 --> 00:16:54,560 steps. For example, speed is 30 steps. 479 00:16:54,560 --> 00:16:56,079 Stop. It determines where to stop 480 00:16:56,079 --> 00:16:58,480 reading the image during these 30 steps, 481 00:16:58,480 --> 00:17:00,839 thus affecting the newly created 482 00:17:00,839 --> 00:17:04,559 image. For example, right now it's 0.5, 483 00:17:04,559 --> 00:17:06,880 meaning half. If I set speed to one, it 484 00:17:06,880 --> 00:17:08,319 won't look at this image. After the 485 00:17:08,319 --> 00:17:10,000 first 15 steps, it will work 486 00:17:10,000 --> 00:17:12,400 independently. Or if I set it to one, it 487 00:17:12,400 --> 00:17:14,559 will continue to read the sample image 488 00:17:14,559 --> 00:17:17,280 throughout the entire production. on the 489 00:17:17,280 --> 00:17:19,839 right side is weight. This weight 490 00:17:19,839 --> 00:17:21,520 determines how much the new image will 491 00:17:21,520 --> 00:17:23,439 be influenced by the image I added. I'm 492 00:17:23,439 --> 00:17:26,559 coming to stop at. I set it to 0.8 493 00:17:26,559 --> 00:17:28,240 because I want the influence to continue 494 00:17:28,240 --> 00:17:30,160 for a long time. I'm not entering a 495 00:17:30,160 --> 00:17:32,240 written prompt and I press the generate 496 00:17:32,240 --> 00:17:33,919 button. It's completed. It was 497 00:17:33,919 --> 00:17:36,000 influenced by the blue t-shirt. Since 498 00:17:36,000 --> 00:17:38,720 stop at is high, it also resembled me. 499 00:17:38,720 --> 00:17:40,240 Since the weight is in the middle, it 500 00:17:40,240 --> 00:17:41,840 was able to change the background and 501 00:17:41,840 --> 00:17:43,440 such. We've reached the point I 502 00:17:43,440 --> 00:17:45,200 mentioned at the beginning of the video. 503 00:17:45,200 --> 00:17:47,440 Let's create a character. Let's create 504 00:17:47,440 --> 00:17:48,960 visuals based on the clothes this 505 00:17:48,960 --> 00:17:50,880 character wears, the place they go, and 506 00:17:50,880 --> 00:17:52,640 the time. I access the advanced 507 00:17:52,640 --> 00:17:55,360 settings. I ignore the input image. 508 00:17:55,360 --> 00:17:57,520 First, I select the quality. I will 509 00:17:57,520 --> 00:17:59,440 create my own character, so it should be 510 00:17:59,440 --> 00:18:01,120 high quality from the start. The 511 00:18:01,120 --> 00:18:03,520 existing styles are sufficient. I remove 512 00:18:03,520 --> 00:18:06,080 the negative and added masterpiece. Also 513 00:18:06,080 --> 00:18:07,679 reviewing the dimensions of the visual 514 00:18:07,679 --> 00:18:10,559 that will be created. It's 1:1, so it 515 00:18:10,559 --> 00:18:14,880 should be 1,024x 1,024 pixels. 516 00:18:14,880 --> 00:18:17,039 I increase the guidance scale a bit from 517 00:18:17,039 --> 00:18:19,679 the advanced tab. I want a more vibrant 518 00:18:19,679 --> 00:18:22,960 visual, so I paste my prompt. Close up 519 00:18:22,960 --> 00:18:25,039 portrait of a very beautiful brunette 520 00:18:25,039 --> 00:18:27,679 girl with blue eyes taken. Video in 521 00:18:27,679 --> 00:18:30,000 photo studio, meaning a close-up 522 00:18:30,000 --> 00:18:31,520 portrait of a very beautiful brunette 523 00:18:31,520 --> 00:18:33,360 girl with blue eyes taken in a photo 524 00:18:33,360 --> 00:18:35,600 studio. Desertia. Artificial 525 00:18:35,600 --> 00:18:38,400 intelligence continues. When producing 526 00:18:38,400 --> 00:18:40,720 visuals, it can't fully convert indoor 527 00:18:40,720 --> 00:18:42,640 lighting to outdoor lighting or vice 528 00:18:42,640 --> 00:18:45,360 versa. For this reason, it's important 529 00:18:45,360 --> 00:18:47,280 to decide on this when creating the 530 00:18:47,280 --> 00:18:49,400 initial prompt. Will it be indoors or 531 00:18:49,400 --> 00:18:51,520 outdoors? I'm pressing the generate 532 00:18:51,520 --> 00:18:53,280 button. Let's see what kind of result it 533 00:18:53,280 --> 00:18:55,120 will give. It's important for us to see 534 00:18:55,120 --> 00:18:56,960 the face of the visual clearly. This 535 00:18:56,960 --> 00:18:58,720 will be very important when creating new 536 00:18:58,720 --> 00:19:01,120 visuals. The first visual is completed. 537 00:19:01,120 --> 00:19:03,280 A beautiful girl was created. Looks 538 00:19:03,280 --> 00:19:04,960 real. It's almost impossible to 539 00:19:04,960 --> 00:19:07,039 distinguish this from a photograph. This 540 00:19:07,039 --> 00:19:09,280 is good for me. I'm downloading it to my 541 00:19:09,280 --> 00:19:11,679 computer. One of my predictions for 2024 542 00:19:11,679 --> 00:19:13,520 is artificial intelligence that can 543 00:19:13,520 --> 00:19:15,280 produce videos indistinguishable from 544 00:19:15,280 --> 00:19:17,200 reality like the one we just saw in the 545 00:19:17,200 --> 00:19:19,200 photo. Therefore, this technique will 546 00:19:19,200 --> 00:19:21,360 become even more important. Then, 547 00:19:21,360 --> 00:19:23,440 instead of just photos, we will produce 548 00:19:23,440 --> 00:19:25,200 videos with our character in different 549 00:19:25,200 --> 00:19:27,679 outfits and various locations. I'm 550 00:19:27,679 --> 00:19:29,440 activating the input image and coming to 551 00:19:29,440 --> 00:19:31,760 the image prompt section. I'm uploading 552 00:19:31,760 --> 00:19:34,480 the model image I just downloaded. I'm 553 00:19:34,480 --> 00:19:36,240 also opening the advanced option in the 554 00:19:36,240 --> 00:19:38,400 image prompt section. There are four 555 00:19:38,400 --> 00:19:40,160 options here. We've talked about the 556 00:19:40,160 --> 00:19:42,400 image prompt. There's also Pyrokini, 557 00:19:42,400 --> 00:19:45,760 CPDS, and phase swap. I'm choosing face 558 00:19:45,760 --> 00:19:47,679 swap. We will discuss the remaining two 559 00:19:47,679 --> 00:19:50,640 as well. If you notice, stop at 560 00:19:50,640 --> 00:19:53,679 automatically became 0.9. By entering a 561 00:19:53,679 --> 00:19:55,600 prompt, it wants the image I'm going to 562 00:19:55,600 --> 00:19:57,679 create to take this image's face to the 563 00:19:57,679 --> 00:20:00,559 final step. I'm boosting it even more. 564 00:20:00,559 --> 00:20:02,480 I'm setting it to one. It doesn't stop 565 00:20:02,480 --> 00:20:04,480 influencing and is present in all steps 566 00:20:04,480 --> 00:20:06,960 until the end. I'm entering my prompt 567 00:20:06,960 --> 00:20:09,360 fitness at the gym. Pink hair. So, it 568 00:20:09,360 --> 00:20:11,039 will be a girl with pink hair working 569 00:20:11,039 --> 00:20:12,720 out. I'm setting the performance to 570 00:20:12,720 --> 00:20:14,480 speed. I click generate. Let's see how 571 00:20:14,480 --> 00:20:16,240 it turns out. The first one is 572 00:20:16,240 --> 00:20:18,400 completed. The image turned out nice. 573 00:20:18,400 --> 00:20:20,400 The fingers are a bit problematic. It 574 00:20:20,400 --> 00:20:21,600 would be better if she didn't wear 575 00:20:21,600 --> 00:20:23,440 earrings at the gym. But we got the 576 00:20:23,440 --> 00:20:25,360 image we added. I'm saving it to my 577 00:20:25,360 --> 00:20:27,360 computer by clicking the download icon. 578 00:20:27,360 --> 00:20:29,440 The second one is also completed. It 579 00:20:29,440 --> 00:20:30,960 looks better since there are no hands 580 00:20:30,960 --> 00:20:33,120 around. Let me try another image. I'm 581 00:20:33,120 --> 00:20:35,200 entering my prompt. A pose in an elegant 582 00:20:35,200 --> 00:20:39,120 red evening party dress, blonde hair. So 583 00:20:39,120 --> 00:20:41,280 I said a pose in an elegant red party 584 00:20:41,280 --> 00:20:44,480 dress, blonde hair. I'm pressing the 585 00:20:44,480 --> 00:20:47,799 generate button. First one done. Very 586 00:20:47,799 --> 00:20:50,080 stylish. There's a bokeh effect in the 587 00:20:50,080 --> 00:20:51,840 background. The girl looks quite pretty. 588 00:20:51,840 --> 00:20:53,760 The second one is also completed. This 589 00:20:53,760 --> 00:20:56,080 time focus thought of a sitting pose. I 590 00:20:56,080 --> 00:20:58,080 think this one turned out nice, too. 591 00:20:58,080 --> 00:21:00,640 Let's try another example. I'm entering 592 00:21:00,640 --> 00:21:03,520 my prompt right away. A girl in a beret 593 00:21:03,520 --> 00:21:05,679 and coat walking down the street on a 594 00:21:05,679 --> 00:21:08,159 very cold snowy winter day. So, a girl 595 00:21:08,159 --> 00:21:09,919 wearing a beret and coat walking down 596 00:21:09,919 --> 00:21:11,760 the street on a very cold and snowy 597 00:21:11,760 --> 00:21:14,240 winter day. I'm saying generate see what 598 00:21:14,240 --> 00:21:16,159 comes out the first one is completed. I 599 00:21:16,159 --> 00:21:17,679 think this one turned out very nice, 600 00:21:17,679 --> 00:21:20,559 too. Coat, beret, a snowy street there. 601 00:21:20,559 --> 00:21:23,679 Second one done. Nice pose, too. Let's 602 00:21:23,679 --> 00:21:26,080 diversify a bit more. For example, we 603 00:21:26,080 --> 00:21:27,679 saw a photo and the character in that 604 00:21:27,679 --> 00:21:29,600 photo struck a pose. We want our 605 00:21:29,600 --> 00:21:31,919 character to strike the same pose. Let's 606 00:21:31,919 --> 00:21:33,919 see what we'll do. I have a pose here. 607 00:21:33,919 --> 00:21:36,400 The model has struck a pose like this. I 608 00:21:36,400 --> 00:21:38,480 want my character to do the same. I'm 609 00:21:38,480 --> 00:21:40,559 turning to focus. Let's keep our face 610 00:21:40,559 --> 00:21:42,799 swap photo. I'm not changing it. This 611 00:21:42,799 --> 00:21:45,440 time I'm choosing Pyraini. Pyroini is 612 00:21:45,440 --> 00:21:47,520 copying the movements. I'm going to the 613 00:21:47,520 --> 00:21:49,840 second photo box. I'm adding a stance 614 00:21:49,840 --> 00:21:52,480 pose here. I'm uploading my image. It 615 00:21:52,480 --> 00:21:54,320 will only take the character stance. I'm 616 00:21:54,320 --> 00:21:55,919 not touching the settings. I'm leaving 617 00:21:55,919 --> 00:21:57,600 it as default. I'm not entering a 618 00:21:57,600 --> 00:21:59,520 prompt. I can enter one if I want. just 619 00:21:59,520 --> 00:22:01,440 the visual dimensions. I'm choosing a 620 00:22:01,440 --> 00:22:03,039 vertical image. I click the generate 621 00:22:03,039 --> 00:22:04,880 button and it starts. It will take the 622 00:22:04,880 --> 00:22:06,480 face from the first image and the pose 623 00:22:06,480 --> 00:22:08,159 from the second image. Let's see what 624 00:22:08,159 --> 00:22:10,320 comes out. The first image is completed. 625 00:22:10,320 --> 00:22:12,240 Since I didn't enter a prompt, it did it 626 00:22:12,240 --> 00:22:14,400 on its own. It could take the face and 627 00:22:14,400 --> 00:22:16,240 pose as they are. There's a bit of 628 00:22:16,240 --> 00:22:18,640 distortion in the arm. The second pose 629 00:22:18,640 --> 00:22:20,640 is also completed. I think this one 630 00:22:20,640 --> 00:22:22,480 turned out better, but there's still a 631 00:22:22,480 --> 00:22:24,559 problem with the hand. Let me try 632 00:22:24,559 --> 00:22:27,200 entering a prompt. I said it on the 633 00:22:27,200 --> 00:22:29,679 beach in summer. I say generate. The 634 00:22:29,679 --> 00:22:31,360 first one is completed. I think it 635 00:22:31,360 --> 00:22:32,720 turned out much better when I entered a 636 00:22:32,720 --> 00:22:34,720 prompt. The eyes are a bit problematic. 637 00:22:34,720 --> 00:22:36,480 This became a good example. I will show 638 00:22:36,480 --> 00:22:38,480 how to fix these. The second one is 639 00:22:38,480 --> 00:22:40,559 completed. The face and eyes are nice. 640 00:22:40,559 --> 00:22:42,000 It just looks like there's a bit of a 641 00:22:42,000 --> 00:22:44,640 spinal issue. Anyway, we'll try it out. 642 00:22:44,640 --> 00:22:46,559 We'll continue until we find the one we 643 00:22:46,559 --> 00:22:48,240 like the most. We've reached the last 644 00:22:48,240 --> 00:22:50,640 section within the image prompt. CPDS, 645 00:22:50,640 --> 00:22:52,320 which stands for contrast preserving 646 00:22:52,320 --> 00:22:54,240 decolorization structure. It's not 647 00:22:54,240 --> 00:22:56,320 something very important. Unfortunately, 648 00:22:56,320 --> 00:22:58,159 you are also exposed to my obsession 649 00:22:58,159 --> 00:23:00,080 with trying to do everything perfectly. 650 00:23:00,080 --> 00:23:01,760 I get this silly feeling that if I see 651 00:23:01,760 --> 00:23:03,440 it there, I should explain it. I should 652 00:23:03,440 --> 00:23:05,440 tell what it is. But there's nothing to 653 00:23:05,440 --> 00:23:07,840 be done. As I said, you are also exposed 654 00:23:07,840 --> 00:23:09,760 to this. In short, it extracts the 655 00:23:09,760 --> 00:23:11,600 contrast from the image you add and 656 00:23:11,600 --> 00:23:13,679 applies it to the newly created image. I 657 00:23:13,679 --> 00:23:16,480 check the CPDS check box. From the third 658 00:23:16,480 --> 00:23:19,679 photo edition box, I upload the same 659 00:23:19,679 --> 00:23:21,679 pose photo from my computer. Now, it 660 00:23:21,679 --> 00:23:23,440 will also take the contrast of this pose 661 00:23:23,440 --> 00:23:27,080 photo. I select the dimensions of 768x 662 00:23:27,080 --> 00:23:30,000 1,344 pixels from here. And again, I 663 00:23:30,000 --> 00:23:31,520 press the generate button without 664 00:23:31,520 --> 00:23:33,360 entering a prompt. Let's see what will 665 00:23:33,360 --> 00:23:35,440 happen. Completed. As you can see, the 666 00:23:35,440 --> 00:23:37,280 colors have become nicer because the 667 00:23:37,280 --> 00:23:39,120 other one is a real photo. This one was 668 00:23:39,120 --> 00:23:40,799 influenced by it. This is the other 669 00:23:40,799 --> 00:23:42,880 visual. I think both turned out very 670 00:23:42,880 --> 00:23:45,280 beautiful. Within focus, you can modify 671 00:23:45,280 --> 00:23:47,760 visuals, expand them, and intervene in 672 00:23:47,760 --> 00:23:49,679 selected areas. 673 00:23:49,679 --> 00:23:51,360 I'm coming to the third tab within the 674 00:23:51,360 --> 00:23:54,039 input image the unpaint or I'll paint 675 00:23:54,039 --> 00:23:56,559 tab. Let's start with the first example. 676 00:23:56,559 --> 00:23:58,159 There was a snowy street visual I 677 00:23:58,159 --> 00:23:59,919 created. Let's take that for example. 678 00:23:59,919 --> 00:24:01,760 Drop image here. I click on the click to 679 00:24:01,760 --> 00:24:03,600 upload link and upload the visual from 680 00:24:03,600 --> 00:24:05,520 my computer. I can immediately say to 681 00:24:05,520 --> 00:24:07,039 extend this from the left and right 682 00:24:07,039 --> 00:24:08,880 under the outpaint direction heading. I 683 00:24:08,880 --> 00:24:12,400 check the left and right checkboxes. 684 00:24:12,400 --> 00:24:14,640 Then I just press the generate button. 685 00:24:14,640 --> 00:24:16,799 Let's see what happens. The first one is 686 00:24:16,799 --> 00:24:18,880 completed. As you can see, it expanded 687 00:24:18,880 --> 00:24:20,559 the visual beautifully. There's nothing 688 00:24:20,559 --> 00:24:22,799 disturbing at all. As you might guess, I 689 00:24:22,799 --> 00:24:24,559 can also expand by entering a written 690 00:24:24,559 --> 00:24:26,640 prompt. So, prompts I write can be added 691 00:24:26,640 --> 00:24:28,720 to the areas that will be expanded. 692 00:24:28,720 --> 00:24:30,960 Like, I don't know, add a tree or add a 693 00:24:30,960 --> 00:24:32,720 street lamp. Here, we can make small 694 00:24:32,720 --> 00:24:34,640 aesthetic adjustments with focus. 695 00:24:34,640 --> 00:24:37,039 Painless and effortless. I'm in the in 696 00:24:37,039 --> 00:24:38,880 paint or out paint tab. I immediately 697 00:24:38,880 --> 00:24:40,720 select improve detail from the method 698 00:24:40,720 --> 00:24:42,480 section. We had created an image with 699 00:24:42,480 --> 00:24:44,640 problematic eyes. If you remember, let's 700 00:24:44,640 --> 00:24:46,240 make an adjustment to it. I click on the 701 00:24:46,240 --> 00:24:48,000 click to upload link. I take our patient 702 00:24:48,000 --> 00:24:49,440 with the problematic eyes from my 703 00:24:49,440 --> 00:24:51,440 computer. I need to mark the problematic 704 00:24:51,440 --> 00:24:54,240 area here. From here I can see the 705 00:24:54,240 --> 00:24:57,200 shortcuts to use this canvas. I can also 706 00:24:57,200 --> 00:24:58,799 adjust the size of my brush from the 707 00:24:58,799 --> 00:25:01,039 right side. Now since the eyes are 708 00:25:01,039 --> 00:25:02,799 problematic, I will select them by 709 00:25:02,799 --> 00:25:05,039 painting over them. I hold down the 710 00:25:05,039 --> 00:25:07,600 shift key and scroll the mouse wheel. 711 00:25:07,600 --> 00:25:10,159 The image zooms in. I paint the eyes 712 00:25:10,159 --> 00:25:11,760 with the precision of a doctor. All 713 00:25:11,760 --> 00:25:14,159 right, I remove the zoom with the R key. 714 00:25:14,159 --> 00:25:16,159 There are also quick prompts here. It 715 00:25:16,159 --> 00:25:18,440 seems like the AI knows where it has 716 00:25:18,440 --> 00:25:20,799 issues. I click on the beautiful eyes 717 00:25:20,799 --> 00:25:23,840 link. It added it to the prompt section. 718 00:25:23,840 --> 00:25:26,159 Nice. I scroll up and press the generate 719 00:25:26,159 --> 00:25:28,240 button. Look, it zoomed in on the area I 720 00:25:28,240 --> 00:25:30,240 drew in my workspace and I can see the 721 00:25:30,240 --> 00:25:32,720 work there step by step or rather stage 722 00:25:32,720 --> 00:25:34,799 by stage. As you can see, it corrects 723 00:25:34,799 --> 00:25:37,279 the errors. The first one is completed. 724 00:25:37,279 --> 00:25:38,720 The problems with the eyes have been 725 00:25:38,720 --> 00:25:40,799 largely resolved. So, what else can we 726 00:25:40,799 --> 00:25:43,360 do? We can add, remove, or transform 727 00:25:43,360 --> 00:25:45,039 something. I'm returning to the in paint 728 00:25:45,039 --> 00:25:47,919 or outpaint tab. I erase the eyes I drew 729 00:25:47,919 --> 00:25:50,480 by pressing the silicone. I select 730 00:25:50,480 --> 00:25:52,799 modify content from the method section. 731 00:25:52,799 --> 00:25:54,559 I will try to change the bracelet on the 732 00:25:54,559 --> 00:25:57,520 arm. I zoom in on the image again. I 733 00:25:57,520 --> 00:25:59,600 select the bracelet here. In other 734 00:25:59,600 --> 00:26:02,240 words, I paint the bracelet. I completed 735 00:26:02,240 --> 00:26:04,720 it. I exit the zoom by pressing the R 736 00:26:04,720 --> 00:26:06,960 key on the keyboard. I come to the end 737 00:26:06,960 --> 00:26:08,880 paint additional prompt box and enter 738 00:26:08,880 --> 00:26:10,720 the red wristband prompt. I press the 739 00:26:10,720 --> 00:26:12,400 generate button. Let's see if the 740 00:26:12,400 --> 00:26:14,240 bracelet will change. The preview of the 741 00:26:14,240 --> 00:26:16,159 bracelet has appeared. It's creating it 742 00:26:16,159 --> 00:26:18,400 slowly. Only that area is visible 743 00:26:18,400 --> 00:26:20,400 because I selected it. It's completed. 744 00:26:20,400 --> 00:26:22,320 As you can see, the red wristband has 745 00:26:22,320 --> 00:26:24,000 appeared. It also matched with the 746 00:26:24,000 --> 00:26:26,159 shorts. From here on, you can do 747 00:26:26,159 --> 00:26:28,559 anything you want. For example, you can 748 00:26:28,559 --> 00:26:30,480 change the hair color to pink. We've 749 00:26:30,480 --> 00:26:32,320 reached the final section. In this 750 00:26:32,320 --> 00:26:34,080 section, you can add an image or an 751 00:26:34,080 --> 00:26:36,159 animation image and have it convert it 752 00:26:36,159 --> 00:26:38,480 into a text prompt or in other words 753 00:26:38,480 --> 00:26:41,120 into text for you. It reads the image 754 00:26:41,120 --> 00:26:43,120 and converts what it understands into 755 00:26:43,120 --> 00:26:45,760 text for you. I'm in the describe tab. 756 00:26:45,760 --> 00:26:47,840 Here we can have it describe a photo or 757 00:26:47,840 --> 00:26:49,760 anime image that I input. I clicked on 758 00:26:49,760 --> 00:26:51,279 the click to upload link and edit my 759 00:26:51,279 --> 00:26:53,520 model from my computer. Let's see how it 760 00:26:53,520 --> 00:26:55,039 will describe it. I'm clicking on the 761 00:26:55,039 --> 00:26:57,760 describe this image into prop button. It 762 00:26:57,760 --> 00:26:59,679 will immediately describe the image in 763 00:26:59,679 --> 00:27:01,520 its own way and enter the resulting 764 00:27:01,520 --> 00:27:04,240 text. Enter the prompt box above. Yes, 765 00:27:04,240 --> 00:27:06,880 it wrote in the generate box. A woman 766 00:27:06,880 --> 00:27:10,240 with a big earring next to blue eyes. 767 00:27:10,240 --> 00:27:12,240 So, a beautiful woman with a big earring 768 00:27:12,240 --> 00:27:14,080 next to blue eyes. Let's add another 769 00:27:14,080 --> 00:27:16,080 example. Let me add the model we used 770 00:27:16,080 --> 00:27:18,240 for this pose. I'm clicking on the click 771 00:27:18,240 --> 00:27:19,840 to upload link and uploading the image 772 00:27:19,840 --> 00:27:22,320 of the model I used for the pose. I 773 00:27:22,320 --> 00:27:23,919 clicked on the describe this image into 774 00:27:23,919 --> 00:27:26,799 prompt button. It responded, "A woman 775 00:27:26,799 --> 00:27:29,679 with glasses leans against her neck 776 00:27:29,679 --> 00:27:33,200 wearing a floral shirt." So I said, "A 777 00:27:33,200 --> 00:27:35,120 woman with glasses is leaning against 778 00:27:35,120 --> 00:27:38,000 her neck wearing a floral shirt." I'm 779 00:27:38,000 --> 00:27:40,640 pressing the generate button. Let's see 780 00:27:40,640 --> 00:27:43,520 what it will create. It didn't get so 781 00:27:43,520 --> 00:27:45,440 she's not leaning there. It's like she's 782 00:27:45,440 --> 00:27:47,279 placing her hand. There's a feminine 783 00:27:47,279 --> 00:27:49,600 pose. It couldn't figure that part out. 784 00:27:49,600 --> 00:27:52,400 Anyway, it created it. It didn't fully 785 00:27:52,400 --> 00:27:53,679 understand or couldn't create the 786 00:27:53,679 --> 00:27:55,520 visual. I don't know exactly about that 787 00:27:55,520 --> 00:27:57,360 part, but I think the visual turned out 788 00:27:57,360 --> 00:27:59,440 very realistic and beautiful. If you've 789 00:27:59,440 --> 00:28:00,960 made it this far in the video, I 790 00:28:00,960 --> 00:28:02,720 congratulate you. It's been a bit of a 791 00:28:02,720 --> 00:28:04,640 long video, but I think if I'm 792 00:28:04,640 --> 00:28:06,559 explaining something, I should touch on 793 00:28:06,559 --> 00:28:08,240 everything. I'm still dragging it out 794 00:28:08,240 --> 00:28:10,159 and can't stop. If you have any 795 00:28:10,159 --> 00:28:11,600 questions, you can write them as 796 00:28:11,600 --> 00:28:13,520 comments under the video, leaving my 797 00:28:13,520 --> 00:28:15,360 social media accounts here. You can 798 00:28:15,360 --> 00:28:18,240 follow me from there and ask questions. 799 00:28:18,240 --> 00:28:20,399 That's it for this week. See you in the 800 00:28:20,399 --> 00:28:24,120 next video. Goodbye.58272