Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,240 --> 00:00:01,839
With many artificial intelligence
2
00:00:01,839 --> 00:00:03,760
applications, we are all now producing
3
00:00:03,760 --> 00:00:06,400
images. At least we are trying. I also
4
00:00:06,400 --> 00:00:08,240
use many of them for both my own work
5
00:00:08,240 --> 00:00:10,320
and experiments. Each has its own
6
00:00:10,320 --> 00:00:12,559
strengths and weaknesses. In this video,
7
00:00:12,559 --> 00:00:14,000
I will talk about an artificial
8
00:00:14,000 --> 00:00:15,759
intelligence application that produces
9
00:00:15,759 --> 00:00:18,080
very successful, very realistic images
10
00:00:18,080 --> 00:00:19,439
that can address several important
11
00:00:19,439 --> 00:00:21,920
issues at once. The main issue is the
12
00:00:21,920 --> 00:00:24,160
successful AI image production tools are
13
00:00:24,160 --> 00:00:27,359
paid. They're right because producing
14
00:00:27,359 --> 00:00:29,439
anything with artificial intelligence
15
00:00:29,439 --> 00:00:31,519
requires significant computing power.
16
00:00:31,519 --> 00:00:32,960
There's no point in me agreeing with
17
00:00:32,960 --> 00:00:34,640
them. After all, these are all
18
00:00:34,640 --> 00:00:36,399
commercial ventures and naturally they
19
00:00:36,399 --> 00:00:38,320
want to make money. The easiest way to
20
00:00:38,320 --> 00:00:40,000
do these operations without spending
21
00:00:40,000 --> 00:00:41,920
money is to place our own computer at
22
00:00:41,920 --> 00:00:44,000
the point we call that computer. In
23
00:00:44,000 --> 00:00:45,840
other words, to leave all the load on
24
00:00:45,840 --> 00:00:48,160
our own computer. This leads us to open
25
00:00:48,160 --> 00:00:49,920
source software. We don't need to pay
26
00:00:49,920 --> 00:00:52,320
anything for these. Completely free.
27
00:00:52,320 --> 00:00:54,079
They are there for development and
28
00:00:54,079 --> 00:00:56,079
continue to be developed. After reaching
29
00:00:56,079 --> 00:00:58,160
a certain point, companies that wish can
30
00:00:58,160 --> 00:01:00,480
switch to a paid version. In fact,
31
00:01:00,480 --> 00:01:02,800
OpenAI has such a story. When they first
32
00:01:02,800 --> 00:01:05,199
started, as the name suggests, OpenAI
33
00:01:05,199 --> 00:01:07,520
began with this open software. Then they
34
00:01:07,520 --> 00:01:09,760
evolved into a company extremely secure
35
00:01:09,760 --> 00:01:11,520
open software. I have been using them
36
00:01:11,520 --> 00:01:13,360
for a long time. I haven't seen any
37
00:01:13,360 --> 00:01:15,360
problems. Since it's open, if there were
38
00:01:15,360 --> 00:01:17,520
any virus, no one would install it or
39
00:01:17,520 --> 00:01:19,200
those who know would immediately see it
40
00:01:19,200 --> 00:01:21,360
and request its removal. These types of
41
00:01:21,360 --> 00:01:23,200
open software are generally hosted on
42
00:01:23,200 --> 00:01:25,439
GitHub. In the past, installing these
43
00:01:25,439 --> 00:01:27,680
open software on our computers required
44
00:01:27,680 --> 00:01:29,680
some technical knowledge. Now with the
45
00:01:29,680 --> 00:01:31,680
Pinocchio software, that's no longer
46
00:01:31,680 --> 00:01:33,680
necessary. You can handle everything
47
00:01:33,680 --> 00:01:35,840
very easily. By the way, the biggest
48
00:01:35,840 --> 00:01:37,920
advantage of generating visuals on your
49
00:01:37,920 --> 00:01:39,520
own computer is that you don't face any
50
00:01:39,520 --> 00:01:41,600
copyright issues. The secondary solver
51
00:01:41,600 --> 00:01:43,200
of this software is character
52
00:01:43,200 --> 00:01:44,720
continuity. It's one of the most
53
00:01:44,720 --> 00:01:46,560
frequently asked questions I receive.
54
00:01:46,560 --> 00:01:48,560
So, how do we ensure the continuity of a
55
00:01:48,560 --> 00:01:50,720
character we've created? We created a
56
00:01:50,720 --> 00:01:52,799
female character. It also solves how to
57
00:01:52,799 --> 00:01:54,159
place this character in different
58
00:01:54,159 --> 00:01:56,399
settings like in Ulude during winter
59
00:01:56,399 --> 00:01:58,880
when it's snowing on the beach in summer
60
00:01:58,880 --> 00:02:02,079
at the cinema or in a studio. By the
61
00:02:02,079 --> 00:02:03,920
way, I couldn't help but wonder if this
62
00:02:03,920 --> 00:02:05,759
topic is related to a news article I
63
00:02:05,759 --> 00:02:08,560
read recently. Someone created an AI
64
00:02:08,560 --> 00:02:11,080
influencer named Itana and it's earning
65
00:02:11,080 --> 00:02:13,599
$11,000 a month. It's a figure that
66
00:02:13,599 --> 00:02:15,680
boggles the mind. I also found the
67
00:02:15,680 --> 00:02:17,599
Instagram account of the AI influencer
68
00:02:17,599 --> 00:02:19,840
and it's still active. In summary, with
69
00:02:19,840 --> 00:02:21,599
this technique, you can create such a
70
00:02:21,599 --> 00:02:23,680
character and maintain its presence in
71
00:02:23,680 --> 00:02:26,080
various environments. By the way, I
72
00:02:26,080 --> 00:02:28,400
think 2024 will be the year of AI
73
00:02:28,400 --> 00:02:30,160
influencers. I can almost hear you
74
00:02:30,160 --> 00:02:31,920
saying, "Who cares about your opinion?
75
00:02:31,920 --> 00:02:34,000
Just send the goods." I'm starting right
76
00:02:34,000 --> 00:02:35,879
away. I'm on the
77
00:02:35,879 --> 00:02:38,319
Pinocchio. As always, I will add all the
78
00:02:38,319 --> 00:02:40,239
links and prompts I use in the video
79
00:02:40,239 --> 00:02:41,840
description. I'm clicking the download
80
00:02:41,840 --> 00:02:43,840
button. On the page that opens, you can
81
00:02:43,840 --> 00:02:45,440
download Pinocchio according to the
82
00:02:45,440 --> 00:02:47,040
platform you are using. There are
83
00:02:47,040 --> 00:02:48,800
platform options available such as
84
00:02:48,800 --> 00:02:52,720
Windows, Mac, M1, M2, M3, Intel Mac, and
85
00:02:52,720 --> 00:02:54,800
Linux. Since I am using a computer with
86
00:02:54,800 --> 00:02:57,040
an M1 Apple silicon processor, I click
87
00:02:57,040 --> 00:03:00,160
on the M1, M2, M3 Mac link. On the page
88
00:03:00,160 --> 00:03:02,080
that opens, I click on the link click to
89
00:03:02,080 --> 00:03:06,800
download Pinocchio for M1, M2, M3 Max.
90
00:03:06,800 --> 00:03:08,319
It started downloading the latest
91
00:03:08,319 --> 00:03:10,239
version. The download is complete. I
92
00:03:10,239 --> 00:03:11,920
double click on the file I downloaded to
93
00:03:11,920 --> 00:03:13,760
open it. In the window that opens, I
94
00:03:13,760 --> 00:03:15,840
first drag the Pinocchio application and
95
00:03:15,840 --> 00:03:18,159
drop it onto the applications folder.
96
00:03:18,159 --> 00:03:20,000
Then I right click on the patch command
97
00:03:20,000 --> 00:03:21,920
file and click on open link from the
98
00:03:21,920 --> 00:03:24,080
menu that appears. A warning appears
99
00:03:24,080 --> 00:03:26,159
saying the developer is not verified. I
100
00:03:26,159 --> 00:03:28,159
know it's not a problem. I press the
101
00:03:28,159 --> 00:03:30,080
open button again. The terminal
102
00:03:30,080 --> 00:03:32,239
application opens on my computer. Here
103
00:03:32,239 --> 00:03:33,840
it asks me to enter my computer's
104
00:03:33,840 --> 00:03:35,680
password. I will enter it, but I want to
105
00:03:35,680 --> 00:03:37,760
clarify one point here. In the terminal,
106
00:03:37,760 --> 00:03:40,560
the cursor does not move as you type. It
107
00:03:40,560 --> 00:03:42,239
looks like you're not typing, but you
108
00:03:42,239 --> 00:03:45,120
actually are. I enter my password and
109
00:03:45,120 --> 00:03:46,879
press the enter key on the keyboard. As
110
00:03:46,879 --> 00:03:48,799
you can see, the process completed.
111
00:03:48,799 --> 00:03:50,879
Notification appeared. Now I return to
112
00:03:50,879 --> 00:03:52,799
the application folder and double click
113
00:03:52,799 --> 00:03:55,519
on the Pinocchio application to open it.
114
00:03:55,519 --> 00:03:58,319
Pinocchio has opened. From this page, I
115
00:03:58,319 --> 00:03:59,599
can choose where the Pinocchio
116
00:03:59,599 --> 00:04:01,760
applications files will be stored on my
117
00:04:01,760 --> 00:04:04,159
computer and select a light or dark
118
00:04:04,159 --> 00:04:06,400
theme. It can stay like this. I press
119
00:04:06,400 --> 00:04:08,560
the save button. Now my Pinocchio
120
00:04:08,560 --> 00:04:10,799
application is ready. From here I click
121
00:04:10,799 --> 00:04:12,720
on the visit discover page button or the
122
00:04:12,720 --> 00:04:14,560
discover icon over there and all the
123
00:04:14,560 --> 00:04:16,560
applications I can use within Pinocchio
124
00:04:16,560 --> 00:04:18,320
have opened. There are many applications
125
00:04:18,320 --> 00:04:21,199
like invokeai, stream diffusion, dream
126
00:04:21,199 --> 00:04:23,520
talk. I explained in detail what we can
127
00:04:23,520 --> 00:04:25,040
do with many of them in a previous
128
00:04:25,040 --> 00:04:27,280
video. I'm leaving the link. You can
129
00:04:27,280 --> 00:04:29,280
watch it from there. These applications
130
00:04:29,280 --> 00:04:30,639
don't have a direct connection with
131
00:04:30,639 --> 00:04:32,720
Pinocchio. You can think of Pinocchio as
132
00:04:32,720 --> 00:04:34,560
a player. In short, it lets you use
133
00:04:34,560 --> 00:04:36,320
whichever application is available on
134
00:04:36,320 --> 00:04:38,080
the counter. All the applications within
135
00:04:38,080 --> 00:04:40,160
Pinocchio are on GitHub if you want
136
00:04:40,160 --> 00:04:42,000
instead of using Pinocchio. You can go
137
00:04:42,000 --> 00:04:44,400
to GitHub, download the files, download
138
00:04:44,400 --> 00:04:46,400
the necessary software and versions to
139
00:04:46,400 --> 00:04:48,240
run the application, run the software,
140
00:04:48,240 --> 00:04:49,919
and set up all the settings to use the
141
00:04:49,919 --> 00:04:52,479
artificial intelligence applications. Or
142
00:04:52,479 --> 00:04:54,240
instead of all this, you can set up the
143
00:04:54,240 --> 00:04:56,160
Pinocchio I showed you and have it do
144
00:04:56,160 --> 00:04:58,320
these processes for you. I'm returning
145
00:04:58,320 --> 00:05:01,560
to the discover page. The AI app is
146
00:05:01,560 --> 00:05:04,800
Focus. Yes, it's spelled with three O's.
147
00:05:04,800 --> 00:05:07,120
I clicked on it. On the page that opens,
148
00:05:07,120 --> 00:05:08,639
there's a link to the GitHub project.
149
00:05:08,639 --> 00:05:10,720
Again, as I mentioned earlier, those who
150
00:05:10,720 --> 00:05:12,479
want can also set it up from here.
151
00:05:12,479 --> 00:05:14,400
There's also information that Focus uses
152
00:05:14,400 --> 00:05:16,320
the stable diffusion infrastructure.
153
00:05:16,320 --> 00:05:17,840
There's also a link to the Twitter
154
00:05:17,840 --> 00:05:19,520
account of Cocktail Peanut, the creator
155
00:05:19,520 --> 00:05:21,280
of Pinocchio, which you can follow if
156
00:05:21,280 --> 00:05:23,320
you want. I'm clicking on the download
157
00:05:23,320 --> 00:05:25,680
button. As you can see on the page that
158
00:05:25,680 --> 00:05:28,240
opens, many software components like
159
00:05:28,240 --> 00:05:31,120
Git, Zip, Node, JavaScript are missing
160
00:05:31,120 --> 00:05:33,360
for Fukus to work. Thanks to Pinocchio,
161
00:05:33,360 --> 00:05:34,800
I don't have to deal with where these
162
00:05:34,800 --> 00:05:37,039
are located, where to download them from
163
00:05:37,039 --> 00:05:38,680
or which version to
164
00:05:38,680 --> 00:05:41,120
download. I click on the install button
165
00:05:41,120 --> 00:05:42,880
and start downloading all of them. The
166
00:05:42,880 --> 00:05:44,960
downloads are complete. I click on the
167
00:05:44,960 --> 00:05:46,919
okay button. Now we can move on to
168
00:05:46,919 --> 00:05:49,680
Fukus. Just now when I click to install
169
00:05:49,680 --> 00:05:51,840
Fukus, Pinocchio helped me download the
170
00:05:51,840 --> 00:05:54,160
necessary software to use Fukus. Now I
171
00:05:54,160 --> 00:05:55,919
will download Fukus. I click on the
172
00:05:55,919 --> 00:05:58,080
download button. It downloaded. I click
173
00:05:58,080 --> 00:06:00,240
on it again. From the page that opens, I
174
00:06:00,240 --> 00:06:02,000
click on the install link. From the menu
175
00:06:02,000 --> 00:06:03,840
on the left, a warning came up under the
176
00:06:03,840 --> 00:06:05,280
installation required heading that
177
00:06:05,280 --> 00:06:07,039
another software is missing. I click on
178
00:06:07,039 --> 00:06:08,800
the install button again. The
179
00:06:08,800 --> 00:06:10,160
installation of the missing brew
180
00:06:10,160 --> 00:06:13,120
software has also started completed. I
181
00:06:13,120 --> 00:06:15,199
click on the okay button. Now the source
182
00:06:15,199 --> 00:06:17,560
files for focus have started
183
00:06:17,560 --> 00:06:19,520
downloading. These are quite large
184
00:06:19,520 --> 00:06:21,600
files. Please wait patiently for them to
185
00:06:21,600 --> 00:06:23,440
download. I must reiterate that you are
186
00:06:23,440 --> 00:06:25,880
downloading the entire AI model to your
187
00:06:25,880 --> 00:06:28,160
computer. I mentioned this before.
188
00:06:28,160 --> 00:06:29,759
Download the model to your computer,
189
00:06:29,759 --> 00:06:31,680
open it in Pinocchio, and start running
190
00:06:31,680 --> 00:06:33,600
it. If you're sure it's working, you can
191
00:06:33,600 --> 00:06:35,280
turn off the internet and continue with
192
00:06:35,280 --> 00:06:37,520
visual production. So, in summary, you
193
00:06:37,520 --> 00:06:39,280
download it, but you only download it
194
00:06:39,280 --> 00:06:41,600
once. After this, you can create
195
00:06:41,600 --> 00:06:43,919
unlimited, free, royalty-free
196
00:06:43,919 --> 00:06:46,479
productions as much as you want. The
197
00:06:46,479 --> 00:06:48,240
download is complete. I click the start
198
00:06:48,240 --> 00:06:50,160
button from the menu on the left. From
199
00:06:50,160 --> 00:06:52,319
the open submen, I can start using the
200
00:06:52,319 --> 00:06:54,639
mode or rather the model that I want.
201
00:06:54,639 --> 00:06:56,639
Today, I will proceed with the realistic
202
00:06:56,639 --> 00:06:58,960
mode. I click on the realistic mode. I
203
00:06:58,960 --> 00:07:00,800
started downloading its models. The
204
00:07:00,800 --> 00:07:04,080
model is about 6 and 1/2 GB in size. I'm
205
00:07:04,080 --> 00:07:05,919
waiting for it to be downloaded as well.
206
00:07:05,919 --> 00:07:08,240
The download is complete. To be sure, I
207
00:07:08,240 --> 00:07:10,560
restarted Pinocchio again. I click the
208
00:07:10,560 --> 00:07:12,319
start button from the menu on the left
209
00:07:12,319 --> 00:07:14,160
and select the realistic mode from the
210
00:07:14,160 --> 00:07:16,560
opened submen. I saw the message app
211
00:07:16,560 --> 00:07:19,039
started successfully. The processes are
212
00:07:19,039 --> 00:07:21,599
complete. If I want, I can click on the
213
00:07:21,599 --> 00:07:24,000
link here to open the focus application
214
00:07:24,000 --> 00:07:26,240
in an external browser or I can click
215
00:07:26,240 --> 00:07:28,319
the web UI button from the menu on the
216
00:07:28,319 --> 00:07:30,880
left to have the focus application run
217
00:07:30,880 --> 00:07:32,840
directly within
218
00:07:32,840 --> 00:07:35,199
Pinocchio. The most important feature of
219
00:07:35,199 --> 00:07:37,199
focus as I mentioned at the beginning is
220
00:07:37,199 --> 00:07:39,120
close-up or focus which I guess is where
221
00:07:39,120 --> 00:07:41,440
the name comes from. It produces very
222
00:07:41,440 --> 00:07:43,759
very realistic visuals. Right now,
223
00:07:43,759 --> 00:07:45,680
without touching any settings and using
224
00:07:45,680 --> 00:07:47,360
the default settings it opens with, when
225
00:07:47,360 --> 00:07:49,039
I paste a prompt into the box and
226
00:07:49,039 --> 00:07:50,960
produce any visual, you will see that it
227
00:07:50,960 --> 00:07:53,120
creates incredibly realistic visuals.
228
00:07:53,120 --> 00:07:55,680
I'm pasting my prompt right away. A very
229
00:07:55,680 --> 00:07:58,400
beautiful Turkish girl fitness at the
230
00:07:58,400 --> 00:08:00,800
gym. I meant to say a beautiful Turkish
231
00:08:00,800 --> 00:08:03,199
girl doing fitness at the gym. Let's see
232
00:08:03,199 --> 00:08:05,120
what kind of visual will come out. By
233
00:08:05,120 --> 00:08:07,120
the way, you absolutely need to enter
234
00:08:07,120 --> 00:08:08,720
the prompts in English. It doesn't
235
00:08:08,720 --> 00:08:10,800
understand Turkish. You can use deep l
236
00:08:10,800 --> 00:08:12,960
for quick translation. By the way, I
237
00:08:12,960 --> 00:08:14,879
don't want the same sexes. Since the
238
00:08:14,879 --> 00:08:17,199
topic started with itana AI influencer,
239
00:08:17,199 --> 00:08:20,080
I went with women. Of course, my female
240
00:08:20,080 --> 00:08:21,840
viewers can also produce visuals of
241
00:08:21,840 --> 00:08:23,840
handsome men. As you can appreciate, it
242
00:08:23,840 --> 00:08:25,440
would look a bit odd for me to produce
243
00:08:25,440 --> 00:08:27,280
visuals of handsome men. Did I seem
244
00:08:27,280 --> 00:08:29,520
homophobic this time? I wonder. It's
245
00:08:29,520 --> 00:08:32,200
very difficult, you know. Anyway, I'm
246
00:08:32,200 --> 00:08:33,919
continuing. We will see how
247
00:08:33,919 --> 00:08:35,599
knowledgeable our AI model is about
248
00:08:35,599 --> 00:08:37,599
Turkish girls. I'm pressing the generate
249
00:08:37,599 --> 00:08:39,599
button and waiting. This waiting is
250
00:08:39,599 --> 00:08:41,440
related to the graphics card power of my
251
00:08:41,440 --> 00:08:43,440
computer or rather your computer.
252
00:08:43,440 --> 00:08:45,279
Computers with powerful graphics cards
253
00:08:45,279 --> 00:08:47,200
are faster while those with weaker
254
00:08:47,200 --> 00:08:50,000
graphics cards are slower. You can also
255
00:08:50,000 --> 00:08:52,640
see it step by step. I speed it up in
256
00:08:52,640 --> 00:08:54,560
editing. Of course, I don't have such a
257
00:08:54,560 --> 00:08:56,640
computer. The first image is completed.
258
00:08:56,640 --> 00:08:58,640
I'm clicking on it and enlarging it. How
259
00:08:58,640 --> 00:09:01,120
is it? Not too bad. I guess it looks
260
00:09:01,120 --> 00:09:03,040
quite realistic. The hands turned out a
261
00:09:03,040 --> 00:09:04,959
bit problematic. I think the background
262
00:09:04,959 --> 00:09:07,040
and face came out nicely. The left arm
263
00:09:07,040 --> 00:09:09,040
looks a bit injured. I guess it happened
264
00:09:09,040 --> 00:09:10,560
while lifting weights. Whatever
265
00:09:10,560 --> 00:09:12,880
happened, the second one forms, changes
266
00:09:12,880 --> 00:09:14,720
clothes, it drew a model of a typical
267
00:09:14,720 --> 00:09:16,640
Turkish girl. The second one is also
268
00:09:16,640 --> 00:09:18,720
completed. This one looks nice, too.
269
00:09:18,720 --> 00:09:20,240
We'd have a hard time distinguishing
270
00:09:20,240 --> 00:09:23,040
whether it's a photo or AI generated. We
271
00:09:23,040 --> 00:09:24,959
know there's no such person on Earth.
272
00:09:24,959 --> 00:09:26,880
Let's enter another prompt. This time,
273
00:09:26,880 --> 00:09:28,720
let's not make it a person, but a room
274
00:09:28,720 --> 00:09:30,640
or room in a house. I want to show you
275
00:09:30,640 --> 00:09:32,959
different things. A 1960s style room
276
00:09:32,959 --> 00:09:35,760
with orange objects. I mean, I'm saying
277
00:09:35,760 --> 00:09:38,880
a 1960s style room with orange objects.
278
00:09:38,880 --> 00:09:40,240
Let's see what will come out. I'm
279
00:09:40,240 --> 00:09:41,519
pressing the generate button and
280
00:09:41,519 --> 00:09:43,440
waiting. It seems a nice room is coming.
281
00:09:43,440 --> 00:09:45,200
Let's see it complete. The first one is
282
00:09:45,200 --> 00:09:47,200
completed. I'm clicking on it to open. I
283
00:09:47,200 --> 00:09:48,800
think it turned out nice. It has the
284
00:09:48,800 --> 00:09:51,360
feel of 1960s homes we see in movies.
285
00:09:51,360 --> 00:09:53,200
It's predominantly orange, just as I
286
00:09:53,200 --> 00:09:54,800
wanted. If you want to save it, you can
287
00:09:54,800 --> 00:09:56,560
click on the download icon here to save
288
00:09:56,560 --> 00:09:59,200
it. I named it 01 and saved it in PNG
289
00:09:59,200 --> 00:10:01,120
format. The second one doesn't seem bad
290
00:10:01,120 --> 00:10:02,560
either. It looks like it won't have a
291
00:10:02,560 --> 00:10:04,240
television in it right now. There will
292
00:10:04,240 --> 00:10:07,279
be a fireplace. It's done. This is
293
00:10:07,279 --> 00:10:08,880
another house interior. So, they
294
00:10:08,880 --> 00:10:10,480
considered the reflection of the mirror.
295
00:10:10,480 --> 00:10:13,200
I'm saving this as 02png. Now, let's
296
00:10:13,200 --> 00:10:15,040
start examining the features of focus.
297
00:10:15,040 --> 00:10:16,560
For this, I'm clicking on the input
298
00:10:16,560 --> 00:10:18,800
image checkbox. Details have opened up
299
00:10:18,800 --> 00:10:21,200
at the bottom. I have four options here.
300
00:10:21,200 --> 00:10:22,959
We will discuss all of them in order.
301
00:10:22,959 --> 00:10:25,040
First, let's look at the upscale or
302
00:10:25,040 --> 00:10:27,760
variation tab. By uploading an image to
303
00:10:27,760 --> 00:10:29,839
the drop image here section, we can
304
00:10:29,839 --> 00:10:31,760
create slightly varied versions with
305
00:10:31,760 --> 00:10:34,240
various strongly varied versions with
306
00:10:34,240 --> 00:10:36,920
very strong and enlarge them with
307
00:10:36,920 --> 00:10:38,880
upscaler. I should mention that the
308
00:10:38,880 --> 00:10:40,959
upscaler here does not enlarge the image
309
00:10:40,959 --> 00:10:43,279
by stretching it. It recreates the image
310
00:10:43,279 --> 00:10:45,519
using the uploaded one as a reference.
311
00:10:45,519 --> 00:10:47,680
Let's do an example. I click on the drop
312
00:10:47,680 --> 00:10:49,440
image here section. Let me enter the
313
00:10:49,440 --> 00:10:52,000
1960s orange room image I just created
314
00:10:52,000 --> 00:10:54,240
for my computer. What is its size? for
315
00:10:54,240 --> 00:10:57,440
example, 1.3 MGB. Let's go check its
316
00:10:57,440 --> 00:10:59,200
dimensions in Finder. I right click on
317
00:10:59,200 --> 00:11:03,000
the image and select get info. It's 896x
318
00:11:03,000 --> 00:11:06,640
1,152 pixels. I go back to Pinocchio and
319
00:11:06,640 --> 00:11:08,560
upload my image by clicking on drop
320
00:11:08,560 --> 00:11:10,720
image here. Then I click on upscale to
321
00:11:10,720 --> 00:11:13,519
2x from the options below. I press the
322
00:11:13,519 --> 00:11:16,079
generate button and wait. Meanwhile,
323
00:11:16,079 --> 00:11:17,519
every time you try something different
324
00:11:17,519 --> 00:11:19,680
on focus, it will download other models.
325
00:11:19,680 --> 00:11:21,600
Wait patiently for it to download. Since
326
00:11:21,600 --> 00:11:23,440
everyone uses it for different purposes,
327
00:11:23,440 --> 00:11:25,120
it doesn't download the entire package
328
00:11:25,120 --> 00:11:27,360
at once. That would be silly. The sizes
329
00:11:27,360 --> 00:11:29,200
would become quite large. If you're not
330
00:11:29,200 --> 00:11:30,959
going to upscale at all, why would you
331
00:11:30,959 --> 00:11:33,519
download its model completed? It created
332
00:11:33,519 --> 00:11:35,360
the first one. I click on it and open
333
00:11:35,360 --> 00:11:37,040
it. It has become much higher quality
334
00:11:37,040 --> 00:11:39,200
compared to the previous one. Let me
335
00:11:39,200 --> 00:11:41,040
download it to my computer. I click on
336
00:11:41,040 --> 00:11:43,640
the download icon and save it as zero-
337
00:11:43,640 --> 00:11:46,399
qpng. I immediately check Finder. I
338
00:11:46,399 --> 00:11:47,920
right click on the first image and
339
00:11:47,920 --> 00:11:50,720
select get info. Its size was 1.3
340
00:11:50,720 --> 00:11:54,399
megabytes with dimensions of 896 x 1152
341
00:11:54,399 --> 00:11:56,240
pixels. I right click on the new one and
342
00:11:56,240 --> 00:11:58,959
select get info. Its size has become 4 7
343
00:11:58,959 --> 00:12:00,800
megabytes. The dimensions have also
344
00:12:00,800 --> 00:12:04,880
changed to 1792. By 2304 it was this.
345
00:12:04,880 --> 00:12:07,519
Now it's this. It's not just enlargement
346
00:12:07,519 --> 00:12:09,839
but reinterpretation. It improved
347
00:12:09,839 --> 00:12:12,240
sections. It corrected the distortion in
348
00:12:12,240 --> 00:12:14,320
this corner as well. We will continue
349
00:12:14,320 --> 00:12:16,240
from the input image section and show
350
00:12:16,240 --> 00:12:18,160
the advanced section because the other
351
00:12:18,160 --> 00:12:19,760
tabs of the input image are linked to
352
00:12:19,760 --> 00:12:21,440
the advanced section. I click on the
353
00:12:21,440 --> 00:12:23,200
advanced checkbox and the advanced
354
00:12:23,200 --> 00:12:25,040
settings open the performance section.
355
00:12:25,040 --> 00:12:27,600
Here I have three options. Speed,
356
00:12:27,600 --> 00:12:29,839
quality, extreme speed. In other words,
357
00:12:29,839 --> 00:12:32,560
when the AI creates photos, it goes step
358
00:12:32,560 --> 00:12:35,440
by step like in stages. It draws
359
00:12:35,440 --> 00:12:37,440
something more at each stage. The more
360
00:12:37,440 --> 00:12:39,200
stages there are, the higher the
361
00:12:39,200 --> 00:12:41,440
quality. Think of it like sanding. When
362
00:12:41,440 --> 00:12:44,000
you sand it 30 times, it becomes shiny.
363
00:12:44,000 --> 00:12:46,000
The speed option completes the visual in
364
00:12:46,000 --> 00:12:48,480
30 steps. Extreme speed completes it in
365
00:12:48,480 --> 00:12:50,959
eight steps. Quality completes it in 60
366
00:12:50,959 --> 00:12:52,880
steps. You can proceed with whichever
367
00:12:52,880 --> 00:12:54,800
one you prefer depending on the power of
368
00:12:54,800 --> 00:12:56,800
your computer's graphics card. I would
369
00:12:56,800 --> 00:12:58,800
like to clarify this. The quality
370
00:12:58,800 --> 00:13:00,720
difference related to this visual is
371
00:13:00,720 --> 00:13:02,720
actually a difference in AI usage. In
372
00:13:02,720 --> 00:13:05,360
other words, 60 steps mean thinking 60
373
00:13:05,360 --> 00:13:07,360
times. Eight steps mean thinking eight
374
00:13:07,360 --> 00:13:09,600
times. As the steps increase, you give
375
00:13:09,600 --> 00:13:11,440
the AI more chances to think, but
376
00:13:11,440 --> 00:13:14,160
naturally the time also extends. Under
377
00:13:14,160 --> 00:13:16,160
performance, there are aspect ratio
378
00:13:16,160 --> 00:13:18,720
visual dimensions. It supports all
379
00:13:18,720 --> 00:13:20,560
visual dimensions included in stable
380
00:13:20,560 --> 00:13:22,160
diffusion. Since we're here, I'm
381
00:13:22,160 --> 00:13:25,440
choosing 1024x 1024. The image number
382
00:13:25,440 --> 00:13:27,279
determines the number of visuals it will
383
00:13:27,279 --> 00:13:29,040
generate each time I press the generate
384
00:13:29,040 --> 00:13:31,839
button. By default, it comes as two. If
385
00:13:31,839 --> 00:13:33,360
you want, you can increase this number
386
00:13:33,360 --> 00:13:36,000
up to 32. I'm leaving it at two. The
387
00:13:36,000 --> 00:13:38,560
random checkbox allows the seat value to
388
00:13:38,560 --> 00:13:41,279
be assigned randomly. If you uncheck it,
389
00:13:41,279 --> 00:13:43,279
you can enter the seat value manually.
390
00:13:43,279 --> 00:13:46,000
The second tab is styles. This is
391
00:13:46,000 --> 00:13:48,160
actually the strongest aspect of focus.
392
00:13:48,160 --> 00:13:50,079
There are over a 100 styles you can
393
00:13:50,079 --> 00:13:52,000
select and use whichever you want. When
394
00:13:52,000 --> 00:13:54,160
you hover over these styles, a pop-up
395
00:13:54,160 --> 00:13:57,120
cat image appears. From these images,
396
00:13:57,120 --> 00:13:58,639
you can roughly understand what that
397
00:13:58,639 --> 00:14:01,040
style does. For example, focus cinematic
398
00:14:01,040 --> 00:14:02,880
is a cinematic cat image. There are
399
00:14:02,880 --> 00:14:04,880
bokeh lights and such in the background.
400
00:14:04,880 --> 00:14:07,360
What else? Adorable 3D character,
401
00:14:07,360 --> 00:14:08,720
meaning a cute three-dimensional
402
00:14:08,720 --> 00:14:10,959
character. The watercolor style, those
403
00:14:10,959 --> 00:14:13,199
like this. All of these are styles that
404
00:14:13,199 --> 00:14:15,600
have undergone very extensive training.
405
00:14:15,600 --> 00:14:17,920
For example, the watercolor style. I'm
406
00:14:17,920 --> 00:14:19,600
not entirely sure, but I guess it
407
00:14:19,600 --> 00:14:21,120
completed its training by scanning
408
00:14:21,120 --> 00:14:22,720
hundreds of thousands of watercolor
409
00:14:22,720 --> 00:14:24,959
images. Car advertisement images, for
410
00:14:24,959 --> 00:14:26,720
instance. It has been trained with
411
00:14:26,720 --> 00:14:29,040
thousands of car advertisement photos.
412
00:14:29,040 --> 00:14:30,959
To use them, simply click on them to
413
00:14:30,959 --> 00:14:32,639
activate. The other tab is the model
414
00:14:32,639 --> 00:14:34,160
which is the section we selected when
415
00:14:34,160 --> 00:14:35,920
opening focus. We are currently working
416
00:14:35,920 --> 00:14:37,839
with a realistic image but you can
417
00:14:37,839 --> 00:14:39,920
change it from here as well. In the
418
00:14:39,920 --> 00:14:42,800
advanced tab there is a gagen scale. As
419
00:14:42,800 --> 00:14:44,959
you increase its value the cleanliness,
420
00:14:44,959 --> 00:14:47,440
vibrancy and artistry increase. It
421
00:14:47,440 --> 00:14:49,040
becomes more beautiful yet there are
422
00:14:49,040 --> 00:14:51,360
nonsensical hallucinations. I generally
423
00:14:51,360 --> 00:14:53,680
try not to exceed a value of seven. But
424
00:14:53,680 --> 00:14:55,040
make sure to try different values
425
00:14:55,040 --> 00:14:57,279
yourself to gain experience. I can't
426
00:14:57,279 --> 00:14:58,720
change it right now because we're in
427
00:14:58,720 --> 00:15:00,480
extreme speed mode. I'm going back to
428
00:15:00,480 --> 00:15:02,480
the settings. If I set it to speed,
429
00:15:02,480 --> 00:15:04,320
meaning if I take it to 30 steps, it
430
00:15:04,320 --> 00:15:06,160
will be activated. Image sharpness is
431
00:15:06,160 --> 00:15:07,839
already understood from its name.
432
00:15:07,839 --> 00:15:09,680
Increasing it enhances the sharpness.
433
00:15:09,680 --> 00:15:11,920
Now, let's create an image using styles
434
00:15:11,920 --> 00:15:15,120
from the styles MK. I'm choosing Fango.
435
00:15:15,120 --> 00:15:17,680
I'm entering my prompt. An appetizing
436
00:15:17,680 --> 00:15:19,279
fruit platter of summer fruits on a
437
00:15:19,279 --> 00:15:21,600
wooden table. So, I want an appetizing
438
00:15:21,600 --> 00:15:23,680
fruit platter of summer fruits on a
439
00:15:23,680 --> 00:15:26,320
wooden table. Aspect ratio, meaning the
440
00:15:26,320 --> 00:15:29,240
dimensions. I choose
441
00:15:29,240 --> 00:15:31,920
1,344x74 pixels. I'm not changing the
442
00:15:31,920 --> 00:15:33,839
guidance scale. I press the generate
443
00:15:33,839 --> 00:15:35,560
button and it starts creating.
444
00:15:35,560 --> 00:15:38,399
Completed. Let's see. Very stylish. Like
445
00:15:38,399 --> 00:15:40,560
a Fanggo painting. If Fango saw how
446
00:15:40,560 --> 00:15:42,320
easily and quickly this was done, I
447
00:15:42,320 --> 00:15:43,760
think he would cut off another ear.
448
00:15:43,760 --> 00:15:45,519
Normally, we enter prompts in writing,
449
00:15:45,519 --> 00:15:47,199
but we can enter them with a visual
450
00:15:47,199 --> 00:15:49,759
prompt in Focus. In fact, we can also
451
00:15:49,759 --> 00:15:51,440
control what Fukus should do with these
452
00:15:51,440 --> 00:15:54,320
visual prompts. I activate the input
453
00:15:54,320 --> 00:15:56,720
image checkbox to be able to upload my
454
00:15:56,720 --> 00:15:59,440
image. Then I go to the image prompt in
455
00:15:59,440 --> 00:16:01,680
my second tab. I open the advanced
456
00:16:01,680 --> 00:16:03,360
checkbox located at the bottom of this
457
00:16:03,360 --> 00:16:05,920
tab so I can see all the features. Now I
458
00:16:05,920 --> 00:16:07,680
click on the first box and upload one of
459
00:16:07,680 --> 00:16:10,320
my own photos from my computer. I also
460
00:16:10,320 --> 00:16:12,560
open the advanced settings. I choose the
461
00:16:12,560 --> 00:16:16,240
image size as 1024x 1024 pixels. I
462
00:16:16,240 --> 00:16:18,639
switch to the styles tab. I remove the
463
00:16:18,639 --> 00:16:21,040
default ones. I have the idea of
464
00:16:21,040 --> 00:16:23,920
robotizing my uploaded image. For this
465
00:16:23,920 --> 00:16:26,240
reason, I type robot into the search
466
00:16:26,240 --> 00:16:28,480
box. A futuristic cybernetic robot
467
00:16:28,480 --> 00:16:30,079
appeared and I select it. There's a
468
00:16:30,079 --> 00:16:32,240
futuristic biomedical cyberpunk option.
469
00:16:32,240 --> 00:16:34,320
I select that too. Now I've reached the
470
00:16:34,320 --> 00:16:35,920
place where I can adjust the impact of
471
00:16:35,920 --> 00:16:37,600
the uploaded image on the new image that
472
00:16:37,600 --> 00:16:39,360
will be created. First, there's the stop
473
00:16:39,360 --> 00:16:42,240
it option. Stop at Look, this is I just
474
00:16:42,240 --> 00:16:44,800
had an epiphany. Stop at Stop it. Stop
475
00:16:44,800 --> 00:16:47,680
the engine. Stop at it comes from here.
476
00:16:47,680 --> 00:16:49,920
Look, I just figured it out. Stop. I
477
00:16:49,920 --> 00:16:51,519
mentioned that the image is created in
478
00:16:51,519 --> 00:16:54,560
steps. For example, speed is 30 steps.
479
00:16:54,560 --> 00:16:56,079
Stop. It determines where to stop
480
00:16:56,079 --> 00:16:58,480
reading the image during these 30 steps,
481
00:16:58,480 --> 00:17:00,839
thus affecting the newly created
482
00:17:00,839 --> 00:17:04,559
image. For example, right now it's 0.5,
483
00:17:04,559 --> 00:17:06,880
meaning half. If I set speed to one, it
484
00:17:06,880 --> 00:17:08,319
won't look at this image. After the
485
00:17:08,319 --> 00:17:10,000
first 15 steps, it will work
486
00:17:10,000 --> 00:17:12,400
independently. Or if I set it to one, it
487
00:17:12,400 --> 00:17:14,559
will continue to read the sample image
488
00:17:14,559 --> 00:17:17,280
throughout the entire production. on the
489
00:17:17,280 --> 00:17:19,839
right side is weight. This weight
490
00:17:19,839 --> 00:17:21,520
determines how much the new image will
491
00:17:21,520 --> 00:17:23,439
be influenced by the image I added. I'm
492
00:17:23,439 --> 00:17:26,559
coming to stop at. I set it to 0.8
493
00:17:26,559 --> 00:17:28,240
because I want the influence to continue
494
00:17:28,240 --> 00:17:30,160
for a long time. I'm not entering a
495
00:17:30,160 --> 00:17:32,240
written prompt and I press the generate
496
00:17:32,240 --> 00:17:33,919
button. It's completed. It was
497
00:17:33,919 --> 00:17:36,000
influenced by the blue t-shirt. Since
498
00:17:36,000 --> 00:17:38,720
stop at is high, it also resembled me.
499
00:17:38,720 --> 00:17:40,240
Since the weight is in the middle, it
500
00:17:40,240 --> 00:17:41,840
was able to change the background and
501
00:17:41,840 --> 00:17:43,440
such. We've reached the point I
502
00:17:43,440 --> 00:17:45,200
mentioned at the beginning of the video.
503
00:17:45,200 --> 00:17:47,440
Let's create a character. Let's create
504
00:17:47,440 --> 00:17:48,960
visuals based on the clothes this
505
00:17:48,960 --> 00:17:50,880
character wears, the place they go, and
506
00:17:50,880 --> 00:17:52,640
the time. I access the advanced
507
00:17:52,640 --> 00:17:55,360
settings. I ignore the input image.
508
00:17:55,360 --> 00:17:57,520
First, I select the quality. I will
509
00:17:57,520 --> 00:17:59,440
create my own character, so it should be
510
00:17:59,440 --> 00:18:01,120
high quality from the start. The
511
00:18:01,120 --> 00:18:03,520
existing styles are sufficient. I remove
512
00:18:03,520 --> 00:18:06,080
the negative and added masterpiece. Also
513
00:18:06,080 --> 00:18:07,679
reviewing the dimensions of the visual
514
00:18:07,679 --> 00:18:10,559
that will be created. It's 1:1, so it
515
00:18:10,559 --> 00:18:14,880
should be 1,024x 1,024 pixels.
516
00:18:14,880 --> 00:18:17,039
I increase the guidance scale a bit from
517
00:18:17,039 --> 00:18:19,679
the advanced tab. I want a more vibrant
518
00:18:19,679 --> 00:18:22,960
visual, so I paste my prompt. Close up
519
00:18:22,960 --> 00:18:25,039
portrait of a very beautiful brunette
520
00:18:25,039 --> 00:18:27,679
girl with blue eyes taken. Video in
521
00:18:27,679 --> 00:18:30,000
photo studio, meaning a close-up
522
00:18:30,000 --> 00:18:31,520
portrait of a very beautiful brunette
523
00:18:31,520 --> 00:18:33,360
girl with blue eyes taken in a photo
524
00:18:33,360 --> 00:18:35,600
studio. Desertia. Artificial
525
00:18:35,600 --> 00:18:38,400
intelligence continues. When producing
526
00:18:38,400 --> 00:18:40,720
visuals, it can't fully convert indoor
527
00:18:40,720 --> 00:18:42,640
lighting to outdoor lighting or vice
528
00:18:42,640 --> 00:18:45,360
versa. For this reason, it's important
529
00:18:45,360 --> 00:18:47,280
to decide on this when creating the
530
00:18:47,280 --> 00:18:49,400
initial prompt. Will it be indoors or
531
00:18:49,400 --> 00:18:51,520
outdoors? I'm pressing the generate
532
00:18:51,520 --> 00:18:53,280
button. Let's see what kind of result it
533
00:18:53,280 --> 00:18:55,120
will give. It's important for us to see
534
00:18:55,120 --> 00:18:56,960
the face of the visual clearly. This
535
00:18:56,960 --> 00:18:58,720
will be very important when creating new
536
00:18:58,720 --> 00:19:01,120
visuals. The first visual is completed.
537
00:19:01,120 --> 00:19:03,280
A beautiful girl was created. Looks
538
00:19:03,280 --> 00:19:04,960
real. It's almost impossible to
539
00:19:04,960 --> 00:19:07,039
distinguish this from a photograph. This
540
00:19:07,039 --> 00:19:09,280
is good for me. I'm downloading it to my
541
00:19:09,280 --> 00:19:11,679
computer. One of my predictions for 2024
542
00:19:11,679 --> 00:19:13,520
is artificial intelligence that can
543
00:19:13,520 --> 00:19:15,280
produce videos indistinguishable from
544
00:19:15,280 --> 00:19:17,200
reality like the one we just saw in the
545
00:19:17,200 --> 00:19:19,200
photo. Therefore, this technique will
546
00:19:19,200 --> 00:19:21,360
become even more important. Then,
547
00:19:21,360 --> 00:19:23,440
instead of just photos, we will produce
548
00:19:23,440 --> 00:19:25,200
videos with our character in different
549
00:19:25,200 --> 00:19:27,679
outfits and various locations. I'm
550
00:19:27,679 --> 00:19:29,440
activating the input image and coming to
551
00:19:29,440 --> 00:19:31,760
the image prompt section. I'm uploading
552
00:19:31,760 --> 00:19:34,480
the model image I just downloaded. I'm
553
00:19:34,480 --> 00:19:36,240
also opening the advanced option in the
554
00:19:36,240 --> 00:19:38,400
image prompt section. There are four
555
00:19:38,400 --> 00:19:40,160
options here. We've talked about the
556
00:19:40,160 --> 00:19:42,400
image prompt. There's also Pyrokini,
557
00:19:42,400 --> 00:19:45,760
CPDS, and phase swap. I'm choosing face
558
00:19:45,760 --> 00:19:47,679
swap. We will discuss the remaining two
559
00:19:47,679 --> 00:19:50,640
as well. If you notice, stop at
560
00:19:50,640 --> 00:19:53,679
automatically became 0.9. By entering a
561
00:19:53,679 --> 00:19:55,600
prompt, it wants the image I'm going to
562
00:19:55,600 --> 00:19:57,679
create to take this image's face to the
563
00:19:57,679 --> 00:20:00,559
final step. I'm boosting it even more.
564
00:20:00,559 --> 00:20:02,480
I'm setting it to one. It doesn't stop
565
00:20:02,480 --> 00:20:04,480
influencing and is present in all steps
566
00:20:04,480 --> 00:20:06,960
until the end. I'm entering my prompt
567
00:20:06,960 --> 00:20:09,360
fitness at the gym. Pink hair. So, it
568
00:20:09,360 --> 00:20:11,039
will be a girl with pink hair working
569
00:20:11,039 --> 00:20:12,720
out. I'm setting the performance to
570
00:20:12,720 --> 00:20:14,480
speed. I click generate. Let's see how
571
00:20:14,480 --> 00:20:16,240
it turns out. The first one is
572
00:20:16,240 --> 00:20:18,400
completed. The image turned out nice.
573
00:20:18,400 --> 00:20:20,400
The fingers are a bit problematic. It
574
00:20:20,400 --> 00:20:21,600
would be better if she didn't wear
575
00:20:21,600 --> 00:20:23,440
earrings at the gym. But we got the
576
00:20:23,440 --> 00:20:25,360
image we added. I'm saving it to my
577
00:20:25,360 --> 00:20:27,360
computer by clicking the download icon.
578
00:20:27,360 --> 00:20:29,440
The second one is also completed. It
579
00:20:29,440 --> 00:20:30,960
looks better since there are no hands
580
00:20:30,960 --> 00:20:33,120
around. Let me try another image. I'm
581
00:20:33,120 --> 00:20:35,200
entering my prompt. A pose in an elegant
582
00:20:35,200 --> 00:20:39,120
red evening party dress, blonde hair. So
583
00:20:39,120 --> 00:20:41,280
I said a pose in an elegant red party
584
00:20:41,280 --> 00:20:44,480
dress, blonde hair. I'm pressing the
585
00:20:44,480 --> 00:20:47,799
generate button. First one done. Very
586
00:20:47,799 --> 00:20:50,080
stylish. There's a bokeh effect in the
587
00:20:50,080 --> 00:20:51,840
background. The girl looks quite pretty.
588
00:20:51,840 --> 00:20:53,760
The second one is also completed. This
589
00:20:53,760 --> 00:20:56,080
time focus thought of a sitting pose. I
590
00:20:56,080 --> 00:20:58,080
think this one turned out nice, too.
591
00:20:58,080 --> 00:21:00,640
Let's try another example. I'm entering
592
00:21:00,640 --> 00:21:03,520
my prompt right away. A girl in a beret
593
00:21:03,520 --> 00:21:05,679
and coat walking down the street on a
594
00:21:05,679 --> 00:21:08,159
very cold snowy winter day. So, a girl
595
00:21:08,159 --> 00:21:09,919
wearing a beret and coat walking down
596
00:21:09,919 --> 00:21:11,760
the street on a very cold and snowy
597
00:21:11,760 --> 00:21:14,240
winter day. I'm saying generate see what
598
00:21:14,240 --> 00:21:16,159
comes out the first one is completed. I
599
00:21:16,159 --> 00:21:17,679
think this one turned out very nice,
600
00:21:17,679 --> 00:21:20,559
too. Coat, beret, a snowy street there.
601
00:21:20,559 --> 00:21:23,679
Second one done. Nice pose, too. Let's
602
00:21:23,679 --> 00:21:26,080
diversify a bit more. For example, we
603
00:21:26,080 --> 00:21:27,679
saw a photo and the character in that
604
00:21:27,679 --> 00:21:29,600
photo struck a pose. We want our
605
00:21:29,600 --> 00:21:31,919
character to strike the same pose. Let's
606
00:21:31,919 --> 00:21:33,919
see what we'll do. I have a pose here.
607
00:21:33,919 --> 00:21:36,400
The model has struck a pose like this. I
608
00:21:36,400 --> 00:21:38,480
want my character to do the same. I'm
609
00:21:38,480 --> 00:21:40,559
turning to focus. Let's keep our face
610
00:21:40,559 --> 00:21:42,799
swap photo. I'm not changing it. This
611
00:21:42,799 --> 00:21:45,440
time I'm choosing Pyraini. Pyroini is
612
00:21:45,440 --> 00:21:47,520
copying the movements. I'm going to the
613
00:21:47,520 --> 00:21:49,840
second photo box. I'm adding a stance
614
00:21:49,840 --> 00:21:52,480
pose here. I'm uploading my image. It
615
00:21:52,480 --> 00:21:54,320
will only take the character stance. I'm
616
00:21:54,320 --> 00:21:55,919
not touching the settings. I'm leaving
617
00:21:55,919 --> 00:21:57,600
it as default. I'm not entering a
618
00:21:57,600 --> 00:21:59,520
prompt. I can enter one if I want. just
619
00:21:59,520 --> 00:22:01,440
the visual dimensions. I'm choosing a
620
00:22:01,440 --> 00:22:03,039
vertical image. I click the generate
621
00:22:03,039 --> 00:22:04,880
button and it starts. It will take the
622
00:22:04,880 --> 00:22:06,480
face from the first image and the pose
623
00:22:06,480 --> 00:22:08,159
from the second image. Let's see what
624
00:22:08,159 --> 00:22:10,320
comes out. The first image is completed.
625
00:22:10,320 --> 00:22:12,240
Since I didn't enter a prompt, it did it
626
00:22:12,240 --> 00:22:14,400
on its own. It could take the face and
627
00:22:14,400 --> 00:22:16,240
pose as they are. There's a bit of
628
00:22:16,240 --> 00:22:18,640
distortion in the arm. The second pose
629
00:22:18,640 --> 00:22:20,640
is also completed. I think this one
630
00:22:20,640 --> 00:22:22,480
turned out better, but there's still a
631
00:22:22,480 --> 00:22:24,559
problem with the hand. Let me try
632
00:22:24,559 --> 00:22:27,200
entering a prompt. I said it on the
633
00:22:27,200 --> 00:22:29,679
beach in summer. I say generate. The
634
00:22:29,679 --> 00:22:31,360
first one is completed. I think it
635
00:22:31,360 --> 00:22:32,720
turned out much better when I entered a
636
00:22:32,720 --> 00:22:34,720
prompt. The eyes are a bit problematic.
637
00:22:34,720 --> 00:22:36,480
This became a good example. I will show
638
00:22:36,480 --> 00:22:38,480
how to fix these. The second one is
639
00:22:38,480 --> 00:22:40,559
completed. The face and eyes are nice.
640
00:22:40,559 --> 00:22:42,000
It just looks like there's a bit of a
641
00:22:42,000 --> 00:22:44,640
spinal issue. Anyway, we'll try it out.
642
00:22:44,640 --> 00:22:46,559
We'll continue until we find the one we
643
00:22:46,559 --> 00:22:48,240
like the most. We've reached the last
644
00:22:48,240 --> 00:22:50,640
section within the image prompt. CPDS,
645
00:22:50,640 --> 00:22:52,320
which stands for contrast preserving
646
00:22:52,320 --> 00:22:54,240
decolorization structure. It's not
647
00:22:54,240 --> 00:22:56,320
something very important. Unfortunately,
648
00:22:56,320 --> 00:22:58,159
you are also exposed to my obsession
649
00:22:58,159 --> 00:23:00,080
with trying to do everything perfectly.
650
00:23:00,080 --> 00:23:01,760
I get this silly feeling that if I see
651
00:23:01,760 --> 00:23:03,440
it there, I should explain it. I should
652
00:23:03,440 --> 00:23:05,440
tell what it is. But there's nothing to
653
00:23:05,440 --> 00:23:07,840
be done. As I said, you are also exposed
654
00:23:07,840 --> 00:23:09,760
to this. In short, it extracts the
655
00:23:09,760 --> 00:23:11,600
contrast from the image you add and
656
00:23:11,600 --> 00:23:13,679
applies it to the newly created image. I
657
00:23:13,679 --> 00:23:16,480
check the CPDS check box. From the third
658
00:23:16,480 --> 00:23:19,679
photo edition box, I upload the same
659
00:23:19,679 --> 00:23:21,679
pose photo from my computer. Now, it
660
00:23:21,679 --> 00:23:23,440
will also take the contrast of this pose
661
00:23:23,440 --> 00:23:27,080
photo. I select the dimensions of 768x
662
00:23:27,080 --> 00:23:30,000
1,344 pixels from here. And again, I
663
00:23:30,000 --> 00:23:31,520
press the generate button without
664
00:23:31,520 --> 00:23:33,360
entering a prompt. Let's see what will
665
00:23:33,360 --> 00:23:35,440
happen. Completed. As you can see, the
666
00:23:35,440 --> 00:23:37,280
colors have become nicer because the
667
00:23:37,280 --> 00:23:39,120
other one is a real photo. This one was
668
00:23:39,120 --> 00:23:40,799
influenced by it. This is the other
669
00:23:40,799 --> 00:23:42,880
visual. I think both turned out very
670
00:23:42,880 --> 00:23:45,280
beautiful. Within focus, you can modify
671
00:23:45,280 --> 00:23:47,760
visuals, expand them, and intervene in
672
00:23:47,760 --> 00:23:49,679
selected areas.
673
00:23:49,679 --> 00:23:51,360
I'm coming to the third tab within the
674
00:23:51,360 --> 00:23:54,039
input image the unpaint or I'll paint
675
00:23:54,039 --> 00:23:56,559
tab. Let's start with the first example.
676
00:23:56,559 --> 00:23:58,159
There was a snowy street visual I
677
00:23:58,159 --> 00:23:59,919
created. Let's take that for example.
678
00:23:59,919 --> 00:24:01,760
Drop image here. I click on the click to
679
00:24:01,760 --> 00:24:03,600
upload link and upload the visual from
680
00:24:03,600 --> 00:24:05,520
my computer. I can immediately say to
681
00:24:05,520 --> 00:24:07,039
extend this from the left and right
682
00:24:07,039 --> 00:24:08,880
under the outpaint direction heading. I
683
00:24:08,880 --> 00:24:12,400
check the left and right checkboxes.
684
00:24:12,400 --> 00:24:14,640
Then I just press the generate button.
685
00:24:14,640 --> 00:24:16,799
Let's see what happens. The first one is
686
00:24:16,799 --> 00:24:18,880
completed. As you can see, it expanded
687
00:24:18,880 --> 00:24:20,559
the visual beautifully. There's nothing
688
00:24:20,559 --> 00:24:22,799
disturbing at all. As you might guess, I
689
00:24:22,799 --> 00:24:24,559
can also expand by entering a written
690
00:24:24,559 --> 00:24:26,640
prompt. So, prompts I write can be added
691
00:24:26,640 --> 00:24:28,720
to the areas that will be expanded.
692
00:24:28,720 --> 00:24:30,960
Like, I don't know, add a tree or add a
693
00:24:30,960 --> 00:24:32,720
street lamp. Here, we can make small
694
00:24:32,720 --> 00:24:34,640
aesthetic adjustments with focus.
695
00:24:34,640 --> 00:24:37,039
Painless and effortless. I'm in the in
696
00:24:37,039 --> 00:24:38,880
paint or out paint tab. I immediately
697
00:24:38,880 --> 00:24:40,720
select improve detail from the method
698
00:24:40,720 --> 00:24:42,480
section. We had created an image with
699
00:24:42,480 --> 00:24:44,640
problematic eyes. If you remember, let's
700
00:24:44,640 --> 00:24:46,240
make an adjustment to it. I click on the
701
00:24:46,240 --> 00:24:48,000
click to upload link. I take our patient
702
00:24:48,000 --> 00:24:49,440
with the problematic eyes from my
703
00:24:49,440 --> 00:24:51,440
computer. I need to mark the problematic
704
00:24:51,440 --> 00:24:54,240
area here. From here I can see the
705
00:24:54,240 --> 00:24:57,200
shortcuts to use this canvas. I can also
706
00:24:57,200 --> 00:24:58,799
adjust the size of my brush from the
707
00:24:58,799 --> 00:25:01,039
right side. Now since the eyes are
708
00:25:01,039 --> 00:25:02,799
problematic, I will select them by
709
00:25:02,799 --> 00:25:05,039
painting over them. I hold down the
710
00:25:05,039 --> 00:25:07,600
shift key and scroll the mouse wheel.
711
00:25:07,600 --> 00:25:10,159
The image zooms in. I paint the eyes
712
00:25:10,159 --> 00:25:11,760
with the precision of a doctor. All
713
00:25:11,760 --> 00:25:14,159
right, I remove the zoom with the R key.
714
00:25:14,159 --> 00:25:16,159
There are also quick prompts here. It
715
00:25:16,159 --> 00:25:18,440
seems like the AI knows where it has
716
00:25:18,440 --> 00:25:20,799
issues. I click on the beautiful eyes
717
00:25:20,799 --> 00:25:23,840
link. It added it to the prompt section.
718
00:25:23,840 --> 00:25:26,159
Nice. I scroll up and press the generate
719
00:25:26,159 --> 00:25:28,240
button. Look, it zoomed in on the area I
720
00:25:28,240 --> 00:25:30,240
drew in my workspace and I can see the
721
00:25:30,240 --> 00:25:32,720
work there step by step or rather stage
722
00:25:32,720 --> 00:25:34,799
by stage. As you can see, it corrects
723
00:25:34,799 --> 00:25:37,279
the errors. The first one is completed.
724
00:25:37,279 --> 00:25:38,720
The problems with the eyes have been
725
00:25:38,720 --> 00:25:40,799
largely resolved. So, what else can we
726
00:25:40,799 --> 00:25:43,360
do? We can add, remove, or transform
727
00:25:43,360 --> 00:25:45,039
something. I'm returning to the in paint
728
00:25:45,039 --> 00:25:47,919
or outpaint tab. I erase the eyes I drew
729
00:25:47,919 --> 00:25:50,480
by pressing the silicone. I select
730
00:25:50,480 --> 00:25:52,799
modify content from the method section.
731
00:25:52,799 --> 00:25:54,559
I will try to change the bracelet on the
732
00:25:54,559 --> 00:25:57,520
arm. I zoom in on the image again. I
733
00:25:57,520 --> 00:25:59,600
select the bracelet here. In other
734
00:25:59,600 --> 00:26:02,240
words, I paint the bracelet. I completed
735
00:26:02,240 --> 00:26:04,720
it. I exit the zoom by pressing the R
736
00:26:04,720 --> 00:26:06,960
key on the keyboard. I come to the end
737
00:26:06,960 --> 00:26:08,880
paint additional prompt box and enter
738
00:26:08,880 --> 00:26:10,720
the red wristband prompt. I press the
739
00:26:10,720 --> 00:26:12,400
generate button. Let's see if the
740
00:26:12,400 --> 00:26:14,240
bracelet will change. The preview of the
741
00:26:14,240 --> 00:26:16,159
bracelet has appeared. It's creating it
742
00:26:16,159 --> 00:26:18,400
slowly. Only that area is visible
743
00:26:18,400 --> 00:26:20,400
because I selected it. It's completed.
744
00:26:20,400 --> 00:26:22,320
As you can see, the red wristband has
745
00:26:22,320 --> 00:26:24,000
appeared. It also matched with the
746
00:26:24,000 --> 00:26:26,159
shorts. From here on, you can do
747
00:26:26,159 --> 00:26:28,559
anything you want. For example, you can
748
00:26:28,559 --> 00:26:30,480
change the hair color to pink. We've
749
00:26:30,480 --> 00:26:32,320
reached the final section. In this
750
00:26:32,320 --> 00:26:34,080
section, you can add an image or an
751
00:26:34,080 --> 00:26:36,159
animation image and have it convert it
752
00:26:36,159 --> 00:26:38,480
into a text prompt or in other words
753
00:26:38,480 --> 00:26:41,120
into text for you. It reads the image
754
00:26:41,120 --> 00:26:43,120
and converts what it understands into
755
00:26:43,120 --> 00:26:45,760
text for you. I'm in the describe tab.
756
00:26:45,760 --> 00:26:47,840
Here we can have it describe a photo or
757
00:26:47,840 --> 00:26:49,760
anime image that I input. I clicked on
758
00:26:49,760 --> 00:26:51,279
the click to upload link and edit my
759
00:26:51,279 --> 00:26:53,520
model from my computer. Let's see how it
760
00:26:53,520 --> 00:26:55,039
will describe it. I'm clicking on the
761
00:26:55,039 --> 00:26:57,760
describe this image into prop button. It
762
00:26:57,760 --> 00:26:59,679
will immediately describe the image in
763
00:26:59,679 --> 00:27:01,520
its own way and enter the resulting
764
00:27:01,520 --> 00:27:04,240
text. Enter the prompt box above. Yes,
765
00:27:04,240 --> 00:27:06,880
it wrote in the generate box. A woman
766
00:27:06,880 --> 00:27:10,240
with a big earring next to blue eyes.
767
00:27:10,240 --> 00:27:12,240
So, a beautiful woman with a big earring
768
00:27:12,240 --> 00:27:14,080
next to blue eyes. Let's add another
769
00:27:14,080 --> 00:27:16,080
example. Let me add the model we used
770
00:27:16,080 --> 00:27:18,240
for this pose. I'm clicking on the click
771
00:27:18,240 --> 00:27:19,840
to upload link and uploading the image
772
00:27:19,840 --> 00:27:22,320
of the model I used for the pose. I
773
00:27:22,320 --> 00:27:23,919
clicked on the describe this image into
774
00:27:23,919 --> 00:27:26,799
prompt button. It responded, "A woman
775
00:27:26,799 --> 00:27:29,679
with glasses leans against her neck
776
00:27:29,679 --> 00:27:33,200
wearing a floral shirt." So I said, "A
777
00:27:33,200 --> 00:27:35,120
woman with glasses is leaning against
778
00:27:35,120 --> 00:27:38,000
her neck wearing a floral shirt." I'm
779
00:27:38,000 --> 00:27:40,640
pressing the generate button. Let's see
780
00:27:40,640 --> 00:27:43,520
what it will create. It didn't get so
781
00:27:43,520 --> 00:27:45,440
she's not leaning there. It's like she's
782
00:27:45,440 --> 00:27:47,279
placing her hand. There's a feminine
783
00:27:47,279 --> 00:27:49,600
pose. It couldn't figure that part out.
784
00:27:49,600 --> 00:27:52,400
Anyway, it created it. It didn't fully
785
00:27:52,400 --> 00:27:53,679
understand or couldn't create the
786
00:27:53,679 --> 00:27:55,520
visual. I don't know exactly about that
787
00:27:55,520 --> 00:27:57,360
part, but I think the visual turned out
788
00:27:57,360 --> 00:27:59,440
very realistic and beautiful. If you've
789
00:27:59,440 --> 00:28:00,960
made it this far in the video, I
790
00:28:00,960 --> 00:28:02,720
congratulate you. It's been a bit of a
791
00:28:02,720 --> 00:28:04,640
long video, but I think if I'm
792
00:28:04,640 --> 00:28:06,559
explaining something, I should touch on
793
00:28:06,559 --> 00:28:08,240
everything. I'm still dragging it out
794
00:28:08,240 --> 00:28:10,159
and can't stop. If you have any
795
00:28:10,159 --> 00:28:11,600
questions, you can write them as
796
00:28:11,600 --> 00:28:13,520
comments under the video, leaving my
797
00:28:13,520 --> 00:28:15,360
social media accounts here. You can
798
00:28:15,360 --> 00:28:18,240
follow me from there and ask questions.
799
00:28:18,240 --> 00:28:20,399
That's it for this week. See you in the
800
00:28:20,399 --> 00:28:24,120
next video. Goodbye.58272
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.