[GUIDE] - DeepFaceLab 2.0 Guide

TMBDF · May 17, 2022

city said:
TMBDF said:

city said:

Hello guys! A complete newbie here, but I can't find a solution regarding memory error.

So basically I'm getting memory error every several hours of training, even with Quick96. I have an RTX 2080 8GB so this is really weird. I know it's not recommended to use laptop but this is what I can afford for now. I've had this laptop for a year, so maybe is it because the gpu starts to wear out? I used to train using faceswap (but the result is very subpar) and can go pretty much all day, but now even with the same app and settings, it'd have a memory error after around 100k iters.

Is there anything I could probably do to solve this other than call it a day and build a PC?

Click to expand...

No such thing as GPU wearing out but excesive heat could make it fail, but that's a complete failure, not memory errors unless you have an overclock applied which can cause errors due to instability.

What kind of errors you get exactly? Please post the message you get.

Click to expand...

Thanks for the quick response!

No I never overclocked the cpu or gpu.

The error message is basically something like "Unable to allocate 40 MiB for an array with shape (...)"

This isn't helpful, post the full error.

jfietearp · May 22, 2022

Been testing several projects for weeks now, and I have some questions

1. Does the merging fully run on CPU? Can I do GPU-heavy task during merging?
2. Is it possible to change the default value? For merging, I already have a set of value that would look like promising generally, for example "blur" is always a high number. It doesn't bother me much but if possible it'd be great if I don't have to adjust all of these values every time I do interactive merge.
3. Is it possible to autosave Quick96 model?
4. Does data_dst quality affect the swap quality? (assuming it's equal or lower than data_src)
For example, would 1080p-1080p swap be different than 1080p-720p?

That's all for now. Thank you!

TMBDF · May 22, 2022

1. No, GPU is used when merging with GPU. You can do some light GPU accelerated work but only if it doesn't need a lot of VRAM.
2. No, default values can't be changed unless you alter the code, there is a merger session file that saves after you finish merging and you could try copying this file to your model folder and when merging and asked if you want to use last merging session select yes, however I'm not sure if it will still work if number of frames is different.
3. Explain what you mean by autosaving Quick96 model? You mean enable backups every x hours? If there is an option then yes, if there is no option for autobackup then no unless you cahnge the code, don't use Quick96 to produce videos, it's a model meant only for very low VRAM GPUs and to test ideas quickly, it is basically a 96 res DF-UD full face SAEHD model with locked down settings.
4. Yes, quality, resolution, sharpness, level of noise and compression of data_dst will affect quality of generated faces since the model attempts to create faces that look like SRC but match DST.

NeedForSpeed73 · May 24, 2022

I wanted thank you all for putting up this guide and share my experience. I have a 12900K with 32Gb of DDR5 and a RTX3090 on Windows 11; I wasn't able to run SAEHD training without incurring in memory errors (OOM or just lock ups) even disabling all memory consuming settings. After trying almost everything from memory settings in ths BIOS (disabling XMP) to underclocking the GPU, I finally realized that the problem was related to the pagefile because once I noticed the error occurred while SAEHD was writing backup data to the SSD.
So my suggestion is: if you have memory problems go to the pagefile setting and set it to fixed size, because looks like Windows isn't able to react fast enough to SAEHD memory needs. So go to Settings->System Information->Advanced System Settings->Advanced Tab->Performance Settings Button->Advanced Tab->Virtual Memory Set (button)->Custom Size radio button and set (for the disk you have the pagefile on) a minimum of 1.5 times your physical memory (as suggested by Microsoft) and a maximum of a little less of the free space you have on that same disk.
This let all the memory problems go away for me (and I was able to reverse-check the problem by reverting it to auto and getting an error at the first SAEHD run).

P.S.: excuse me if any of the settings I posted have a wrong name, I've Windows set in my local language (Italian) so I just translated them myself while writing this message.

MrFakeNoob · May 26, 2022

Nice guide, I'm now getting almost instant results on 8 batches with a 550ms avg turn times. Looks like ready to merge after 2 full passes

Great work!!!

TMBDF · May 26, 2022

MrFakeNoob said:
Nice guide, I'm now getting almost instant results on 8 batches with a 550ms avg turn times. Looks like ready to merge after 2 full passes
Great work!!!

Glad it helped, consider a donation so I can keep maitaining it

city · May 27, 2022

TMBDF said:
city said:

TMBDF said:

city said:

Hello guys! A complete newbie here, but I can't find a solution regarding memory error.

So basically I'm getting memory error every several hours of training, even with Quick96. I have an RTX 2080 8GB so this is really weird. I know it's not recommended to use laptop but this is what I can afford for now. I've had this laptop for a year, so maybe is it because the gpu starts to wear out? I used to train using faceswap (but the result is very subpar) and can go pretty much all day, but now even with the same app and settings, it'd have a memory error after around 100k iters.

Is there anything I could probably do to solve this other than call it a day and build a PC?

Click to expand...

No such thing as GPU wearing out but excesive heat could make it fail, but that's a complete failure, not memory errors unless you have an overclock applied which can cause errors due to instability.

What kind of errors you get exactly? Please post the message you get.

Click to expand...

Thanks for the quick response!

No I never overclocked the cpu or gpu.

The error message is basically something like "Unable to allocate 40 MiB for an array with shape (...)"

Click to expand...

This isn't helpful, post the full error.

sorry, just had time for training, problem still occurs even after restarting

TMBDF · May 27, 2022

city said:
sorry, just had time for training, problem still occurs even after restarting

Hmm, so it occurs after some time, it might still be a form of OOM, does it happen when you start doing something on PC, launch some app? Or does it happen always after similar amount of hours? Perhaps try lowering your batch size by 1 and see if it still crashes after some time, I don't see any mentions of pagefile or anything else in the error that would suggest that being the issue. If this is your main and only GPU try to not run any apps that needs lots of GPU power and use VRAM as that can push it over the edge and OOM.

jfietearp · May 30, 2022

TMBDF said:
1. No, GPU is used when merging with GPU. You can do some light GPU accelerated work but only if it doesn't need a lot of VRAM.
2. No, default values can't be changed unless you alter the code, there is a merger session file that saves after you finish merging and you could try copying this file to your model folder and when merging and asked if you want to use last merging session select yes, however I'm not sure if it will still work if number of frames is different.
3. Explain what you mean by autosaving Quick96 model? You mean enable backups every x hours? If there is an option then yes, if there is no option for autobackup then no unless you cahnge the code, don't use Quick96 to produce videos, it's a model meant only for very low VRAM GPUs and to test ideas quickly, it is basically a 96 res DF-UD full face SAEHD model with locked down settings.
4. Yes, quality, resolution, sharpness, level of noise and compression of data_dst will affect quality of generated faces since the model attempts to create faces that look like SRC but match DST.

2. Is there any guides somewhere that can help me alter the code? I know a bit about programming so I might try and see how it goes.

3. Does that mean I can re-create Quick96 using SAEHD batch file? Most of the times I'm just doing this for fun and at least for me, Quick96 seems good enough in addition to it only takes several hours to fully train the model.

TMBDF · May 30, 2022

jfietearp said:
2. Is there any guides somewhere that can help me alter the code? I know a bit about programming so I might try and see how it goes.

3. Does that mean I can re-create Quick96 using SAEHD batch file? Most of the times I'm just doing this for fun and at least for me, Quick96 seems good enough in addition to it only takes several hours to fully train the model.

There are no guides about modifying DFL code, and iperov doesn't comment the code much so you will have to figure it out on your own.

Yes

jfietearp · Jun 3, 2022

TMBDF said:
There are no guides about modifying DFL code, and iperov doesn't comment the code much so you will have to figure it out on your own.

Yes

thanks, I'll maybe just try and find out

also another question, is it not possible to use decimal values for bitrate in the converting process? or did i miss something?

TMBDF · Jun 3, 2022

jfietearp said:
thanks, I'll maybe just try and find out

also another question, is it not possible to use decimal values for bitrate in the converting process? or did i miss something?

Merge to MP4 uses CRF value to set overall quality, it will maintain bitrate at level high enough through video to achieve certain quality and this doesn't let you control exact bitrate and the size of the output file, if you want to render at specific bitrate you'll have to change the merge to mp4 part of the code, it's all done with ffmpeg, or you can render the video yourself by importing the merged image sequence (and masks if you want to do some compositing work too) and render it that way.

TMBDF · Jun 4, 2022

I'm going to be rewriting the guide, adding new pictures to show some things that require visual represenation, what are the things you think could be expanded upon in it? What do you think is still missing or should be explained in simpler terms or more often?

bingoli · Jun 5, 2022

TMBDF said:
I'm going to be rewriting the guide, adding new pictures to show some things that require visual represenation, what are the things you think could be expanded upon in it? What do you think is still missing or should be explained in simpler terms or more often?

maybe some example pics for WF, F, Head and so on and also for face style power settings.

helpimbeing · Jun 6, 2022

Hi there, I am getting a similar error to some other posters here, though I searched through several pages and couldn't find a solution posted.

I am running a 10GB RTX 3080 and am getting OOM errors when running train SAEHD at anything but the absolute minimum settings. Quick96 runs without any problems. I have hardware accelerated GPU scheduling enabled and no overclocking. I'm not running any other programs that are using significant amounts of memory. Is this a problem or is the 3080 just not good enough? I've seen people running with 1080ti so I'm not sure why the 3080 wouldn't be good enough.

Running batch size 2 with no optimizer on GPU and no Adabelief works:

Running trainer.

Choose one of saved models, or enter a name to create a new model.
[r] : rename
[d] : delete

[0] : new - latest
: test
test

Model first run.

Choose one or several GPU idxs (separated by comma).

[CPU] : CPU
[0] : NVIDIA GeForce RTX 3080

[0] Which GPU indexes to choose? : 0
0

[1] Autobackup every N hour ( 0..24 ?:help ) : 1
1
[y] Write preview history ( y/n ?:help ) : n
[0] Target iteration :
0
[n] Flip SRC faces randomly ( y/n ?:help ) : n
[y] Flip DST faces randomly ( y/n ?:help ) : y
[2] Batch_size ( ?:help ) : 2
2
[64] Resolution ( 64-640 ?:help ) : 64
64
[wf] Face type ( h/mf/f/wf/head ?:help ) : wf
wf
[df-ud] AE architecture ( ?:help ) : df
df
[32] AutoEncoder dimensions ( 32-1024 ?:help ) : 32
32
[16] Encoder dimensions ( 16-256 ?:help ) : 16
16
[16] Decoder dimensions ( 16-256 ?:help ) : 16
16
[16] Decoder mask dimensions ( 16-256 ?:help ) : 16
16
[y] Masked training ( y/n ?:help ) : y
[y] Eyes and mouth priority ( y/n ?:help ) : y
[n] Uniform yaw distribution of samples ( y/n ?:help ) : n
[y] Blur out mask ( y/n ?:help ) : y
[n] Place models and optimizer on GPU ( y/n ?:help ) : n
[y] Use AdaBelief optimizer? ( y/n ?:help ) : n
[n] Use learning rate dropout ( n/y/cpu ?:help ) : n
n
[y] Enable random warp of samples ( y/n ?:help ) : y
[0.05] Random hue/saturation/light intensity ( 0.0 .. 0.3 ?:help ) : 0.05
0.05
[0.0] GAN power ( 0.0 .. 5.0 ?:help ) : 0
0.0
[0.0] 'True face' power. ( 0.0000 .. 1.0 ?:help ) :
0.0
[0.005] Face style power ( 0.0..100.0 ?:help ) : 0.005
0.005
[2.0] Background style power ( 0.0..100.0 ?:help ) : 2
2.0
[rct] Color transfer for src faceset ( none/rct/lct/mkl/idt/sot ?:help ) : none
none
[y] Enable gradient clipping ( y/n ?:help ) : y
[n] Enable pretraining mode ( y/n ?:help ) : n
Initializing models: 100%|###############################################################| 5/5 [00:00<00:00, 19.59it/s]
Loading samples: 100%|############################################################| 9375/9375 [00:37<00:00, 250.83it/s]
Loading samples: 100%|############################################################| 1352/1352 [00:05<00:00, 240.95it/s]
================== Model Summary ===================
== ==
== Model name: test_SAEHD ==
== ==
== Current iteration: 0 ==
== ==
==---------------- Model Options -----------------==
== ==
== resolution: 64 ==
== face_type: wf ==
== models_opt_on_gpu: False ==
== archi: df ==
== ae_dims: 32 ==
== e_dims: 16 ==
== d_dims: 16 ==
== d_mask_dims: 16 ==
== masked_training: True ==
== eyes_mouth_prio: True ==
== uniform_yaw: False ==
== blur_out_mask: True ==
== adabelief: False ==
== lr_dropout: n ==
== random_warp: True ==
== random_hsv_power: 0.05 ==
== true_face_power: 0.0 ==
== face_style_power: 0.005 ==
== bg_style_power: 2.0 ==
== ct_mode: none ==
== clipgrad: True ==
== pretrain: False ==
== autobackup_hour: 1 ==
== write_preview_history: False ==
== target_iter: 0 ==
== random_src_flip: False ==
== random_dst_flip: True ==
== batch_size: 2 ==
== gan_power: 0.0 ==
== gan_patch_size: 48 ==
== gan_dims: 16 ==
== ==
==------------------ Running On ------------------==
== ==
== Device index: 0 ==
== Name: NVIDIA GeForce RTX 3080 ==
== VRAM: 7.27GB ==
== ==
====================================================
Starting. Press "Enter" to stop training and save model.

Trying to do the first iteration. If an error occurs, reduce the model parameters.

!!!
Windows 10 users IMPORTANT notice. You should set this setting in order to work correctly.

!!!
You are training the model from scratch. It is strongly recommended to use a pretrained model to speed up the training and improve the quality.

[12:20:08][#000002][0080ms][5.3105][5.2307]
Done.
Press any key to continue . . .

If I increase the batch size or run optimizer on GPU I get the following:

Running trainer.

Choose one of saved models, or enter a name to create a new model.
[r] : rename
[d] : delete

[0] : test2 - latest
[1] : test
[2] : new
: test3
test3

Model first run.

Choose one or several GPU idxs (separated by comma).

[CPU] : CPU
[0] : NVIDIA GeForce RTX 3080

[0] Which GPU indexes to choose? : 0
0

[1] Autobackup every N hour ( 0..24 ?:help ) : 1
1
[n] Write preview history ( y/n ?:help ) : y
[n] Choose image for the preview history ( y/n ) : n
[0] Target iteration :
0
[n] Flip SRC faces randomly ( y/n ?:help ) : n
[y] Flip DST faces randomly ( y/n ?:help ) : y
[2] Batch_size ( ?:help ) : 12
12
[64] Resolution ( 64-640 ?:help ) : 64
64
[wf] Face type ( h/mf/f/wf/head ?:help ) : wf
wf
[df] AE architecture ( ?:help ) : df
df
[32] AutoEncoder dimensions ( 32-1024 ?:help ) : 32
32
[16] Encoder dimensions ( 16-256 ?:help ) : 16
16
[16] Decoder dimensions ( 16-256 ?:help ) : 16
16
[16] Decoder mask dimensions ( 16-256 ?:help ) : 16
16
[y] Masked training ( y/n ?:help ) : y
[y] Eyes and mouth priority ( y/n ?:help ) : y
[n] Uniform yaw distribution of samples ( y/n ?:help ) : n
[y] Blur out mask ( y/n ?:help ) : y
[y] Place models and optimizer on GPU ( y/n ?:help ) : n
[n] Use AdaBelief optimizer? ( y/n ?:help ) : n
[n] Use learning rate dropout ( n/y/cpu ?:help ) : n
n
[y] Enable random warp of samples ( y/n ?:help ) : y
[0.05] Random hue/saturation/light intensity ( 0.0 .. 0.3 ?:help ) : 0.05
0.05
[0.0] GAN power ( 0.0 .. 5.0 ?:help ) :
0.0
[0.0] 'True face' power. ( 0.0000 .. 1.0 ?:help ) :
0.0
[0.005] Face style power ( 0.0..100.0 ?:help ) : 0.005
0.005
[2.0] Background style power ( 0.0..100.0 ?:help ) : 2
2.0
[none] Color transfer for src faceset ( none/rct/lct/mkl/idt/sot ?:help ) : rct
rct
[y] Enable gradient clipping ( y/n ?:help ) : y
[n] Enable pretraining mode ( y/n ?:help ) : n
Initializing models: 100%|###############################################################| 5/5 [00:00<00:00, 20.14it/s]
Loading samples: 100%|############################################################| 9375/9375 [00:37<00:00, 250.92it/s]
Loading samples: 100%|############################################################| 1352/1352 [00:05<00:00, 248.87it/s]
Process Process-46:
Traceback (most recent call last):
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleGeneratorFace.py", line 134, in batch_func
x, = SampleProcessor.process ([sample], self.sample_process_options, self.output_sample_types, self.debug, ct_sample=ct_sample)
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleProcessor.py", line 145, in process
img = get_eyes_mouth_mask()*mask
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleProcessor.py", line 80, in get_eyes_mouth_mask
return np.clip(mask, 0, 1)
File "<__array_function__ internals>", line 6, in clip
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\numpy\core\fromnumeric.py", line 2097, in clip
return _wrapfunc(a, 'clip', a_min, a_max, out=out, **kwargs)
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\numpy\core\fromnumeric.py", line 58, in _wrapfunc
return bound(*args, **kwds)
Process Process-48:
Traceback (most recent call last):
Process Process-35:
Process Process-29:
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleGeneratorFace.py", line 134, in batch_func
x, = SampleProcessor.process ([sample], self.sample_process_options, self.output_sample_types, self.debug, ct_sample=ct_sample)
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleProcessor.py", line 56, in process
sample_bgr = sample.load_bgr()
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\Sample.py", line 112, in load_bgr
img = cv2_imread (self.filename, loader_func=self.read_raw_file).astype(np.float32) / 255.0
MemoryError: Unable to allocate 12.0 MiB for an array with shape (1024, 1024, 3) and data type float32

During handling of the above exception, another exception occurred:

Process Process-39:
Traceback (most recent call last):
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\numpy\core\_methods.py", line 141, in _clip
um.clip, a, min, max, out=out, casting=casting, **kwargs)
Process Process-44:
Traceback (most recent call last):
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\numpy\core\_methods.py", line 94, in _clip_dep_invoke_with_casting
return ufunc(*args, out=out, **kwargs)
Traceback (most recent call last):
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleGeneratorFace.py", line 134, in batch_func
x, = SampleProcessor.process ([sample], self.sample_process_options, self.output_sample_types, self.debug, ct_sample=ct_sample)
Traceback (most recent call last):
Process Process-25:
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleGeneratorFace.py", line 134, in batch_func
x, = SampleProcessor.process ([sample], self.sample_process_options, self.output_sample_types, self.debug, ct_sample=ct_sample)
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleProcessor.py", line 192, in process
ct_sample_bgr = ct_sample.load_bgr()
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleGeneratorFace.py", line 134, in batch_func
x, = SampleProcessor.process ([sample], self.sample_process_options, self.output_sample_types, self.debug, ct_sample=ct_sample)
Traceback (most recent call last):
File "multiprocessing\process.py", line 258, in _bootstrap
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\Sample.py", line 112, in load_bgr
img = cv2_imread (self.filename, loader_func=self.read_raw_file).astype(np.float32) / 255.0
File "C:\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\samplelib\SampleProcessor.py", line 56, in process
sample_bgr = sample.load_bgr()
MemoryError: Unable to allocate 4.00 MiB for an array with shape (1024, 1024, 1) and data type float32

thewavyd · Jun 7, 2022

helpimbeing said:
Hi there, I am getting a similar error to some other posters here, though I searched through several pages and couldn't find a solution posted.

...

Have you tried messing with your pagefile size? I've read the general wisdom of setting the minimum to 1.5x your RAM, but on my system that didn't work. For DFL and some other ML projects I had to set a pretty high pagefile size on my SSD. Most likely this means I need a RAM upgrade to match the rest of my specs.

helpimbeing · Jun 8, 2022

thewavyd said:
Have you tried messing with your pagefile size? I've read the general wisdom of setting the minimum to 1.5x your RAM, but on my system that didn't work. For DFL and some other ML projects I had to set a pretty high pagefile size on my SSD. Most likely this means I need a RAM upgrade to match the rest of my specs.

I did try increasing it before since I had read some suggestions for it, but it didn't help. I just tried setting the pagefile max to the absolute maximum possible value and now I can at least run batch size 4 without optimizers. Still, I believe the training process should mostly be VRAM limited so I'm just surprised that a 10GB card is struggling this much with very low settings when 12GB cards work fine.

city · Jun 8, 2022

TMBDF said:
Hmm, so it occurs after some time, it might still be a form of OOM, does it happen when you start doing something on PC, launch some app? Or does it happen always after similar amount of hours? Perhaps try lowering your batch size by 1 and see if it still crashes after some time, I don't see any mentions of pagefile or anything else in the error that would suggest that being the issue. If this is your main and only GPU try to not run any apps that needs lots of GPU power and use VRAM as that can push it over the edge and OOM.

It always happens after several hours, and still occurred even after I restarted my laptop, ran no other programs, and just let it run.

Don't know if this would help, but this OOM-like error also occurs when I run another program for hours. So maybe driver issues? Idk, I've installed the latest driver from Nvidia website, but that still could be the problem as some article suggest.

TMBDF · Jun 8, 2022

helpimbeing said:
Hi there, I am getting a similar error to some other posters here, though I searched through several pages and couldn't find a solution posted.

I am running a 10GB RTX 3080 and am getting OOM errors when running train SAEHD at anything but the absolute minimum settings. Quick96 runs without any problems. I have hardware accelerated GPU scheduling enabled and no overclocking. I'm not running any other programs that are using significant amounts of memory. Is this a problem or is the 3080 just not good enough? I've seen people running with 1080ti so I'm not sure why the 3080 wouldn't be good enough.

Running batch size 2 with no optimizer on GPU and no Adabelief works:

If I increase the batch size or run optimizer on GPU I get the following:

This is not an OOM error, it doesn't say anywhere it has ran out of memory (VRAM), rather than that it says it couldn't allocate memory but this is not the same, at least in case of DFL as you've noted it can't be OOM since 3080 while only has 10GB should be able to run model so ridiculously small and lightweight.
First of all try older drivers or use studio drivers, also before you install new drivers run DDU in safe mode to completely delete old drivers and then do clean install, disable any overclocking software, make sure your page file size is 4x of your RAM (and you should have 32GB of RAM minimum just to be safe, so pagefile 128GB but if you can do more do 256 on OS drive and extra 64-128GB on another SSD, don't put it on HDD since they are slow), make sure you have nothing heavy running in the background and download newest version of DFL, then try it again and test using following model:
DF-UD WF 256, default dims, adabelied enabled, gpu optimizer on gpu enabled, leave rest at default values, HSV 0, gradient clipping disabled, use RTC and enable pretraining, see if it runs. set bs to 4 as that's minimum value you should use, 2 is too low so no sense testing at this low value.

helpimbeing · Jun 8, 2022

TMBDF said:
This is not an OOM error, it doesn't say anywhere it has ran out of memory (VRAM), rather than that it says it couldn't allocate memory but this is not the same, at least in case of DFL as you've noted it can't be OOM since 3080 while only has 10GB should be able to run model so ridiculously small and lightweight.
First of all try older drivers or use studio drivers, also before you install new drivers run DDU in safe mode to completely delete old drivers and then do clean install, disable any overclocking software, make sure your page file size is 4x of your RAM (and you should have 32GB of RAM minimum just to be safe, so pagefile 128GB but if you can do more do 256 on OS drive and extra 64-128GB on another SSD, don't put it on HDD since they are slow), make sure you have nothing heavy running in the background and download newest version of DFL, then try it again and test using following model:
DF-UD WF 256, default dims, adabelied enabled, gpu optimizer on gpu enabled, leave rest at default values, HSV 0, gradient clipping disabled, use RTC and enable pretraining, see if it runs. set bs to 4 as that's minimum value you should use, 2 is too low so no sense testing at this low value.

Thanks for the help! I ran DDU and installed studio instead of game drivers and was able to run at the settings you provided, albeit still pretty slowly.

I played around with some of the settings and found that enabling either BSP or FSP gave me an actual OOM error, I guess this means I'm already running very close to the GPU's limit?

Error: OOM when allocating tensor with shape[4,22,384,384] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
[[node gradients/Conv2D_45_grad/Conv2DBackpropInput (defined at C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\core\leras\ops\__init__.py:55) ]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info. This isn't available when running in Eager mode.

Errors may have originated from an input operation.
Input Source operations connected to node gradients/Conv2D_45_grad/Conv2DBackpropInput:
decoder_dst/out_convm/weight/read (defined at C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\core\leras\layers\Conv2D.py:61)

Original stack trace for 'gradients/Conv2D_45_grad/Conv2DBackpropInput':
File "threading.py", line 884, in _bootstrap
File "threading.py", line 916, in _bootstrap_inner
File "threading.py", line 864, in run
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\mainscripts\Trainer.py", line 58, in trainerThread
debug=debug)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\models\ModelBase.py", line 193, in __init__
self.on_initialize()
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\models\Model_SAEHD\Model.py", line 547, in on_initialize
gpu_G_loss_gvs += [ nn.gradients ( gpu_G_loss, self.src_dst_trainable_weights )]
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\core\leras\ops\__init__.py", line 55, in tf_gradients
grads = gradients.gradients(loss, vars, colocate_gradients_with_ops=True )
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\ops\gradients_impl.py", line 172, in gradients
unconnected_gradients)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\ops\gradients_util.py", line 682, in _GradientsHelper
lambda: grad_fn(op, *out_grads))
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\ops\gradients_util.py", line 338, in _MaybeCompile
return grad_fn() # Exit early
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\ops\gradients_util.py", line 682, in <lambda>
lambda: grad_fn(op, *out_grads))
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\ops\nn_grad.py", line 590, in _Conv2DGrad
data_format=data_format),
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\ops\gen_nn_ops.py", line 1291, in conv2d_backprop_input
name=name)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\framework\op_def_library.py", line 750, in _apply_op_helper
attrs=attr_protos, op_def=op_def)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\framework\ops.py", line 3569, in _create_op_internal
op_def=op_def)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\framework\ops.py", line 2045, in __init__
self._traceback = tf_stack.extract_stack_for_node(self._c_op)

...which was originally created as op 'Conv2D_45', defined at:
File "threading.py", line 884, in _bootstrap
[elided 3 identical lines from previous traceback]
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\models\ModelBase.py", line 193, in __init__
self.on_initialize()
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\models\Model_SAEHD\Model.py", line 410, in on_initialize
gpu_pred_dst_dst, gpu_pred_dst_dstm = self.decoder_dst(gpu_dst_code)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\core\leras\models\ModelBase.py", line 117, in __call__
return self.forward(*args, **kwargs)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\core\leras\archis\DeepFakeArchi.py", line 253, in forward
m = tf.nn.sigmoid(self.out_convm(m))
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\core\leras\layers\LayerBase.py", line 14, in __call__
return self.forward(*args, **kwargs)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\DeepFaceLab\core\leras\layers\Conv2D.py", line 101, in forward
x = tf.nn.conv2d(x, weight, strides, 'VALID', dilations=dilations, data_format=nn.data_format)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\util\dispatch.py", line 206, in wrapper
return target(*args, **kwargs)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\ops\nn_ops.py", line 2397, in conv2d
name=name)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\ops\gen_nn_ops.py", line 972, in conv2d
data_format=data_format, dilations=dilations, name=name)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\framework\op_def_library.py", line 750, in _apply_op_helper
attrs=attr_protos, op_def=op_def)

Traceback (most recent call last):
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\client\session.py", line 1375, in _do_call
return fn(*args)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\client\session.py", line 1360, in _run_fn
target_list, run_metadata)
File "C:\Users\admin\Downloads\DeepFaceLab_NVIDIA_RTX3000_series_build_11_20_2021\DeepFaceLab_NVIDIA_RTX3000_series\_internal\python-3.6.8\lib\site-packages\tensorflow\python\client\session.py", line 1453, in _call_tf_sessionrun
run_metadata)
tensorflow.python.framework.errors_impl.ResourceExhaustedError: OOM when allocating tensor with shape[4,22,384,384] and type float on /job:localhost/replica:0/task:0/device:GPU:0 by allocator GPU_0_bfc
[[{{node gradients/Conv2D_45_grad/Conv2DBackpropInput}}]]
Hint: If you want to see a list of allocated tensors when OOM happens, add report_tensor_allocations_upon_oom to RunOptions for current allocation info. This isn't available when running in Eager mode.

================== Model Summary ===================
== ==
== Model name: new_SAEHD ==
== ==
== Current iteration: 0 ==
== ==
==---------------- Model Options -----------------==
== ==
== resolution: 384 ==
== face_type: wf ==
== models_opt_on_gpu: True ==
== archi: df-ud ==
== ae_dims: 256 ==
== e_dims: 64 ==
== d_dims: 64 ==
== d_mask_dims: 22 ==
== masked_training: True ==
== uniform_yaw: False ==
== lr_dropout: n ==
== random_warp: True ==
== gan_power: 0.0 ==
== true_face_power: 0.0 ==
== face_style_power: 0.001 ==
== bg_style_power: 0.0 ==
== ct_mode: mkl ==
== clipgrad: True ==
== pretrain: False ==
== autobackup_hour: 1 ==
== write_preview_history: False ==
== target_iter: 0 ==
== random_flip: False ==
== batch_size: 4 ==
== eyes_mouth_prio: True ==
== blur_out_mask: True ==
== adabelief: True ==
== random_hsv_power: 0.05 ==
== random_src_flip: False ==
== random_dst_flip: True ==
== gan_patch_size: 32 ==
== gan_dims: 16 ==
== ==
==------------------ Running On ------------------==
== ==
== Device index: 0 ==
== Name: NVIDIA GeForce RTX 3080 ==
== VRAM: 7.27GB ==
== ==
====================================================

[GUIDE] - DeepFaceLab 2.0 Guide

Moderator | Deepfake Creator | Guide maintainer

New member

Moderator | Deepfake Creator | Guide maintainer

New member

New member

Moderator | Deepfake Creator | Guide maintainer

New member

Moderator | Deepfake Creator | Guide maintainer

New member

Moderator | Deepfake Creator | Guide maintainer

New member

Moderator | Deepfake Creator | Guide maintainer

Moderator | Deepfake Creator | Guide maintainer

New member

New member

New member

New member

New member

Moderator | Deepfake Creator | Guide maintainer

New member