Furkan Gözükara

SECourses Musubi Tuner - 1-Click to Install App for LoRA Training and Full Fine Tuning Qwen Image, Qwen Image Edit, Wan 2.1 and Wan 2.2 Models with Musubi Tuner with Ready Presets

Added 2025-09-29 13:46:00 +0000 UTC

Patreon exclusive posts index to find our scripts easily, Patreon scripts updates history to see which updates arrived to which scripts and amazing Patreon special generative scripts list that you can use in any of your task.

Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord

Please also Star, Watch and Fork our Stable Diffusion & Generative AI GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)

=======

Latest Zip File : SECourses_Musubi_Trainer_v18.4.zip

This is the initial release of the App
- App screenshots gallery : https://www.reddit.com/r/SECourses/comments/1n2qo9g/secourses_musubi_tuner_v1_published_a_training/
- Stage 1 Research results gallery : https://www.reddit.com/r/SECourses/comments/1n8bcrj/qwen_image_lora_trainings_stage_1_results_and/
Currently 1-click to install on Windows, RunPod and Massed Compute
- We also have 1-click necessary model downloader script
- The model downloader will download qwen_2.5_vl_7b_bf16.safetensors, qwen_image_bf16.safetensors and qwen_train_vae.safetensors
- Please use model downloader to not have any issues because your selected models may be wrong
  - Moreover, the Musubi Tuner automatically does FP8 and FP8 scaled conversion while loading BF16 model into RAM
- I specifically developed this model downaloder with like UGET method so that it is both ultra fast and ultra robust - not using Hugging Face downloader anymore
  - Moreover the downloader script verifies SHA 256 hash of the models and prevents any possibly corrupted model downloads
This is an interface app that is based on famous Kohya Musubi Tuner
- It has all the features of Kohya Musubi Tuner + extra features with huge amount of details and very easy to use
- Musubi Tuner Official repo : https://github.com/kohya-ss/musubi-tuner
The installer will install with Torch 2.8 and CUDA 12.9 and pre-compiled xFormers, Triton, Flash Attention and Sage Attention libraries
- We literally support all of the GPUs like RTX 3000, 4000, 5000 series or, A40, L40, A100, H100, B200 etc
- Flash Attention and Sage Attention may not work on RTX 2000 or 1000 series but I have plan for them hopefully soon with newest libraries compiling myself
Currently it fully supports Qwen Image model LoRA training + Qwen2.5-VL image captioning + Qwen Image Edit Training+ Qwen Image Full Fine Tuning / DreamBooth
The followings are also fully supported and their research in progress
- Wan 2.1 LoRA training - example config :
  - Wan_2.1_text_to_video_test_lora_8500_mb.toml
- Wan 2.2 LoRA training - example configs :
  - Wan_2.2_Text_to_Video_Low_Noise_Test_LoRA_Config_10250_MB
  - Wan_2.2_Text_to_Video_Both_Low_and_High_Noise_Test_LoRA_Config_12000_MB.toml
Musubi Tuner tab is from original fork - so this is not tested or supported please use Qwen Image LoRA and Image Captioning tabs
Please check all screenshots in details to see what the app really supports

28 October 2025 Update V18.7

Bug fixes and a new amazing feature called as Image Preprocessing and FP8 Model Converter
Scaled FP8 Model Converter is used to batch convert Qwen Image Fine-Tuned / DreamBooth models into scaled FP8 from BF16 weights
- Extremely useful if you don't have over 60 GB GPUs and also reduces size to half
- Almost same quality with using intelligent FP8 Scaled weight conversation
What this tool does is that, it preprocess your given folder image with Kohya Musubi tuner actual training code
So your pre-processed images is the images that are actually being trained by the trainer
Extremely useful to see your resolution, aspect ratios and what bucketing does
Moreover, currently Kohya doesn't use exif data image orientation so if you might get mis-oriented images surprise, use this tool to see and you can use fixed dataset as well
Detailed debug options added to latent caching section so you can enable debug and 1 by 1 look at the processed images
You can use pre-processed dataset as your training dataset if you wish saves time and resources

21 October 2025 Update V18.0

Qwen Image Edit Plus (2509) Fully supported now
Just load the DreamBooth or LoRA Config and then follow the below steps
Original files are included inside Qwen_Image_Training_Configs folder check each one
Extract latest zip file, overwrite your existing files and run Windows_Install_and_Update.bat to update
Model downloader now supports downloading Qwen Image Edit Plus (2509) model set download as well
- Run Windows_Download_Training_Model_Files.bat to download desired model
Qwen Image Fine Tuning configs completed finally
- Working on updating inference presets in SwarmUI

The logic of Qwen Image Edit Model training shown below

21 October 2025 Update V17.9

Qwen Image Full Fine Tuning / DreamBooth research finally completed
- Hopefully full tutorial coming and more information will be shared soon
We have made configs starting from 5750_MB (6 GB GPUs) to 84000_MB (96 GB GPUs) GPUs
The lower VRAM requiring configs will use more RAM since we do block swapping and also they will be slower
I have prepared 3 different config set
200_epoch folder - best quality - lower learning rate - more epochs (more epochs takes more time linearly)
150_epoch - good quality - a little bit higher learning rate - medium epochs
75_epoch - lower quality - higher learning rate - lower epochs - faster training
All qwen configs moved into Qwen_Image_Training_Configs and seperated as LoRA_Training and Fine_Tuning_Training-DreamBooth sub folders
All configs updated including LoRA and below 20 GB configs modified like this
- T5 text encoder caching will be made on CPU with BF16 - still really fast
After tutorial for this hopefully I will research Wan 2.2 training for ultra realistic image gen + video gen

20 October 2025 Update V17.8

Windows_Download_Training_Model_Files.bat updated and now you can download Wan 2.2 Text to Video and Image to Video models
- I have converted official FP32 Wan weights into BF16 so use our downloader to download BF16 weights of Wan 2.2
- BF16 yields much better quality at training and also slightly better or sometimes significantly better quality at inference
Example Wan 2.2 train configs added
- Wan_2.2_Text_to_Video_Low_Noise_Test_LoRA_Config_10250_MB.toml
  - Uses 10250 MB VRAM
- Wan_2.2_Text_to_Video_Both_Low_and_High_Noise_Test_LoRA_Config_12000_MB.toml
  - Uses 12000 MB VRAM
- Example configs are using BF16 weights of Wan 2.2 now
Massed_Compute_Instructions_READ.txt and Runpod_Instructions_Trainer_READ.txt updated to have Wan 2.2 model download options
Wan training research results coming soon hopefully
Qwen Image fine tuning completed i am finalizing to publish hopefully

3 October 2025 Update V17.5

Please extract latest zip file, overwrite older files
- Delete SECourses_Musubi_Trainer\musubi-tuner folder and run Windows_Install_and_Update.bat to update
Switch to low memory branch file removed since it is now merged
Bug fix on Linux systems - RunPod & Massed Compute for Wan training
Wan models training directory explanation improved : https://pasteboard.co/y5fI2Sf8Bc0L.png

29 September 2025 Status Update V17

Wan training support added
I think it is supporting all of the Wan trainings that Musubi Tuner supports
So far only Wan 2.1 Text to Video model LoRA training tested and a demo config put as : Wan_2.1_text_to_video_test_lora_8500_mb.toml
Windows_Download_Training_Model_Files.bat updated to download Wan 2.1 Text to Video models
The models will be downloaded into Training_Models_Qwen and Training_Models_Wan according to your selection
Model downloader updated for RunPod and Massed Compute as well
You can read full changelogs on app and below
Extract zip file, overwrite and run Windows_Install_and_Update.bat for update

27 September 2025 Status Update V16

Qwen Image LoRA trainings research and development completed fully
- I have done over 50 full trainings to find out best parameters and workflow
- You can see training checkpoints and files here : https://huggingface.co/MonsterMMORPG/Qwen_Image_LoRA_Training_Research/tree/main
- To access files you can message me and i can let you with a special price but normally this is not needed by you
I have updated all the configs after recent experiments
I feel like 6e-05 is working slightly better than 5e-05 so updated configs to that
The latest experiments put into Final_Qwen_LoRA_Test_Results.zip file
I have trained different learning rates up to 1000 epochs on 28 images dataset and you can see grid results in the attached Final_Qwen_LoRA_Test_Results.zip file
So if you need faster training, you can possibly increase learning rate to like 8e-5 or 9e-5 and do like 100 or less but it is R&D depending on your needs
The app is now fully supporting Qwen Image full Fine Tuning / DreamBooth
I added a demo config file : dreambooth.toml
- This demo config toml uses 5 GB VRAM only when you also run Windows_Switch_Low_RAM_Branch_Temporary.bat
- There is a pull request of Kohya which reduces used shared VRAM signficiantly but it is not merged yet so we switch to that branch with that bat file
- When you run Windows_Download_Training_Model_Files.bat again it will switch back to main branch
I am starting to fully research Qwen Image full Fine Tuning and also adding Wan 2.2 and Wan 2.1 training capability to the app hopefully as soon as possible
Installers updated to Torch 2.8, CUDA 12.9, Flash Attention 2.8.3, Sage Attention 2.2, xFormers 0.0.33
I have pre-compiled libraries for both Windows and Linux and working amazing

Windows Requirements

Python 3.10.11, FFmpeg, CUDA 12.9, cuDNN 9.12 or above, C++ tools, MSVC and Git
- Don't worry CUDA 12.9 works with all GPUs
- Follow this requirements tutorial video exactly : https://youtu.be/DrhUHnYfwC0
- Follow its updated post with links and screenshots exactly : https://www.patreon.com/posts/click-to-open-post-used-in-tutorial-111553210

5 September 2025 Status Update V14

Some bug fixes and Configs_v1 added that you can use
New option added for sample generation Disable Automatic Prompt Enhancement
- Thus you can provide fully customized prompt txt
Space character having paths will now work but still dont use space character in any path
- .e.g my awesome images - is wrong but my_awesome_images is right
We support as low as 6 GB GPUs
Tier 1 is better than Tier 2 and Tier 2 better than Tier 3 and so on
There shouldn't be any big difference between Tier 1 and Tier 2
Tier 2 and Tier 3 also should be pretty close in terms of quality
To update download latest zip file, extract and overwrite older files and run Windows_Install_and_Update.bat

Some Raw Generation Results of 2_september_best_config.toml

No Uscale, No Face Restore
All 8 Steps with 8-steps LoRA and Preset We have
Trained for 200 Epochs, 28 images = 5600 steps
All are 1328x1328px native res

31 August 2025 Update V11

Now you can immediately and properly stop batch image captioning as well
Some Critical config save load bugs fixed please upgrade
- e.g. : FIXED: Critical checkpoint removal bug - Checkpoints were being deleted immediately after saving when save_last_n_epochs=0
- Now we give GPU ID = 0 as a default, if you have multiple GPUs like onboard GPU and external GPU, make sure the ID matches to your external GPU ID
- It is set in Distributed GPUs and that may look like multiple GPU config but when no multiple GPU enabled, it still uses it
UI significantly improved please check newest screenshots given at the very top
The downloader app improved and now it will show the progress of each model on a single line not in multiple lines as it downloads
Added comprehensive parameter support for Qwen Image training with 100% coverage of Musubi Tuner parameters
NEW: Integrated search bar in Qwen Image Training tab - quickly find any setting without opening all panels
Tab renamed from "Qwen Image LoRA" to "Qwen Image Training" to reflect both LoRA and Fine-tuning support
Implemented Qwen-Image-Edit mode support for control image training (experimental - not fully tested)
Added control image resolution settings for Edit mode (dataset_qwen_image_edit_control_resolution_width/height)
Introduced dataset_qwen_image_edit_no_resize_control option for maintaining original control image sizes - this is for Qwen Image Edit model - not tested yet
Started implementation of Qwen Image Fine-Tuning mode (DreamBooth) - parameter infrastructure in place
Enhanced FP8 quantization descriptions with clearer GPU compatibility information
Improved timestep sampling with better documentation of qwen_shift vs standard shift methods
Added advanced flow matching parameters (logit_mean, logit_std, mode_scale) for fine-tuned control
Implemented complete VAE optimization settings (tiling, chunk_size, spatial_tile_sample_min_size)
Enhanced parameter descriptions throughout the GUI for better user understanding
All critical default values confirmed to match official Qwen Image documentation
Parameter accuracy validated at 100% for all implemented features
Some configs save and load were broken and all fixed hopefully
- Please report if you notice error
- e.g. one of the fixed one is GPU ID set
Please use latest zip file, overwrite previous files and just run Windows_Install_and_Update.bat

30 August 2025 Update V7

Dataset TOML file generate error fixed
Qwen2.5-VL image captioning turns out working perfect on Windows
- It turns out my model file was corrupted even though it was same size
- Therefore I have updated the model downloader and now it will check and verify SHA 256 of files therefore it will be 100% accurate
Prompt file selection folder icon issue fixed
Downloader file will use generated venv of installation
- Make sure to run it after installation completed
Fixed skip existing captions functionality in Image Captioning with Qwen2.5-VL
- Previously skipping was happening after caption generation which was destroying the skip logic
- Now properly checks for existing captions before processing, significantly improving efficiency
- Added full batch captioning status display in command line with progress tracking and ETA
- Enhanced config save/load functionality for better reliability
- Improved interface of Image Captioning with Qwen2.5-VL for better user experience
- Various error fixes in the Qwen2.5-VL captioning pipeline
Fixed broken config save and load functionality for Optimizer Arguments and Scheduler Arguments
Improved Stop Training button responsiveness - now appears much earlier when Text Encoder caching starts
- Enhanced training control for better user experience
A new full dedicated section for Sample generation implemented
- It will automatically format your given sample txt file with the settings you set on GUI
- So you just type prompts into txt file with new lines e.g.
  - ohwx man wearing a very nice amazing suit
  - ohwx man driving a luxury car
Please use latest zip file, overwrite previous files and just run Windows_Install_and_Update.bat

How To Give Accurate Dataset for Dataset Prepare

Example folder : E:\training_imgs_28_1328
- So you give above path into UI
Inside this folder example sub folder for dataset images
- E:\training_imgs_28_1328\1_ohwx man
Inside E:\training_imgs_28_1328\1_ohwx man
- I have my images as below

How To Install and Use:

Windows:

Full tutorial coming soon hopefully
Use same folder logic of Kohya and use Generate Dataset Configuration button it will handle all
- e.g. Parent Folder > sub folder like 1_ohwx man
Just use Windows_Install_and_Update.bat for install and update

Massed Compute (Recommend Cloud) :

Please register via this link : https://vm.massedcompute.com/signup?linkId=lp_034338&sourceId=secourses&tenantId=massed-compute
- We have a special coupon for all GPUs : SECourses
  If you want to learn more about GPUs and prices read this link : https://www.patreon.com/posts/126671823
  Select RTX A6000 or Better GPU - like L40S or A6000 ADA or A100 or H100 or now RTX 6000 PRO
- Then select our image SECourses from Creator dropdown
- Then follow Massed_Compute_Instructions_READ.txt
- Same as my any other Massed Compute installer script
- Example tutorial for learn how to install and use Massed Compute
  - (Starts at 12:58) : https://youtu.be/KW-MHmoNcqo?si=G1WbG-Qw4ujWvOtG&t=778

RunPod (Cloud):

Please register via this link : https://get.runpod.io/955rkuppqv4h
- Then follow Runpod_Instructions_READ.txt
- Same as my any other RunPod installer script
- Use the template written in Runpod_Instructions_READ.txt file
- Example tutorial for learn how to install and use RunPod
  - (starts at 22:03) : https://youtu.be/KW-MHmoNcqo?si=QN8X8Sjn13ZYu-EU&t=1323