NokiMo
Furkan Gözükara
Furkan Gözükara

patreon


All Joy Captions (JoyCaption) Apps 1-Click Installers for Windows, RunPod, Massed Compute With Improved Gradio Apps - All Supporting Batch Processing, JoyCaption Pre-Alpha, Alpha One, Alpha Two and Joy Caption Beta One

Patreon exclusive posts index to find our scripts easily, Patreon exclusive posts index to see which scripts updated or added last and amazing Patreon special generative scripts list that you can use in any of your task.

Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord

Please also Star, Watch and Fork our Stable Diffusion & Generative AI  GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)

=======

App Screenshots Gallery here (Public) : https://www.reddit.com/r/SECourses/comments/1mfedbw/all_joy_captions_joycaption_apps_1click/

For mass editing captions > https://www.patreon.com/posts/108992085

Latest installers : Joy_Caption_v40.zip

I have developed 4-seperate amazingly advanced Gradio APPs for

21 September 2025 Update

18 September 2025 Update

Windows Requirements

22 August 2025 Update

17 August 2025 Update

17 May 2025 Update

5 January 2025 Update

29 December 2024 Update

Features

All apps has the following amazing features. If any of them are broken please report and let me know

How To Install And Use

Using 4-bit Quantization reduces VRAM usage but also slows down.

When using multi-GPU batch captioning, the displayed statistics on CMD could be wrong but still should work perfectly fine.

Massed Compute (Recommend Cloud) :

RunPod (Cloud):

All Joy Captions (JoyCaption) Apps 1-Click Installers for Windows, RunPod, Massed Compute With Improved Gradio Apps - All Supporting Batch Processing, JoyCaption Pre-Alpha, Alpha One, Alpha Two and Joy Caption Beta One All Joy Captions (JoyCaption) Apps 1-Click Installers for Windows, RunPod, Massed Compute With Improved Gradio Apps - All Supporting Batch Processing, JoyCaption Pre-Alpha, Alpha One, Alpha Two and Joy Caption Beta One All Joy Captions (JoyCaption) Apps 1-Click Installers for Windows, RunPod, Massed Compute With Improved Gradio Apps - All Supporting Batch Processing, JoyCaption Pre-Alpha, Alpha One, Alpha Two and Joy Caption Beta One All Joy Captions (JoyCaption) Apps 1-Click Installers for Windows, RunPod, Massed Compute With Improved Gradio Apps - All Supporting Batch Processing, JoyCaption Pre-Alpha, Alpha One, Alpha Two and Joy Caption Beta One All Joy Captions (JoyCaption) Apps 1-Click Installers for Windows, RunPod, Massed Compute With Improved Gradio Apps - All Supporting Batch Processing, JoyCaption Pre-Alpha, Alpha One, Alpha Two and Joy Caption Beta One

Comments

we have this app give it a try : https://www.patreon.com/posts/108992085

Furkan Gözükara

What is the best tool for dataset exploration? It seems I have too many repeating tags and very few unique ones. I’d like to thoroughly review a 'cloud' of the data so that I can sort or remove images based on it

eman eliforp

you can write custom prompts. or you mean something else?

Furkan Gözükara

Is it possible to add an option in Extra Options of how the object such as a car is viewed. Like the front and side of the car is visible of the image.

Hockey

you are welcome

Furkan Gözükara

I haven’t upgraded since V19, and I’m really impressed with the improvements. Thank you for all the effort you’ve put into this.

benjamin macauley

you did set hugging face token and it is expired. open a cmd and run this : huggingface-cli logout and restart download

Furkan Gözükara

if anyone is having issues with downloading AI models ##\Joy_Caption_v38 as remote repo cannot be accessed in snapshot_download (401 Client Error: Unauthorized for url: https://huggingface.co/api/models/MonsterMMORPG/CapFiles/revision/main## you can use: set HUGGINGFACE_HUB_TOKEN=hf_xxxxxxxxxx && set HF_TOKEN=%HUGGINGFACE_HUB_TOKEN% && cd /d C:\path\to\our\folder && Windows_Download_Models.bat

Christoph Behrmann

no it definitely works with cuda 12.8

Furkan Gözükara

WARNING. For this auto installer to work you need to have installed Python 3.10.11, Git, FFmpeg, CUDA 12.8 and C++ tools --this is first line when starting 'Windows_Install.bat' -- when running with my 3090 I was thinking it does only work with CUDA 12.6 ?

Christoph Behrmann

it took a while but i added for you 22 August 2025 Update With V37 now there is extra option for Beta One app If you enable Unload model and clear VRAM after each caption before first run, it will fully unload model after each caption and clear VRAM

Furkan Gözükara

This is working very well where many others do not, thank you! Is it possible to have this unload the model, or clear cache / vram after generating a caption (or give a button to do this function if it isn't automatic)? In windows, when I generate the caption (using nf4), then put that caption into comfy, i see that my resources are still consumed ( I am running a 4080 super (16GB) with 64GB DDR5) and it forces me to restart joycaption. Also if I could save the checkbox preferences that's a nice to have. Thank you again for your content and contributions Edit: I should have mentioned, I am using Beta One

Joe

can you eloborate more what you mean?

Furkan Gözükara

you can use here to join discord : https://discord.com/servers/software-engineering-courses-secourses-772774097734074388

Furkan Gözükara

Can you update discord invite link?

Philipp Ludwig

Can you compare 2 image with this app? (Flux kontext training)

gianni angelini

thank you for reporting with V36 now it will sort first and then process. just extract new zip file and overwrite older files

Furkan Gözükara

did you try option 4 ? looks like option 1 cant run in 8 gb - also did you enable quantization options? check interface

Furkan Gözükara

trying to get joy Caption to work, even after several installations i got CUDA out of Memory errors. Anyone could help?? SECourses JoyCaption Image Captioning App - Select to Start 1. Joy Caption Pre Alpha 2. Joy Caption Alpha One 3. Joy Caption Alpha Two 4. Joy Caption Beta One 5. Exit Enter your choice (1-5): 1 Loading CLIP Using a slow image processor as `use_fast` is unset and a slow processor was saved with this model. `use_fast=True` will be the default behavior in v4.52, even if the model was saved with a slow processor. This will result in minor differences in outputs. You'll still be able to use a slow processor with `use_fast=False`. [2025-08-16 21:16:29,497] [INFO] [real_accelerator.py:239:get_accelerator] Setting ds_accelerator to cuda (auto detect) test.c LINK : fatal error LNK1181: Eingabedatei "aio.lib" kann nicht geöffnet werden. test.c LINK : fatal error LNK1181: Eingabedatei "cufile.lib" kann nicht geöffnet werden. W0816 21:16:37.515000 17928 venv\Lib\site-packages\torch\distributed\elastic\multiprocessing\redirects.py:29] NOTE: Redirects are currently not supported in Windows or MacOs. Loading tokenizer * Running on local URL: http://127.0.0.1:7860 * To create a public link, set `share=True` in `launch()`. Processing C:\Users\theow\AppData\Local\Temp\gradio\43d88ba8957c6994609b3a0f883f08115c192acb053be29e3d5236165d4ba990\image027.jpg on GPU 0... Loading model for GPU 0 Loading checkpoint shards: 100%|█████████████████████████████████████████████████████████| 4/4 [00:39<00:00, 9.91s/it] Error processing image C:\Users\theow\AppData\Local\Temp\gradio\43d88ba8957c6994609b3a0f883f08115c192acb053be29e3d5236165d4ba990\image027.jpg on GPU 0: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 8.00 GiB of which 0 bytes is free. Of the allocated memory 7.05 GiB is allocated by PyTorch, and 194.77 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation. See documentation for Memory Management (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables) Image shape: (1365, 1024, 3) Processing C:\Users\theow\AppData\Local\Temp\gradio\210c49e2c309d72f744d0ba227a4b8cc1fdf827b312e37f6aca32b7df7840ba0\image028.jpg on GPU 0... Loading model for GPU 0 Loading checkpoint shards: 0%| | 0/4 [00:08

Ingo Weigel

I am using batch processing with Beta One, and it is absolutely brilliant - I have for example 500 sub-folders - and it does NOT go thru (sub-)folders from A-Z, it looks like some random order... I was on the road for 2 days and it was doing its job for 200k+ images... just to let you know about "A-Z order NOT working on my end" ...better saying: I cannot make out any order :)

Christoph Behrmann

sadly not. just have python 3.10.11 installed in your system. it doesnt have to be default. my installer will use it

Furkan Gözükara

Hi, could you please modify the installer so that it can (also) use Python 3.12? 3.10 is pretty old and most other current tools need 3.11 or newer. You did this Python selection code already nicely for the ComfyUI installer, so should not be a big deal to add this to Joy Caption, right?

Jason Dawn

ok but which version of the app :D Joy_Caption_Alpha_One or Joy_Caption_Alpha_Two etc

Furkan Gözükara

Thanks much for this amazing resource. There appears to be an issue with v34 - changing caption type has no effect. they all come out as "detailed" and none of the options seem to be taking effect?

Krush Oldwin

fixed the error. please use latest zip file. sorry for this.

Furkan Gözükara

it is related to linux. 100% safe to ignore on windows. pytorch related not me :D

Furkan Gözükara

it is related to linux. 100% safe to ignore on windows. pytorch related not me :D

Furkan Gözükara

it is related to linux. 100% safe to ignore on windows. pytorch related not me :D

Furkan Gözükara

With all your release lately I have this: LINK : fatal error LNK1181: cannot open input file 'aio.lib' issue and I'm not alone. Are you planning to fix that or at least point at how to fix it.

RayHell

Enter your choice (1-5): 4 [2025-05-18 20:53:26,278] [INFO] [real_accelerator.py:239:get_accelerator] Setting ds_accelerator to cuda (auto detect) test.c LINK : fatal error LNK1181: 'aio.lib' giriş dosyası açılamıyor test.c LINK : fatal error LNK1181: 'cufile.lib' giriş dosyası açılamıyor W0518 20:53:34.810000 15380 venv\Lib\site-packages\torch\distributed\elastic\multiprocessing\redirects.py:29] NOTE: Redirects are currently not supported in Windows or MacOs. 🛠️ System configuration: Python : 3.10.11 (H:\Joy_Caption_v33\venv\Scripts\python.exe) PyTorch : 2.7.0+cu128 ‣ CUDA build : 12.8 transformers : 4.51.3 bitsandbytes : 0.45.5 liger_kernel : 0.5.8 GPUs (total 1): • [0] NVIDIA GeForce RTX 3080 Ti | compute 8.6 | 12.0 GiB * Running on local URL: http://127.0.0.1:7860 * To create a public link, set `share=True` in `launch()`.

Cemil Hacimahmutoglu

In the batch caption option of Joycaption beta 1, it seems to create .txt files for each but for quite a few times when processing the images in batch mode it says "[date & time] [INFO] [real_accelerator.py:239:get_accelerator] Setting ds_accelerator to cuda (auto detect) test.c LINK : fatal error LNK1181: cannot open input file 'aio.lib' test.c LINK : fatal error LNK1181: cannot open input file 'cufile.lib'" Do those matter? Is that something to do with trying to do CUDA acceleration and it failing but then still continuing to caption but not in an accelerated way? Also quite a few times when processing the images in batch mode it says " NOTE: Redirects are currently not supported in Windows or MacOs.". I don't think it needs to output that multiple times if it's not really needed. It doesn't seem to be outputting to the console the number of images it's processed in batch mode so far and which one it's on and how many are left to do like Joycaption alpha 2 did. And it might be good if it also output a message to the console when it's finished processing the batch if possible (I think the alpha 2 did that). Otherwise there's nothing shown to indicate it's finished. It could list at the end the number processed, number skipped etc. how long it took, like before if possible. Also when you press "start batch processing" that button doesn't change or become disabled until it finishes and no "cancel/stop batch processing" button shows. It would help if a cancel/stop batch processing button showed in case needed like you could with Joycaption Alpha 2 (eg. in case you've not set the right things and need to stop it quickly). edit: though it does show a processing % at the top of the GUI in batch mode. But showing some processing info in the console could help too.

cool1

No matching distribution found for triton>=2.3.1

shen oracle

you are welcome. also there was an error fixed with v33

Furkan Gözükara

i have the same problem hopefully solved soon

darcy

Same :(

Marko Radosavljevic

Apparently venv is not installed correctly, modules are missing

Harkonnen

Thanks for the update. Though when I select to start (eg. 3 or 4) it says: "ModuleNotFoundError: No module named 'gradio'" Also when running the download models.bat it didn't seem to get to 100% on everything but then said they're all downloaded and rerunning "download models" again again says they're all downloaded. So I don't know if I have partially downloaded model files now. Ideally it could be able to detect that and continue where it left off if any are incomplete.

cool1

Thanks Mate, appreciate that. :-)

Bartosz Polecki

yes please set your virtual RAM to 100 GB. look youtube so easy to set. your RAM is not being sufficient

Furkan Gözükara

i'm a bit dense when it comes to these things, but you said this would work for 8GB GPU systems? i have an 8 GB gaming laptop, but whenever i've tried to run this even with fp16 and 4-bit Quantization i get the error "The paging file is too small for this operation to complete." i assume i have to do something with the RAM memory, but i wanted to hear what suggestions you have

Rikard

Duplicate remover app published it took 2 weeks but finally done :) I coded for you : https://www.patreon.com/posts/120352012

Furkan Gözükara

i agree with you. i will try to make better hopefully

Furkan Gözükara

You need to include or put the update bat file inside the one-click installer script with all of your one-click installer scripts because it is a hard and time-consuming process. Second, in this way, ssd stroage tbw performance get expired fastly because ai tools are large file tools and it makes ssd obsolete

Anshul Gupta

For now, one way is with Digikam (a free image organizer/viewer). You can right click on a folder (that's in an album) and select "Find duplicates" and you can specify a similarity range %. Though you first need to click "update fingerprints" on the images as far as I know. (if it hadn't already been done for those images). Though it would be easier if there was just a simple way for a basic option in a program to just select a folder and it told you the duplicates (ie. a more basic/lightweight option than using digikam for it).

cool1

Thanks a lot. I'll try that. edit: It seems to work okay now thanks. The stopping and starting batch processing seems okay and it's not showing those errors on each file now. It takes quite a long time for the first image in the batch after re-starting the batch (about 2 mins) but I assume it's loading the model then. I don't remember the alpha 1 taken that long. Maybe the model is a bigger than alpha 1. But it's working. It's a lot faster at re-starting batch processing when it wasn't stopped with the "stop batch processing" option (doesn't take mins on the first image). edit: the first image in the batch has sped up a lot now after re-starting batch processing after stopping it part way through (it doesn't take 2 mins now just seconds). Maybe there was something else running like Adobe's creative cloud UI helper that was making it slower before.

cool1

coding it right now for you stay tuned

Furkan Gözükara

How to eliminate duplicated/similar images? Which tool is best?

eman eliforp

download v31 fixed all errors. also read changelogs

Furkan Gözükara

I've just tried deleting all the .txt files from that folder and re-pressing "start batch processing" and it still does the same thing. It still immediately shows "Starting batch processing. Found 1 GPU(s).. Batch processing complete. Batch processing complete. Processed: 0..." (if I use that some time after stopping batch processing. I assume if I ctrl C out of the program and re-start it it would work again, but only until I stopped batch processing once - which seems to take longer than alpha 1 did). I can go back to Alpha 1 or Pre Alpha for now though. I assume if the same stop/start method was used that was used in alpha 1 was used it might work the same way as that, which was working. But if you changed if for other things like multi-GPU maybe the default way could work the same way as alpha 1 (most people probably only have 1 GPU).

cool1

try fresh folder no txt files and see if works

Furkan Gözükara

Thank you. I downloaded 29_2 and extracted that. It shows as V29 in the gradio top of the screen but it works differently so it must have extracted okay. When I ran it it and then pressed "stop batch processing" it immediately said in the command window "Stop Clicked Stopping batch processing. Please wait for current tasks to finish...". So I waited and then it stopped soon after (probably not processing more than the first image). Though it seemed to take longer to actually stop than Alpha1. It still gives the error messages about existing .txt files being there but I can ignore those. But after it has fully stopped (it's said "Batch processing complete. Batch processing complete. Processed: 0..." and I try to to start the batch processing again by pressing "Start batch processing" it immediately says "Starting batch processing. Found 1 GPU. what the input & output folders are, and immediately "Batch processing complete. Batch processing complete. Processed: 0...". So each time I press "start batch processing" (some time after it's fully stopped after using "stop batch processing") it won't restart again. "Overwrite existing caption file" option is still checked so it shouldn't be failing because of existing .txt files.

cool1

you were right. for some reason it had overwritten older file. please try Joy_Caption_v29_v2.zip

Furkan Gözükara

I've tried that. I've let it go past 1 too and then kept pressing it, nothing happens, it just seems to keep going on the next .txt files until it gets the end, no matter how many times I press "stop batch processing". Alpha 1 worked ok for this. Gpu IDs box is set to 0 (the default). I don't know if that affects it. I only have 1 GPU (3090).

cool1

i tried regular batch it worked. start processing let it process like 1 image and then stop you will see.

Furkan Gözükara

I extracted the v29 files. It shows v29 in the gradio top of screen for Alpha 2, but when I press "stop batch processing" now while it's processing a batch, nothing seems to happen. It doesn't give the error message now though. The code in alpha 2 for checking for stopping seems different to alpha 1's checking in process_gpu_batch. I don't know if copying the way it was doing it in Alpha 1 might make it stop the batch processing in the same way as alpha1 as that one worked ok.

cool1

you are welcome

Furkan Gözükara

Thanks a lot

cool1

fixed with v29 zip

Furkan Gözükara

Also another issue with JoyCaption Alpha 2 is pressing the "stop batch processing" button doesn't stop it from processing the images in the folder. It says the following error: ...\Joy_Caption_Alpha_Two.py", line 649, in stop_batch_processing_fn stop_flag.value = True NameError: name 'stop_flag' is not defined then continues captioning the rest of the images in the folder.

cool1

yes you can ignore it.

Furkan Gözükara

It seem like there might be a bit of a bug/issue with batch captioning with JoyCaption Alpha 2. I batch captioned a folder that already had .txt files with short captions for each image - so captioning to make them longer. I had "Overwrite existing caption file" checked at the time I selected "batch caption". It seems to have captioned every image in the folder, updating the .txt files with the longer captions, but after each one it shows in the command window "Error processing image [folder & filename]: [folder & filename] is the same as [folder & filename].". Then at the end of the batch it shows in the command prompt "Processed 0/45 | skipped: 0| Failed: 45" (the GUI "batch processing progress" window also mentions the same number failed) but that's incorrect as it seems to have added longer captions to all the existing .txt files in the folder and not failed on them.

cool1

In their page about it they scored JoyCaption Alpha 2 at 22.8 for avg score (top) but that's only slightly higher than the 22.6 that they scored Pre Alpha. Though alpha 2 has more options and they did add more training stuff to it. Though they said standard deviation for Pre Alpha and Alpha 2 are both 5.32 so they think that's equal (I would have thought one would be better if the alpha 2 is scored slightly better). Also they only got those scores from checking 15 images. Ideally the could have tested more. It still sometimes mixes up the left and right side (eg. talking about the person's right hand when it's actually their left hand, and re-running it can make it correct it. It might get that wrong 25% of the time though I haven't done a proper test). I noticed in Alpha 1 version, when captioning a single image that you drag to the input image section, it output the images with the same filename as the source image to the output folder (so you'd have multiple images being output there). But the new Joycaption Alpha 2 version is outputting them all as "temp_input_image.png". I assume that's intentional, though it's different than before. In JoyCaption Alpha 2 it shows "Gradio" in the browser tab. Ideally that could show "JoyCaption Alpha 2" there so it's clearer when switching between tabs/windows though it's not vital.

cool1

yes could be

Furkan Gözükara

When I mentioned the windows process sometimes going to about 2.5% CPU, I think now it was probably nothing to do with JoyCaption or "stop batch processing". It's "Shell Infrastructure Host" that goes to that much and perplexity.ai says that could be caused by a memory leak in the windows photo viewer and I'd used that (legacy version). So it was probably that. I'll use a different image viewer in future so it shouldn't have that problem.

cool1

the apps has custom prompt boxes you mean that?

Furkan Gözükara

Is it possible to add the prompt description box - where I can specify what the model should keep in "mind" as it captions. Prefix and suffix are the best but very limiting

Sphiwe April

just made v28 update 1 : fixed 2 : please read changes and see if fixes or elaborate more

Furkan Gözükara

Perfect timing on this release. Feature request for batch processing; the capability of working through subfolders within the target folder. Edit: A way to offload the model without closing and reopening it would also be appreciated.

H M

ye this model is not that great. but since people requested i added :D

Furkan Gözükara

Extra options causes error, spits out random caption and takes over a minute. i selected the first two options on extra options, also selected Do Not mention the image's resolution, and Do not use any ambigous language.

DarkView AI

you should test all. also fp16 is for older gpus dont use unless you needed. 4 bit is good for lower vram gpus especially if you dont have 24 gb gpu

Furkan Gözükara

Thanks a lot. In the Alpha 2 version it has 2 new checkboxes, one "Use FP16 Model Type" and one "Use 4-bit Quantization". What does it use for those if you don't select them? If they're not selected does it use a higher bit model type (like FP32?) and no quantization? I assume we should leave both those blank for the most accurate captions if we have enough VRAM and we are okay with it using that higher VRAM? In one of the top new options for Alpha 2 there's an option to "not include information about people/characters that cannot be changed (like ethnicity, gender etc)...". Have there been any tests anywhere to see if doing that actually helps? I don't know if by adding those descriptions in the captions if it might make you be more likely to need to specify them when prompting (after training), but I thought including them might also help it train them quicker (by starting learning with learned concepts that are similar to that). When you use the "stop batch processing" option, if you kept doing that a lot does that ever cause problems? Like is it closing all processes and freeing things fully? eg. a few times after using the previous Alpha 1 version I've had one of the windows processes start using about 2.5% CPU after stopping batch processing a few times (which made things feel slower) but I don't know if it was caused by that or other things (since I'd been using multiple apps), but it went to 0 after rebooting (task manager said it might cause instability if I end-tasked that windows process which I was trying to restart to get it to 0% CPU).

cool1

thank you so much

Furkan Gözükara

i think Joy Caption Pre Alpha but you can compare all 3

Furkan Gözükara

Amazing, thanks a lot! Which JoyCaption Model works best?

Markus

Hi. Its unbelievable. You are so efficient, so we just recently talked about this project. I really appreciate your work, a lot of time and effort spent for this /and for us, your subscribers. Thank you and All the best to you, Master!

Ant-2014


Related Creators