Furkan Gözükara

All Joy Captions (JoyCaption) Apps 1-Click Installers for Windows, RunPod, Massed Compute With Improved Gradio Apps - All Supporting Batch Processing, JoyCaption Pre-Alpha, Alpha One, Alpha Two and Joy Caption Beta One

Added 2025-09-21 19:00:00 +0000 UTC

Patreon exclusive posts index to find our scripts easily, Patreon exclusive posts index to see which scripts updated or added last and amazing Patreon special generative scripts list that you can use in any of your task.

Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord

Please also Star, Watch and Fork our Stable Diffusion & Generative AI GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)

=======

App Screenshots Gallery here (Public) : https://www.reddit.com/r/SECourses/comments/1mfedbw/all_joy_captions_joycaption_apps_1click/

For mass editing captions > https://www.patreon.com/posts/108992085

Latest installers : Joy_Caption_v40.zip

I have developed 4-seperate amazingly advanced Gradio APPs for

Joy Caption Pre Alpha
Joy Caption Alpha One
Joy Caption Alpha Two
Joy Caption Beta One - Newest and Best
- Beta One is extremely fast, advanced, and supports full batch processing
When doing multi-GPU batch captioning displayed statistics could be wrong

21 September 2025 Update

With v40, we have added full preset save and load feature to Joy Caption Beta One
You can now even add new Extra Options (for non-custom prompts)
It will remember all your saved settings in preset and auto load next time you restart your app
Presets will be saved inside presets folder
Extract zip file and overwrite previous files for update that is sufficient to upgrade from V18

18 September 2025 Update

With v39, we have significantly improved Joy Caption Beta One
Now it fully and properly utilizes liger_kernel
The processing speed of BF16 is extremely fast
You will lose a lot of speed in Int8 or NF4 - so use only if you get out of VRAM error
Multiple GPU processing of Joy Caption Beta One completey rewritten and now works amazing and ultra fast
Installation libraries upgraded to Torch 2.8, Triton 3.4, CUDA 12.9 for Windows, RunPod and Massed Compute
- I am using all pre-compiled libraries so it should work fairly well as long as you have Python 3.10.11 and Git installed
Extract zip file and overwrite previous files. Moreover, i recommend delete venv folder and make a fresh install of venv

Windows Requirements

Python 3.10, FFmpeg, CUDA 12.9, C++ tools, MSVC and Git
If it doesn't work make sure to below tutorial and install everything exactly as shown in this below tutorial
https://youtu.be/DrhUHnYfwC0

22 August 2025 Update

With V38 now there is extra option for Beta One app
- If you enable Unload model and clear VRAM after each caption before first run, it will fully unload model after each caption and clear VRAM

17 August 2025 Update

With V36 now it will process with sorted by name all folders and sub-folders in all apps
- So it will sort files and folders by name first and then will do batch process
  - E.g. A > B > C etc
To update to V36 just extract files from zip and overwrite older files that is it

17 May 2025 Update

Joy Caption Beta One app added to our installers
Now fully supports RTX 5000 series GPUs with all kind of speed improvements
- Installs with xFormers, Flash Attention, Sage Attention, Triton, DeepSpeed
- Thus, Joy Caption Beta One works ultra fast
- If you get Triton error use Clear_Triton_Cache.bat
Delete older venv and make a fresh install

5 January 2025 Update

Joy Caption Alpha Two stop batch processing problem fixed
Joy Caption Alpha Two batch statistics display on CMD fixed
Joy Caption Alpha Two batch "Error processing image d.JPG: 'd.JPG' and 'd.JPG' are the same file" error fixed
Joy Caption Alpha Two even multi-GPU batch processing displays accurate statistics and progress on CMD right now

29 December 2024 Update

Batch processing now will process sub-folders as well
When using Pre Alpha or Alpha One, batch processing was slower on single GPU and was loading model again. This issue fixed

Features

All apps has the following amazing features. If any of them are broken please report and let me know

Use 4-bit Quantization
Input Folder for Batch Processing
Output Folder for Batch Processing (Optional)
Multi-GPU Batch captioning (e.g. 0,1,2)
Overwrite existing caption file - by default skips
Append new caption to existing caption
Remove newlines from generated captions
Cut off at last complete sentence
Discard repeating sentences
Don't save processed image
Resize Input Image Max Resolution
Caption Prefix and Caption Suffix

How To Install And Use

Using 4-bit Quantization reduces VRAM usage but also slows down.

When using multi-GPU batch captioning, the displayed statistics on CMD could be wrong but still should work perfectly fine.

Massed Compute (Recommend Cloud) :

Please register via this link : https://vm.massedcompute.com/signup?linkId=lp_034338&sourceId=secourses&tenantId=massed-compute
- Use our coupon SECourses
- Our coupon works on all GPUs now
  - H100 has amazing price and speed but you can use like RTX A6000 ADA as well
  - Full details here : https://www.patreon.com/posts/26671823
- Then select our image SECourses from Creator dropdown
- Then follow Massed_Compute_Instructions_READ.txt
- Same as my any other Massed Compute installer script
- Example tutorial for learn how to install and use Massed Compute
  - (Starts at 12:58) : https://youtu.be/KW-MHmoNcqo?si=G1WbG-Qw4ujWvOtG&t=778

RunPod (Cloud):

Please register via this link : https://get.runpod.io/955rkuppqv4h
- Then follow Runpod_Instructions_READ.txt
- Same as my any other RunPod installer script
- Use the template written in Runpod_Instructions_READ.txt file
- Example tutorial for learn how to install and use RunPod
  - (starts at 22:03) : https://youtu.be/KW-MHmoNcqo?si=QN8X8Sjn13ZYu-EU&t=1323

All Joy Captions (JoyCaption) Apps 1-Click Installers for Windows, RunPod, Massed Compute With Improved Gradio Apps - All Supporting Batch Processing, JoyCaption Pre-Alpha, Alpha One, Alpha Two and Joy Caption Beta One

Comments

we have this app give it a try : https://www.patreon.com/posts/108992085

Furkan Gözükara

2025-09-29 22:43:37 +0000 UTC

What is the best tool for dataset exploration? It seems I have too many repeating tags and very few unique ones. I’d like to thoroughly review a 'cloud' of the data so that I can sort or remove images based on it

eman eliforp

2025-09-29 19:05:45 +0000 UTC

you can write custom prompts. or you mean something else?

Furkan Gözükara

2025-09-21 17:46:34 +0000 UTC

Is it possible to add an option in Extra Options of how the object such as a car is viewed. Like the front and side of the car is visible of the image.

Hockey

2025-09-21 13:04:42 +0000 UTC

you are welcome

Furkan Gözükara

2025-09-20 07:43:36 +0000 UTC

I haven’t upgraded since V19, and I’m really impressed with the improvements. Thank you for all the effort you’ve put into this.

benjamin macauley

2025-09-19 22:00:50 +0000 UTC

you did set hugging face token and it is expired. open a cmd and run this : huggingface-cli logout and restart download

Furkan Gözükara

2025-08-28 14:56:09 +0000 UTC

if anyone is having issues with downloading AI models ##\Joy_Caption_v38 as remote repo cannot be accessed in snapshot_download (401 Client Error: Unauthorized for url: https://huggingface.co/api/models/MonsterMMORPG/CapFiles/revision/main## you can use: set HUGGINGFACE_HUB_TOKEN=hf_xxxxxxxxxx && set HF_TOKEN=%HUGGINGFACE_HUB_TOKEN% && cd /d C:\path\to\our\folder && Windows_Download_Models.bat

Christoph Behrmann

2025-08-28 14:33:14 +0000 UTC

no it definitely works with cuda 12.8

Furkan Gözükara

2025-08-28 12:57:16 +0000 UTC

WARNING. For this auto installer to work you need to have installed Python 3.10.11, Git, FFmpeg, CUDA 12.8 and C++ tools --this is first line when starting 'Windows_Install.bat' -- when running with my 3090 I was thinking it does only work with CUDA 12.6 ?

Christoph Behrmann

2025-08-28 12:54:06 +0000 UTC

it took a while but i added for you 22 August 2025 Update With V37 now there is extra option for Beta One app If you enable Unload model and clear VRAM after each caption before first run, it will fully unload model after each caption and clear VRAM

Furkan Gözükara

2025-08-21 21:26:56 +0000 UTC

This is working very well where many others do not, thank you! Is it possible to have this unload the model, or clear cache / vram after generating a caption (or give a button to do this function if it isn't automatic)? In windows, when I generate the caption (using nf4), then put that caption into comfy, i see that my resources are still consumed ( I am running a 4080 super (16GB) with 64GB DDR5) and it forces me to restart joycaption. Also if I could save the checkbox preferences that's a nice to have. Thank you again for your content and contributions Edit: I should have mentioned, I am using Beta One

Joe

2025-08-21 13:59:02 +0000 UTC

can you eloborate more what you mean?

Furkan Gözükara

2025-08-17 14:25:49 +0000 UTC

you can use here to join discord : https://discord.com/servers/software-engineering-courses-secourses-772774097734074388

Furkan Gözükara

2025-08-17 14:25:28 +0000 UTC

Can you update discord invite link?

Philipp Ludwig

2025-08-17 13:37:58 +0000 UTC

Can you compare 2 image with this app? (Flux kontext training)

gianni angelini

2025-08-17 07:41:57 +0000 UTC

thank you for reporting with V36 now it will sort first and then process. just extract new zip file and overwrite older files

Furkan Gözükara

2025-08-16 22:35:32 +0000 UTC

did you try option 4 ? looks like option 1 cant run in 8 gb - also did you enable quantization options? check interface

Furkan Gözükara

2025-08-16 21:46:32 +0000 UTC

trying to get joy Caption to work, even after several installations i got CUDA out of Memory errors. Anyone could help?? SECourses JoyCaption Image Captioning App - Select to Start 1. Joy Caption Pre Alpha 2. Joy Caption Alpha One 3. Joy Caption Alpha Two 4. Joy Caption Beta One 5. Exit Enter your choice (1-5): 1 Loading CLIP Using a slow image processor as `use_fast` is unset and a slow processor was saved with this model. `use_fast=True` will be the default behavior in v4.52, even if the model was saved with a slow processor. This will result in minor differences in outputs. You'll still be able to use a slow processor with `use_fast=False`. [2025-08-16 21:16:29,497] [INFO] [real_accelerator.py:239:get_accelerator] Setting ds_accelerator to cuda (auto detect) test.c LINK : fatal error LNK1181: Eingabedatei "aio.lib" kann nicht geöffnet werden. test.c LINK : fatal error LNK1181: Eingabedatei "cufile.lib" kann nicht geöffnet werden. W0816 21:16:37.515000 17928 venv\Lib\site-packages\torch\distributed\elastic\multiprocessing\redirects.py:29] NOTE: Redirects are currently not supported in Windows or MacOs. Loading tokenizer * Running on local URL: http://127.0.0.1:7860 * To create a public link, set `share=True` in `launch()`. Processing C:\Users\theow\AppData\Local\Temp\gradio\43d88ba8957c6994609b3a0f883f08115c192acb053be29e3d5236165d4ba990\image027.jpg on GPU 0... Loading model for GPU 0 Loading checkpoint shards: 100%|█████████████████████████████████████████████████████████| 4/4 [00:39<00:00, 9.91s/it] Error processing image C:\Users\theow\AppData\Local\Temp\gradio\43d88ba8957c6994609b3a0f883f08115c192acb053be29e3d5236165d4ba990\image027.jpg on GPU 0: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 8.00 GiB of which 0 bytes is free. Of the allocated memory 7.05 GiB is allocated by PyTorch, and 194.77 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation. See documentation for Memory Management (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables) Image shape: (1365, 1024, 3) Processing C:\Users\theow\AppData\Local\Temp\gradio\210c49e2c309d72f744d0ba227a4b8cc1fdf827b312e37f6aca32b7df7840ba0\image028.jpg on GPU 0... Loading model for GPU 0 Loading checkpoint shards: 0%| | 0/4 [00:08

Ingo Weigel

2025-08-16 19:40:17 +0000 UTC

I am using batch processing with Beta One, and it is absolutely brilliant - I have for example 500 sub-folders - and it does NOT go thru (sub-)folders from A-Z, it looks like some random order... I was on the road for 2 days and it was doing its job for 200k+ images... just to let you know about "A-Z order NOT working on my end" ...better saying: I cannot make out any order :)

Christoph Behrmann

2025-08-15 16:02:13 +0000 UTC

sadly not. just have python 3.10.11 installed in your system. it doesnt have to be default. my installer will use it

Furkan Gözükara

2025-08-03 11:04:00 +0000 UTC

Hi, could you please modify the installer so that it can (also) use Python 3.12? 3.10 is pretty old and most other current tools need 3.11 or newer. You did this Python selection code already nicely for the ComfyUI installer, so should not be a big deal to add this to Joy Caption, right?

Jason Dawn

2025-08-03 09:30:53 +0000 UTC

ok but which version of the app :D Joy_Caption_Alpha_One or Joy_Caption_Alpha_Two etc

Furkan Gözükara

2025-08-01 09:16:41 +0000 UTC

Thanks much for this amazing resource. There appears to be an issue with v34 - changing caption type has no effect. they all come out as "detailed" and none of the options seem to be taking effect?

Krush Oldwin

2025-08-01 02:26:01 +0000 UTC

fixed the error. please use latest zip file. sorry for this.

Furkan Gözükara

2025-05-19 00:40:27 +0000 UTC

it is related to linux. 100% safe to ignore on windows. pytorch related not me :D

Furkan Gözükara

2025-05-19 00:40:13 +0000 UTC

it is related to linux. 100% safe to ignore on windows. pytorch related not me :D

Furkan Gözükara

2025-05-19 00:40:02 +0000 UTC

it is related to linux. 100% safe to ignore on windows. pytorch related not me :D

Furkan Gözükara

2025-05-19 00:39:55 +0000 UTC

With all your release lately I have this: LINK : fatal error LNK1181: cannot open input file 'aio.lib' issue and I'm not alone. Are you planning to fix that or at least point at how to fix it.

RayHell

2025-05-18 20:20:36 +0000 UTC

Enter your choice (1-5): 4 [2025-05-18 20:53:26,278] [INFO] [real_accelerator.py:239:get_accelerator] Setting ds_accelerator to cuda (auto detect) test.c LINK : fatal error LNK1181: 'aio.lib' giriş dosyası açılamıyor test.c LINK : fatal error LNK1181: 'cufile.lib' giriş dosyası açılamıyor W0518 20:53:34.810000 15380 venv\Lib\site-packages\torch\distributed\elastic\multiprocessing\redirects.py:29] NOTE: Redirects are currently not supported in Windows or MacOs. 🛠️ System configuration: Python : 3.10.11 (H:\Joy_Caption_v33\venv\Scripts\python.exe) PyTorch : 2.7.0+cu128 ‣ CUDA build : 12.8 transformers : 4.51.3 bitsandbytes : 0.45.5 liger_kernel : 0.5.8 GPUs (total 1): • [0] NVIDIA GeForce RTX 3080 Ti | compute 8.6 | 12.0 GiB * Running on local URL: http://127.0.0.1:7860 * To create a public link, set `share=True` in `launch()`.

Cemil Hacimahmutoglu

2025-05-18 18:02:25 +0000 UTC

In the batch caption option of Joycaption beta 1, it seems to create .txt files for each but for quite a few times when processing the images in batch mode it says "[date & time] [INFO] [real_accelerator.py:239:get_accelerator] Setting ds_accelerator to cuda (auto detect) test.c LINK : fatal error LNK1181: cannot open input file 'aio.lib' test.c LINK : fatal error LNK1181: cannot open input file 'cufile.lib'" Do those matter? Is that something to do with trying to do CUDA acceleration and it failing but then still continuing to caption but not in an accelerated way? Also quite a few times when processing the images in batch mode it says " NOTE: Redirects are currently not supported in Windows or MacOs.". I don't think it needs to output that multiple times if it's not really needed. It doesn't seem to be outputting to the console the number of images it's processed in batch mode so far and which one it's on and how many are left to do like Joycaption alpha 2 did. And it might be good if it also output a message to the console when it's finished processing the batch if possible (I think the alpha 2 did that). Otherwise there's nothing shown to indicate it's finished. It could list at the end the number processed, number skipped etc. how long it took, like before if possible. Also when you press "start batch processing" that button doesn't change or become disabled until it finishes and no "cancel/stop batch processing" button shows. It would help if a cancel/stop batch processing button showed in case needed like you could with Joycaption Alpha 2 (eg. in case you've not set the right things and need to stop it quickly). edit: though it does show a processing % at the top of the GUI in batch mode. But showing some processing info in the console could help too.

cool1

2025-05-18 17:14:54 +0000 UTC

No matching distribution found for triton>=2.3.1

shen oracle

2025-05-18 11:40:10 +0000 UTC

you are welcome. also there was an error fixed with v33

Furkan Gözükara

2025-05-18 11:28:23 +0000 UTC

i have the same problem hopefully solved soon

darcy

2025-05-18 04:29:10 +0000 UTC

Same :(

Marko Radosavljevic

2025-05-17 21:35:14 +0000 UTC

Apparently venv is not installed correctly, modules are missing

Harkonnen

2025-05-17 21:27:22 +0000 UTC

Thanks for the update. Though when I select to start (eg. 3 or 4) it says: "ModuleNotFoundError: No module named 'gradio'" Also when running the download models.bat it didn't seem to get to 100% on everything but then said they're all downloaded and rerunning "download models" again again says they're all downloaded. So I don't know if I have partially downloaded model files now. Ideally it could be able to detect that and continue where it left off if any are incomplete.

cool1

2025-05-17 20:19:58 +0000 UTC

Thanks Mate, appreciate that. :-)

Bartosz Polecki

2025-05-17 17:11:55 +0000 UTC

yes please set your virtual RAM to 100 GB. look youtube so easy to set. your RAM is not being sufficient

Furkan Gözükara

2025-03-27 23:47:59 +0000 UTC

i'm a bit dense when it comes to these things, but you said this would work for 8GB GPU systems? i have an 8 GB gaming laptop, but whenever i've tried to run this even with fp16 and 4-bit Quantization i get the error "The paging file is too small for this operation to complete." i assume i have to do something with the RAM memory, but i wanted to hear what suggestions you have

Rikard

2025-03-27 13:03:09 +0000 UTC

Duplicate remover app published it took 2 weeks but finally done :) I coded for you : https://www.patreon.com/posts/120352012

Furkan Gözükara

2025-01-19 23:49:12 +0000 UTC

i agree with you. i will try to make better hopefully

Furkan Gözükara

2025-01-08 00:35:59 +0000 UTC

You need to include or put the update bat file inside the one-click installer script with all of your one-click installer scripts because it is a hard and time-consuming process. Second, in this way, ssd stroage tbw performance get expired fastly because ai tools are large file tools and it makes ssd obsolete

Anshul Gupta

2025-01-07 14:09:49 +0000 UTC

For now, one way is with Digikam (a free image organizer/viewer). You can right click on a folder (that's in an album) and select "Find duplicates" and you can specify a similarity range %. Though you first need to click "update fingerprints" on the images as far as I know. (if it hadn't already been done for those images). Though it would be easier if there was just a simple way for a basic option in a program to just select a folder and it told you the duplicates (ie. a more basic/lightweight option than using digikam for it).

cool1

2025-01-05 11:35:57 +0000 UTC

Thanks a lot. I'll try that. edit: It seems to work okay now thanks. The stopping and starting batch processing seems okay and it's not showing those errors on each file now. It takes quite a long time for the first image in the batch after re-starting the batch (about 2 mins) but I assume it's loading the model then. I don't remember the alpha 1 taken that long. Maybe the model is a bigger than alpha 1. But it's working. It's a lot faster at re-starting batch processing when it wasn't stopped with the "stop batch processing" option (doesn't take mins on the first image). edit: the first image in the batch has sped up a lot now after re-starting batch processing after stopping it part way through (it doesn't take 2 mins now just seconds). Maybe there was something else running like Adobe's creative cloud UI helper that was making it slower before.

cool1

2025-01-05 11:33:54 +0000 UTC

coding it right now for you stay tuned

Furkan Gözükara

2025-01-05 11:30:08 +0000 UTC

How to eliminate duplicated/similar images? Which tool is best?

eman eliforp

2025-01-05 11:28:34 +0000 UTC

download v31 fixed all errors. also read changelogs

Furkan Gözükara

2025-01-05 11:27:12 +0000 UTC

I've just tried deleting all the .txt files from that folder and re-pressing "start batch processing" and it still does the same thing. It still immediately shows "Starting batch processing. Found 1 GPU(s).. Batch processing complete. Batch processing complete. Processed: 0..." (if I use that some time after stopping batch processing. I assume if I ctrl C out of the program and re-start it it would work again, but only until I stopped batch processing once - which seems to take longer than alpha 1 did). I can go back to Alpha 1 or Pre Alpha for now though. I assume if the same stop/start method was used that was used in alpha 1 was used it might work the same way as that, which was working. But if you changed if for other things like multi-GPU maybe the default way could work the same way as alpha 1 (most people probably only have 1 GPU).

cool1

2025-01-05 09:12:10 +0000 UTC

try fresh folder no txt files and see if works

Furkan Gözükara

2025-01-05 00:22:45 +0000 UTC

Thank you. I downloaded 29_2 and extracted that. It shows as V29 in the gradio top of the screen but it works differently so it must have extracted okay. When I ran it it and then pressed "stop batch processing" it immediately said in the command window "Stop Clicked Stopping batch processing. Please wait for current tasks to finish...". So I waited and then it stopped soon after (probably not processing more than the first image). Though it seemed to take longer to actually stop than Alpha1. It still gives the error messages about existing .txt files being there but I can ignore those. But after it has fully stopped (it's said "Batch processing complete. Batch processing complete. Processed: 0..." and I try to to start the batch processing again by pressing "Start batch processing" it immediately says "Starting batch processing. Found 1 GPU. what the input & output folders are, and immediately "Batch processing complete. Batch processing complete. Processed: 0...". So each time I press "start batch processing" (some time after it's fully stopped after using "stop batch processing") it won't restart again. "Overwrite existing caption file" option is still checked so it shouldn't be failing because of existing .txt files.

cool1

2025-01-05 00:14:05 +0000 UTC

you were right. for some reason it had overwritten older file. please try Joy_Caption_v29_v2.zip

Furkan Gözükara

2025-01-04 23:55:48 +0000 UTC

I've tried that. I've let it go past 1 too and then kept pressing it, nothing happens, it just seems to keep going on the next .txt files until it gets the end, no matter how many times I press "stop batch processing". Alpha 1 worked ok for this. Gpu IDs box is set to 0 (the default). I don't know if that affects it. I only have 1 GPU (3090).

cool1

2025-01-04 23:47:57 +0000 UTC

i tried regular batch it worked. start processing let it process like 1 image and then stop you will see.

Furkan Gözükara

2025-01-04 23:41:32 +0000 UTC

I extracted the v29 files. It shows v29 in the gradio top of screen for Alpha 2, but when I press "stop batch processing" now while it's processing a batch, nothing seems to happen. It doesn't give the error message now though. The code in alpha 2 for checking for stopping seems different to alpha 1's checking in process_gpu_batch. I don't know if copying the way it was doing it in Alpha 1 might make it stop the batch processing in the same way as alpha1 as that one worked ok.

cool1

2025-01-04 23:40:03 +0000 UTC

you are welcome

Furkan Gözükara

2025-01-04 14:18:53 +0000 UTC

Thanks a lot

cool1

2025-01-04 14:17:23 +0000 UTC

fixed with v29 zip

Furkan Gözükara

2025-01-04 14:12:44 +0000 UTC

Also another issue with JoyCaption Alpha 2 is pressing the "stop batch processing" button doesn't stop it from processing the images in the folder. It says the following error: ...\Joy_Caption_Alpha_Two.py", line 649, in stop_batch_processing_fn stop_flag.value = True NameError: name 'stop_flag' is not defined then continues captioning the rest of the images in the folder.

cool1

2025-01-04 11:15:32 +0000 UTC

yes you can ignore it.

Furkan Gözükara

2025-01-03 23:40:47 +0000 UTC

It seem like there might be a bit of a bug/issue with batch captioning with JoyCaption Alpha 2. I batch captioned a folder that already had .txt files with short captions for each image - so captioning to make them longer. I had "Overwrite existing caption file" checked at the time I selected "batch caption". It seems to have captioned every image in the folder, updating the .txt files with the longer captions, but after each one it shows in the command window "Error processing image [folder & filename]: [folder & filename] is the same as [folder & filename].". Then at the end of the batch it shows in the command prompt "Processed 0/45 | skipped: 0| Failed: 45" (the GUI "batch processing progress" window also mentions the same number failed) but that's incorrect as it seems to have added longer captions to all the existing .txt files in the folder and not failed on them.

cool1

2025-01-03 23:01:52 +0000 UTC

In their page about it they scored JoyCaption Alpha 2 at 22.8 for avg score (top) but that's only slightly higher than the 22.6 that they scored Pre Alpha. Though alpha 2 has more options and they did add more training stuff to it. Though they said standard deviation for Pre Alpha and Alpha 2 are both 5.32 so they think that's equal (I would have thought one would be better if the alpha 2 is scored slightly better). Also they only got those scores from checking 15 images. Ideally the could have tested more. It still sometimes mixes up the left and right side (eg. talking about the person's right hand when it's actually their left hand, and re-running it can make it correct it. It might get that wrong 25% of the time though I haven't done a proper test). I noticed in Alpha 1 version, when captioning a single image that you drag to the input image section, it output the images with the same filename as the source image to the output folder (so you'd have multiple images being output there). But the new Joycaption Alpha 2 version is outputting them all as "temp_input_image.png". I assume that's intentional, though it's different than before. In JoyCaption Alpha 2 it shows "Gradio" in the browser tab. Ideally that could show "JoyCaption Alpha 2" there so it's clearer when switching between tabs/windows though it's not vital.

cool1

2025-01-03 22:43:36 +0000 UTC

yes could be

Furkan Gözükara

2024-12-31 10:13:22 +0000 UTC

When I mentioned the windows process sometimes going to about 2.5% CPU, I think now it was probably nothing to do with JoyCaption or "stop batch processing". It's "Shell Infrastructure Host" that goes to that much and perplexity.ai says that could be caused by a memory leak in the windows photo viewer and I'd used that (legacy version). So it was probably that. I'll use a different image viewer in future so it shouldn't have that problem.

cool1

2024-12-31 09:00:12 +0000 UTC

the apps has custom prompt boxes you mean that?

Furkan Gözükara

2024-12-30 14:45:54 +0000 UTC

Is it possible to add the prompt description box - where I can specify what the model should keep in "mind" as it captions. Prefix and suffix are the best but very limiting

Sphiwe April

2024-12-30 13:43:08 +0000 UTC

just made v28 update 1 : fixed 2 : please read changes and see if fixes or elaborate more

Furkan Gözükara

2024-12-29 20:10:29 +0000 UTC

Perfect timing on this release. Feature request for batch processing; the capability of working through subfolders within the target folder. Edit: A way to offload the model without closing and reopening it would also be appreciated.

H M

2024-12-29 14:01:29 +0000 UTC

ye this model is not that great. but since people requested i added :D

Furkan Gözükara

2024-12-29 09:20:19 +0000 UTC

Extra options causes error, spits out random caption and takes over a minute. i selected the first two options on extra options, also selected Do Not mention the image's resolution, and Do not use any ambigous language.

DarkView AI

2024-12-29 02:50:26 +0000 UTC

you should test all. also fp16 is for older gpus dont use unless you needed. 4 bit is good for lower vram gpus especially if you dont have 24 gb gpu

Furkan Gözükara

2024-12-28 19:14:09 +0000 UTC

Thanks a lot. In the Alpha 2 version it has 2 new checkboxes, one "Use FP16 Model Type" and one "Use 4-bit Quantization". What does it use for those if you don't select them? If they're not selected does it use a higher bit model type (like FP32?) and no quantization? I assume we should leave both those blank for the most accurate captions if we have enough VRAM and we are okay with it using that higher VRAM? In one of the top new options for Alpha 2 there's an option to "not include information about people/characters that cannot be changed (like ethnicity, gender etc)...". Have there been any tests anywhere to see if doing that actually helps? I don't know if by adding those descriptions in the captions if it might make you be more likely to need to specify them when prompting (after training), but I thought including them might also help it train them quicker (by starting learning with learned concepts that are similar to that). When you use the "stop batch processing" option, if you kept doing that a lot does that ever cause problems? Like is it closing all processes and freeing things fully? eg. a few times after using the previous Alpha 1 version I've had one of the windows processes start using about 2.5% CPU after stopping batch processing a few times (which made things feel slower) but I don't know if it was caused by that or other things (since I'd been using multiple apps), but it went to 0 after rebooting (task manager said it might cause instability if I end-tasked that windows process which I was trying to restart to get it to 0% CPU).

cool1

2024-12-28 17:17:06 +0000 UTC

thank you so much

Furkan Gözükara

2024-12-28 16:23:05 +0000 UTC

i think Joy Caption Pre Alpha but you can compare all 3

Furkan Gözükara

2024-12-28 16:22:57 +0000 UTC

Amazing, thanks a lot! Which JoyCaption Model works best?

Markus

2024-12-28 16:18:59 +0000 UTC

Hi. Its unbelievable. You are so efficient, so we just recently talked about this project. I really appreciate your work, a lot of time and effort spent for this /and for us, your subscribers. Thank you and All the best to you, Master!

Ant-2014

2024-12-28 16:06:32 +0000 UTC

Related Creators

kagamic58

fanbox

mbahndolo

patreon

ペニバン美術

patreon

THUNDERX FLP

patreon

紙

fanbox

lktv

patreon

taotaille

patreon

causa

patreon

patreon

炭酸草

fanbox

son2j

patreon

Niwashi

fanbox

S̷a̷t̷a̷n̷

gumroad

Sarz

patreon

kagemi

fanbox

あさひな

fanbox

picklescombo

fanbox

LionCheater

patreon

half-truth

fanbox

Omega Games - My Girlfriend's New Kink

patreon

Poseidon 🔱

gumroad

がるぼーい

fanbox

MinciVore

patreon

foxsaliant

patreon

TheeLitOne

patreon

HooAaam

fanbox

Race

gumroad

Gentleman910

patreon

devildaddy

patreon

Charlie, The Steak

patreon

NosukeDeLibido

fanbox

w4g4

patreon

eosphorite

patreon

紙場こたる

fanbox

Ennie Ticia

patreon

zhase

fanbox

CathrynWolfess

patreon

あさしん。

fanbox

LittlePaws

fanbox

cey

fanbox

GTNTR

patreon

ImperialPotter

patreon

Kalamazoo Aviation

patreon

Filth Hound Fetish Works

gumroad

Ando San

patreon

Angel1227

patreon

guanyu19

fanbox

citrusheart

patreon

pew

patreon

RHAP

patreon