IndexTTS2 SECourses Premium Voice Cloning and Generation App - 1-Click to Install on Windows, RunPod and Massed Compute - Generate Entire Audiobooks With Consistent High Quality Voice
Added 2025-09-19 23:30:01 +0000 UTCPatreon exclusive posts index to find our scripts easily, Patreon exclusive posts index to see which scripts updated or added last and amazing Patreon special generative scripts list that you can use in any of your task.
Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord
Please also Star, Watch and Fork our Stable Diffusion & Generative AI GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)
=======
Latest installer zip file : Index_TTS_v3_1.zip
Higher quality YouTube video: https://youtu.be/YbgFVKWB7hs
I have significantly improved the app published here : https://github.com/index-tts/index-tts
Hopefully many more features coming and this is only initial release
Just run Windows_Install_or_Update.bat for installation
You only need Python 3.10.11 and Git and FFmpeg installed
I am using fully pre-compiled libraries for both Windows and Linux thus it should work with maximum speed on literally every GPU like RTX 2000 series, 3000, 4000, 5000, H100, B200, etc
Read the Gradio app interface extremely carefully since I added literally every option with detailed information and explanation
Default values are supposed to be good but you can play with values to further improve

25 September 2025 Update V3
Since official repo was getting out of git LFS quota and causing errors, everything uploaded into a new repo
Make a fresh install and you can move checkpoints folder into new install
24 September 2025 Update V2
Automatic FFmpeg installation added to RunPod and Massed Compute
Save Used Reference Audio added
Prevent VRAM Accumulation added - useful when High number Beam Search Beams used
Higher number Beam Search Beams really improves quality like 8
Load from Audio File Path added
Useful when you want to upload big audio into like RunPod or Massed Compute rather than Gradio live share link
Just run Windows_Install_or_Update.bat to update
More features coming hopefully soon
Windows Requirements
Python 3.10.11, CUDA 12.9, C++ tools, MSVC, FFmpeg and Git
If it doesn't work make sure to below tutorial and install everything exactly as shown in this below tutorial
Follow this post entirely along with the video : https://www.patreon.com/posts/111553210
This above post is fully updated with links and screenshots, so easy to follow
Massed Compute (Recommend Cloud) :
Please register via this link : https://vm.massedcompute.com/signup?linkId=lp_034338&sourceId=secourses&tenantId=massed-compute
Use our coupon SECourses
Our coupon works on all GPUs now
I recommend RTX 6000 PRO but this app works on every GPU
Full details here : https://www.patreon.com/posts/26671823
Then select our image SECourses from Creator dropdown
Then follow Massed_Compute_Instructions_READ.txt
Same as my any other Massed Compute installer script
Example tutorial for learn how to install and use Massed Compute
(Starts at 12:58) : https://youtu.be/KW-MHmoNcqo?si=G1WbG-Qw4ujWvOtG&t=778
RunPod (Cloud):
Please register via this link : https://get.runpod.io/955rkuppqv4h
Then follow Runpod_Instructions_READ.txt
Same as my any other RunPod installer script
Use the template written in Runpod_Instructions_READ.txt file
Example tutorial for learn how to install and use RunPod
(starts at 22:03) : https://youtu.be/KW-MHmoNcqo?si=QN8X8Sjn13ZYu-EU&t=1323

Patreon exclusive posts index to find our scripts easily, Patreon exclusive posts index to see which scripts updated or added last and amazing Patreon special generative scripts list that you can use in any of your task.
Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord
Please also Star, Watch and Fork our Stable Diffusion & Generative AI GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)
=======
Latest installer zip file : Index_TTS_v3.zip
Higher quality YouTube video: https://youtu.be/YbgFVKWB7hs
I have significantly improved the app published here : https://github.com/index-tts/index-tts
Hopefully many more features coming and this is only initial release
Just run Windows_Install_or_Update.bat for installation
You only need Python 3.10.11 and Git and FFmpeg installed
I am using fully pre-compiled libraries for both Windows and Linux thus it should work with maximum speed on literally every GPU like RTX 2000 series, 3000, 4000, 5000, H100, B200, etc
Read the Gradio app interface extremely carefully since I added literally every option with detailed information and explanation
Default values are supposed to be good but you can play with values to further improve

25 September 2025 Update V3
Since official repo was getting out of git LFS quota and causing errors, everything uploaded into a new repo
Make a fresh install and you can move checkpoints folder into new install
24 September 2025 Update V2
Automatic FFmpeg installation added to RunPod and Massed Compute
Save Used Reference Audio added
Prevent VRAM Accumulation added - useful when High number Beam Search Beams used
Higher number Beam Search Beams really improves quality like 8
Load from Audio File Path added
Useful when you want to upload big audio into like RunPod or Massed Compute rather than Gradio live share link
Just run Windows_Install_or_Update.bat to update
More features coming hopefully soon
Windows Requirements
Python 3.10.11, CUDA 12.9, C++ tools, MSVC, FFmpeg and Git
If it doesn't work make sure to below tutorial and install everything exactly as shown in this below tutorial
Follow this post entirely along with the video : https://www.patreon.com/posts/111553210
This above post is fully updated with links and screenshots, so easy to follow
Massed Compute (Recommend Cloud) :
Please register via this link : https://vm.massedcompute.com/signup?linkId=lp_034338&sourceId=secourses&tenantId=massed-compute
Use our coupon SECourses
Our coupon works on all GPUs now
I recommend RTX 6000 PRO but this app works on every GPU
Full details here : https://www.patreon.com/posts/26671823
Then select our image SECourses from Creator dropdown
Then follow Massed_Compute_Instructions_READ.txt
Same as my any other Massed Compute installer script
Example tutorial for learn how to install and use Massed Compute
(Starts at 12:58) : https://youtu.be/KW-MHmoNcqo?si=G1WbG-Qw4ujWvOtG&t=778
RunPod (Cloud):
Please register via this link : https://get.runpod.io/955rkuppqv4h
Then follow Runpod_Instructions_READ.txt
Same as my any other RunPod installer script
Use the template written in Runpod_Instructions_READ.txt file
Example tutorial for learn how to install and use RunPod
(starts at 22:03) : https://youtu.be/KW-MHmoNcqo?si=QN8X8Sjn13ZYu-EU&t=1323

Comments
hi easy. it is not version issue. make a fresh install and send me cmd logs : monstermmorpg@gmail.com probably you dont have c++ tools and accurate python version 3.10.11
Furkan Gözükara
2025-10-19 10:24:15 +0000 UTCThe 3.1 installer will not recognize my rtx5080 and will only use the CPU. I reinstalled all dependencies multiple times and also followed the video instructions to ensure my machine was set up appropriately. No errors are being thrown and ffmpeg is installed. I reinstalled using the version 1 installer and the app utilizes the GPU appropriately.
billgill
2025-10-19 02:00:21 +0000 UTCyou dont have effmpeg follow requirements : https://youtu.be/DrhUHnYfwC0
Furkan Gözükara
2025-10-16 20:16:04 +0000 UTCThis worked this morning, and i did something like clicked on update instead of start, it then failed, i deleted everything and reinstalled v3 and get this error at the end... any idea pls?? >> Be patient, it may take a while to run in CPU mode. >> Text tokenizer loaded for preview functionality Warning: FFmpeg not found in PATH. Video/audio processing will not work. Please install FFmpeg: https://ffmpeg.org/download.html * Running on local URL: http://127.0.0.1:7860 * To create a public link, set `share=True` in `launch()`. Error: FFmpeg not found. Please ensure FFmpeg is installed and in PATH. Emo control mode:0,weight:0.65,vec:None >> Loading models for first synthesis... >> GPT weights restored from: ./checkpoints\gpt.pth preprocessor_config.json: 100%|███████████████████████████████████████████████████████████████| 275/275 [00:00> semantic_codec weights restored from: checkpoints\hub\models--amphion--MaskGCT\snapshots\265c6cef07625665d0c28d2faafb1415562379dc\semantic_codec\model.safetensors cfm loaded length_regulator loaded gpt_layer loaded >> s2mel weights restored from: ./checkpoints\s2mel.pth campplus_cn_common.bin: 100%|█████████████████████████████████████████████████████| 28.0M/28.0M [00:01<00:00, 15.0MB/s] >> campplus_model weights restored from: checkpoints\hub\models--funasr--campplus\snapshots\fb71fe990cbf6031ae6987a2d76fe64f94377b7e\campplus_cn_common.bin config.json: 1.41kB [00:00, ?B/s] Loading weights from nvidia/bigvgan_v2_22khz_80band_256x bigvgan_generator.pt: 100%|█████████████████████████████████████████████████████████| 449M/449M [00:23<00:00, 18.9MB/s] Removing weight norm... >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x >> All models loaded successfully! >> starting inference... Traceback (most recent call last): File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\queueing.py", line 745, in process_events response = await route_utils.call_process_api( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\route_utils.py", line 349, in call_process_api output = await app.get_blocks().process_api( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\blocks.py", line 2123, in process_api result = await self.call_function( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\blocks.py", line 1630, in call_function prediction = await anyio.to_thread.run_sync( # type: ignore File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\to_thread.py", line 56, in run_sync return await get_async_backend().run_sync_in_worker_thread( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 2485, in run_sync_in_worker_thread return await future File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 976, in run result = context.run(func, *args) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\utils.py", line 915, in wrapper response = f(*args, **kwargs) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\webui.py", line 445, in gen_single output = tts.infer(spk_audio_prompt=prompt, text=text, File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\indextts\infer_v2.py", line 563, in infer audio,sr = self._load_and_cut_audio(spk_audio_prompt,max_speaker_audio_length,verbose) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\indextts\infer_v2.py", line 417, in _load_and_cut_audio audio, sr = librosa.load(audio_path) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\librosa\core\audio.py", line 176, in load y, sr_native = __soundfile_load(path, offset, duration, dtype) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\librosa\core\audio.py", line 209, in __soundfile_load context = sf.SoundFile(path) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\soundfile.py", line 690, in __init__ self._file = self._open(file, mode_int, closefd) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\soundfile.py", line 1261, in _open raise TypeError("Invalid file: {0!r}".format(self.name)) TypeError: Invalid file: None
Neil Rhodes
2025-10-16 20:12:06 +0000 UTCyes because either your python installation is wrong or you are running as administrators. follow this requirements tutorial and don't run as administrator : https://youtu.be/DrhUHnYfwC0
Furkan Gözükara
2025-10-15 21:22:51 +0000 UTCit doesn't seem like this installer works correctly? Attempting to launch the windows_install_or_update.bat does not work ERROR: Could not install packages due to an OSError: [Errno 13] Permission denied:[path \\appdata\\local\\pip\\cache\\wheels\\c9\\69\\31\\d56d90b22a1777b0b231e234b00302a55be255930f8bd92dcd\\jieba-0.42.1-py3-none-any.whl'jieba-0.42.1-py3-none-any.whl'] Check the permissions. running as admin results in: Requirement already satisfied: pip in c:\windows\system32\premium_indextts2_secourses\venv\lib\site-packages (25.2) ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements.txt' 'Windows_Model_Download_and_Fix.bat' is not recognized as an internal or external command, operable program or batch file.
zrikz
2025-10-15 20:53:40 +0000 UTCno worries I found a workaround
Trill OG
2025-10-15 09:17:26 +0000 UTChi there, is it possible to create more than a single queue for multiple output files in one whole session . Im wanting to generate individual output files for each chapter of my ebook without having to generate each chapter with a new session.
Trill OG
2025-10-14 18:49:34 +0000 UTCVibeVoice supporting Turkish and i will publish hopefully soon
Furkan Gözükara
2025-10-14 14:01:26 +0000 UTCHello, Is it support Turkish?
Ahmet Inceelli
2025-10-14 12:37:53 +0000 UTCfor Italian there is VibeVoice. hopefully will publish very soon sorry for delay
Furkan Gözükara
2025-10-03 10:58:23 +0000 UTCWhy does it work just in CPU mode? It's said the model works with different languages, and I fed it with an audio in Italian. It generated an output with a weird accent if it were trying to read Italian text with English pronunciation. Maybe, for Italian, there are different models?
Aldo Jones
2025-10-02 23:42:07 +0000 UTCthanks
Furkan Gözükara
2025-10-01 10:16:21 +0000 UTCHere is a useful prompt for AI to fix your Text for correct pronunciation, by all means use this in your text notes if you wish :) Goal: Process the provided text to ensure it is read aloud clearly, naturally, and accurately by a basic local Text-to-Speech (TTS) engine, eliminating all potential ambiguities, apostrophes, and punctuation-related errors that cause mispronunciation or awkward pacing. Instructions: Revise the text according to the following strict, non-negotiable rules. The output text must only contain standard letter characters (A-Z, a-z), numbers (0-9), commas (,), periods (.), question marks (?), exclamation points (!), and simple parentheses (). Eliminate All Contractions and Apostrophes: Spell out every single contraction and remove all apostrophes from the text entirely. (e.g., change "can't" to can not, "I'm" to I am, "it's" to it is or it has). Possessives must be handled by context or sentence restructuring. Spell Out Numbers and Abbreviations: Convert all numerical digits and common acronyms or initialisms to their fully spelled-out word form (e.g., change "16" to sixteen, "3:00 am" to three A M, "TBH" to to be honest). Standardize Punctuation and Flow: Correct any typos or instances of run-on words. Replace complex or grammatically ambiguous phrases with a clear, direct, and common alternative to ensure proper TTS cadence. Use Phonetic Respelling for Ambiguous Words: For words that a basic engine might struggle to pronounce clearly (especially slang, proper nouns, or technical terms), use a simple phonetic respelling immediately followed by the original word in parentheses. The Text to be Processed: [INSERT TEXT HERE]
Neil Rhodes
2025-10-01 09:27:28 +0000 UTCye i noticed same. i dont know atm sadly
Furkan Gözükara
2025-09-30 22:32:35 +0000 UTCHow do we get around the pronunciation issues like "We'd" has to be "weed" and "I'm" has to be "I am " and "read" in past tense has to be "red" (and many others) is there a tool for fixing this, It kinda breaks the immersion when having stories read back by your favoroite voice, or do we simply have to manually correct everything? anyone know?
Neil Rhodes
2025-09-30 08:50:16 +0000 UTCyes i know. please follow this video and this post exactly and reinstall. it should be fixed : https://youtu.be/DrhUHnYfwC0 https://www.patreon.com/posts/click-to-open-post-used-in-tutorial-111553210
Furkan Gözükara
2025-09-26 22:30:54 +0000 UTCHello, when I run the application it does not use my gpu. The command line says Be patient, it may take a while to run in CPU mode. I have a 5090 and it sits at 0% utilization when running this particular app. Anyone know how to fix this?
RenderDrgn
2025-09-26 14:50:47 +0000 UTChello. thanks. yes you need c++ tools and MSVC as well for this. please follow this tutorial and its updated post : https://youtu.be/DrhUHnYfwC0
Furkan Gözükara
2025-09-23 19:19:38 +0000 UTCHello, new to Patrion, long time follower, I am trying your installer and I am getting this failed item after install: LLVM ERROR: Symbol not found: __svml_cosf8_ha
James Charleston II
2025-09-23 16:04:32 +0000 UTCCurrently Chinese and English : https://github.com/index-tts/index-tts/issues/418
Furkan Gözükara
2025-09-22 07:48:29 +0000 UTCHow many languages support?
Hoàng Giang Sơn Trương
2025-09-22 02:27:37 +0000 UTCyep not a good idea at all. probably wont work either. but you can run on runpod or massed compute or computer with 8 gb GPU. 6 may also work but need to be tested
Furkan Gözükara
2025-09-21 20:06:41 +0000 UTCrunning Index_TTS with CPU-only on some Lenovo Thinkpad is no good idea?
Christoph Behrmann
2025-09-21 20:01:14 +0000 UTChello are you requesting a feature i am confused can you eloborate more
Furkan Gözükara
2025-09-21 19:16:02 +0000 UTCthanks. i am working on more features right now
Furkan Gözükara
2025-09-21 18:08:19 +0000 UTCyep thanks
Furkan Gözükara
2025-09-21 18:08:05 +0000 UTCyou are welcome. working on improvements right now
Furkan Gözükara
2025-09-21 18:07:38 +0000 UTCorjinal türkçe yok ama türkçe karakter kullanmadan okuyabiliyor. deneyebilirsin
Furkan Gözükara
2025-09-21 18:07:29 +0000 UTCteşekkürler hocam. Bunu türkçe desteği yok değilmi hocam
Cemil Hacimahmutoglu
2025-09-20 18:30:23 +0000 UTCJust what I needed. Thank you very much. =D
Hockey
2025-09-20 17:52:58 +0000 UTCIt seems that the functionality for automatic voiceover into another language while preserving emotions needs to be improved for automatic use. Lack: - Cutting the original audio into parts while maintaining the integrity of the sounds - Transcription of these parts into text and translation into the desired language - Batch re-sound of sliced texts in accordance with the emotions of the sliced audio
Dmitry
2025-09-20 15:14:43 +0000 UTCIt just works! amazing!
Neil Rhodes
2025-09-20 12:47:19 +0000 UTCYep next level
Furkan Gözükara
2025-09-20 10:35:26 +0000 UTCI wasn't expecting you to cover this based on your focus towards image and video. This is a huge development in voice clone and I've been messing with it for a while now. Nice to see you included it in your busy schedule. Thanks.
Lou
2025-09-20 10:32:00 +0000 UTCok this is actually insane, it perfectly emulates human speech with the pauses, uhms, breathing etc. just crazy
Hipno
2025-09-20 10:25:32 +0000 UTCif authors add yes for sure but i dont know how to
Furkan Gözükara
2025-09-20 07:30:56 +0000 UTCthanks for info i will update requirements
Furkan Gözükara
2025-09-20 07:30:39 +0000 UTCFor all those who can't get the installer to do it's thing, make sure you've got Cuda covered Pytorch before grabbing your regular packages. Cheat sheet for this is just to run the following two commands in cmd (assuming windows): 1) pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu129 and then 2) pip install huggingface_hub transformers the installer shouldn't hiccup after that, have fun cloning Mike Wazowski👍.
PS
2025-09-20 04:40:06 +0000 UTCI fixed it. I'll put it here if anyone else has the same problem. Courtesy Grok Expert. To fix this reliably and safely, download and install the official Intel oneAPI DPC++/C++ Compiler Runtime for Windows. https://registrationcenter-download.intel.com/akdlm/IRC_NAS/47a201d7-d4cd-4079-a2d8-0e66b860aaaa/w_dpcpp_cpp_runtime_p_2025.2.1.1001.exe Run as administrator. Restart machine. Next time I ran the TTS webUI it worked like a charm!
DanO..
2025-09-20 02:10:37 +0000 UTCHello, good afternoon. Do you think it could be implemented for the Spanish language as well?
Civitaier
2025-09-20 00:49:29 +0000 UTCIt broke my python. Came back with this, "LLVM ERROR: Symbol not found: __svml_cosf8_ha" any ideas? I'm really looking forward to this! (I also ran install/update again just in case.)
DanO..
2025-09-20 00:43:06 +0000 UTC