[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"workflow-asset-8b48f7ce":3,"seo:featured-workflow:8b48f7ce-4f09-11f1-9bc6-00163e2b0d79:en":84,"workflow-related-asset-8b48f7ce-8b48f7ce-4f09-11f1-9bc6-00163e2b0d79":85},{"id":4,"uuid":5,"slug":6,"title":7,"description":8,"author_id":9,"author_name":10,"author_avatar":11,"token_estimate":12,"time_saved":12,"model_used":13,"fork_count":12,"vote_count":12,"view_count":12,"parent_id":12,"parent_uuid":13,"lang_type":14,"steps":15,"tags":22,"has_voted":28,"visibility":18,"share_token":13,"is_featured":12,"content_hash":29,"asset_kind":30,"target_tools":31,"install_mode":35,"entrypoint":19,"risk_profile":36,"dependencies":38,"verification":44,"agent_metadata":47,"agent_fit":60,"trust":72,"provenance":81,"created_at":83,"updated_at":83},3660,"8b48f7ce-4f09-11f1-9bc6-00163e2b0d79","asset-8b48f7ce","GPT-SoVITS — Few-Shot Voice Cloning and Text-to-Speech","An open-source TTS system that can clone any voice from just one minute of audio data, combining GPT-style language modeling with VITS synthesis for natural speech generation.","8a911193-3180-11f1-9bc6-00163e2b0d79","AI Open Source","https:\u002F\u002Ftokrepo.com\u002Fapple-touch-icon.png",0,"","en",[16],{"id":17,"step_order":18,"title":19,"description":13,"prompt_template":20,"variables":13,"depends_on":21,"expected_output":13},4234,1,"GPT-SoVITS Overview","# GPT-SoVITS — Few-Shot Voice Cloning and Text-to-Speech\n\n## Quick Use\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002FRVC-Boss\u002FGPT-SoVITS.git\ncd GPT-SoVITS\npip install -r requirements.txt\npython webui.py\n# Open http:\u002F\u002Flocalhost:9874\n```\n\n## Introduction\nGPT-SoVITS is an open-source text-to-speech system that achieves voice cloning from as little as one minute of reference audio. It combines GPT-based language modeling for prosody with VITS (Variational Inference with adversarial learning for end-to-end TTS) for high-quality waveform synthesis.\n\n## What GPT-SoVITS Does\n- Clones a speaker's voice from 1-10 minutes of reference audio recordings\n- Generates natural-sounding speech in the cloned voice from text input\n- Supports cross-lingual voice cloning across Chinese, English, and Japanese\n- Provides a web UI for training, inference, and audio management\n- Includes tools for dataset preparation, annotation, and audio preprocessing\n\n## Architecture Overview\nGPT-SoVITS uses a two-stage pipeline. First, a GPT-based model predicts semantic tokens from text, capturing prosody and rhythm. Then a VITS-based model converts these tokens into a high-fidelity waveform matching the target speaker's voice characteristics. Speaker embedding is extracted from reference audio using a pretrained encoder, enabling few-shot adaptation.\n\n## Self-Hosting & Configuration\n- Requires Python 3.9+ with PyTorch and CUDA for GPU-accelerated training and inference\n- Pretrained base models are downloaded automatically on first run\n- Training a voice clone takes 30-60 minutes on a consumer GPU with 1 minute of audio\n- The web UI runs locally with no external API dependencies\n- Supports CPU-only inference at reduced speed for machines without GPUs\n\n## Key Features\n- One-minute voice cloning produces recognizable speaker identity and style\n- Cross-lingual synthesis supports Chinese, English, and Japanese text\n- Built-in dataset tools handle audio slicing, denoising, and automatic transcription\n- Fine-tuning from pretrained models converges quickly even on consumer hardware\n- Batch inference mode for generating large volumes of audio efficiently\n\n## Comparison with Similar Tools\n- **Bark** — generates speech with music and effects; GPT-SoVITS specializes in voice cloning fidelity\n- **Coqui TTS** — broader TTS toolkit; GPT-SoVITS achieves better few-shot cloning quality\n- **Fish Speech** — multilingual TTS; GPT-SoVITS offers a more mature training pipeline\n- **F5-TTS** — flow-matching approach; GPT-SoVITS uses GPT + VITS with established community support\n- **Kokoro** — lightweight TTS; GPT-SoVITS provides deeper voice cloning from minimal data\n\n## FAQ\n**Q: How much audio data is needed to clone a voice?**\nA: As little as 1 minute for basic cloning, though 5-10 minutes yields better results.\n\n**Q: Can it run on CPU only?**\nA: Yes, inference works on CPU but is significantly slower. Training requires a CUDA GPU.\n\n**Q: Is the output suitable for production use?**\nA: Quality is high for many use cases. Evaluate on your specific requirements.\n\n**Q: What audio formats are supported?**\nA: WAV is the primary format. MP3 and other formats are converted automatically during preprocessing.\n\n## Sources\n- https:\u002F\u002Fgithub.com\u002FRVC-Boss\u002FGPT-SoVITS","0",[23],{"id":24,"name":25,"slug":26,"icon":27},12,"Configs","config","⚙️",false,"5573cbd4107f4efe9bd79af170910ab17b5c2edcc483693c01a833fda23a200d","skill",[32,33,34],"claude_code","codex","gemini_cli","single",{"executes_code":28,"modifies_global_config":28,"requires_secrets":37,"uses_absolute_paths":28,"network_access":28},[],{"npm":39,"pip":40,"brew":42,"system":43},[],[41],"requirements.txt",[],[],{"commands":45,"expected_files":46},[],[19],{"asset_kind":30,"target_tools":48,"install_mode":35,"entrypoint":19,"risk_profile":49,"dependencies":51,"content_hash":29,"verification":56,"inferred":59},[32,33,34],{"executes_code":28,"modifies_global_config":28,"requires_secrets":50,"uses_absolute_paths":28,"network_access":28},[],{"npm":52,"pip":53,"brew":54,"system":55},[],[41],[],[],{"commands":57,"expected_files":58},[],[19],true,{"target":33,"score":61,"status":62,"policy":63,"why":64,"asset_kind":30,"install_mode":35},98,"native","allow",[65,66,67,68,69,70,71],"target_tools includes codex","asset_kind skill","install_mode single","markdown-only","policy allow","safe markdown-only Codex install","trust established",{"author_trust_level":73,"verified_publisher":28,"asset_signed_hash":29,"signature_status":74,"install_count":12,"report_count":12,"dangerous_capability_badges":75,"review_status":76,"signals":77},"established","hash_only",[],"unreviewed",[78,79,80],"author has published assets","content hash available","no dangerous capability badges",{"owner_uuid":9,"owner_name":10,"source_url":82,"content_hash":29,"visibility":18,"created_at":83,"updated_at":83},"https:\u002F\u002Ftokrepo.com\u002Fen\u002Fworkflows\u002Fasset-8b48f7ce","2026-05-14 04:23:03",null,[86,138,192,239],{"id":87,"uuid":88,"slug":89,"title":90,"description":91,"author_id":9,"author_name":10,"author_avatar":11,"token_estimate":12,"time_saved":12,"model_used":13,"fork_count":12,"vote_count":12,"view_count":92,"parent_id":12,"parent_uuid":13,"lang_type":14,"steps":93,"tags":94,"has_voted":28,"visibility":18,"share_token":13,"is_featured":12,"content_hash":96,"asset_kind":30,"target_tools":97,"install_mode":35,"entrypoint":98,"risk_profile":99,"dependencies":101,"verification":106,"agent_metadata":109,"agent_fit":121,"trust":123,"provenance":126,"created_at":128,"updated_at":129,"__relatedScore":130,"__relatedReasons":131,"__sharedTags":136},2511,"f07fee9a-45df-11f1-9bc6-00163e2b0d79","asset-f07fee9a","GPT-NeoX — Open-Source Large Language Model Training Library","A GPU-optimized library by EleutherAI for training large-scale autoregressive language models. GPT-NeoX powered the training of GPT-NeoX-20B and Pythia, providing the open-source community with tools for billion-parameter model training.",92,[],[95],{"id":24,"name":25,"slug":26,"icon":27},"4480e03ed6f0fdc0567c6e7e2b22bb3fa2f85d933d3e13f190dc95c13dff7289",[32,33,34],"GPT-NeoX Overview",{"executes_code":28,"modifies_global_config":28,"requires_secrets":100,"uses_absolute_paths":28,"network_access":28},[],{"npm":102,"pip":103,"brew":104,"system":105},[],[],[],[],{"commands":107,"expected_files":108},[],[98],{"asset_kind":30,"target_tools":110,"install_mode":35,"entrypoint":98,"risk_profile":111,"dependencies":113,"content_hash":96,"verification":118},[32,33,34],{"executes_code":28,"modifies_global_config":28,"requires_secrets":112,"uses_absolute_paths":28,"network_access":28},[],{"npm":114,"pip":115,"brew":116,"system":117},[],[],[],[],{"commands":119,"expected_files":120},[],[98],{"target":33,"score":61,"status":62,"policy":63,"why":122,"asset_kind":30,"install_mode":35},[65,66,67,68,69,70,71],{"author_trust_level":73,"verified_publisher":28,"asset_signed_hash":96,"signature_status":74,"install_count":12,"report_count":12,"dangerous_capability_badges":124,"review_status":76,"signals":125},[],[78,79,80],{"owner_uuid":9,"owner_name":10,"source_url":127,"content_hash":96,"visibility":18,"created_at":128,"updated_at":129},"https:\u002F\u002Ftokrepo.com\u002Fen\u002Fworkflows\u002Fasset-f07fee9a","2026-05-02 12:32:34","2026-05-14 04:49:59",91.9527244228309,[132,133,134,135],"topic-match","same-kind","same-target","same-author",[26,137],"configs",{"id":139,"uuid":140,"slug":141,"title":142,"description":143,"author_id":9,"author_name":10,"author_avatar":11,"token_estimate":12,"time_saved":12,"model_used":13,"fork_count":12,"vote_count":12,"view_count":144,"parent_id":12,"parent_uuid":13,"lang_type":14,"steps":145,"tags":146,"has_voted":28,"visibility":18,"share_token":13,"is_featured":12,"content_hash":148,"asset_kind":30,"target_tools":149,"install_mode":35,"entrypoint":150,"risk_profile":151,"dependencies":153,"verification":158,"agent_metadata":161,"agent_fit":173,"trust":180,"provenance":185,"created_at":187,"updated_at":188,"__relatedScore":189,"__relatedReasons":190,"__sharedTags":191},1041,"d3b64571-35cb-11f1-9bc6-00163e2b0d79","zoxide-smarter-cd-command-learns-your-habits-d3b64571","zoxide — A Smarter cd Command That Learns Your Habits","zoxide is a smarter cd command written in Rust. It tracks directories you visit and lets you jump to any of them with just a few keystrokes. Inspired by z and autojump. Works with bash, zsh, fish, PowerShell, nushell, and more.",105,[],[147],{"id":24,"name":25,"slug":26,"icon":27},"13b4327219e3bf09b65e9b88e820d25b8faaf260fc04c0fa2e3e1cf8e3024ddc",[32,33,34],"SKILL.md",{"executes_code":28,"modifies_global_config":28,"requires_secrets":152,"uses_absolute_paths":28,"network_access":59},[],{"npm":154,"pip":155,"brew":156,"system":157},[],[],[],[],{"commands":159,"expected_files":160},[],[13],{"asset_kind":30,"target_tools":162,"install_mode":35,"entrypoint":150,"risk_profile":163,"dependencies":165,"content_hash":148,"verification":170},[32,33,34],{"executes_code":28,"modifies_global_config":28,"requires_secrets":164,"uses_absolute_paths":28,"network_access":59},[],{"npm":166,"pip":167,"brew":168,"system":169},[],[],[],[],{"commands":171,"expected_files":172},[],[13],{"target":33,"score":174,"status":175,"policy":176,"why":177,"asset_kind":30,"install_mode":35},64,"needs_confirmation","confirm",[65,66,67,178,179,71],"policy confirm","risk_profile.network_access is true",{"author_trust_level":73,"verified_publisher":28,"asset_signed_hash":148,"signature_status":74,"install_count":12,"report_count":12,"dangerous_capability_badges":181,"review_status":76,"signals":183},[182],"network_access",[184,78,79],"asset has usage views",{"owner_uuid":9,"owner_name":10,"source_url":186,"content_hash":148,"visibility":18,"created_at":187,"updated_at":188},"https:\u002F\u002Ftokrepo.com\u002Fen\u002Fworkflows\u002Fzoxide-smarter-cd-command-learns-your-habits-d3b64571","2026-04-12 01:28:17","2026-05-13 17:01:33",78.03795879789716,[132,133,134,135],[26,137],{"id":193,"uuid":194,"slug":195,"title":196,"description":197,"author_id":9,"author_name":10,"author_avatar":11,"token_estimate":12,"time_saved":12,"model_used":13,"fork_count":12,"vote_count":12,"view_count":198,"parent_id":12,"parent_uuid":13,"lang_type":14,"steps":199,"tags":200,"has_voted":28,"visibility":18,"share_token":13,"is_featured":12,"content_hash":202,"asset_kind":30,"target_tools":203,"install_mode":35,"entrypoint":204,"risk_profile":205,"dependencies":207,"verification":212,"agent_metadata":215,"agent_fit":227,"trust":229,"provenance":232,"created_at":234,"updated_at":235,"__relatedScore":236,"__relatedReasons":237,"__sharedTags":238},2305,"eb65b8c4-431c-11f1-9bc6-00163e2b0d79","d2-declarative-diagram-scripting-language-eb65b8c4","D2 — Declarative Diagram Scripting Language","A modern diagram scripting language that turns text into diagrams, offering a readable syntax for architecture, flow, and sequence diagrams rendered from code.",96,[],[201],{"id":24,"name":25,"slug":26,"icon":27},"2851a6609ce0c7a696ee46060bf63bd3fd30c9595d0d9ee4c3f763779c0ea314",[32,33,34],"D2 Overview",{"executes_code":28,"modifies_global_config":28,"requires_secrets":206,"uses_absolute_paths":28,"network_access":59},[],{"npm":208,"pip":209,"brew":210,"system":211},[],[],[],[],{"commands":213,"expected_files":214},[],[204],{"asset_kind":30,"target_tools":216,"install_mode":35,"entrypoint":204,"risk_profile":217,"dependencies":219,"content_hash":202,"verification":224},[32,33,34],{"executes_code":28,"modifies_global_config":28,"requires_secrets":218,"uses_absolute_paths":28,"network_access":59},[],{"npm":220,"pip":221,"brew":222,"system":223},[],[],[],[],{"commands":225,"expected_files":226},[],[204],{"target":33,"score":174,"status":175,"policy":176,"why":228,"asset_kind":30,"install_mode":35},[65,66,67,178,179,71],{"author_trust_level":73,"verified_publisher":28,"asset_signed_hash":202,"signature_status":74,"install_count":12,"report_count":12,"dangerous_capability_badges":230,"review_status":76,"signals":231},[182],[78,79],{"owner_uuid":9,"owner_name":10,"source_url":233,"content_hash":202,"visibility":18,"created_at":234,"updated_at":235},"https:\u002F\u002Ftokrepo.com\u002Fen\u002Fworkflows\u002Fd2-declarative-diagram-scripting-language-eb65b8c4","2026-04-29 00:11:31","2026-05-13 18:57:48",74.98015760139937,[132,133,134,135],[26,137],{"id":240,"uuid":241,"slug":242,"title":243,"description":244,"author_id":9,"author_name":10,"author_avatar":11,"token_estimate":245,"time_saved":12,"model_used":246,"fork_count":12,"vote_count":12,"view_count":247,"parent_id":12,"parent_uuid":13,"lang_type":14,"steps":248,"tags":249,"has_voted":28,"visibility":18,"share_token":13,"is_featured":12,"content_hash":251,"asset_kind":30,"target_tools":252,"install_mode":35,"entrypoint":243,"risk_profile":255,"dependencies":257,"verification":262,"agent_metadata":265,"agent_fit":277,"trust":279,"provenance":282,"created_at":284,"updated_at":285,"__relatedScore":286,"__relatedReasons":287,"__sharedTags":288},262,"04367306-be4a-4f46-854d-dd2b4d0d429e","chroma-open-source-vector-database-ai-04367306","Chroma — Open-Source Vector Database for AI","Chroma is the open-source vector database and data infrastructure for AI applications. 27.1K+ GitHub stars. Simple 4-function API for embedding, storing, and querying documents. Supports Python, JavaS",500,"Claude Code",182,[],[250],{"id":24,"name":25,"slug":26,"icon":27},"00605e1a63ad3ac2a280050b36e245642efe82b5532b00967d9f612e49e70ed4",[32,33,253,34,254],"cursor","windsurf",{"executes_code":28,"modifies_global_config":28,"requires_secrets":256,"uses_absolute_paths":28,"network_access":28},[],{"npm":258,"pip":259,"brew":260,"system":261},[],[],[],[],{"commands":263,"expected_files":264},[],[243],{"asset_kind":30,"target_tools":266,"install_mode":35,"entrypoint":243,"risk_profile":267,"dependencies":269,"content_hash":251,"verification":274},[32,33,253,34,254],{"executes_code":28,"modifies_global_config":28,"requires_secrets":268,"uses_absolute_paths":28,"network_access":28},[],{"npm":270,"pip":271,"brew":272,"system":273},[],[],[],[],{"commands":275,"expected_files":276},[],[243],{"target":33,"score":61,"status":62,"policy":63,"why":278,"asset_kind":30,"install_mode":35},[65,66,67,68,69,70,71],{"author_trust_level":73,"verified_publisher":28,"asset_signed_hash":251,"signature_status":74,"install_count":12,"report_count":12,"dangerous_capability_badges":280,"review_status":76,"signals":281},[],[184,78,79,80],{"owner_uuid":9,"owner_name":10,"source_url":283,"content_hash":251,"visibility":18,"created_at":284,"updated_at":285},"https:\u002F\u002Ftokrepo.com\u002Fen\u002Fworkflows\u002Fchroma-open-source-vector-database-ai-04367306","2026-03-31 20:19:12","2026-05-14 02:04:45",72.39367663459565,[133,134,135],[26,137]]