Japanese SFT/DPO data convert to speech via TTS. And audio caption data generated by Qwen3-Omni. All datasets are available for commercial use.
Ayuto Tsutsumi
Atotti
AI & ML interests
None yet
Recent Activity
liked a model about 23 hours ago
ACE-Step/ace-step-v1.5-1d-vae-stable-audio-format liked a model 13 days ago
nvidia/diar_streaming_sortformer_4spk-v2.1 liked a model about 1 month ago
neuphonic/neucodec