Yuhan SONG's picture

Yuhan SONG

QbethQ

·

AI & ML interests

None yet

Recent Activity

authored a paper 10 days ago

Beyond Transcription: Unified Audio Schema for Perception-Aware AudioLLMs

authored a paper 10 days ago

UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception

published a model 11 days ago

tencent/Universal_Audio_Tokenizer

View all activity

Organizations

authored 2 papers 10 days ago

Beyond Transcription: Unified Audio Schema for Perception-Aware AudioLLMs

Paper • 2604.12506 • Published Apr 14 • 3

UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception

Paper • 2605.31521 • Published 15 days ago • 1

authored a paper 9 months ago

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

Paper • 2509.22220 • Published Sep 26, 2025 • 66