BUT-FIT/DiCoW_v3_3_large
Automatic Speech Recognition • 2B • Updated • 397 • 1
None defined yet.
FLiP: Towards understanding and interpreting multimodal multilingual sentence embeddings
SE-DiCoW: Self-Enrolled Diarization-Conditioned Whisper