verachen

xiaozhi-esp32

C++ 0 0

Updated 2026-06-17 07:20:25 +00:00

kedia

C++ 0 0

Updated 2026-06-17 07:02:39 +00:00

livekit_agents

Python 0 0

Updated 2026-06-12 03:35:24 +00:00

livekit_token

Python 0 0

Updated 2026-05-21 02:19:54 +00:00

xiaozhi-esp32_bak

C++ 0 0

Updated 2026-05-11 05:40:37 +00:00

Qwen3-ASR

Python 0 0

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.

Updated 2026-04-23 08:30:45 +00:00

feishu_bitable

HTML 0 0

Updated 2026-03-31 09:24:37 +00:00

Fun-ASR

Python 0 0

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

audio audio-language-model audio-understanding fun-asr multimodal-large-language-models pytorch speaker-diarization speech-recognition

Updated 2026-02-11 02:48:50 +00:00

VoxCPM

Python 0 0

Updated 2026-02-09 11:28:41 +00:00

xiaozhi-esp32-shhkAICam

C++ 0 0

Updated 2026-02-02 06:33:13 +00:00