Question 1

How to fix slow pip install of funasr due to building from source?

Accepted Answer

Upgrade to funasr v1.3.9 or later, which includes a pre-built universal wheel (`py3-none-any`). This avoids the source build step. Use: `pip install funasr>=1.3.9`. For older versions, the source build was a no-op, but the wheel eliminates any build overhead.

Question 2

How to fix CUDA error 'device-side assert triggered' when using repetition_penalty in FunASR 1.3.7 with vLLM?

Accepted Answer

This is caused by incompatibility between repetition_penalty and enable_prompt_embeds=True in vLLM. Remove repetition_penalty=1.3 from vllm_engine.generate() call. As workaround, split audio into ≤25-second chunks for inference and use a truncate_repetition() post-processing function to suppress repeats. Example truncation logic: def truncate_repetition(text, min_repeat_len=5, max_repeats=3): ... . The next FunASR version will adopt chunking and post-processing officially.

Question 3

How to perform real-time speech recognition via WebSocket when Qwen3-ASR only supports offline mode?

Accepted Answer

Qwen3-ASR does not support WebSocket real-time streaming (offline only via AutoModel). For WebSocket streaming, use the Fun-ASR-Nano model with FunASR's real-time server. Install: `pip install funasr>=1.3.5 vllm>=0.12.0` (version 1.3.5 fixes ModuleNotFoundError for `dynamic_vad` and `vllm.inputs.data` import issues). Start server: `python examples/industrial_data_pretraining/fun_asr_nano/serve_realtime_ws.py --port 10095 --language 中文`. Client: open `client_mic.html` in browser or use `client_python.py`. Docs: https://github.com/modelscope/FunASR/blob/main/docs/vllm_guide.md

Question 4

How to fix FunASR real-time serve download failure with unauthenticated Hugging Face requests?

Accepted Answer

Set a Hugging Face token to avoid rate limits and download issues. Export HF_TOKEN='your_token' before running the server, or use huggingface-cli login to cache credentials. If download still fails, manually download the model from https://huggingface.co/FunAudioLLM/Fun-ASR-Nano-2512 and point --model to the local path.

FunASR

Overview

README Preview

FAQ (4)

同类型项目

superpowers

everything-claude-code

flutter

langflow