瀏覽代碼

Merge pull request #12815 from TomBayne/enh/vad_filter

feat/Enable vad_filter to improve quality of transcription in faster-whisper
Tim Jaeryang Baek 3 周之前
父節點
當前提交
60596c362a
共有 1 個文件被更改,包括 1 次插入1 次删除
  1. 1 1
      backend/open_webui/routers/audio.py

+ 1 - 1
backend/open_webui/routers/audio.py

@@ -497,7 +497,7 @@ def transcribe(request: Request, file_path):
             )
             )
 
 
         model = request.app.state.faster_whisper_model
         model = request.app.state.faster_whisper_model
-        segments, info = model.transcribe(file_path, beam_size=5)
+        segments, info = model.transcribe(file_path, beam_size=5, vad_filter=True)
         log.info(
         log.info(
             "Detected language '%s' with probability %f"
             "Detected language '%s' with probability %f"
             % (info.language, info.language_probability)
             % (info.language, info.language_probability)