Incompatible with latest faster-whisper #403

vytskalt · 2024-10-02T16:35:51Z

Looks like recent changes to faster-whisper broke compatibility with stable-ts, giving errors like this:

Traceback (most recent call last):
  File "/nix/store/qcr3a5k910x6ywvkhinzqjiwv50mpvn1-stable-ts-aligner/bin/stable-ts-aligner", line 41, in <module>
    results = list(executor.map(lambda req: align(req, model), requests))
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nix/store/zs1xky7izkfmc8wxm8bhhdff5a605hfj-python3-minimal-3.11.9/lib/python3.11/concurrent/futures/_base.py", line 619, in result_iterator
    yield _result_or_cancel(fs.pop())
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nix/store/zs1xky7izkfmc8wxm8bhhdff5a605hfj-python3-minimal-3.11.9/lib/python3.11/concurrent/futures/_base.py", line 317, in _result_or_cancel
    return fut.result(timeout)
           ^^^^^^^^^^^^^^^^^^^
  File "/nix/store/zs1xky7izkfmc8wxm8bhhdff5a605hfj-python3-minimal-3.11.9/lib/python3.11/concurrent/futures/_base.py", line 456, in result
    return self.__get_result()
           ^^^^^^^^^^^^^^^^^^^
  File "/nix/store/zs1xky7izkfmc8wxm8bhhdff5a605hfj-python3-minimal-3.11.9/lib/python3.11/concurrent/futures/_base.py", line 401, in __get_result
    raise self._exception
  File "/nix/store/zs1xky7izkfmc8wxm8bhhdff5a605hfj-python3-minimal-3.11.9/lib/python3.11/concurrent/futures/thread.py", line 58, in run
    result = self.fn(*self.args, **self.kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nix/store/qcr3a5k910x6ywvkhinzqjiwv50mpvn1-stable-ts-aligner/bin/stable-ts-aligner", line 41, in <lambda>
    results = list(executor.map(lambda req: align(req, model), requests))
                                            ^^^^^^^^^^^^^^^^^
  File "/nix/store/qcr3a5k910x6ywvkhinzqjiwv50mpvn1-stable-ts-aligner/bin/stable-ts-aligner", line 10, in align
    result = model.align(request['audio_file'], request['text'], language=request['language'], nonspeech_skip=None, fast_mode=True)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nix/store/il0mrip5xma96339x272izlv1mq1g5lq-python3-minimal-3.11.9-env/lib/python3.11/site-packages/stable_whisper/alignment.py", line 583, in align
    segment = timestamp_words()
              ^^^^^^^^^^^^^^^^^
  File "/nix/store/il0mrip5xma96339x272izlv1mq1g5lq-python3-minimal-3.11.9-env/lib/python3.11/site-packages/stable_whisper/alignment.py", line 309, in timestamp_words
    features = model.feature_extractor(audio_segment.cpu().numpy())
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/nix/store/il0mrip5xma96339x272izlv1mq1g5lq-python3-minimal-3.11.9-env/lib/python3.11/site-packages/faster_whisper/feature_extractor.py", line 88, in __call__
    waveform = waveform.to(torch.float32)
               ^^^^^^^^^^^
AttributeError: 'numpy.ndarray' object has no attribute 'to'

The text was updated successfully, but these errors were encountered:

jianfch · 2024-10-02T22:44:38Z

There seems to be a lot more that broke from those changes with the sudden change from numpy to pytorch.

vytskalt · 2024-10-03T04:41:32Z

Yes, very weird they would do that ~~in a minor release.~~

HeidelParreno · 2024-10-03T11:06:58Z

I tried running stable-ts[fw] in jpy notebook, it crashed at model.transcribe. It works on the prompt tho, dunno what causes it.

jianfch · 2024-10-03T15:45:23Z

I tried running stable-ts[fw] in jpy notebook, it crashed at model.transcribe. It works on the prompt tho, dunno what causes it.

stable-ts[fw] installs the latest Faster-Whisper version (1.0.3) on PyPI, so aforementioned changes (occured after 1.0.3) do not affect it.
For Faster-Whisper models, the transcribe() method is the original Faster-Whisper transcription method. To use Stable-ts, use model.transcribe_stable() instead.
But if transcribe() is crashing then it's likely a Faster-Whisper issue. A similar issue seems to be on their repo already: SYSTRAN/faster-whisper#820.

HeidelParreno · 2024-10-06T12:57:52Z

I tried running stable-ts[fw] in jpy notebook, it crashed at model.transcribe. It works on the prompt tho, dunno what causes it.

stable-ts[fw] installs the latest Faster-Whisper version (1.0.3) on PyPI, so aforementioned changes (occured after 1.0.3) do not affect it. For Faster-Whisper models, the transcribe() method is the original Faster-Whisper transcription method. To use Stable-ts, use model.transcribe_stable() instead. But if transcribe() is crashing then it's likely a Faster-Whisper issue. A similar issue seems to be on their repo already: SYSTRAN/faster-whisper#820.

Thanks for this! Solved the crashing by downgrading faster-whisper to 1.0.0, changed my CUDA to 12.1, and changed my pytorch to cu121

-updated `align()` and `transcribe_stable()` to be compatible with models on the latest faster-whisper commit (#403) -added `pipeline_kwargs` to `load_hf_whisper()` for passing specific arguments to `transformers.AutoModelForSpeechSeq2Seq.pipeline()` -added `"large-v3-turbo"` and `"turbo"` to `HF_MODELS` for loading `"openai/whisper-large-v3-turbo"` on Hugging Face

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incompatible with latest faster-whisper #403

Incompatible with latest faster-whisper #403

vytskalt commented Oct 2, 2024

jianfch commented Oct 2, 2024

vytskalt commented Oct 3, 2024 •

edited

Loading

HeidelParreno commented Oct 3, 2024

jianfch commented Oct 3, 2024 •

edited

Loading

HeidelParreno commented Oct 6, 2024

Incompatible with latest faster-whisper #403

Incompatible with latest faster-whisper #403

Comments

vytskalt commented Oct 2, 2024

jianfch commented Oct 2, 2024

vytskalt commented Oct 3, 2024 • edited Loading

HeidelParreno commented Oct 3, 2024

jianfch commented Oct 3, 2024 • edited Loading

HeidelParreno commented Oct 6, 2024

vytskalt commented Oct 3, 2024 •

edited

Loading

jianfch commented Oct 3, 2024 •

edited

Loading