Releases · jianchang512/stt

23 Sep 18:42

jianchang512

v0.94

6e83c29

0.1 Latest

Latest

Change

Fix api获取不到上传的文件
移除 vad配置，使用默认
更新 large-v3-turbo模型
添加语言自动判断
结束状态判断时错误
修复api多个同时调用只返回第一个的bug
修复distil模型提示不存在问题
修复显示问题
增加自定义huggingface模型功能

完整包下载地址(含base模型)1.7GB/Download url:

百度网盘: https://pan.baidu.com/s/1KGX31_rurBl9UQiuwaiMqg?pwd=a48t

GitHub地址: https://github.com/jianchang512/stt/releases/download/v0.94/win-0.0.94.7z

v0.1 升级补丁包地址/96MB:

GitHub: https://github.com/jianchang512/stt/releases/download/v0.94/win-patchUpdate-0.1.7z

使用方法

解压到英文目录下，双击 start.exe, 在打开的页面中上传音频视频文件识别
extract to folder and doubleclick start.exe, wait unitl open webbrowser

ffmpeg.exe下载

如果你的计算机上没有ffmpeg，请下载这2个exe文件，和 start.exe放在同一位置

https://github.com/jianchang512/stt/releases/download/v0.93/ffmpeg.exe

https://github.com/jianchang512/stt/releases/download/v0.93/ffprobe.exe

全部模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

Assets 4

19 Feb 18:29

jianchang512

v0.93

4008c9b

stt-v0.93

Change

添加模型这次从 tiny-->large-v3
修复前端js超时问题 #29

完整包下载地址(含base模型)1.7GB/Download url:

百度网盘: https://pan.baidu.com/s/1rAKW7pBMLx4od9391wjtYw?pwd=xg4s

Huggingface: https://huggingface.co/spaces/mortimerme/s4/resolve/main/win-stt-0.0.93.7z?download=true

升级补丁包地址/30MB:

百度网盘：https://pan.baidu.com/s/1Dqd1FNC0Cwe5KqD_5bNhqg?pwd=gj65

Huggingface: https://huggingface.co/spaces/mortimerme/s4/resolve/main/win-stt-patch-0.0.93.7z?download=true

使用方法

解压到英文目录下，双击 start.exe, 在打开的页面中上传音频视频文件识别
extract to folder and doubleclick start.exe, wait unitl open webbrowser

ffmpeg.exe下载

如果你的计算机上没有ffmpeg，请下载这2个exe文件，和 start.exe放在同一位置

https://github.com/jianchang512/stt/releases/download/v0.93/ffmpeg.exe

https://github.com/jianchang512/stt/releases/download/v0.93/ffprobe.exe

全部模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

Assets 4

26 Jan 15:54

jianchang512

v0.91

51f9ca5

stt-v0.91

Change

add more param at the set.ini for whisper model

下载地址(内含base模型)/Download url:

百度网盘: https://pan.baidu.com/s/1oAeBvVcNv3HP2QWa3uiilg?pwd=efvc

GitHub: https://github.com/jianchang512/stt/releases/download/v0.91/sttv-0.91.7z

解压到英文目录下，双击 start.exe, 在打开的页面中上传音频视频文件识别
extract to folder and doubleclick start.exe, wait unitl open webbrowser

模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

Assets 3

12 Jan 14:58

jianchang512

v0.0.6

4f6fb6a

stt-v0.0.6

Change

fixed bug #7
默认使用 cpu 运算，如果确定使用英伟达显卡，并且配置好了cuda环境，请修改 set.ini 中 devtype=cpu为 devtype=cuda,并重新启动，可使用cuda加速
progressbar

下载地址(内含base模型)/Download url:

百度网盘: https://pan.baidu.com/s/1am6RNTwR9cfA3_EKwyTLxg?pwd=cvrd

GitHub: https://github.com/jianchang512/stt/releases/download/v0.0.6/sttv0.0.6.7z

解压到英文目录下，双击 start.exe, 在打开的页面中上传音频视频文件识别
extract to folder and doubleclick start.exe, wait unitl open webbrowser

模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

Assets 3

28 Dec 17:51

jianchang512

v0.0.1

d1667e9

cuBLASxx.dll和cudnn下载

如果遇到 "cublasxxx.dll不存在"或启用cuda加速后闪退问题，请下载该文件，然后将里面的dll文件复制到 C:/Windows/System32 目录下或者软件根目录下即exe所在目录下

任意文件夹地址栏输入cmd打开黑窗口，输入命令nvcc -V查看当前CUDA版本

CUDA11.x版本点击此处下载 https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA11_win_v4.7z

CUDA12.x版本点击此处下载 https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA12_win_v1.7z

Assets 2

28 Dec 16:34

jianchang512

0.0

14d1d0c

download all models-下载所有语音识别模型

faster-whisper 模型下载，适用于 stt 项目和 “pyvideotrans视频翻译配音”项目的faster-whisper模式使用，openai-whisper模型请往下拉

tiny 64MB
tiny.en 64MB

base 124MB
base.en 124MB

small 415MB
small 百度网盘
 small.en 415MB

medium 1.27G
medium.en 1.27G

large-v1 百度网盘
 large-v1 huggingface

large-v2 huggingface
large-v2 百度网盘

large-v3 huggingface
large-v3 百度网盘

large-v3-turbo 1.3G

distil-whisper-small.en 282MB

distil-whisper-medium.en 671MB
distil-medium百度网盘

distil-whisper-large-v2 1.27G
distil-large-v2 百度网盘

distil-whisper-large-v3 1.3G
distil-whisper-large-v3百度网盘

下载后解压，将压缩包内的"models--Systran--faster-xx"文件夹复制到models目录内，解压复制后 models 目录下文件夹列表如下

压缩包内容

正确放置后的 models 目录下的文件夹列表

openai-whisper模型下载，仅适用于“pyvideotrans视频翻译配音软件”的 openai-whisper模式模型下载使用

下载后将pt文件放到软件目录下的models文件夹内

tiny.pt模型
 tiny.en.pt模型

base.pt模型
 base.en.pt模型

FunASR中文模型下载

百度网盘下载(含语音降噪模型)：https://pan.baidu.com/s/1rxDMmUcGiEdiZ0LId6_tDw?pwd=6hp7

Huggingface(墙外下载地址)： https://huggingface.co/spaces/mortimerme/s4/resolve/main/FunASR-Chinese-models.7z?download=true

下载后解压会得到一个hub文件夹，将该文件夹复制到视频翻译软件的models文件夹内，覆盖已有的同名hub文件夹

cuBLASxx.dll和cudnn下载

任意文件夹地址栏输入cmd打开黑窗口，输入命令nvcc -V查看当前CUDA版本

CUDA11.x版本点击此处下载 https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA11_win_v4.7z

CUDA12.x版本点击此处下载 https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA12_win_v1.7z

uvr5模型下载

点击下载uvr5模型

下载后解压，得到一个 uvr5_weights 文件夹，将该文件夹复制到视频翻译配音软件根目录下。

ffmpeg.exe 下载

如果是windows系统，并且提示找不到 ffmpeg 命令，可下载下面这2个文件，放到软件根目录下或软件根目录下的ffmpeg文件夹内

https://github.com/jianchang512/stt/releases/download/0.0/ffmpeg.exe

https://github.com/jianchang512/stt/releases/download/0.0/ffprobe.exe

Assets 23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change

完整包下载地址(含base模型)1.7GB/Download url:

v0.1 升级补丁包地址/96MB:

使用方法

ffmpeg.exe下载

全部模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

Change

完整包下载地址(含base模型)1.7GB/Download url:

升级补丁包地址/30MB:

使用方法

ffmpeg.exe下载

全部模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

Change

下载地址(内含base模型)/Download url:

模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

Change

下载地址(内含base模型)/Download url:

模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

cuBLASxx.dll和cudnn下载

faster-whisper 模型下载，适用于 stt 项目和 “pyvideotrans视频翻译配音”项目的faster-whisper模式使用，openai-whisper模型请往下拉

openai-whisper模型下载，仅适用于“pyvideotrans视频翻译配音软件”的 openai-whisper模式模型下载使用

FunASR中文模型下载

cuBLASxx.dll和cudnn下载

uvr5模型下载

ffmpeg.exe 下载

Releases: jianchang512/stt

0.1

Change

完整包下载地址(含base模型)1.7GB/Download url:

v0.1 升级补丁包地址/96MB:

使用方法

ffmpeg.exe下载

全部模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

stt-v0.93

Change

完整包下载地址(含base模型)1.7GB/Download url:

升级补丁包地址/30MB:

使用方法

ffmpeg.exe下载

全部模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

stt-v0.91

Change

下载地址(内含base模型)/Download url:

模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

stt-v0.0.6

Change

下载地址(内含base模型)/Download url:

模型下载地址/Download models https://github.com/jianchang512/stt/releases/tag/0.0

cuBLASxx.dll和cudnn下载

cuBLASxx.dll和cudnn下载

download all models-下载所有语音识别模型

faster-whisper 模型下载，适用于 stt 项目 和 “pyvideotrans视频翻译配音”项目的faster-whisper模式使用，openai-whisper模型请往下拉

openai-whisper模型下载，仅适用于“pyvideotrans视频翻译配音软件”的 openai-whisper模式模型 下载使用

FunASR中文模型下载

cuBLASxx.dll和cudnn下载

uvr5模型下载

ffmpeg.exe 下载

faster-whisper 模型下载，适用于 stt 项目和 “pyvideotrans视频翻译配音”项目的faster-whisper模式使用，openai-whisper模型请往下拉

openai-whisper模型下载，仅适用于“pyvideotrans视频翻译配音软件”的 openai-whisper模式模型下载使用