Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] 🌈 BingAPI 实现 whisper-1 模型接口 #397

Open
Harry-zklcdc opened this issue Apr 10, 2024 · 0 comments
Open

[Feature] 🌈 BingAPI 实现 whisper-1 模型接口 #397

Harry-zklcdc opened this issue Apr 10, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@Harry-zklcdc
Copy link
Owner

Harry-zklcdc commented Apr 10, 2024

Describe the problem related to the feature request

逆向实现 whisper-1 模型的STT接口

Describe the solution you'd like

请求数据包

  1. Bing Copilot STT 接口: wss://sr.bing.com/opaluqu/speech/recognition/dictation/cognitiveservices/v1
  2. 请求数据包结构:
Path: speech.config
X-RequestId: <UUID>
X-Timestamp: <Timestamp>
Content-Type: application/json

{"context":{"system":{"name":"SpeechSDK","version":"1.15.0-alpha.0.1","build":"JavaScript","lang":"JavaScript"},"os":{"platform":"Browser/MacIntel","name":"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/123.0.0.0 Safari/537.36","version":"5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/123.0.0.0 Safari/537.36"},"audio":{"source":{"bitspersample":16,"channelcount":1,"connectivity":"Unknown","manufacturer":"Speech SDK","model":"默认 - MacBook Pro麦克风 (Built-in)","samplerate":16000,"type":"Microphones"}}},"recognition":"conversation"}
Path: speech.context
X-RequestId: <UUID>
X-Timestamp: <Timestamp>
Content-Type: application/json

{}

binary

.~Path: audio
<0d0a>
X-RequestId: <UUID>
<0d0a>
X-Timestamp: <Timestamp>
<0d0a>
Content-Type: audio/x-wav
<0d0a>
RIFF
<0000 0000>
WAVEfmt 
<1000 0000 0100 0100 80>
>
<0000 00>
}
<0000 0200 1000>
data
<0000 0000>

binary, 一个上传包大小 3296B

audio..X-RequestId: <UUID>
<0d0a>
X-Timestamp: <Timestamp>
<0d0a>
+ wavBinaryData

Describe alternatives you've considered

No response

Additional context

No response

@Harry-zklcdc Harry-zklcdc added the enhancement New feature or request label Apr 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant