-
Connect to the sample web application at http://localhost:8090.
-
Optional: If you configured your deployment with NVIDIA Riva, check [X] Enable TTS output to enable the web application to read aloud the answers to your queries.
Select the desired ASR language (
English (en-US)
for this test), TTS language (English (en-US)
for this test) and TTS voice from the dropdown menus below the checkboxes to use the voice-to-voice interaction capabilities. -
On the Converse tab, enter "How many cores does the Grace superchip contain?" in the chat box and click Submit.
Alternatively, click on the microphone button to the right of the text box and ask the question verbally.
-
Upload the sample data to the knowledge base.
Click the Knowledge Base tab and then click Add File.
Navigate to the
dataset.zip
file that is located in thenotebooks
directory. Unzip the archive and upload the PDFs. -
If you want to remove a file from knowledge base, select the file name from Knowledge Base tab and click Delete.
-
Return to Converse tab and select [X] Use knowledge base.
-
Reenter the question: "How many cores does the Grace superchip contain?"
The default prompts are optimized for Llama chat model. If you use a completion model, then you must fine tune the prompts.
If you receive the following "Media devices could not be accessed" error message when you first attempt to transcribe a voice query, perform the following steps.
-
Open another browser tab and enter
chrome://flags
in the location field. -
Enter
insecure origins treated as secure
in the search field. -
Enter
http://<host-ip>:8090
in the text box and select Enabled from the menu. -
Click Relaunch.
-
After the browser opens, grant
http://host-ip:8090
access to your microphone. -
Retry your request.