-
Notifications
You must be signed in to change notification settings - Fork 213
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FR]: Enable Opik to display additional media formats, including audio, PDF, and video. #567
Comments
Hi @pleomax0730 Is there a specific type of data you would like to us to support first ? We have gotten requests for better PDF support so could be a good candidate to start with |
@jverre Thanks for your reply to my feature request! In terms of the current community, I think the best order might be PDF, then audio, and finally video. |
Makes sense, I'll keep this ticket open. It's quite a big feature so we might not get to it straightaway |
@jverre @pleomax0730 I'd like to contribute to the Documents part of this issue. Could you please provide some guidance on the specific tasks or features that need to be implemented? Would it involve tasks like document summarization or extraction of key information? |
Hi, @AHB102 langfuse track audio might be a good reference or idea of implementing this feature. No summarization or extraction is needed. Only display the media or data as the reference to this track. |
Hi @pleomax0730 @AHB102 It's a pretty big initiative so feel free to add comments here in case we have missed anything that is relevant |
@jverre Hello 🖐️, I read the doc and think starting with the SDK changes for docs sounds like a good plan. However, I’m still relatively new to software development, so I’m a bit lost in the codebase. Any specific docs or resources you’d recommend to help me get started? |
@AHB102 This is a pretty big feature that touches the Python SDK, the backend and the UI, might be a bit tricky as a new issue I'll create a couple of issues later today and will tag them as |
@jverre Yeah agreed, I'll definitely look into good first issues. |
Proposal summary
Feature Request
Enable Opik to display additional media formats, including audio, PDF, and video.
Background
Opik currently supports only image display, which limits its flexibility for monitoring, testing, and evaluating multimodal LLM applications that may involve other data formats. Expanding support for audio, PDF, and video would allow users to fully leverage Opik’s capabilities across a broader range of use cases.
Proposed Use Cases:
Benefits:
Summary
Adding support for audio, PDF, and video display will make Opik a more versatile platform, suitable for a wide range of LLM applications beyond text and images. This enhancement will empower users to develop, evaluate, and monitor their applications seamlessly across all media types.
Motivation
Many existing solutions for LLM evaluation and monitoring are limited to text and image formats, with little to no support for other media types like audio, PDFs, or video. This lack of multimodal support forces teams to use multiple tools or rely on custom workarounds, creating friction in their workflows and hindering a comprehensive evaluation process.
By introducing audio, PDF, and video support, Opik could become the first open-source platform to offer complete multimodal monitoring and evaluation capabilities. This would make Opik highly attractive to teams working on complex applications that require seamless integration of various data formats, such as multimedia retrieval, interactive voice systems, and document processing pipelines.
Competitive Advantage:
Leading the way with these features would position Opik as a go-to solution for multimodal LLM applications, setting it apart from other evaluation and monitoring tools. This could significantly increase Opik’s user base by attracting organizations and researchers who need comprehensive, media-agnostic monitoring for their LLM projects.
The text was updated successfully, but these errors were encountered: