You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We want to add a feature that tracks and reports the loading state of individual AI models in our FastMLX application. This will allow users to check the status of specific models they're interested in using.
Objective:
Create a system to track and report the loading state of individual models, with the ability to query the state of a single model or all models.
Tasks:
Add a ModelState enum in fastmlx.py with states like LOADING, READY, and ERROR.
Modify the ModelProvider class to include a state attribute for each model.
Update the model loading process to set appropriate states.
Add two endpoints:
/v1/model_status to report the current state of all models.
/v1/model_status/{model_name} to report the state of a specific model.
Modify existing endpoints to check model state before processing requests.
ModelProvider class is updated to track individual model states.
New endpoints /v1/model_status and /v1/model_status/{model_name} are added and functional.
Existing endpoints check specific model state before processing.
Proper error handling for non-existent models is implemented.
Basic logging for state changes and queries is in place.
Code is commented and follows our style guide.
We're excited to see your implementation of this feature! It will provide users with more granular control and information about model availability. If you have any questions or need clarification, please don't hesitate to ask in the comments. Good luck!
The text was updated successfully, but these errors were encountered:
Description:
We want to add a feature that tracks and reports the loading state of individual AI models in our FastMLX application. This will allow users to check the status of specific models they're interested in using.
Objective:
Create a system to track and report the loading state of individual models, with the ability to query the state of a single model or all models.
Tasks:
ModelState
enum infastmlx.py
with states likeLOADING
,READY
, andERROR
.ModelProvider
class to include astate
attribute for each model./v1/model_status
to report the current state of all models./v1/model_status/{model_name}
to report the state of a specific model.Example Implementation:
Guidelines:
get_model_status
method can handle both single model and all models queries efficiently.Resources:
Definition of Done:
ModelState
enum is implemented.ModelProvider
class is updated to track individual model states./v1/model_status
and/v1/model_status/{model_name}
are added and functional.We're excited to see your implementation of this feature! It will provide users with more granular control and information about model availability. If you have any questions or need clarification, please don't hesitate to ask in the comments. Good luck!
The text was updated successfully, but these errors were encountered: