SeldonIO · sakoush · Nov 26, 2024 · Sep 10, 2024 · Sep 23, 2024 · Sep 23, 2024
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -1,6 +1,47 @@
 # Changelog
 
 
+<a name="1.6.1"></a>
+## [1.6.1](https://github.com/SeldonIO/MLServer/releases/tag/1.6.1) - 10 Sep 2024
+
+<!-- Release notes generated using configuration in .github/release.yml at 1.6.1 -->
+
+## Overview
+
+### Features
+MLServer now offers an option to use pre-existing Python environments by specifying a path to the environment to be used - by [@idlefella](https://github.com/idlefella) in ([#1891](https://github.com/SeldonIO/MLServer/issues/1891))
+
+### Releases
+MLServer released catboost runtime which allows serving [catboost](https://catboost.ai/) models with MLServer - by [@sakoush](https://github.com/sakoush) in ([#1839](https://github.com/SeldonIO/MLServer/issues/1839))
+
+### Fixes
+* Kafka json byte encoding fix to match rest server by [@DerTiedemann](https://github.com/DerTiedemann) and [@sakoush](https://github.com/sakoush) in ([#1622](https://github.com/SeldonIO/MLServer/issues/1622))
+* Prometheus interceptor fix for gRPC streaming by [@RobertSamoilescu](https://github.com/RobertSamoilescu) in ([#1858](https://github.com/SeldonIO/MLServer/issues/1858))
+
+
+## What's Changed
+* Re-generate License Info by [@github-actions](https://github.com/github-actions) in https://github.com/SeldonIO/MLServer/pull/1812
+* Update CHANGELOG by [@github-actions](https://github.com/github-actions) in https://github.com/SeldonIO/MLServer/pull/1830
+* Update release.yml to include catboost by [@sakoush](https://github.com/sakoush) in https://github.com/SeldonIO/MLServer/pull/1839
+* Fix kafka json byte encoding to match rest server by [@DerTiedemann](https://github.com/DerTiedemann) in https://github.com/SeldonIO/MLServer/pull/1622
+* Included Prometheus interceptor support for gRPC streaming by [@RobertSamoilescu](https://github.com/RobertSamoilescu) in https://github.com/SeldonIO/MLServer/pull/1858
+* Run gRPC test serially by [@RobertSamoilescu](https://github.com/RobertSamoilescu) in https://github.com/SeldonIO/MLServer/pull/1872
+* Re-generate License Info by [@github-actions](https://github.com/github-actions) in https://github.com/SeldonIO/MLServer/pull/1886
+* Feature/support existing environments by [@idlefella](https://github.com/idlefella) in https://github.com/SeldonIO/MLServer/pull/1891
+* Fix tensorflow upperbound macos by [@RobertSamoilescu](https://github.com/RobertSamoilescu) in https://github.com/SeldonIO/MLServer/pull/1901
+* ci: Merge change for release 1.6.1  by [@RobertSamoilescu](https://github.com/RobertSamoilescu) in https://github.com/SeldonIO/MLServer/pull/1902
+* Bump preflight to 1.10.0 by [@RobertSamoilescu](https://github.com/RobertSamoilescu) in https://github.com/SeldonIO/MLServer/pull/1903
+* ci: Merge change for release 1.6.1 [2] by [@RobertSamoilescu](https://github.com/RobertSamoilescu) in https://github.com/SeldonIO/MLServer/pull/1904
+
+## New Contributors
+* [@DerTiedemann](https://github.com/DerTiedemann) made their first contribution in https://github.com/SeldonIO/MLServer/pull/1622
+* [@idlefella](https://github.com/idlefella) made their first contribution in https://github.com/SeldonIO/MLServer/pull/1891
+
+**Full Changelog**: https://github.com/SeldonIO/MLServer/compare/1.6.0...1.6.1
+
+[Changes][1.6.1]
+
+
 <a name="1.6.0"></a>
 ## [1.6.0](https://github.com/SeldonIO/MLServer/releases/tag/1.6.0) - 26 Jun 2024
 
@@ -509,6 +550,7 @@ To learn more about how to use MLServer directly from the MLflow CLI, check out
 [Changes][1.1.0]
 
 
+[1.6.1]: https://github.com/SeldonIO/MLServer/compare/1.6.0...1.6.1
 [1.6.0]: https://github.com/SeldonIO/MLServer/compare/1.5.0...1.6.0
 [1.5.0]: https://github.com/SeldonIO/MLServer/compare/1.4.0...1.5.0
 [1.4.0]: https://github.com/SeldonIO/MLServer/compare/1.3.5...1.4.0

diff --git a/docs-gb/.gitbook/assets/kserve-logo.png b/docs-gb/.gitbook/assets/kserve-logo.png
diff --git a/docs-gb/.gitbook/assets/seldon-core-logo.png b/docs-gb/.gitbook/assets/seldon-core-logo.png
diff --git a/docs-gb/PULL_REQUEST_TEMPLATE/pull_request_template.md b/docs-gb/PULL_REQUEST_TEMPLATE/pull_request_template.md
@@ -0,0 +1,26 @@
+# Pull Request
+
+## Description
+<!-- Provide a brief description of the purpose of this pull request -->
+
+## Changes Made
+<!-- Describe the changes introduced by this pull request -->
+
+## Related Issues
+<!-- Mention any related issues or tickets that are being addressed by this pull request -->
+
+## Screenshots (if applicable)
+<!-- Add screenshots or images to visually represent the changes, if applicable -->
+
+## Checklist
+<!-- Make sure to check the items below before submitting your pull request -->
+
+- [ ] Code follows the project's style guidelines
+- [ ] All tests related to the changes pass successfully
+- [ ] Documentation is updated (if necessary)
+- [ ] Code is reviewed by at least one other team member
+- [ ] Any breaking changes are communicated and documented
+
+## Additional Notes
+<!-- Add any additional notes or context that may be helpful for reviewers -->
+
diff --git a/docs-gb/README.md b/docs-gb/README.md
@@ -0,0 +1,151 @@
+# MLServer
+
+An open source inference server for your machine learning models.
+
+[![video_play_icon](https://user-images.githubusercontent.com/10466106/151803854-75d17c32-541c-4eee-b589-d45b07ea486d.png)](https://www.youtube.com/watch?v=aZHe3z-8C_w)
+
+## Overview
+
+MLServer aims to provide an easy way to start serving your machine learning
+models through a REST and gRPC interface, fully compliant with [KFServing's V2
+Dataplane](https://docs.seldon.io/projects/seldon-core/en/latest/reference/apis/v2-protocol.html)
+spec. Watch a quick video introducing the project [here](https://www.youtube.com/watch?v=aZHe3z-8C_w).
+
+- Multi-model serving, letting users run multiple models within the same
+  process.
+- Ability to run [inference in parallel for vertical
+  scaling](https://mlserver.readthedocs.io/en/latest/user-guide/parallel-inference.html)
+  across multiple models through a pool of inference workers.
+- Support for [adaptive
+  batching](https://mlserver.readthedocs.io/en/latest/user-guide/adaptive-batching.html),
+  to group inference requests together on the fly.
+- Scalability with deployment in Kubernetes native frameworks, including
+  [Seldon Core](https://docs.seldon.io/projects/seldon-core/en/latest/graph/protocols.html#v2-kfserving-protocol) and
+  [KServe (formerly known as KFServing)](https://kserve.github.io/website/modelserving/v1beta1/sklearn/v2/), where
+  MLServer is the core Python inference server used to serve machine learning
+  models.
+- Support for the standard [V2 Inference Protocol](https://docs.seldon.io/projects/seldon-core/en/latest/reference/apis/v2-protocol.html) on
+  both the gRPC and REST flavours, which has been standardised and adopted by
+  various model serving frameworks.
+
+You can read more about the goals of this project on the [initial design
+document](https://docs.google.com/document/d/1C2uf4SaAtwLTlBCciOhvdiKQ2Eay4U72VxAD4bXe7iU/edit?usp=sharing).
+
+## Usage
+
+You can install the `mlserver` package running:
+
+```bash
+pip install mlserver
+```
+
+Note that to use any of the optional [inference runtimes](#inference-runtimes),
+you'll need to install the relevant package.
+For example, to serve a `scikit-learn` model, you would need to install the
+`mlserver-sklearn` package:
+
+```bash
+pip install mlserver-sklearn
+```
+
+For further information on how to use MLServer, you can check any of the
+[available examples](#examples).
+
+## Inference Runtimes
+
+Inference runtimes allow you to define how your model should be used within
+MLServer.
+You can think of them as the **backend glue** between MLServer and your machine
+learning framework of choice.
+You can read more about [inference runtimes in their documentation
+page](./docs/runtimes/index.md).
+
+Out of the box, MLServer comes with a set of pre-packaged runtimes which let
+you interact with a subset of common frameworks.
+This allows you to start serving models saved in these frameworks straight
+away.
+However, it's also possible to **[write custom
+runtimes](./docs/runtimes/custom.md)**.
+
+Out of the box, MLServer provides support for:
+
+| Framework     | Supported | Documentation                                                    |
+| ------------- | --------- | ---------------------------------------------------------------- |
+| Scikit-Learn  | ✅        | [MLServer SKLearn](./runtimes/sklearn)                           |
+| XGBoost       | ✅        | [MLServer XGBoost](./runtimes/xgboost)                           |
+| Spark MLlib   | ✅        | [MLServer MLlib](./runtimes/mllib)                               |
+| LightGBM      | ✅        | [MLServer LightGBM](./runtimes/lightgbm)                         |
+| CatBoost      | ✅        | [MLServer CatBoost](./runtimes/catboost)                         |
+| Tempo         | ✅        | [`github.com/SeldonIO/tempo`](https://github.com/SeldonIO/tempo) |
+| MLflow        | ✅        | [MLServer MLflow](./runtimes/mlflow)                             |
+| Alibi-Detect  | ✅        | [MLServer Alibi Detect](./runtimes/alibi-detect)                 |
+| Alibi-Explain | ✅        | [MLServer Alibi Explain](./runtimes/alibi-explain)               |
+| HuggingFace   | ✅        | [MLServer HuggingFace](./runtimes/huggingface)                   |
+
+## Supported Python Versions
+
+🔴 Unsupported
+
+🟠 Deprecated: To be removed in a future version
+
+🟢 Supported
+
+🔵 Untested
+
+| Python Version | Status |
+| -------------- | ------ |
+| 3.7            | 🔴     |
+| 3.8            | 🔴     |
+| 3.9            | 🟢     |
+| 3.10           | 🟢     |
+| 3.11           | 🔵     |
+| 3.12           | 🔵     |
+
+## Examples
+
+To see MLServer in action, check out [our full list of
+examples](./docs/examples/index.md).
+You can find below a few selected examples showcasing how you can leverage
+MLServer to start serving your machine learning models.
+
+- [Serving a `scikit-learn` model](./docs/examples/sklearn/README.md)
+- [Serving a `xgboost` model](./docs/examples/xgboost/README.md)
+- [Serving a `lightgbm` model](./docs/examples/lightgbm/README.md)
+- [Serving a `catboost` model](./docs/examples/catboost/README.md)
+- [Serving a `tempo` pipeline](./docs/examples/tempo/README.md)
+- [Serving a custom model](./docs/examples/custom/README.md)
+- [Serving an `alibi-detect` model](./docs/examples/alibi-detect/README.md)
+- [Serving a `HuggingFace` model](./docs/examples/huggingface/README.md)
+- [Multi-Model Serving with multiple frameworks](./docs/examples/mms/README.md)
+- [Loading / unloading models from a model repository](./docs/examples/model-repository/README.md)
+
+## Developer Guide
+
+### Versioning
+
+Both the main `mlserver` package and the [inference runtimes
+packages](./docs/runtimes/index.md) try to follow the same versioning schema.
+To bump the version across all of them, you can use the
+[`./hack/update-version.sh`](./hack/update-version.sh) script.
+
+We generally keep the version as a placeholder for an upcoming version.
+
+For example:
+
+```bash
+./hack/update-version.sh 0.2.0.dev1
+```
+
+### Testing
+
+To run all of the tests for MLServer and the runtimes, use:
+
+```bash
+make test
+```
+
+To run run tests for a single file, use something like:
+
+```bash
+tox -e py3 -- tests/batch_processing/test_rest.py
+```
diff --git a/docs-gb/SUMMARY.md b/docs-gb/SUMMARY.md
@@ -0,0 +1,50 @@
+* [MLServer](README.md)
+* [Getting Started](getting-started.md)
+* [User Guide](user-guide/index.md)
+  * [Content Types (and Codecs)](user-guide/content-type.md)
+  * [OpenAPI Support](user-guide/openapi.md)
+  * [Parallel Inference](user-guide/parallel-inference.md)
+  * [Adaptive Batching](user-guide/adaptive-batching.md)
+  * [Custom Inference Runtimes](user-guide/custom.md)
+  * [Metrics](user-guide/metrics.md)
+  * [Deployment](user-guide/deployment/README.md)
+    * [Seldon Core](user-guide/deployment/seldon-core.md)
+    * [KServe](user-guide/deployment/kserve.md)
+  * [Streaming](user-guide/streaming.md)
+* [Inference Runtimes](runtimes/README.md)
+  * [SKLearn](runtimes/sklearn.md)
+  * [XGBoost](runtimes/xgboost.md)
+  * [MLFlow](runtimes/mlflow.md)
+  * [Spark MlLib](runtimes/mllib.md)
+  * [LightGBM](runtimes/lightgbm.md)
+  * [Catboost](runtimes/catboost.md)
+  * [Alibi-Detect](runtimes/alibi-detect.md)
+  * [Alibi-Explain](runtimes/alibi-explain.md)
+  * [HuggingFace](runtimes/huggingface.md)
+  * [Custom](runtimes/custom.md)
+* [Reference](reference/README.md)
+  * [MLServer Settings](reference/settings.md)
+  * [Model Settings](reference/model-settings.md)
+  * [MLServer CLI](reference/cli.md)
+  * [Python API](reference/python-api/README.md)
+    * [MLModel](reference/api/model.md)
+    * [Types](reference/api/types.md)
+    * [Codecs](reference/api/codecs.md)
+    * [Metrics](reference/api/metrics.md)
+* [Examples](examples/README.md)
+  * [Serving Scikit-Learn models](examples/sklearn/README.md)
+  * [Serving XGBoost models](examples/xgboost/README.md)
+  * [Serving LightGBM models](examples/lightgbm/README.md)
+  * [Serving MLflow models](examples/mlflow/README.md)
+  * [Serving a custom model](examples/custom/README.md)
+  * [Serving Alibi-Detect models](examples/alibi-detect/README.md)
+  * [Serving HuggingFace Transformer Models](examples/huggingface/README.md)
+  * [Multi-Model Serving](examples/mms/README.md)
+  * [Model Repository API](examples/model-repository/README.md)
+  * [Content Type Decoding](examples/content-type/README.md)
+  * [Custom Conda environments in MLServer](examples/conda/README.md)
+  * [Serving a custom model with JSON serialization](examples/custom-json/README.md)
+  * [Serving models through Kafka](examples/kafka/README.md)
+  * [Streaming](examples/streaming/README.md)
+  * [Deploying a Custom Tensorflow Model with MLServer and Seldon Core](examples/cassava/README.md)
+* [Changelog](changelog.md)