Unverified Commit 9593ada3 authored by Aleix Ramírez Baena's avatar Aleix Ramírez Baena Committed by GitHub
Browse files

F #227: First version of Dynamo appliance (#228)

This appliance includes the runtime installation of the Dynamo inference framework. Supported parameters:

- ONEAPP_DYNAMO_API_PORT: Port where the Dynamo API will be exposed
-  ONEAPP_DYNAMO_MODEL_ID: Name of the model in Hugging Face
-  ONEAPP_DYNAMO_MODEL_TOKEN: HF API token
- ONEAPP_DYNAMO_ENGINE_NAME: Name of the dynamo engine to use: mistralrs|sglang|llamacpp|vllm|trtllm|echo_full|echo_core.
-  ONEAPP_DYNAMO_ENGINE_EXTRA_ARGS_JSON: Engine extra args set in JSON format.
- ONEAPP_DYNAMO_ENGINE_EXTRA_ARGS_JSON_BASE64: Engine extra args set in JSON and encoded in base64.
parent 9a63b659
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment