mirror of
https://github.com/geoffsee/predict-otron-9001.git
synced 2025-09-08 22:46:44 +00:00
450 B
450 B
predict-otron-9000
This is an extensible axum/tokio hybrid combining embeddings-engine, inference-engine, and leptos-app.
Notes
- When
server_mode
is Standalone (default), the instance contains all components necessary for inference. - When
server_mode
is HighAvailability, automatic scaling of inference and embeddings; proxies to inference and embeddings services via dns