predict-otron-9001

mirror of https://github.com/geoffsee/predict-otron-9001.git synced 2025-09-08 22:46:44 +00:00

Files

geoffsee 315ef17605 supports small llama and gemma models

Refactor inference

dedicated crates for llama and gemma inferencing, not integrated

2025-08-29 20:00:41 -04:00

cli.ts

update docs

2025-08-28 12:54:09 -04:00

curl_chat_stream.sh

2025-08-27 16:15:01 -04:00

curl_chat.sh

2025-08-27 16:15:01 -04:00

performance_test_embeddings.sh

2025-08-27 16:15:01 -04:00

performance_test_inference.sh

2025-08-27 16:15:01 -04:00

run_llama.sh

2025-08-29 20:00:41 -04:00

run_server.sh

update docs

2025-08-28 12:54:09 -04:00

test.sh

update docs

2025-08-28 12:54:09 -04:00