Commit Graph

  • 4380ac69d3 v0.1.5 already exists master v0.1.6 geoffsee 2025-09-04 15:09:30 -04:00
  • e6f3351ebb minor version geoffsee 2025-09-04 15:08:43 -04:00
  • 3992532f15 fmt and clippy geoffsee 2025-09-04 15:07:49 -04:00
  • 3ecdd9ffa0 update deployment tooling to remove dependencies on unused metadata geoffsee 2025-09-04 15:03:17 -04:00
  • 296d4dbe7e add root dockerfile that contains binaries for all services geoffsee 2025-09-04 14:54:20 -04:00
  • fb5098eba6 fix clippy errors geoffsee 2025-09-04 13:53:00 -04:00
  • c1c583faab run cargo fmt geoffsee 2025-09-04 13:45:25 -04:00
  • 1e02b12cda fixes issue with model selection geoffsee 2025-09-04 13:42:30 -04:00
  • ff55d882c7 reorg + update docs with new paths geoffsee 2025-09-04 12:27:13 -04:00
  • 400c70f17d streaming implementaion re-added to UI v0.1.5 geoffsee 2025-09-02 14:45:16 -04:00
  • bcbc6c4693 fix invalid endpoint in curl_stream_script.sh geoffsee 2025-09-02 13:58:34 -04:00
  • 21f20470de patch version v0.1.4 geoffsee 2025-09-01 22:55:59 -04:00
  • 2deecb5e51 chat client only displays available models geoffsee 2025-09-01 22:29:54 -04:00
  • 545e0c9831 make wasm32 availble for all builds in ci geoffsee 2025-08-31 20:22:12 -04:00
  • eca61c51ad add build step to ci geoffsee 2025-08-31 20:08:54 -04:00
  • d1a7d5b28e fix format error geoffsee 2025-08-31 19:59:09 -04:00
  • 8d2b85b0b9 update docs geoffsee 2025-08-31 19:27:15 -04:00
  • 4570780666 release 0.1.3 geoffsee 2025-08-31 18:55:37 -04:00
  • 44e4f9e5e1 put proof in the pudding geoffsee 2025-08-31 18:54:20 -04:00
  • 64daa77c6b leptos chat ui renders geoffsee 2025-08-31 18:50:25 -04:00
  • 2b4a8a9df8 chat-ui not functional yet but builds geoffsee 2025-08-31 18:18:56 -04:00
  • 38d51722f2 Update configuration loading with Cargo.toml path and clean up .gitignore geoffsee 2025-08-31 14:06:44 -04:00
  • 7bc9479a11 fix format issues, needs precommit hook v0.1.2 geoffsee 2025-08-31 13:24:51 -04:00
  • 0580dc8c5e move cli into crates and stage for release geoffsee 2025-08-31 13:23:50 -04:00
  • 9e9aa69769 bump version in Cargo.toml v0.1.1 geoffsee 2025-08-31 11:04:31 -04:00
  • 3eb1a5329b add rust compiler optimizations at workspace level, bump minor version and publish first release geoffsee 2025-08-31 11:02:58 -04:00
  • eb1591aa5d fix fmt error geoffsee 2025-08-31 10:52:48 -04:00
  • e6c417bd83 align dependencies across inference features geoffsee 2025-08-31 10:49:04 -04:00
  • f5d2a85f2e cleanup, add ci geoffsee 2025-08-31 10:31:07 -04:00
  • 419e1c2ea7 fix Kubernetes spelling Geoff Seemueller 2025-08-30 08:24:24 -04:00
  • 06fdfcf898 clarify project intent Geoff Seemueller 2025-08-30 08:23:38 -04:00
  • 315ef17605 supports small llama and gemma models inference-overhaul geoffsee 2025-08-29 18:15:29 -04:00
  • d06b16bb12 remove confusing comments geoffsee 2025-08-28 16:09:29 -04:00
  • 62dcc8f5bb ai generated README.md geoffsee 2025-08-28 16:04:38 -04:00
  • f7001fc72b remove arbitrary keys for standalone config Geoff Seemueller 2025-08-28 13:19:48 -04:00
  • 5bce413f8f Update SERVER_CONFIG.md, replacing Local with Standalone Geoff Seemueller 2025-08-28 13:18:55 -04:00
  • d9772a67d1 update diagrams to show accurate development configuration geoffsee 2025-08-28 13:04:17 -04:00
  • 6b709b8ec5 remove weird art geoffsee 2025-08-28 12:56:07 -04:00
  • d04340d9ac update docs geoffsee 2025-08-28 12:54:09 -04:00
  • 0488bddfdb Create ARCHITECTURE.md - update stale references to old chat crate geoffsee 2025-08-28 12:22:05 -04:00
  • 770985afd2 remove stale doc geoffsee 2025-08-28 12:07:19 -04:00
  • e38a2d4512 predict-otron-9000 serves a leptos SSR frontend geoffsee 2025-08-28 12:06:22 -04:00
  • 45d7cd8819 - Introduced ServerConfig for handling deployment modes and services. - Added HighAvailability mode for proxying requests to external services. - Maintained Local mode for embedded services. - Updated README.md and included SERVER_CONFIG.md for detailed documentation. geoffsee 2025-08-28 09:55:39 -04:00
  • c96831d494 Add Docker Compose setup for Predict-O-Tron 9000 and Leptos Chat geoffsee 2025-08-28 08:46:57 -04:00
  • bfe7c04cf5 Add Rust-based Helm Chart Generator Tool geoffsee 2025-08-28 08:39:54 -04:00
  • c8b3561e36 Remove ROOT_CAUSE_ANALYSIS.md and outdated server logs geoffsee 2025-08-28 08:26:18 -04:00
  • b606adbe5d Add Docker Compose and Kubernetes metadata to Cargo.toml files geoffsee 2025-08-28 07:56:34 -04:00
  • 9d6cb62b10 Add Dockerfile for Leptos Chat deployment geoffsee 2025-08-28 07:54:46 -04:00
  • 956d00f596 Add CLEANUP.md with identified documentation and code issues. Update README files to fix repository URL, unify descriptions, and clarify Gemma model usage. geoffsee 2025-08-28 07:24:14 -04:00
  • 719beb3791 - Change default server host to localhost for improved security. - Increase default maximum tokens in CLI configuration to 256. - Refactor and reorganize CLI geoffsee 2025-08-27 21:47:24 -04:00
  • 766d41af78 - Refactored build_pipeline usage to ensure pipeline arguments are cloned. - Introduced reset_state for clearing cached state between requests. - Enhanced chat UI with model selector and dynamic model fetching. - Improved error logging and detailed debug messages for chat request flows. - Added fresh instantiation of TextGeneration to prevent tensor shape mismatches. geoffsee 2025-08-27 17:53:50 -04:00
  • f1b57866e1 remove stale files geoffsee 2025-08-27 16:36:54 -04:00
  • 9e28e259ad Add support for listing available models via CLI and HTTP endpoint geoffsee 2025-08-27 16:35:08 -04:00
  • 432c04d9df Removed legacy inference engine assets. geoffsee 2025-08-27 16:19:31 -04:00
  • 8338750beb Refactor apply_cached_repeat_penalty for optimized caching and reuse, add extensive unit tests, and integrate special handling for gemma-specific models. geoffsee 2025-08-26 01:30:26 -04:00
  • 7dd23213c9 fix image path again geoffsee 2025-08-16 20:11:15 -04:00
  • dff09dc4d0 fix image path geoffsee 2025-08-16 20:09:28 -04:00
  • 83f2a8b295 add an image to the readme geoffsee 2025-08-16 20:08:35 -04:00
  • b8ba994783 Integrate create_inference_router from inference-engine into predict-otron-9000, simplify server routing, and update dependencies to unify versions. geoffsee 2025-08-16 19:53:21 -04:00
  • 411ad78026 Remove stale reference in documentation. Geoff Seemueller 2025-08-16 19:29:11 -04:00
  • 2aa6d4cdf8 Introduce predict-otron-9000: Unified server combining embeddings and inference engines. Includes OpenAI-compatible APIs, full documentation, and example scripts. geoffsee 2025-08-16 19:11:35 -04:00