geoffsee
e6f3351ebb
minor version
2025-09-04 15:08:43 -04:00
geoffsee
3992532f15
fmt and clippy
2025-09-04 15:07:49 -04:00
geoffsee
3ecdd9ffa0
update deployment tooling to remove dependencies on unused metadata
2025-09-04 15:03:17 -04:00
geoffsee
296d4dbe7e
add root dockerfile that contains binaries for all services
2025-09-04 14:54:20 -04:00
geoffsee
fb5098eba6
fix clippy errors
2025-09-04 13:53:00 -04:00
geoffsee
c1c583faab
run cargo fmt
2025-09-04 13:45:25 -04:00
geoffsee
1e02b12cda
fixes issue with model selection
2025-09-04 13:42:30 -04:00
geoffsee
ff55d882c7
reorg + update docs with new paths
2025-09-04 12:40:59 -04:00
geoffsee
400c70f17d
streaming implementaion re-added to UI
v0.1.5
2025-09-02 14:45:16 -04:00
geoffsee
bcbc6c4693
fix invalid endpoint in curl_stream_script.sh
2025-09-02 13:58:34 -04:00
geoffsee
21f20470de
patch version
v0.1.4
2025-09-01 22:55:59 -04:00
geoffsee
2deecb5e51
chat client only displays available models
2025-09-01 22:29:54 -04:00
geoffsee
545e0c9831
make wasm32 availble for all builds in ci
2025-08-31 20:22:12 -04:00
geoffsee
eca61c51ad
add build step to ci
2025-08-31 20:08:54 -04:00
geoffsee
d1a7d5b28e
fix format error
2025-08-31 19:59:09 -04:00
geoffsee
8d2b85b0b9
update docs
2025-08-31 19:27:15 -04:00
geoffsee
4570780666
release 0.1.3
2025-08-31 18:55:37 -04:00
geoffsee
44e4f9e5e1
put proof in the pudding
2025-08-31 18:54:20 -04:00
geoffsee
64daa77c6b
leptos chat ui renders
2025-08-31 18:50:25 -04:00
geoffsee
2b4a8a9df8
chat-ui not functional yet but builds
2025-08-31 18:18:56 -04:00
geoffsee
38d51722f2
Update configuration loading with Cargo.toml path and clean up .gitignore
...
---
This commit message concisely communicates the key changes:
1. The code now builds an absolute path to the `Cargo.toml` file, enhancing clarity in configuration loading.
2. The addition of `PathBuf` usage improves type safety.
3. The removal of unnecessary entries from `.gitignore` helps maintain a clean project structure.
These updates reflect improvements in both functionality and project organization.
2025-08-31 14:06:44 -04:00
geoffsee
7bc9479a11
fix format issues, needs precommit hook
v0.1.2
2025-08-31 13:24:51 -04:00
geoffsee
0580dc8c5e
move cli into crates and stage for release
2025-08-31 13:23:50 -04:00
geoffsee
9e9aa69769
bump version in Cargo.toml
v0.1.1
2025-08-31 11:04:31 -04:00
geoffsee
3eb1a5329b
add rust compiler optimizations at workspace level, bump minor version and publish first release
2025-08-31 11:02:58 -04:00
geoffsee
eb1591aa5d
fix fmt error
2025-08-31 10:52:48 -04:00
geoffsee
e6c417bd83
align dependencies across inference features
2025-08-31 10:49:04 -04:00
geoffsee
f5d2a85f2e
cleanup, add ci
2025-08-31 10:31:20 -04:00
Geoff Seemueller
419e1c2ea7
fix Kubernetes spelling
2025-08-30 08:24:24 -04:00
Geoff Seemueller
06fdfcf898
clarify project intent
2025-08-30 08:23:38 -04:00
geoffsee
315ef17605
supports small llama and gemma models
...
Refactor inference
dedicated crates for llama and gemma inferencing, not integrated
2025-08-29 20:00:41 -04:00
geoffsee
d06b16bb12
remove confusing comments
2025-08-28 16:09:29 -04:00
geoffsee
62dcc8f5bb
ai generated README.md
2025-08-28 16:04:45 -04:00
Geoff Seemueller
f7001fc72b
remove arbitrary keys for standalone config
2025-08-28 13:19:48 -04:00
Geoff Seemueller
5bce413f8f
Update SERVER_CONFIG.md, replacing Local with Standalone
2025-08-28 13:18:55 -04:00
geoffsee
d9772a67d1
update diagrams to show accurate development configuration
2025-08-28 13:04:17 -04:00
geoffsee
6b709b8ec5
remove weird art
2025-08-28 12:56:07 -04:00
geoffsee
d04340d9ac
update docs
2025-08-28 12:54:09 -04:00
geoffsee
0488bddfdb
Create ARCHITECTURE.md - update stale references to old chat crate
2025-08-28 12:22:05 -04:00
geoffsee
770985afd2
remove stale doc
2025-08-28 12:07:19 -04:00
geoffsee
e38a2d4512
predict-otron-9000 serves a leptos SSR frontend
2025-08-28 12:06:22 -04:00
geoffsee
45d7cd8819
- Introduced ServerConfig
for handling deployment modes and services.
...
- Added HighAvailability mode for proxying requests to external services.
- Maintained Local mode for embedded services.
- Updated `README.md` and included `SERVER_CONFIG.md` for detailed documentation.
2025-08-28 09:55:39 -04:00
geoffsee
c96831d494
Add Docker Compose setup for Predict-O-Tron 9000 and Leptos Chat
2025-08-28 08:46:57 -04:00
geoffsee
bfe7c04cf5
Add Rust-based Helm Chart Generator Tool
...
- Scaffold `helm-chart-tool` with Cargo project files.
- Implement core functionality: parse Cargo.toml, extract Kubernetes metadata, and generate Helm charts.
- Include support for deployments, services, ingress, and helper templates.
- Add README with detailed usage instructions.
- Update `.gitignore` for generated Helm charts and related artifacts.
2025-08-28 08:39:54 -04:00
geoffsee
c8b3561e36
Remove ROOT_CAUSE_ANALYSIS.md and outdated server logs
2025-08-28 08:26:18 -04:00
geoffsee
b606adbe5d
Add Docker Compose and Kubernetes metadata to Cargo.toml files
2025-08-28 07:56:34 -04:00
geoffsee
9d6cb62b10
Add Dockerfile for Leptos Chat deployment
2025-08-28 07:54:57 -04:00
geoffsee
956d00f596
Add CLEANUP.md
with identified documentation and code issues. Update README files to fix repository URL, unify descriptions, and clarify Gemma model usage.
2025-08-28 07:24:14 -04:00
geoffsee
719beb3791
- Change default server host to localhost for improved security.
...
- Increase default maximum tokens in CLI configuration to 256.
- Refactor and reorganize CLI
2025-08-27 21:47:31 -04:00
geoffsee
766d41af78
- Refactored build_pipeline
usage to ensure pipeline arguments are cloned.
...
- Introduced `reset_state` for clearing cached state between requests.
- Enhanced chat UI with model selector and dynamic model fetching.
- Improved error logging and detailed debug messages for chat request flows.
- Added fresh instantiation of `TextGeneration` to prevent tensor shape mismatches.
2025-08-27 17:53:50 -04:00