1 Commits

Author SHA1 Message Date
dependabot[bot]
ed89726b2a Bump globals from 15.15.0 to 16.3.0
Bumps [globals](https://github.com/sindresorhus/globals) from 15.15.0 to 16.3.0.
- [Release notes](https://github.com/sindresorhus/globals/releases)
- [Commits](https://github.com/sindresorhus/globals/compare/v15.15.0...v16.3.0)

---
updated-dependencies:
- dependency-name: globals
  dependency-version: 16.3.0
  dependency-type: direct:development
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2025-07-02 01:24:36 +00:00
2 changed files with 24 additions and 8 deletions

View File

@@ -1,21 +1,25 @@
# code-tokenizer
# toak
it's no joke
[![npm version](https://img.shields.io/npm/v/toak)](https://www.npmjs.com/package/toak)
![Tests](https://github.com/seemueller-io/code-tokenizer/actions/workflows/tests.yml/badge.svg)
![Tests](https://github.com/seemueller-io/toak/actions/workflows/tests.yml/badge.svg)
[![License: AGPL v3](https://img.shields.io/badge/License-AGPL_v3-blue.svg)](https://www.gnu.org/licenses/agpl-3.0.html)
## Overview
`toak` is a cli tool, named for phonetics, that processes git repository files, cleans code, redacts sensitive information, and generates a `prompt.md` with token counts using the Llama 3 tokenizer.
`toak` is an intentionally simple yet powerful tool that processes git repository files, cleans code, redacts sensitive information, and generates markdown documentation with token counts using the Llama 3 tokenizer.
```shell
$ cd your-git-repo
$ npx toak
```
![toak](https://github.com/seemueller-io/code-tokenizer/blob/471c2a359e342c0103d2074650afe1f1b2b5f71d/toak.jpg?raw=true)
![toak](https://github.com/seemueller-io/toak/blob/471c2a359e342c0103d2074650afe1f1b2b5f71d/toak.jpg?raw=true)
## Philosophy
1. _Human-first_ technologies for a better future.
2. If you don't like the name...good.
---
## Features
@@ -25,10 +29,14 @@ $ npx toak
- Redacts sensitive information (API keys, tokens, JWT, hashes)
- Counts tokens using llama3-tokenizer-js
- Supports nested .toak-ignore files
### Token Cleaning
- Removes single-line and multi-line comments
- Strips console.log statements
- Removes import statements
- Cleans up whitespace and empty lines
### Security Features
- Redacts API keys and secrets
- Masks JWT tokens
- Hides authorization tokens
@@ -37,7 +45,15 @@ $ npx toak
## Requirements
- npm/bun/yarn/pnpm
- Node.js (>=14.0.0)
- Git repository
- Bun runtime (for development)
## Installation
```bash
npm install toak
```
## Usage

View File

@@ -63,7 +63,7 @@
"bun": "latest",
"bun-plugin-isolated-decl": "^0.1.10",
"eslint": "^9.24.0",
"globals": "^15.15.0",
"globals": "^16.3.0",
"oxc-transform": "^0.44.0",
"prettier": "^3.5.3",
"typescript": "^5.8.3"