cpai is a command-line tool that concatenates multiple files into a single markdown text string, making it easy to paste the full context of an application into LLMs.
pip install git+https://github.com/sockdrawermoney/cpai.git
Run cpai:
cpai [options] [file|directory...]
Options:
--tree
or-t
: Generate a file and function tree-f [FILENAME], --file [FILENAME]
: Output to file. If FILENAME is not provided, defaults to 'output-cpai.md'-n, --noclipboard
: Don't copy to clipboard--stdout
: Output to stdout instead of clipboard-a, --all
: Include all files (including tests, configs, etc.)-c, --configs
: Include configuration files-x PATTERN [PATTERN...], --exclude PATTERN [PATTERN...]
: Additional patterns to exclude--bydir [DIR...]
: Process directories independently and output to {dir}.tree.md files. If no directories are specified, processes all non-excluded directories in current path.--overwrite
,-o
: Overwrite existing output files without confirmation
Process all files in current directory:
cpai
Generate a tree view of specific files:
cpai --tree src/main.py src/utils.py
Process multiple directories independently:
cpai --bydir src/module1 src/module2 src/module3
# Creates: module1.tree.md, module2.tree.md, module3.tree.md
Process all non-excluded directories in current path:
cpai --bydir
# Creates a .tree.md file for each directory
Process directories and overwrite existing files:
cpai --bydir --overwrite
# Creates or overwrites .tree.md files for each directory
If no files or directories are specified, cpai will process all supported files in the current directory.
Examples:
# Process src/ directory but exclude specific paths
cpai src/ -x "**/*.test.js" "docs/"
# Process multiple directories but exclude specific ones
cpai src/ lib/ -x test/ docs/ "*.spec.ts"
# Process all files except tests and specific directories
cpai -a -x tests/ documentation/ "*.md"
# Process core source files (default behavior)
cpai src/
# Process all files including tests and configs
cpai src/ -a
# Process core source files and include configs
cpai src/ -c
# Process files and output to stdout instead of clipboard
cpai src/ --stdout
# Display directory and function tree structure in stdout
cpai src/ --tree --stdout
# Copy tree structure to clipboard
cpai src/ --tree
The tool can be configured using a cpai.config.json
file in your project root. Here's an example configuration:
{
"include": ["."],
"exclude": [
"**/*.min.js",
"**/*.bundle.js",
"**/vendor/**"
],
"fileExtensions": [".js", ".py", ".ts"],
"outputFile": false,
"usePastebin": true,
"chunkSize": 90000
}
The tool uses a combination of default exclude patterns, custom exclude patterns, and include patterns to determine which files to process:
-
Default Exclude Patterns: A set of common patterns (like
node_modules
,build
,.git
, etc.) are always excluded by default. -
Custom Exclude Patterns: The
exclude
field in your config is additive - any patterns you specify are added to the default excludes. -
Include Patterns: The
include
field is the only way to override excludes. If a file matches an include pattern, it will be included even if it matches an exclude pattern.
For example, if you want to process files in a tests
directory (which is excluded by default):
{
"include": ["./tests/**/*.py"]
}
The fileExtensions
field specifies which file types to process. If not specified, a default set of common extensions is used.
You can create your own cpai.config.json to override any of these defaults. By default, cpai will:
- Include only core source files (excluding tests, configs, build files, etc.)
- Look for source files in common directories (src/, app/, pages/, components/, lib/)
- Support common file extensions for JavaScript/TypeScript, Python, Solidity, and Rust projects
Here are the default settings that cpai starts with (you can override these in your cpai.config.json):
{
"include": ["src", "lib"],
"exclude": [
"build/", "dist/", "__pycache__/", ".cache/", "coverage/", ".next/",
"out/", ".nuxt/", ".output/", "*.egg-info/",
"node_modules/", "venv/", ".env/", "virtualenv/",
"test/", "tests/", "__tests__/", "**/*.test.*", "**/*.spec.*",
".idea/", ".vscode/", ".DS_Store",
".git/", "*.log"
],
"outputFile": false,
"usePastebin": true,
"fileExtensions": [
".ts", ".js", ".py", ".rs", ".sol", ".go", ".jsx", ".tsx",
".css", ".scss", ".svelte", ".html", ".java", ".c", ".cpp",
".h", ".hpp", ".rb", ".php", ".swift", ".kt", ".scala", ".sh",
".bash", ".md", ".json", ".yaml", ".yml", ".toml"
],
"chunkSize": 90000
}
The chunkSize
parameter determines the maximum number of characters in each chunk when splitting large outputs. The default is 90,000 characters.
By default, cpai will:
- Generate a directory structure of the processed files.
- When using
--tree
:- Display a clean tree view of the directory structure and function outlines
- Skip markdown headers and code blocks for cleaner output
- Without
--tree
:- Concatenate the content of all processed files into a single markdown string
- Include directory structure at the top
- Handle output based on options:
- When using
--stdout
: Output directly to terminal - When using
-f/--file
: Write to specified file - Otherwise: Copy to clipboard (using pbcopy on macOS)
- When using
The output format is:
## Directory Structure
directory structure here
## path/to/filename.ext
```ext
file content
------ 90000 character chunk split ------
(next chunk of content)
cpai uses only Python standard library modules and should work on any system with Python 3.6+.
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License.