Skip to content
View jklaise's full-sized avatar

Block or report jklaise

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLMs

56 repositories

🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.

Python 561 50 Updated Feb 27, 2025

Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.

Python 50,072 5,492 Updated Mar 2, 2025

A method to fix GPT-3 after deployment with user feedback, without re-training.

Python 326 13 Updated Apr 4, 2023

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 102,009 16,541 Updated Mar 1, 2025

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Jupyter Notebook 1,685 121 Updated Jan 29, 2024

A language for constraint-guided and efficient LLM programming.

Python 3,836 203 Updated Jun 3, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,770 6,523 Updated Dec 9, 2024

ChatGPT web interface using the OpenAI API

Svelte 1,947 477 Updated Feb 24, 2025

Python bindings for llama.cpp

Python 8,747 1,068 Updated Jan 29, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,244 893 Updated Feb 27, 2025

📋 A list of open LLMs available for commercial use.

11,725 809 Updated Feb 13, 2025

Seamlessly integrate LLMs into scikit-learn.

Python 3,425 279 Updated Feb 1, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,939 5,979 Updated Mar 3, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,573 1,765 Updated Feb 26, 2025

Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

Python 3,502 231 Updated Jul 3, 2024

🤖 A PyTorch library of curated Transformer models and their composable components

Python 880 34 Updated Apr 17, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 6,743 671 Updated Feb 26, 2025

Large Language Model Text Generation Inference

Python 9,832 1,154 Updated Feb 28, 2025

Inference code for Llama models

Python 57,771 9,715 Updated Jan 26, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,266 558 Updated Oct 28, 2024

LLM fine-tuning and eval

TypeScript 344 13 Updated Mar 21, 2024

Numbers every LLM developer should know

4,185 140 Updated Jan 16, 2024

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 7,380 581 Updated Mar 3, 2025

Structured Text Generation

Python 10,875 569 Updated Feb 28, 2025

Universal LLM Deployment Engine with ML Compilation

Python 20,085 1,675 Updated Mar 2, 2025

Fast inference engine for Transformer models

C++ 3,626 324 Updated Feb 25, 2025

experiments with inference on llama

Python 104 16 Updated Jun 6, 2024

Adding guardrails to large language models.

Python 4,550 354 Updated Feb 28, 2025

Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

Go 567 20 Updated Jul 2, 2024