Chat with MLX

Warning

The project is currently under active development, please update frequently using the following commands: pip install -U git+https://github.com/NeptuneIsTheBest/chat-with-mlx.git

Run LLM on your Mac! An all-in-one chat Web UI based on the MLX framework, designed for Apple Silicon.

The idea of chat-with-mlx comes from qnguyen3/chat-with-mlx.

chat-with-mlx provides a similar and more modern experience, and offers more features.

You can upload files, or even images to chat when using the vision model.

If this helps you, I'd be happy if you could give me a star, thank you. ✨

TL;DR

Use the following commands to install and run:

python -m venv chat-with-mlx
cd chat-with-mlx
. ./bin/activate
pip install git+https://github.com/NeptuneIsTheBest/chat-with-mlx.git
chat-with-mlx

Roadmap

Key Features

Chat
Completion
Model Management
RAG

Others

Upload file to chat（support PDF, Word, Excel, PPT and some plain text file like .txt, .csv, .md.）
Upload picture to chat (Currently tested on the Phi-3.5-vision-instruct model)
and so on...

How to use

Installation

Install using pip:

python -m venv chat-with-mlx
cd chat-with-mlx
. ./bin/activate
pip install git+https://github.com/NeptuneIsTheBest/chat-with-mlx.git

Run

Start the server:
```
chat-with-mlx
```

--port: The port on which the server will run (default is 7860).
--share: If specified, the server will be shared publicly.

Use

Use in browser: By default, a page will open, http://127.0.0.1:7860, where you can chat.

Model Configuration

You no longer need to add models by manually editing configuration files, you only need to use the "Model Management" page to add your models.

You can add various models from mlx-community. Models will be automatically downloaded from HuggingFace.

For the following configuration files, the model files will be stored in models/models/Ministral-8B-Instruct-2410-4bit.

Ministral-8B-Instruct-2410-4bit.json

{
  "original_repo": "mistralai/Ministral-8B-Instruct-2410",
  "mlx_repo": "mlx-community/Ministral-8B-Instruct-2410-4bit",
  "model_name": "Ministral-8B-Instruct-2410-4bit",
  "quantize": "4bit",
  "default_language": "multi",
  "system_prompt": "",
  "multimodal_ability": []
}

original_repo: The original repository where the model can be found.
mlx_repo: The repository in the MLX community.
model_name: The name of the model.
quantize: The quantization format of the model (e.g., 4bit).
default_language: Default language setting (e.g., multi for multilingual support).
system_prompt: The system prompt of the model.
multimodal_ability: The multimodal capabilities of the model.

Contributing

If you have any questions, feel free to submit an issue to discuss at any time, or if you want to contribute any code, please feel free to submit a PR.

Thanks to the maintainers of qnguyen3/chat-with-mlx, mlx, as well as all members of the open source community, for creating such a useful library.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
chat_with_mlx_refresh		chat_with_mlx_refresh
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chat with MLX

TL;DR

Roadmap

Key Features

Others

How to use

Installation

Run

Use

Model Configuration

Contributing

License

Star History

About

Releases

Packages

Languages

License

NeptuneIsTheBest/chat-with-mlx

Folders and files

Latest commit

History

Repository files navigation

Chat with MLX

TL;DR

Roadmap

Key Features

Others

How to use

Installation

Run

Use

Model Configuration

Contributing

License

Star History

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages