Mantiq

Mantiq is a sophisticated package designed for dynamic knowledge caching and conversational AI. The package builds on the core principles of knowledge caching and conversational AI, streamlining the integration and processing of KV caches for enhanced language model performance.

Features

Efficient KV Caching: Create, save, and retrieve knowledge vector caches for optimized performance.
Customizable Prompt Instructions: Tailor system and user instructions for specific use cases.
Seamless Integration: Built on top of HuggingFace Transformers for compatibility with a wide range of models.
Dynamic Processing: Parse and utilize datasets in JSON format for knowledge preparation.

Installation

pip install git+https://github.com/tactful-ai/mantiq.git

Usage

Initializing mantiq

from mantiq.cag import NaiiveCAG

# HuggingFace token
hf_token = "your_hf_token_here"

# Initialize NaiiveCAG
cag = NaiiveCAG(
    model_name="meta-llama/Llama-3.2-1B-Instruct",  # Replace with your model name
    hf_token=hf_token,
    quantized=True  # Enable quantization for performance
)

Preparing the KV Cache

Dataset Example

dataset = [
    {"question": "What are the features of this TV?", "answer": "This TV has 4K resolution and smart features.", "context": "55-inch TV with advanced display technology."},
    {"question": "Does this refrigerator save energy?", "answer": "Yes, it has an A++ energy rating.", "context": "Eco-friendly refrigerator with advanced cooling technology."},
    {"question": "What makes this washing machine special?", "answer": "It has AI-powered wash cycles for better cleaning.", "context": "Front load washing machine with AI technology."}
]

KV Cache Preparation

# Define instructions
prompt_instruction = "You are a Sales agent responsible for answering customer questions and convincing them to buy products."
answer_instruction = "Answer with a very short convincing response (max 50 words) to earn the customer's trust, answer only from the given context"

# Prepare KV cache
saving_path = "./cache_directory"
kv_cache, prep_time = cag.prepare_kvcache(
    dataset=dataset,
    saving_path=saving_path,
    prompt_instruction=prompt_instruction,
    answer_instruction=answer_instruction
)
print(f"KV cache prepared in {prep_time:.2f} seconds.")

Reloading KV Cache

# Reload previously saved KV cache
kv_cache = cag.read_kv_cache("./cache_directory/kv_cache.pt")

Generating Responses

# Customer query
input_query = "i wanna buy 4K screen, any recommendations?"

# Generate response
response = cag.generate_response(
    query=input_query,
    past_key_values=kv_cache,  # KV cache prepared earlier
    max_new_tokens=100          # Maximum number of tokens for the response
)
print(response)

#  I'd be happy to help you find a 4K screen that fits your needs. Our 55-inch 4K TV is a top-of-the-line model with crystal-clear picture and HDR for an immersive viewing experience.

Development Background

mantiq is part of a long-term development plan by the tactful team focusing on advancements in Conversational AI and Knowledge Caching (CAG) techniques. This work is inspired by the seminal research presented in the paper "Dynamic Knowledge Caching for Conversational AI," which you can access here.

The original repository that laid the foundation for CAG can be found here.

Contributing

We welcome contributions! Please follow the standard GitHub workflow:

Fork the repository
Create a feature branch
Commit your changes
Submit a pull request

License

This project is licensed under the MIT License. See the LICENSE file for details.

For questions or support, feel free to open an issue in the repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Mantiq

Features

Installation

Usage

Initializing mantiq

Preparing the KV Cache

Dataset Example

KV Cache Preparation

Reloading KV Cache

Generating Responses

Development Background

Contributing

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

Mantiq

Features

Installation

Usage

Initializing mantiq

Preparing the KV Cache

Dataset Example

KV Cache Preparation

Reloading KV Cache

Generating Responses

Development Background

Contributing

License