Overview

llmpebase is a unified platform designed to support experiments with various reasoning methods for large language models (LLMs) in problem-solving. This codebase is designed to be easy to use and ensures fair comparisons across methods. It provides modular components that facilitate the implementation of new reasoning algorithms and their application to diverse datasets across various tasks for comprehensive evaluation.

Code structure

The structure of llmpebase is

.
├── configs                         # Configuration files to be used
├── examples                        # Implemented examples
├── llmpebase                       # The source code of `llmpebase`
└──── datasets                       # Datasets
└──── models                         # LLMs, prompting, and thought structures
└──── exactor                        # To extract the result from the output 
└──── evaluator                      # To evaluate by comparing ground truth and the result

Implemented Components

Task and dataset

Mathematical problems:

GSM8K
SVAMP
AQUA-RAT
MATH
TheoremQA
Game of 24

Multi-task Reasoning:

MMLU
BBH

Commonsense reasoning:

CSQA (CommonsenseQA)

Large Language Models (LLMs)

GPT
Llama
Llama2
Claude
Falcon

Prompting

Few-shot prompting
Chain-of-Thought (CoT) prompting
Zero-shot prompting
BoT prompting
TR prompting

Thought structures

Chain
Tree
Graph

Extractor

LLM-based extractor
Regex-based extractor

Evaluator

Regex-based evaluator

How to use

Anyone can run examples/ of llmpebase by executing the following three steps:

(Optional). To use ChatGPT API, one needs to have the OPENAI_API_KEY, OPENAI_ORGAN_KEY, and ANTHROPIC_KEY and set them in the file .env under the root directory.
Download the code from GitHub. Install llmpebase by running
```
$ pip install .
```

Run the example by running

$ python examples/ChainOfThought/ChainOfThought.py -c configs/GSM8K/Standard_chatgpt.yml -b LLMPEBASE
$ python examples/ChainOfThought/ChainOfThought.py -c configs/GSM8K/CoT_chatgpt.yml -b LLMPEBASE
$ python examples/ChainOfThought/ChainOfThought.py -c configs/GSM8K/ZeroCoT_chatgpt.yml -b LLMPEBASE

See documents under examples/ for more details.

References

[1]. Chen, Sijia and Li, Baochun and Niu, Di, Boosting of thoughts: Trial-and-error problem solving with large language models, ICLR24. See examples/BoTReasoning

[2]. Chen, Sijia and Li, Baochun, Toward Adaptive Reasoning in Large Language Models with Thought Rollback, ICML24. See examples/ThoughtRollback

Name		Name	Last commit message	Last commit date
Latest commit History 301 Commits
.github		.github
configs		configs
examples		examples
llmpebase		llmpebase
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
description.txt		description.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Code structure

Implemented Components

Task and dataset

Large Language Models (LLMs)

Prompting

Thought structures

Extractor

Evaluator

How to use

References

About

Releases

Packages

Languages

License

iQua/llmpebase

Folders and files

Latest commit

History

Repository files navigation

Overview

Code structure

Implemented Components

Task and dataset

Large Language Models (LLMs)

Prompting

Thought structures

Extractor

Evaluator

How to use

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages