Skip to content

Run abstracts through a local LLM to give you some possible scores on what you are looking for.

License

Notifications You must be signed in to change notification settings

jjasghar/ai-abstract-classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ai-abstract-classifier

Scope

This is an application that takes AnythingLLM and a selection of abstracts and asks a local LLM (granite ideally) if the abstract has been written by an AI and/or is a possible sales pitch. It gives a file called overview.csv with a confidence score of up to 100 if it's been AI or too "sales-y."

You can also inject a csv into this instead of reading the API. Take a look at test_data/testing.csv as an example. Take a look at the config.toml.example for where to configure the csv.

NOTE: This is , seporated for the time being, so you'll need to remove all the , from the actual abstracts so it can be parsed correctly.

The sections that are needed the csv are as follows:

  • code
  • title
  • abstract
  • description

Configuration

Everything is configured in the config.toml file, copy it to the working directory and do something like the following:

First install AnythingLLM, here, and configure it with something along these lines of this.

Note: As of this release you will need to configure the model you want this to us via the "default" AnythingLLM configuration. It seems for now you can't programaticly change the workspace for different models, so this is the work around.

Check out testing_notes.md for some of the numbers ran with other models on the same data.

Run these following commands:

git clone [email protected]:jjasghar/pretalx-ai-validator.git
cd pretalx-ai-validator
python3.11 -m venv --upgrade-deps venv
source venv/bin/activate
pip install -r requirements.txt
cp config.toml.example config.toml
vim config.toml
python main.py -h

Utils

There is a jsons_to_markdown.py to convert the chat_primes to readable format(s).

License & Authors

If you would like to see the detailed LICENSE click here.

Copyright:: 2025- IBM, Inc

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

About

Run abstracts through a local LLM to give you some possible scores on what you are looking for.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Languages