Whisper Transcription API

This project implements a high-performance audio transcription API using Rust, Python, and the Whisper AI model. It combines the efficiency of Rust for the web server with the power of Python's machine learning ecosystem.

Components

Rust Web Server (src/main.rs)
Python Whisper Integration (python/whisper_ffi.py)
Benchmark Script (benchmark_transcription.py)

Prerequisites

Rust (latest stable version)
Python 3.8+
uv (Python package manager)

Setup

Clone the repository:

git clone https://github.com/yourusername/whisper-transcription-api.git
cd whisper-transcription-api

Install Rust dependencies:
```
cargo build
```

Set up the Python environment:

uv venv
source .venv/bin/activate  # On Windows use: .venv\Scripts\activate
uv pip install -r requirements.txt

Running the Server

Start the Rust server:
```
cargo run
```

The server will start on http://localhost:3000.

API Usage

Send a POST request to http://localhost:3000/transcribe with the following JSON body:

{
  "audio_data": [<raw audio bytes as a list of integers>]
}

The API will respond with:

{
  "transcription": "Transcribed text",
  "language": "Detected language",
  "confidence": 0.98
}

Benchmarking

To benchmark the API performance:

Update the audio_file_path in benchmark_transcription.py to point to your test audio file.
Run the benchmark script:
```
python benchmark_transcription.py
```

This will output performance statistics for both sequential and concurrent requests.

Project Structure

whisper-transcription-api/
├── src/
│   └── main.rs
├── python/
│   └── whisper_ffi.py
├── Cargo.toml
├── pyproject.toml
├── requirements.txt
├── benchmark_transcription.py
└── README.md

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
python		python
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper Transcription API

Components

Prerequisites

Setup

Running the Server

API Usage

Benchmarking

Project Structure

Contributing

License

About

Releases

Packages

Languages

faraaz-baig/whisper_transcription

Folders and files

Latest commit

History

Repository files navigation

Whisper Transcription API

Components

Prerequisites

Setup

Running the Server

API Usage

Benchmarking

Project Structure

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages