audio-summarize/README.md

# audio-summarize

An audio summarizer that glues together ffmpeg, whisper.cpp and BART.

## Dependencies

- Python 3 (tested: 3.12)
- ffmpeg
- git
- make & c/c++ compiler

## Setup

Create a virtual environment for python and activate it:

```bash
python3 -m venv .venv
source .venv/bin/activate
```

Run setup.sh

```bash
./setup.sh
```

## Run

1. You need a whisper.cpp compatible model file (-> https://huggingface.co/ggerganov/whisper.cpp)
2. In your terminal, make shure you have your python venv activated
3. Run audio-summarize.py

### Usage

```
audio-summarize.py -m filepath -i filepath -o filepath
                    [--summin n] [--summax n] [--segmax n]

options:
  -h, --help   show this help message and exit
  --summin n   The minimum lenght of a segment summary [10, min: 5]
  --summax n   The maximum lenght of a segment summary [90, min: 5]
  --segmax n   The maximum number of tokens per segment [375, 5 - 500]
  -m filepath  The path to a whisper.cpp-compatible model file
  -i filepath  The path to the media file
  -o filepath  Where to save the output text to
```

Example:

```bash
./audio-summarize.py -m ./tmp/whisper_ggml-small.en-q5_1.bin -i ./tmp/test.webm -o ./tmp/output.txt
```
Initial commit 2024-08-13 20:29:07 +02:00			`# audio-summarize`
Documentation of dependencies, setup and usage in the README 2024-08-13 21:14:52 +02:00
			`An audio summarizer that glues together ffmpeg, whisper.cpp and BART.`

			`## Dependencies`

			`- Python 3 (tested: 3.12)`
			`- ffmpeg`
			`- git`
			`- make & c/c++ compiler`

			`## Setup`

			`Create a virtual environment for python and activate it:`

			```bash
			`python3 -m venv .venv`
			`source .venv/bin/activate`
			```

			`Run setup.sh`

			```bash
			`./setup.sh`
			```

			`## Run`

			`1. You need a whisper.cpp compatible model file (-> https://huggingface.co/ggerganov/whisper.cpp)`
			`2. In your terminal, make shure you have your python venv activated`
			`3. Run audio-summarize.py`

			`### Usage`

			```
			`audio-summarize.py -m filepath -i filepath -o filepath`
			`[--summin n] [--summax n] [--segmax n]`

			`options:`
			`-h, --help show this help message and exit`
Add minimum values for --summin, --summax and --segmax 2024-08-13 21:20:28 +02:00			`--summin n The minimum lenght of a segment summary [10, min: 5]`
			`--summax n The maximum lenght of a segment summary [90, min: 5]`
			`--segmax n The maximum number of tokens per segment [375, 5 - 500]`
Documentation of dependencies, setup and usage in the README 2024-08-13 21:14:52 +02:00			`-m filepath The path to a whisper.cpp-compatible model file`
			`-i filepath The path to the media file`
			`-o filepath Where to save the output text to`
			```

			`Example:`

			```bash
			`./audio-summarize.py -m ./tmp/whisper_ggml-small.en-q5_1.bin -i ./tmp/test.webm -o ./tmp/output.txt`
			```