Archived

This repository has been archived on 2025-09-28. You can view files and clone it, but you cannot make any changes to it's state, such as pushing and creating new issues, pull requests or comments.

ChaoticByte a480fdcd34

Documentation of dependencies, setup and usage in the README

2024-08-13 21:17:07 +02:00

1.2 KiB

Raw Blame History

audio-summarize

An audio summarizer that glues together ffmpeg, whisper.cpp and BART.

Dependencies

Python 3 (tested: 3.12)
ffmpeg
git
make & c/c++ compiler

Setup

Create a virtual environment for python and activate it:

python3 -m venv .venv
source .venv/bin/activate

Run setup.sh

./setup.sh

Run

You need a whisper.cpp compatible model file (-> https://huggingface.co/ggerganov/whisper.cpp)
In your terminal, make shure you have your python venv activated
Run audio-summarize.py

Usage

audio-summarize.py -m filepath -i filepath -o filepath
                    [--summin n] [--summax n] [--segmax n]

options:
  -h, --help   show this help message and exit
  --summin n   The minimum lenght of a segment summary [10]
  --summax n   The maximum lenght of a segment summary [90]
  --segmax n   The maximum number of tokens per segment [375, max: 500]
  -m filepath  The path to a whisper.cpp-compatible model file
  -i filepath  The path to the media file
  -o filepath  Where to save the output text to

Example:

./audio-summarize.py -m ./tmp/whisper_ggml-small.en-q5_1.bin -i ./tmp/test.webm -o ./tmp/output.txt

1.2 KiB Raw Blame History

audio-summarize

Dependencies

Setup

Run

Usage

1.2 KiB

Raw Blame History