transcriptionEvergreen

Pipeline that uses whisperAI to transcribe the focus group meetings

The bash script does:

Covert audio to 16 kHz mono WAV
Runs whisper.cpp to generate plain text scripts
Runs Pyannote-Whisper to add speaker labels to each segment

You end up with, for every audio/<name>.WAV:

audio/<name>.txt

audio/<name>_diarized.txt

Instructions for use

Clone the repo

#!/bin/bash
git clone https://github.com/your-org/whisper-transcription.git
cd whisper-transcription

Build the whisper.cpp

#!/bin/bash
cd whisper.cpp
make
./models/download-ggml-model.sh large-v1
cd ..

Install the python dependencies

#!/bin/bash
python3 -m venv venv
source venv/bin/activate
pip install pywhispercpp pyannote-audio pyannote-whisper

Usage

Add the audio files into audio directory (wav/WAV files through git cp)
Run the pipeline

#!/bin/bash
chmod +x run_pipeline.sh
./run_pipeline.sh en

Output

Now under audio directory each WAV file should have a corresponding txt file

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
audio		audio
scripts		scripts
whisper.cpp		whisper.cpp
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
To		To
run_pipeline.sh		run_pipeline.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

transcriptionEvergreen

Pipeline that uses whisperAI to transcribe the focus group meetings

Instructions for use

Usage

Output

About

Uh oh!

Releases

Packages

Languages

20eddibae/EvergreenTranscript

Folders and files

Latest commit

History

Repository files navigation

transcriptionEvergreen

Pipeline that uses whisperAI to transcribe the focus group meetings

Instructions for use

Usage

Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages