Video to Text Converter

This project converts audio from MP4 video files to text using OpenAI's Whisper speech recognition model. It can process a single video file or recursively process all MP4 files in a directory and its subdirectories.

Features

Extracts audio from MP4 video files
Converts audio to text using OpenAI's Whisper model
Supports different Whisper model sizes for balancing speed and accuracy
Processes multiple video files recursively in a directory
Displays progress bar during processing

Installation

Clone the repository:
Navigate to the project directory:
Install the required dependencies:

Usage

Open the video_to_text.py file and set the following variables:

INPUT_DIR: Path to the directory containing the MP4 video files.
OUTPUT_DIR: Path to the directory where the extracted text files will be saved.
MODEL_SIZE: Whisper model size to use ('tiny', 'base', 'small', 'medium', 'large').

Run the script:

The script will process all the MP4 files in the specified input directory and its subdirectories. The extracted text will be saved as individual text files in the specified output directory.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
mp4-to-text.py		mp4-to-text.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video to Text Converter

Features

Installation

Usage

License

About

Releases

Packages

Languages

larsdpeder/mp4-to-text

Folders and files

Latest commit

History

Repository files navigation

Video to Text Converter

Features

Installation

Usage

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages