Txtify is a free open-source web application that transcribes and translates audio from YouTube videos or uploaded media files. It now runs on Docker for easier deployment and includes monitoring capabilities. Leveraging the stable-ts
library and the whisper
models, Txtify offers enhanced transcription accuracy and performance.
Txtify is designed to simplify the process of converting audio and video content into text. Whether you're looking to transcribe a YouTube video or your own audio/video files, Txtify offers an easy-to-use interface and powerful AI models to ensure accuracy and speed. The application supports multiple output formats including .txt
, .srt
, .vtt
, and .sbv
.
Before you begin, ensure you have met the following requirements:
- Python 3.10 installed on your machine
- Docker (containerized deployment)
- An API key for DeepL if you want to enable translation (in case you need to use this tool for translation)
To install and run Txtify using Docker, follow these steps:
- Clone the repository:
git clone https://github.com/lkmeta/txtify.git
cd txtify
- Set Up Environment Variables
cp .env.example .env
Edit the .env file and add your DeepL API key for translation, and any other necessary environment variables.
- Run the Docker
docker-compose up --build -d
Note: The -d flag runs the container in detached mode.
- Stop the Docker Container
docker-compose down
If you want to use the pre-built Docker image available on Docker Hub, follow these steps:
-
Pull the Docker Imgae:
docker pull lkmeta/txtify:latest
-
Run the Docker Container
docker run -d -p 8010:8010 lkmeta/txtify:latest
Open your web browser and navigate to http://localhost:8010
to access Txtify.
To monitor the application and the transcription processes:
-
Ensure you have completed the installation steps above.
-
You can view the logs of the running Docker container to monitor the application output.
docker logs -f txtify_container
Note: The -f option follows the log output in real-time.
To understand how Txtify works, you can use the online simulation demo. Visit Txtify Website and follow the instructions to upload your media or enter a YouTube URL for a simulated transcription process.
- Basic transcription functionality
- Support for multiple output formats
- Integration with DeepL for translations
- Improved UI/UX
- Containerized the application
- Enhance performance and scalability
- Web browser Whisper option
If you encounter any issues, bugs, or have suggestions for improvements, please report them using one of the following methods:
- Contact Form: Visit our Contact Page and submit your feedback or issue.
- GitHub Issues: Open an issue on the repository's issue tracker. Please provide detailed information to help us address the problem effectively.
Your feedback is valuable and helps us improve Txtify!
Feel free to contribute by opening issues, suggesting improvements, or submitting pull requests. Your feedback is highly appreciated!
This project is licensed under Apache 2.0.