text2vid

Exploration of text to video automated content creation

The goal of this project was to create automated video creation flow.

Example output

showcase0001-0510.mp4

Video Creation Pipeline

-> Script Creation from GPT-4
-> Script to TTS
-> Audio to timestamped subtitles
-> Audio to 3D facial animation
-> 3D facial animation to video via Blender

This project was meant to explore the feasability of such a tool. It is not meant to be a production ready tool. More work could've been put into making the videos feel more natural but I decided to move on to different projects.

Models used

TTS model used: https://github.com/neonbjb/tortoise-tts
3d facial animation model used: https://github.com/Doubiiu/CodeTalker
Frame interpolation model used: https://github.com/megvii-research/ECCV2022-RIFE
Audio to Timestamp Subtitle model: https://github.com/linto-ai/whisper-timestamped

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
news		news
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
create_video.py		create_video.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

text2vid

Example output

Video Creation Pipeline

Models used

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

text2vid

Example output

Video Creation Pipeline

Models used

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages