Skip to content

Open Source Version Of Generating BrainRot Videos from reddit url

License

Notifications You must be signed in to change notification settings

harvestingmoon/OBrainRot

Repository files navigation

Open Brain Rot

Logo <Created with Imagen 3>

Updates:

Thank you everyone for the support so far albeit the r/learnmachinelearning && r/machinelearning community! In short here are some future updates I would be doing overtime

  1. Readjusting the text and font ✅
  2. Adding k-ranking sentiments using VADER ✅
  3. Adding parallel compute (Very much in the future)
  4. Add support for generative image and video in the future
  5. Create CLI interface or fully functional website
  6. Update readme for LLM scraping feature!

New Updates!!:

I have made a series of new updates in particular:

  1. ** I integrated a feature that allows an LLM of your choice to choose the thread for you using VADAR and sentiment analysis (The LLM used is llama-3.3 70b-versatile via Groq but I do intend on adding llama cpp support in the future!) VADAR is used to cut down on the threads due to context window for llama 3.3

  2. I have fixed the text so now texts are reproduced in the middle instead!

  3. Fixed certain bugs whereby in pre-processing stages it is unable to read file due to encoding issues

So i got extremely bored over the holidays and decided to just make a fun project to see if its possible to automate the kind of content i was seeing on TikTok

A simple brain rot generator just by inserting your reddit url

Example

Watch the video

How it works

High Level Diagram

Diagram

Scraping

Simple webscraping using Reddit's open source API, to collect the title and story based off the reddit website

Voice Translation:

Using Coqui's xTTSv2 (which is super lightweight , portable and accurate), I converted the text into audio. Coqui's TTS audio also allows you to use sample audios, so I used the common-man's TikTok audio.

Pre-processing:

Removal of certain punctuations, special characters via RegEx before we carry out force alignment.

On Force Alignment:

The most important step to generate the video was the alignment between audio and text in order to get the subtitle. This was achieved using forced alignment. In this, we used wav2vec2 and base it all on Motu Hira's tutorial on Forced alignment with Wav2Vec2. It uses a frame-wise label probality from the audio (that is the voice that we generated), creates a trellis matrix representing the probability of labels aligned per time step before using the most likely path from the trellis matrix.

ffpmeg Magic:

Once we got the audio, video sample as well as the timestamp text (which is in .ass format btw), we can then generate the video using some simple ffmpeg magic. This subprocess can be viewed under

video_generator.py

How to run it

There are mainly 6 important scripts within this, each deliberately separated so that it can be easier to include any upgrades in the future (and what not)

Pre-requisites:

  1. PyTorch (with CUDA 12.4)
  2. Coqui TTS (Link)
  3. FFMpeg (Link)

Afterwards, just run the script

python main.py

and you are good to go!

Others:

Why is there no requirements.txt?

I installed the script on my personal venv so if i were to make a pip it would be very messy (and I am too lazy to do that) ~ in general just follow the instructs on how to run it above and it should be good

Are there any future updates?

So far , yes there are but it is to hopefully create website or gradio to make this more user friendly, and to hopefully create more brain rot videos in the future (i am looking at OpenSora but no plans as of now)

Thanks:

I would like to thank Motu Hira for creating this tutorial on Forced Alignment using Wav2Vec2. Without this, the subtitles would not be able to work (the original plan was to use CMUSphinx but the lack of community support made it difficult for me to work with)

Here is the original Tutorial if anyone is interested:

Link

About

Open Source Version Of Generating BrainRot Videos from reddit url

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages