Skip to content

Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.

License

Notifications You must be signed in to change notification settings

SwagLyrics/autosynch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

autosynch

Build Status codecov

Check out my blog on my progress and process throughout GSoC 2019!

about

Given an audio file of some recognizable song, autosynch will try to align its lyrics to their temporal location in the song. The song lyrics must be available on Genius.

This project is still in its early stages and is inaccurate in many cases. Optimization is a work in progress, but feel free to try it out, modify it, or contribute!

Developed during Google Summer of Code 2019 with CCExtractor.

installation

To install, do the following:

git clone
cd autosynch
pip install -r requirements.txt

Note: autosynch is supported only on Python 3.6+.

dependencies

Using autosynch requires a trained model for vocal isolation as well as PortAudio. For mp3 support, SoX is required. On MacOS/Linux, get everything by executing:

chmod a+x setup.sh
./setup.sh

download model

If you would like to download the weights manually or get a different version, check here:

  • DOI for weights trained on MedleyDB V1
  • DOI for weights trained on MedleyDB V2

Weights must be placed into autosynch/mad_twinnet/outputs/states.

install portaudio

On Mac:

brew install portaudio

On Linux:

sudo apt-get update
sudo apt-get install portaudio19-dev

install sox

Note: Installing SoX is optional and only required for processing mp3 files.

On Mac:

brew install ffmpeg

On Linux:

sudo apt install ffmpeg

usage

To play a song with its lyrics displayed at its calculated position:

python autosynch/playback.py [audio_file.wav] [artist] [song_title]

It will take a few minutes to perform the alignment process. To save the alignment data to eliminate processing time in future plays of the same audio, add the flag -s SAVE_DIR, where SAVE_DIR is the directory you want to save the alignment data.

If you have already generated and saved an alignment data file:

python autosynch/playback.py [audio_file.wav] -f [align_file.yml]

If you would like to process an mp3 file, see this section. Running with an mp3 will automatically generate a wav file in the same directory.

Note: If you did not use setup.sh, first make sure you set your Python environment correctly with export PYTHONPATH=$PYTHONPATH:./ from the outer autosynch directory.

demos

Bruno Mars - Finesse

Finesse demo

(https://www.youtube.com/watch?v=csBDM14ssts)

The last chorus lags behind a bit, but for the most part sections and lines are nicely aligned.

Fun. - We Are Young

We Are Young demo

(https://www.youtube.com/watch?v=Z-yTGKd3ji8)

The instrumental at the beginning throws off the first verse, but everything catches up in by line 4.

references

About

Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published