✨ Winner of RevolutionUC 2024 Best Implementation of an LLM or Other Generative AI Model (CincyTech) and Best Product Market Fit (Center for Entrepreneurship) ✨
Deploy the webapp on your local machine using:
conda env create -f envinronment.yml
python3 app.py
The server should be up and running on localhost address 127.0.0.1:5000. Open this url in a browser to access the web application.
Cat Luong: Spearheaded AI-powered backend architectural design. Performed prompt engineering on Gemini Pro. Integrated Google Gemini Pro, Meta Audiocraft, and AWS Polly into a pipeline that accepts story text and returns an audiobook.
Sparsh Priyadarshi: Scoping study for generative AI based audio synthesis and AWS Polly. Defined Product-Market fit for the MVP. Integrated UI and audiobook generation pipeline, contributed to business logic for audio processing, design and documentation.
Vivek Mehra: Designed & developed the UI frontend & backend. Contributed to application backend & audio pipeline integration debugging, contributed to system architecture design discussions.
Ziddi Mohammad: Integrated Tembo's PostgreSQL into the system and developed a couple of APIs to store and retrieve generated audios data based on userIds and contributed to system design.
@inproceedings{copet2023simple, title={Simple and Controllable Music Generation}, author={Jade Copet and Felix Kreuk and Itai Gat and Tal Remez and David Kant and Gabriel Synnaeve and Yossi Adi and Alexandre Défossez}, booktitle={Thirty-seventh Conference on Neural Information Processing Systems}, year={2023}, }