MuSE: Music from Scene Extraction

MuSE is a custom music generation app for your short form videos based on scene analysis.

Inspiration

MuSE was inspired by the idea of a tool that can harness the power of generative artificial intelligence and produce relevant music for our short form videos based on the videos’ scenes and ambience of the video.

What it does

MuSE is an AI-powered app that can produce relevant music for your videos based on its scene and ambience. It automatically generates custom music for videos by analyzing scenes and selecting appropriate music types, instruments, and moods.

How we built it

Web App Development: For developing our web app, we utilized Streamlit, an open-source Python framework to deliver interactive data apps. We used tools like file upload, text input, and browser session manager provided by Streamlit to build our app.

Frame Extraction: We are using OpenCV, a popular computer vision library, to process the video and extract frames from it. We are extracting 6 frames from the video for scene analysis, all placed at regular intervals.

Scene Analysis: For scene analysis we are leveraging MIT's PlacesCNN model, a high performing model for scene recognition and categorizing deep scene features. The model takes the extracted video frames as input, and gives the following outputs: ambience (indoor or outdoor), scene categories, and scene attributes.

Prompt Creation: After scene analysis, we use the detected scene categories and attributes to associate relevant music type, instruments that should be present in the music piece, and how the audience is supposed to feel after hearing the music, or audience impression. We are using a clustering logic for the association, and producing a prompt based on the final chosen music type, instruments, and impression.

Music Generation: The prompt is passed to Meta's MusicGen model, a text to music model, for music generation. We are using the pre-trained musicgen-small model for generating the audio.

Challenges we ran into

We faced challenges integrating an association algorithm that can effectively find the relevant music type, instrument, and audience impression based on the scene categories and attributes. We also faced challenges deploying the demo app on platforms like Streamlit Cloud, Heroku, and PythonAnywhere due to its memory intensive operations in the backend.

Accomplishments that we're proud of

We successfully integrated the working between the various components of the app, namely the frame extraction, scene analysis, prompt creation and music generation, and got the general workflow to work, which was edifying.

What we learned

As computer science graduates, we learnt a lot and gained deep insights into the concepts of scene recognition and music generation. We also learned how to integrate complex AI models into a cohesive application.

What's next for MuSE

We can work on a more advanced clustering algorithm for better association of musical attributes, which in turn will produce better prompts. We can leverage the power of natural language processing to produce efficient prompts. We can use a more robust music generation model for better high quality audio.

Demo

https://youtu.be/btImkH1TL5I

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.streamlit		.streamlit
__pycache__		__pycache__
.slugignore		.slugignore
IO_places365.txt		IO_places365.txt
Procfile		Procfile
README.md		README.md
W_sceneattribute_wideresnet18.npy		W_sceneattribute_wideresnet18.npy
app.py		app.py
categories_places365.txt		categories_places365.txt
labels_sunattribute.txt		labels_sunattribute.txt
musicGeneration.py		musicGeneration.py
packages.txt		packages.txt
prompt-test.py		prompt-test.py
promptGeneration.py		promptGeneration.py
requirements.txt		requirements.txt
sceneRecognition.py		sceneRecognition.py
wideresnet.py		wideresnet.py
wideresnet18_places365.pth.tar		wideresnet18_places365.pth.tar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MuSE: Music from Scene Extraction

Inspiration

What it does

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

What's next for MuSE

Demo

About

Releases

Packages

Languages

subtrex/muse

Folders and files

Latest commit

History

Repository files navigation

MuSE: Music from Scene Extraction

Inspiration

What it does

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

What's next for MuSE

Demo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages