AudioScenic: Audio-Driven Video Scenescape Editing

Current video editing methods predominantly rely on text-driven approaches, focusing on altering content using textual descriptions. However, these methods often struggle with semantic homogeneity and temporal inconsistency, particularly in editing video scenescapes. To address these challenges, we introduce ``AudioScenic'', an audio-driven framework for video scenescape editing. Unlike existing methods, AudioScenic leverages the unique properties of audio – semantic diversity, magnitude control, and frequency alignment – to guide the editing process while preserving the foreground content. Our framework employs audio semantic embeddings to edit the video scenescapes and presents a mask blending module to restrict the audio embedding’s influence exclusively to the scenescape areas of videos. We introduce an audio magnitude-aware module for controlling the editing effect. Additionally, we integrate an audio frequency fusion module to ensure temporal alignment between the edited scenescapes and the audio conditions, improving the temporal coherence of synthesized results. Our approach enhances the visual diversity and maintains temporal consistency. We demonstrate the effectiveness of our method through extensive experiments, showing significant improvements over existing text-driven and audio-driven models in video scenescape editing.

Name	Name	Last commit message	Last commit date
Latest commit jiajiaxiaoskx Update index.html Dec 13, 2023 ce5fd67 · Dec 13, 2023 History 20 Commits
static	static	Add files via upload	Dec 13, 2023
.nojekyll	.nojekyll	Initial commit	Dec 13, 2023
README.md	README.md	Update README.md	Dec 13, 2023
index.html	index.html	Update index.html	Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioScenic: Audio-Driven Video Scenescape Editing

About

Releases

Packages

Languages

jiajiaxiaoskx/AudioScenic

Folders and files

Latest commit

History

Repository files navigation

AudioScenic: Audio-Driven Video Scenescape Editing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages