From 0247f2cc176c7d31549405db48ea4ca763405585 Mon Sep 17 00:00:00 2001 From: Phil Wang Date: Tue, 14 Feb 2023 09:22:04 -0800 Subject: [PATCH] product management --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index e23e9af..22e5e0e 100644 --- a/README.md +++ b/README.md @@ -145,6 +145,7 @@ music = musiclm(['the crystalline sounds of the piano in a ballroom']) # torch.T - [x] give dynamic positional bias to self attention in AST - [ ] add a version of mulan to open clip +- [ ] support variable lengthed audio with masking in audio transformer, then implement MusicLM generating multiple samples and selecting top match with MuLaN - [ ] set all the proper spectrogram hyperparameters ## Citations