From 2de8bb00c175471ccedcd9717bbed3e5de2ca819 Mon Sep 17 00:00:00 2001 From: Phil Wang Date: Thu, 2 Feb 2023 11:13:10 -0800 Subject: [PATCH] product and project management --- README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 5eb0621..e02845e 100644 --- a/README.md +++ b/README.md @@ -126,12 +126,12 @@ music = musiclm(['the crystalline sounds of the piano in a ballroom']) # torch.T - [x] mulan seems to be using decoupled contrastive learning, offer that as an option - [x] wrap mulan with mulan wrapper and quantize the output, project to audiolm dimensions +- [x] modify audiolm to accept conditioning embeddings, optionally take care of different dimensions through a separate projection +- [x] audiolm and mulan goes into musiclm and generate, filter with mulan -- [ ] modify audiolm to accept conditioning embeddings, optionally take care of different dimensions through a separate projection -- [ ] audiolm and mulan goes into musiclm and generate, filter with mulan +- [ ] improvise a bit and give the audio transformer a position generating module before each attention layer - [ ] add a version of mulan to open clip - [ ] set all the proper spectrogram hyperparameters -- [ ] improvise a bit and give the audio transformer a position generating module before each attention layer ## Appreciation