Adding draft extension for host-provided scratch memory #423

jatinchowdhury18 · 2024-10-15T01:42:49Z

No description provided.

Add .DS_Store to .gitignore (MacOS specific) Fixes: free-audio#390

Summarizing the conversation in discussion free-audio#414, this PR adds a more complete description of when the realtime constraint must be met, by which party, and how it interacts with the thread safe tag.

- allow plugin_latency->get to be called during plugin->activate - require host_latency->changed to be called during plugin->activate

- the plugin interfaces have been separated into 2 independent ones - the plugin interfaces are optional - simplification of the design

…-audio#422) * Add a description of the expectation of request_callback timing Without making a requirement, indicate the intent of the timing. * Add an apostrophe * Add host can starve feedback from alex * more review feedback * notjusthosts

abique · 2024-10-15T08:25:21Z

Hi @jatinchowdhury18

I have some questions with the interface.

Is this scratch memory thread-local?
Can you get the scratch memory pointer from the thread pool? If so is it shared amongst the all the tasks, or each one will use the current thread local one?

include/clap/ext/draft/scratch-memory.h

defiantnerd · 2024-10-15T18:28:54Z

Hi @jatinchowdhury18

I have some questions with the interface.

Is this scratch memory thread-local?

Can you get the scratch memory pointer from the thread pool? If so is it shared amongst the all the tasks, or each one will use the current thread local one?

The only constraint is that the memory is valid for a plugin during it's process call. Like the event queue or the audio buffers. No further lifetime guarantees are given.

abique · 2024-10-15T18:37:26Z

Hi @jatinchowdhury18
I have some questions with the interface.

Is this scratch memory thread-local?

Can you get the scratch memory pointer from the thread pool? If so is it shared amongst the all the tasks, or each one will use the current thread local one?

The only constraint is that the memory is valid for a plugin during it's process call. Like the event queue or the audio buffers. No further lifetime guarantees are given.

Then consider this:

each voice needs 10K of scratch buffer
you process 32 voices in parallel

If you have a single scratch buffer for process(), you can't process your voice in parallel, or you need 32 * 10K.
If you have a thread local scratch that you can query from the thread pool, then you can process your voices in parallel with a scratch buffer of 10K.

I believe this needs to be clarified in the spec.

defiantnerd · 2024-10-15T19:20:53Z

Then consider this:

each voice needs 10K of scratch buffer

you process 32 voices in parallel

If you have a single scratch buffer for process(), you can't process your voice in parallel, or you need 32 * 10K. If you have a thread local scratch that you can query from the thread pool, then you can process your voices in parallel with a scratch buffer of 10K.

I believe this needs to be clarified in the spec.

You can only request/register one buffer per instance with a predefined size during activation. The host signals in the return value of request_size if you can access that.

In the process call you just call auto *scratchmem = ext_scratch_memory->access(host); and if scratchmem is not a null you can just use it during the process call.

But there is only one buffer and the size does not change as long as the plugin is active.

baconpaul · 2024-10-15T20:40:56Z

Then consider this:

each voice needs 10K of scratch buffer

you process 32 voices in parallel

If you have a single scratch buffer for process(), you can't process your voice in parallel, or you need 32 * 10K. If you have a thread local scratch that you can query from the thread pool, then you can process your voices in parallel with a scratch buffer of 10K.
I believe this needs to be clarified in the spec.

You can only request/register one buffer per instance with a predefined size during activation. The host signals in the return value of request_size if you can access that.

In the process call you just call auto *scratchmem = ext_scratch_memory->access(host); and if scratchmem is not a null you can just use it during the process call.

But there is only one buffer and the size does not change as long as the plugin is active.

I think Alex's point is if you use the thread pool extension to schedule jobs, those thread pool local jobs will be effectively parallel and running under process.

Can a thread-pool extension job access memory? If so is it distinct per thread or is it a single memory location?

My guess is: The thread pool and memory scratch extensions need some careful co-consideration. And the patterns where people use the scratch memory will also require scratch-per-thread-voice not scratch-per-process-block in those cases.

jatinchowdhury18 · 2024-10-15T21:37:46Z

Yeah, thanks for bringing this up, I hadn't considered the inter-operation of scratch-memory and the thread-pool extension.

I think the simplest solution would be to have the plugin request the total amount of scratch memory that it needs across all possible threads. Since thread_pool.request_exec is blocking, I think it should be safe for the thread pool jobs to use the scratch memory, provided the plugin makes sure that each thread is using an independent "chunk" of the memory. So a simple example would look like:

struct My_Plugin
{
    size_t scratch_mem_per_voice = 10'000;
    size_t num_voices = 32;
    char* scratch_memory_data = nullptr;

    void activate(...) {
        scratch_memory_ext.pre_reserve(host, scratch_mem_per_voice * num_voices);
    }

    void process(...) {
        // Get all the scratch memory here
        scratch_memory_data = scratch_memory_ext.access(host);
        thread_pool_ext.request_exec(host, num_voices);
    }

    void thread_pool_callback(uint32_t task_index) {
        // Get a partition of the scratch memory for this voice to use
        char* this_voice_scratch_mem = scratch_memory_data + scratch_mem_per_voice * task_index;
        // do the actual DSP work here...
    }
};

However, I can see a few reasons why this might not be an ideal solution... For example, if the host thread pool only has 8 threads, then reserving enough scratch memory for 32 voices to be processed in parallel is a bit wasteful. Maybe the best solution is to have two scratch memory mechanisms... one scratch buffer that is intended to be accessed during the process() callback, and another to be accessed during the thread_pool_exec() callback?

typedef struct clap_host_scratch_memory {
    bool(CLAP_ABI *pre_reserve_process_scratch)(const clap_host_t *host, size_t scratch_size_bytes);
    void*(CLAP_ABI *access_process_scratch)(const clap_host_t *host);

    bool(CLAP_ABI *pre_reserve_thread_pool_scratch)(const clap_host_t *host, size_t scratch_size_per_thread_bytes);
    void*(CLAP_ABI *access_thread_pool_scratch)(const clap_host_t *host, uint32_t task_index);
} clap_host_scratch_memory_t;

All that said, I don't have much experience with the thread-pool extension, so I'll defer to your more informed opinions :).

abique · 2024-10-16T10:43:11Z

Here is my proposal:

The scratch is a thread local pointer, so if you retrieve it from the process call, you'll get a pointer that you can share with all the jobs. If you retrieve it from the thread pool, you get a pointer that is only for the current job.

I think this is the correct direction because it corresponds to how the host will implement this feature: each audio thread will have a single scratch buffer (thread local) whose size is greater or equal to the max requested size of all plugin instances.

The total scratch memory is: num_threads * max_scratch_size.
You definitelly don't want max_scratch_size = max(plug_num_voices * plug_voice_scratch_size), but we want instead max_scratch_size = max(plug_voice_scratch_size).

include/clap/ext/draft/scratch-memory.h

Trinitou · 2024-10-16T21:31:01Z

include/clap/ext/draft/scratch-memory.h

+   // host when the plugin is de-activated.
+   //
+   // [main-thread & being-activated]
+   bool(CLAP_ABI *pre_reserve)(const clap_host_t *host, size_t scratch_size_bytes);


Do we ever use size_t in any other extension? -> uint?_t ?

Good point, uint32_t would do the job I think.

Why not use uint64_t which is size_t in most of our systems? The host has the option to return no for values out of bounds

I don't have a preference for uint32_t vs uint64_t... but Trinitou is right that CLAP doesn't use size_t anywhere else, so I don't think we should use it here.

A scratch size bigger than 4 GB would be problematic I suppose, remember nthreads * max_scratch_size.
Anyway regardless of the type, many host will likely have their own threshold.
uint32_t seems sufficient to me, but I'm happy with uint64_t as well.

include/clap/ext/draft/scratch-memory.h

…over size_t

jatinchowdhury18 · 2024-10-17T04:23:16Z

The scratch is a thread local pointer, so if you retrieve it from the process call, you'll get a pointer that you can share with all the jobs. If you retrieve it from the thread pool, you get a pointer that is only for the current job.

This makes sense! I've added some comments intended to clarify this point, but please let me know if there are ways I can improve my explanation :).

include/clap/ext/draft/scratch-memory.h

Co-authored-by: Trinitou <[email protected]>

Trinitou

It looks pretty good to me now. Any further details can be discussed on next IMO.

include/clap/ext/draft/scratch-memory.h

NatureIsFrequency and others added 10 commits September 2, 2024 16:06

Fix: for issue: free-audio#390 Add .DS_Store to .gitignore

757a927

Add .DS_Store to .gitignore (MacOS specific) Fixes: free-audio#390

Expand the thread-doc to clarify and expand realtime

ebfe262

Summarizing the conversation in discussion free-audio#414, this PR adds a more complete description of when the realtime constraint must be met, by which party, and how it interacts with the thread safe tag.

Adjust latency extension requirements

3527890

- allow plugin_latency->get to be called during plugin->activate - require host_latency->changed to be called during plugin->activate

undo: redesign the interface

5e2bd97

- the plugin interfaces have been separated into 2 independent ones - the plugin interfaces are optional - simplification of the design

Update ChangeLog

86c1681

CLAP 1.2.2

961be0b

Very rough draft scratch-memory extension

a5790e1

Name changes and documenting de-allocation pattern

efa821a

Fixing typedef declaration

48c1ba8

abique requested changes Oct 15, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

baconpaul reviewed Oct 15, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

Some changes based on PR feedback

a65ba57

abique reviewed Oct 16, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

Trinitou reviewed Oct 16, 2024

View reviewed changes

Trinitou reviewed Oct 17, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

Attempt to clarify usage with thread-pool extension and use uint32_t …

1b1a68e

…over size_t

Clarify access pattern from the plugin side

f2a84b9

Trinitou reviewed Oct 17, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Show resolved Hide resolved

Trinitou suggested changes Oct 17, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

jatinchowdhury18 and others added 2 commits October 18, 2024 18:05

Update include/clap/ext/draft/scratch-memory.h

06ccabd

Co-authored-by: Trinitou <[email protected]>

Clarify plugins calling reserve() multiple times

5bf4e59

Clarify: if final reserve() call returns false, then no scratch memory

635045c

Trinitou approved these changes Oct 21, 2024

View reviewed changes

abique reviewed Oct 21, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

abique reviewed Oct 21, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

abique reviewed Oct 21, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

p--b reviewed Oct 22, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Show resolved Hide resolved

p--b reviewed Oct 22, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Outdated Show resolved Hide resolved

p--b reviewed Oct 22, 2024

View reviewed changes

include/clap/ext/draft/scratch-memory.h Show resolved Hide resolved

jatinchowdhury18 added 3 commits October 23, 2024 01:53

Tweaks based on PR feedback

5006ea2

Document being-activated

8df28a3

Plugin most reserve again on re-activation

b7968b2

abique force-pushed the next branch from d0f0373 to 16ea91a Compare November 1, 2024 13:40

abique merged commit 70c2e33 into free-audio:next Nov 1, 2024
4 checks passed

jatinchowdhury18 deleted the ext/scratch-memory branch November 3, 2024 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding draft extension for host-provided scratch memory #423

Adding draft extension for host-provided scratch memory #423

jatinchowdhury18 commented Oct 15, 2024

abique commented Oct 15, 2024

defiantnerd commented Oct 15, 2024

abique commented Oct 15, 2024 •

edited

Loading

defiantnerd commented Oct 15, 2024

baconpaul commented Oct 15, 2024

jatinchowdhury18 commented Oct 15, 2024

abique commented Oct 16, 2024 •

edited

Loading

Trinitou Oct 16, 2024

abique Oct 16, 2024

baconpaul Oct 17, 2024

jatinchowdhury18 Oct 17, 2024

abique Oct 17, 2024 •

edited

Loading

jatinchowdhury18 commented Oct 17, 2024

Trinitou left a comment

Adding draft extension for host-provided scratch memory #423

Adding draft extension for host-provided scratch memory #423

Conversation

jatinchowdhury18 commented Oct 15, 2024

abique commented Oct 15, 2024

defiantnerd commented Oct 15, 2024

abique commented Oct 15, 2024 • edited Loading

defiantnerd commented Oct 15, 2024

baconpaul commented Oct 15, 2024

jatinchowdhury18 commented Oct 15, 2024

abique commented Oct 16, 2024 • edited Loading

Trinitou Oct 16, 2024

Choose a reason for hiding this comment

abique Oct 16, 2024

Choose a reason for hiding this comment

baconpaul Oct 17, 2024

Choose a reason for hiding this comment

jatinchowdhury18 Oct 17, 2024

Choose a reason for hiding this comment

abique Oct 17, 2024 • edited Loading

Choose a reason for hiding this comment

jatinchowdhury18 commented Oct 17, 2024

Trinitou left a comment

Choose a reason for hiding this comment

abique commented Oct 15, 2024 •

edited

Loading

abique commented Oct 16, 2024 •

edited

Loading

abique Oct 17, 2024 •

edited

Loading