chore: Better abstract secondary storage #182

epociask · 2024-10-12T08:28:15Z

Fixes Issue

Related to #164

first PR of likely a few - initially wanna refactor so we can extend this be concurrently decoupled from the request processing flow. E.g, with 2 secondaries the dispersal latency could increase to a few seconds. With EigenDAV2 we expect avg request latency to be around 30 seconds but someone overlaying secondaries could increase this to e.g 32-45 seconds which would cause a diff of 100 - 200 kb/s (post-compression && serial submission strategy):

16 mib / 30 seconds = 533.333333 kb/s
16 mib / 45 seconds = 355.555556 kb/s

Fixes #

Changes proposed

Screenshots (Optional)

Note to reviewers

…ondary insertions

samlaf

I like the refactor! Think there are ways to make the API for secondary struct easier to understand and use though..! I'm not a big fan of async APIs. See my reasoning here: https://www.notion.so/eigen-labs/Bls-Agg-Service-2-0-synchronous-API-98abef46040a48fc8d044e7de0781839

samlaf · 2024-10-12T15:01:30Z

README.md

+Unit tests can be ran via invoking `make test`.  Please make sure to have all test containers downloaded locally before running via:
+```
+docker pull redis
+docker pull minio
+```


is this really needed? I would think testcontainer would still pull the images when attempting to run them if they are not present locally?

samlaf · 2024-10-12T15:44:44Z

server/load_store.go

 	fallbacks := populateTargets(cfg.EigenDAConfig.FallbackTargets, s3Store, redisStore)
 	caches := populateTargets(cfg.EigenDAConfig.CacheTargets, s3Store, redisStore)
+	secondary, err := store.NewSecondaryRouter(log, caches, fallbacks)
+	if err != nil {
+		return nil, err


samlaf · 2024-10-12T15:56:01Z

store/router.go

+	// primary storage backends
+	eigenda GeneratedKeyStore   // ALT DA commitment type for OP mode && simple commitment mode for standard /client
+	s3      PrecomputedKeyStore // OP commitment mode && keccak256 commitment type


Suggested change

// primary storage backends

eigenda GeneratedKeyStore // ALT DA commitment type for OP mode && simple commitment mode for standard /client

s3 PrecomputedKeyStore // OP commitment mode && keccak256 commitment type

// primary storage backends:

// GeneratedKeyStore is used for simple commitment mode && generic OP commitment mode

// PrecomputedKeyStore is used for keccak256 OP commitment mode

eigenda GeneratedKeyStore

s3 PrecomputedKeyStore

Not sure if this is correct. Tried to make your comments mroe precise (I was confused by them) but might have misunderstood them.

samlaf · 2024-10-12T16:06:19Z

store/secondary.go

+// NOTE: multi-target set writes are done at once to avoid re-invocation of the same write function at the same
+// caller step for different target sets vs. reading which is done conditionally to segment between a cached read type
+// vs a fallback read type


can you rephrase? very long sentence, I don't understand it.

samlaf · 2024-10-12T16:07:45Z

store/secondary.go

+	for _, src := range sources {
+		err := src.Put(ctx, key, value)
+		if err != nil {
+			r.log.Warn("Failed to write to redundant target", "backend", src.BackendType(), "err", err)
+		} else {
+			successes++
+		}
+	}
+
+	if successes == 0 {
+		return errors.New("failed to write blob to any redundant targets")


should we use an errgroup here instead? If there are a bunch of sources, we should prob write to them in parallel?

samlaf · 2024-10-12T16:09:15Z

store/secondary.go

+	Ingress() chan<- PutNotif
+	CachingEnabled() bool
+	FallbackEnabled() bool
+	HandleRedundantWrites(ctx context.Context, commitment []byte, value []byte) error
+	MultiSourceRead(context.Context, []byte, bool, func([]byte, []byte) error) ([]byte, error)
+	StreamProcess(context.Context)


add comments to explain the interface. Not sure how this works from first reading. Ingress, StreamProcess, etc. are not super descriptive. Might want to use longer more descriptive names?

samlaf · 2024-10-12T16:12:21Z

server/load_store.go

+		for i := 0; i < 10; i++ {
+			go secondary.StreamProcess(ctx)
+		}


am I understanding correctly that you're spinning up 10 workers to pull from the queue? Are we sure that 1 is not enough? Why is 10 sufficient? Can we make it a config parameter instead. Also probably change the name StreamProcess to RequestProcessor or something more explicit?

Also maybe if you want to go this route to expose an init() function or startWorkers(numWorkers uint64) function which spins up the goroutines?

samlaf · 2024-10-12T16:21:20Z

store/router.go

+		if r.secondary.FallbackEnabled() {
+			data, err = r.secondary.MultiSourceRead(ctx, key, true, r.eigenda.Verify)


Thinking through fallbacks some more. Do we really need them? I can't think of a use case when I would want a fallback instead of a cache. Why would I only want to read after eigenDA, instead of always read before eigenDA? Would greatly simplify the code if we could just get rid of fallbacks. But I might be missing something here still..

…via metrics

…via metrics - cleanups

…via metrics - refactors and lints

…via metrics - ensure thread safety for secondary stores

chore: Better abstract secondary storage

ced2a95

epociask requested review from samlaf and bxue-l2 October 12, 2024 08:28

epociask added 2 commits October 12, 2024 04:57

chore: Better abstract secondary storage - add channel stream for sec…

ab6b939

…ondary insertions

chore: Better abstract secondary storage - add channel stream for sec…

a598791

…ondary insertions

samlaf reviewed Oct 12, 2024

View reviewed changes

epociask added 6 commits October 12, 2024 19:12

chore: Better abstract secondary storage - observe secondary storage …

3c3271d

…via metrics

chore: Better abstract secondary storage - observe secondary storage …

bb9b433

…via metrics - cleanups

chore: Better abstract secondary storage - observe secondary storage …

4b9b0e2

…via metrics - refactors and lints

chore: Better abstract secondary storage - observe secondary storage …

13f221b

…via metrics - refactors and lints

chore: Better abstract secondary storage - observe secondary storage …

95790f2

…via metrics - refactors and lints

chore: Better abstract secondary storage - observe secondary storage …

8ef8108

…via metrics - ensure thread safety for secondary stores

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Better abstract secondary storage #182

chore: Better abstract secondary storage #182

epociask commented Oct 12, 2024

samlaf left a comment

samlaf Oct 12, 2024

samlaf Oct 12, 2024

samlaf Oct 12, 2024

samlaf Oct 12, 2024

samlaf Oct 12, 2024

samlaf Oct 12, 2024

samlaf Oct 12, 2024

samlaf Oct 12, 2024

samlaf Oct 12, 2024

samlaf Oct 12, 2024

		if r.secondary.FallbackEnabled() {
		data, err = r.secondary.MultiSourceRead(ctx, key, true, r.eigenda.Verify)

chore: Better abstract secondary storage #182

Are you sure you want to change the base?

chore: Better abstract secondary storage #182

Conversation

epociask commented Oct 12, 2024

Fixes Issue

Changes proposed

Screenshots (Optional)

Note to reviewers

samlaf left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment