Enforce whitelist filtering #22

lsfera · 2024-06-22T10:07:38Z

closes #13
closes #10
Different specs for Pub/Sub
Allow message table customization

Enabled filters on snapshot reading

…cified in subscriptions

table name, columns names and dimension of varchar column

src/Blumchen/Subscriptions/Subscription.cs

oskardudycz

@lsfera thank you for the next set of changes.

For the future, it'd be great if you could split the changes into a few Pull Requests, and discuss them before sending a big PR. Having such big scope of changes makes it hard to review and consider in terms of the general API. In this case, changes could be split e.g. into:

adding publication filter,
adding table descriptor,
ensuring ConfigureAwait is applied correctly,
etc.

I'd appreciate if you also consult bigger changes upfront.

I'm taking this PR as it is, as changes are good, and I want to review soon the current API and reshape it a bit in the follow-up set of changes.

lsfera · 2024-07-03T08:19:48Z

@lsfera thank you for the next set of changes.

For the future, it'd be great if you could split the changes into a few Pull Requests, and discuss them before sending a big PR. Having such big scope of changes makes it hard to review and consider in terms of the general API. In this case, changes could be split e.g. into:

adding publication filter,

adding table descriptor,

ensuring ConfigureAwait is applied correctly,

etc.

I'd appreciate if you also consult bigger changes upfront.

I'm taking this PR as it is, as changes are good, and I want to review soon the current API and reshape it a bit in the follow-up set of changes.

Agree. I’m still in POC mode, as I want to add features to the project to verify how far we can go with it.
But definitely a more disciplined approach should help.
I’d like to have some open discussion to verify how to approach design choices.
E.g. #14 . We could opt for adding mime_type support along with bytea column for json too - having a unified approach despite the serialisation mechanism… but losing native postgresql advanced query support and compression optimised storage. Or maintain jsonb and bytea as separated use cases.
What are your thoughts?
Supporting object/bytes/text consuming is a thing. What about validating input data? Is this something we should care by design IMHO. Do you Agree?
Introducing DI. On ec2 I implemented background service worker by consumer type, wrapping them with back off retrying policy to deal with service bus/networking issues. Is this something we are interested here? Could a feature branch help on reasoning about it?

oskardudycz · 2024-07-03T09:17:39Z

@lsfera about different mime types, that sounds fair, but I'll need to do more research on e.g. how others are doing it (e.g. Debezium). EventStoreDB stores mime/type and always stores bytes. Then you can select your serialiser based on that.

Definitely what I'd like to have is not enforce deserialisation of messages. This would, e.g. enable just forwarding messages as they are to other messaging systems or even streaming them through, e.g. web sockets.

For that probably there'll be need to split:

raw async enumerable,
on top of that async enumerable with deserialised messages,
probably some consumer abstractions for single and batched handling.

Maybe others, but that would enable different use cases and even other tools like Brighter, MassTransit to use Blümchen internally and also regular applications for business needs. I'll need to sleep on that. Definitely, if I have some draft, I'll tag you in the PR.

I think that we should not be doing any validation or manipulation of the data we're getting from the users. Even from the legal audit perspective it's better for the user to be sure that what they store is what they get in the database.

Introducing DI. On ec2 I implemented background service worker by consumer type, wrapping them with back off retrying policy to deal with service bus/networking issues. Is this something we are interested here? Could a feature branch help on reasoning about it?

Yup, this would definitely help people having the capability to just do AddBlumchen and configure the default options.

Also, later on I'd like to add WebHooks built-in support. That could be useful for doing serverless running Blumchen service e.g. on Fargate, that'd just forward events through HTTP to SQS or other services.

Enforce provided whitelist at publication level

040273a

lsfera marked this pull request as draft June 22, 2024 14:54

move from initializer to getter

b72c81a

lsfera marked this pull request as ready for review June 22, 2024 15:43

lsfera marked this pull request as draft June 25, 2024 20:23

Some heavy lifting on library ergonomis aimed at simplifyng usage.

04e4b2e

Enabled filters on snapshot reading

lsfera marked this pull request as ready for review June 25, 2024 20:58

lsfera added 16 commits June 25, 2024 23:21

updated test deps

c4c239a

Marked assembly as private

1476e44

updated assembly version

a4c83d6

test case renamed

4256afa

cannot be null - not needed

51eb9f8

renamed variable

0410268

moved demo apps aìunder 'demo' folder

cf404b4

tests renamed

933e504

verify subscriber does not receive massegas different by the ones spe…

10a6eec

…cified in subscriptions

mark CreatePublication internal to expose to testing suite

7ecef64

to explicit yield

d4a9dc6

norrowed method scope

6a211f0

expose testoutputhelper to base class

df63f29

inject error handler to trace poisoining messages

ed4a262

enforce required attrebutes on deserialization to catch invalid data

0a98cf8

Allowed message table customization:

3e14955

table name, columns names and dimension of varchar column

lsfera changed the title ~~Enforce provided whitelist at publication level~~ Enforce whitelist filtering Jul 2, 2024

oskardudycz reviewed Jul 3, 2024

View reviewed changes

src/Blumchen/Subscriptions/Subscription.cs Show resolved Hide resolved

oskardudycz approved these changes Jul 3, 2024

View reviewed changes

oskardudycz merged commit 1f4aa7a into event-driven-io:master Jul 3, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enforce whitelist filtering #22

Enforce whitelist filtering #22

lsfera commented Jun 22, 2024 •

edited

Loading

oskardudycz left a comment

lsfera commented Jul 3, 2024 •

edited

Loading

oskardudycz commented Jul 3, 2024

Enforce whitelist filtering #22

Enforce whitelist filtering #22

Conversation

lsfera commented Jun 22, 2024 • edited Loading

oskardudycz left a comment

Choose a reason for hiding this comment

lsfera commented Jul 3, 2024 • edited Loading

oskardudycz commented Jul 3, 2024

lsfera commented Jun 22, 2024 •

edited

Loading

lsfera commented Jul 3, 2024 •

edited

Loading