Routine Load for Iceberg tables #49956

Samrose-Ahmed · 2024-08-19T06:40:27Z

Feature request

Support routine load to load data to iceberg tables.

Is your feature request related to a problem? Please describe.

Use starrocks directly to write to starrocks from kafka without having to use kafka connect or separate write fleet.

Describe the solution you'd like

Reuse routine load infra, adapt for Iceberg tables.

Describe alternatives you've considered

Additional context

jaogoy · 2024-08-19T09:44:56Z

It'd be better to be implemented.
But, if every batch is too small, then the versions will be too much, thereforce the query performance on Iceberg tables will not be good, IMO.

And, can you share with me about your scenarios? Do you just want datalake analytics, and the query performance is not so much restricted to second level?

Samrose-Ahmed · 2024-08-19T09:52:24Z

Yes you need to not commit excessively. I think around 1min-5min intervals are reasonable (that's often used with Flink/Iceberg as the checkpoint interval).

Second level is not necessary and would generate too many files with iceberg. In general, data/metadata gets compacted away so a few new files don't really affect performance too much as long as commit interval is reasonable.

github-actions · 2025-02-17T11:00:44Z

We have marked this issue as stale because it has been inactive for 6 months. If this issue is still relevant, removing the stale label or adding a comment will keep it active. Otherwise, we'll close it in 10 days to keep the issue queue tidy. Thank you for your contribution to StarRocks!

maver1ck · 2025-02-26T07:54:16Z

Hi,
Do we know what's the exact reason this is not working ?

Samrose-Ahmed · 2025-02-27T19:58:57Z

This was never implemented, this is just an issue to track

Samrose-Ahmed added the type/feature-request label Aug 19, 2024

github-actions bot added the no-issue-activity label Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Routine Load for Iceberg tables #49956

Routine Load for Iceberg tables #49956

Samrose-Ahmed commented Aug 19, 2024

jaogoy commented Aug 19, 2024

Samrose-Ahmed commented Aug 19, 2024

github-actions bot commented Feb 17, 2025

maver1ck commented Feb 26, 2025

Samrose-Ahmed commented Feb 27, 2025

Routine Load for Iceberg tables #49956

Routine Load for Iceberg tables #49956

Comments

Samrose-Ahmed commented Aug 19, 2024

Feature request

jaogoy commented Aug 19, 2024

Samrose-Ahmed commented Aug 19, 2024

github-actions bot commented Feb 17, 2025

maver1ck commented Feb 26, 2025

Samrose-Ahmed commented Feb 27, 2025