Decentralized Data Ideas #42
Replies: 6 comments
-
Chatted with some folks working on Subsquid. They're doing interesting things on the decentralized data lake area. This is more or less what I understood about how the Subsquid Archive works.
Subsquid Labs maintains public Archive endpoints and offers batch access via the Squid SDK free of charge. Questions
|
Beta Was this translation helpful? Give feedback.
-
Adding a small note that Dagster is already relying on "hashes" to check when runs are needed! A step closer to fully content addresses workflows. |
Beta Was this translation helpful? Give feedback.
-
You can |
Beta Was this translation helpful? Give feedback.
-
Having a decentralized data lake paired with some good standards and incentives, could result in a universal repository of high-quality structured data ™️. A public data warehouse. Similar to Wikipedia, but for public data. This might be possible now as the technology and theory is there. We have L2s, token based incentive systems, modular and composable data stacks, fast peer to peer connections, and data pipelines are virtually free. |
Beta Was this translation helpful? Give feedback.
-
Working with Parquet on top of IPFS or other decentralized file system is tricky as Parquet doesn't have deterministic encodings and changing only one element might cause the entire file to be "reuploaded". Adding here a few notes that I've collected in the past:
|
Beta Was this translation helpful? Give feedback.
-
Open Data via AT ProtocolThe AT Protocol is an open, decentralized network for building social applications. What if we tapped into the protocol primitives to create an open directory of (potentially interlinked) datasets? Some ideas:
Quite hand wavy for now but looking at the success Bluesky is having inspired me to think more about this one. |
Beta Was this translation helpful? Give feedback.
-
Random thoughts around decentralized and permissionless data lakes.
Also from datonic/datadex#22 (comment).
Beta Was this translation helpful? Give feedback.
All reactions