Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Section highlighting datalad-fuse and dandisets? #899

Open
yarikoptic opened this issue Dec 19, 2022 · 3 comments
Open

Section highlighting datalad-fuse and dandisets? #899

yarikoptic opened this issue Dec 19, 2022 · 3 comments

Comments

@yarikoptic
Copy link
Contributor

@asmacdo showed interest to participate in ongoing handbook hackathon and I thought that it might be great to have a use case show case for dandisets (super dataset at https://github.com/dandi/dandisets, individual at https://github.com/dandisets, asyncio code to update those from the archive within the tools/ of dandisets) and https://github.com/datalad/datalad-fuse/ extension. Dandisets are "special" in that typical files are there large but for access to metadata etc, only small portion of data is needed to be accessed. In datalad-fuse we use https://github.com/fsspec/filesystem_spec/ with local caching, to provide efficient sparse access to remote annexed files which have an http* url associated with them.

In datalad core we had a request for streaming datalad/datalad#4003 -- so it might be useful to highlight how streaming could be implemented, via fsspec interface within datalad-fuse or directly via FUSE filesystem of that one.

WDYT datalad-handbook folks about such a section? (attn @adswa @mih)

@adswa
Copy link
Contributor

adswa commented Dec 19, 2022

Sure, sounds like a cool usecase to me. There is a rough structure that usecases are usually following: http://handbook.datalad.org/en/latest/contributing.html#use-cases

@yarikoptic
Copy link
Contributor Author

more specific target for the use case could be

@adswa
Copy link
Contributor

adswa commented Dec 20, 2022

Just ping me if you need any infos. You should add a new file in docs/usecases and place it somewhere in the docs/usecases/intro.rst toctree. Usecases do not need to have code that is executed and captured, so you can go with .. code-block::s instead of .. runrecords::. Looking forward to it!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants