Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix: Large number of dataframe columns (H5py Attr Limit) #1147

Merged
merged 12 commits into from
Aug 8, 2024

Conversation

selmanozleyen
Copy link
Member

Hi @ivirshup @brainfo,

This PR solves #874. I had a chat with h5py people and this was the simplest workaround they recommended. They also have no idea why it works. See h5py/h5py#2311 (comment).

Done:

@codecov
Copy link

codecov bot commented Sep 25, 2023

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 84.18%. Comparing base (0302f23) to head (c2db00a).
Report is 73 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1147      +/-   ##
==========================================
- Coverage   86.68%   84.18%   -2.50%     
==========================================
  Files          37       37              
  Lines        5932     5937       +5     
==========================================
- Hits         5142     4998     -144     
- Misses        790      939     +149     
Files with missing lines Coverage Δ
src/anndata/_io/specs/methods.py 87.96% <100.00%> (-0.44%) ⬇️
src/anndata/compat/__init__.py 81.81% <100.00%> (-2.58%) ⬇️

... and 6 files with indirect coverage changes

@ilan-gold ilan-gold self-assigned this Aug 8, 2024
@ilan-gold ilan-gold added this to the 0.10.9 milestone Aug 8, 2024
# actually 64kb is the limit, but this should be a conservative estimate
if name in f:
# TODO: One possible solution is to delete the group and recreate it
raise ValueError(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

write_elem is idempotent for the same key

elif k in store:
del store[k]
so I think I will remove this.

@ilan-gold ilan-gold enabled auto-merge (squash) August 8, 2024 21:49
@ilan-gold ilan-gold merged commit 6e918a4 into scverse:main Aug 8, 2024
12 of 13 checks passed
meeseeksmachine pushed a commit to meeseeksmachine/anndata that referenced this pull request Aug 8, 2024
ilan-gold pushed a commit that referenced this pull request Aug 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants