Add cupy support + CI #1066

Zethson · 2023-07-21T11:04:53Z

@ivirshup says:

TODO:

Figure out if pytest logic can be simplified: Add cupy support + CI #1066 (comment)
Concatenation
IO (at least writing, probably via CPU memory)
Indexing
Views
Release note
~~- [ ] Consider how much can be done with array_api~~
Benchmark concatenation to be sure CPU stuff didn't get slower

Signed-off-by: zethson <[email protected]>

for more information, see https://pre-commit.ci

codecov · 2023-07-21T11:28:13Z

Codecov Report

Merging #1066 (c476a5d) into main (0c4c0b0) will decrease coverage by 1.72%.
The diff coverage is 41.29%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1066      +/-   ##
==========================================
- Coverage   84.28%   82.57%   -1.72%     
==========================================
  Files          35       35              
  Lines        4932     5112     +180     
==========================================
+ Hits         4157     4221      +64     
- Misses        775      891     +116

Files Changed	Coverage Δ
anndata/_core/raw.py	`79.56% <25.00%> (-1.04%)`	⬇️
anndata/tests/helpers.py	`86.15% <25.00%> (-9.86%)`	⬇️
anndata/_core/merge.py	`82.50% <28.88%> (-10.76%)`	⬇️
anndata/_core/views.py	`83.41% <51.85%> (-4.86%)`	⬇️
anndata/utils.py	`84.39% <71.42%> (-0.55%)`	⬇️
anndata/compat/__init__.py	`80.44% <75.00%> (-0.69%)`	⬇️
anndata/_io/specs/methods.py	`87.59% <90.90%> (-0.22%)`	⬇️
anndata/_core/anndata.py	`82.79% <100.00%> (+0.04%)`	⬆️

... and 1 file with indirect coverage changes

Signed-off-by: zethson <[email protected]>

.cirun.yml

Signed-off-by: zethson <[email protected]>

.github/workflows/build_gpu.yml

Co-authored-by: Isaac Virshup <[email protected]>

Signed-off-by: zethson <[email protected]>

.github/workflows/build_gpu.yml

ivirshup · 2023-07-24T15:50:44Z

So, it runs. But I see some issues:

micromamba list does not report the names of pip installed packages. I do not actually know if this feature is supported.
pip install is trying to overwrite the micromamba installed numpy. This is probably bad, but does not seem to be causing an import issue...

flying-sheep · 2023-07-25T09:03:48Z

micromamba list does not report the names of pip installed packages. I do not actually know if this feature is supported.

That’s mamba-org/mamba#2059

pip install is trying to overwrite the micromamba installed numpy. This is probably bad, but does not seem to be causing an import issue...

Why is it bad?

Both are dependency resolvers. The ideal way to use package managers would be to pick a single one, and let it install everything in a single step. Since we don’t do that (and probably can’t?), what pip does is correct. Note that it avoids upgrading packages when --upgrade/-U is not specified, unless that’s necessary:

Controlling what gets installed

[…]

the “default” upgrade strategy when --upgrade is not set [is that] packages are not upgraded (not even direct requirements) unless the currently installed version fails to satisfy a requirement (either explicitly specified or a dependency).

The fact that pip updates numpy therefore means that something requires a minimum numpy version greater than the one that micromamba installed.

ivirshup · 2023-07-25T11:26:36Z

Because pip can't actually uninstall conda packages.

Intron7 · 2023-07-26T12:13:44Z

I tested the preprocessing from rapids-singlecell with cupy anndata. There are no issues expect the way views interact with .X replacements. This happens when i want to put a dense matrix in after regress_out.
In anycase it works super well so far.

Intron7 · 2023-07-27T14:48:46Z

So I have some more small Ideas that I think would be good. I can also start implementing some of them:

Check if .nnz for cpx is less than 2^31-1 since cupy only supports int32 indptr
make a .todevice() like torch. To transform X or .layers from and to GPU. Maybe add an all option to dump everything into RAM.
add a flag/property for .X and layers if its in RAM or VRAM

ivirshup · 2023-07-27T16:56:44Z

I think this is getting pretty close to mergable, so I think I'd leave extra features on the actual cupy support out for now. I've opened #1080 to discuss follow up PRs.

ivirshup · 2023-07-27T18:02:49Z

@Intron7, you had mentioned some rules for controlling when this CI was run. Do you think you could link out to this/ maybe help set this up?

Intron7 · 2023-07-27T18:09:39Z

So far I only know that cuML uses that solution. I didnt check out how this works. But I will investigate this.

Intron7 · 2023-07-28T08:57:32Z

If you try to set a dense array for a sparse matrix in a view cuda this happens. Can we handle this a bit more gracefully?

Right now I have to copy before the function or use _init_as_actual. It would be amazing to have a method that would update adata in place to actual.

---------------------------------------------------------------------------
MemoryError                               Traceback (most recent call last)
File cupy/cuda/memory.pyx:742, in cupy.cuda.memory.alloc()

File ~/miniconda3/envs/anndata_test/lib/python3.10/site-packages/rmm/allocators/cupy.py:37, in rmm_cupy_allocator(nbytes)
     34     raise ModuleNotFoundError("No module named 'cupy'")
     36 stream = Stream(obj=cupy.cuda.get_current_stream())
---> 37 buf = librmm.device_buffer.DeviceBuffer(size=nbytes, stream=stream)
     38 dev_id = -1 if buf.ptr else cupy.cuda.device.get_device_id()
     39 mem = cupy.cuda.UnownedMemory(
     40     ptr=buf.ptr, size=buf.size, owner=buf, device_id=dev_id
     41 )

File device_buffer.pyx:85, in rmm._lib.device_buffer.DeviceBuffer.__cinit__()

MemoryError: std::bad_alloc: out_of_memory: CUDA error at: /home/sdicks/miniconda3/envs/anndata_test/include/rmm/mr/device/cuda_memory_resource.hpp:70: cudaErrorMemoryAllocation out of memory
Exception ignored in: 'cupy.cuda.thrust.cupy_malloc'
Traceback (most recent call last):
  File "cupy/cuda/memory.pyx", line 742, in cupy.cuda.memory.alloc
  File "/home/sdicks/miniconda3/envs/anndata_test/lib/python3.10/site-packages/rmm/allocators/cupy.py", line 37, in rmm_cupy_allocator
    buf = librmm.device_buffer.DeviceBuffer(size=nbytes, stream=stream)
  File "device_buffer.pyx", line 85, in rmm._lib.device_buffer.DeviceBuffer.__cinit__
MemoryError: std::bad_alloc: out_of_memory: CUDA error at: /home/sdicks/miniconda3/envs/anndata_test/include/rmm/mr/device/cuda_memory_resource.hpp:70: cudaErrorMemoryAllocation out of memory
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
File <timed eval>:1

File ~/miniconda3/envs/anndata_test/lib/python3.10/site-packages/rapids_singlecell/cunnData_funcs/_regress_out.py:115, in regress_out(cudata, keys, layer, inplace, batchsize, verbose)
    113         cudata.layers[layer] = outputs
    114     else:
--> 115         cudata.X = outputs
    116 else:
    117     return outputs

File ~/git/anndata/anndata/_core/anndata.py:682, in AnnData.X(self, value)
    678     if sparse.issparse(self._adata_ref._X) and isinstance(
    679         value, np.ndarray
    680     ):
    681         value = sparse.coo_matrix(value)
--> 682     self._adata_ref._X[oidx, vidx] = value
    683 else:
    684     self._X = value

File ~/miniconda3/envs/anndata_test/lib/python3.10/site-packages/cupyx/scipy/sparse/_index.py:446, in IndexMixin.__setitem__(self, key, x)
    444     return
    445 x = x.reshape(i.shape)
--> 446 self._set_arrayXarray(i, j, x)

File ~/miniconda3/envs/anndata_test/lib/python3.10/site-packages/cupyx/scipy/sparse/_compressed.py:480, in _compressed_sparse_matrix._set_arrayXarray(self, row, col, x)
    478 def _set_arrayXarray(self, row, col, x):
    479     i, j = self._swap(row, col)
--> 480     self._set_many(i, j, x)

File ~/miniconda3/envs/anndata_test/lib/python3.10/site-packages/cupyx/scipy/sparse/_compressed.py:557, in _compressed_sparse_matrix._set_many(self, i, j, x)
    555 j = j[mask]
    556 j[j < 0] += N
--> 557 self._insert_many(i, j, x[mask])

File ~/miniconda3/envs/anndata_test/lib/python3.10/site-packages/cupyx/scipy/sparse/_compressed.py:616, in _compressed_sparse_matrix._insert_many(self, i, j, x)
    607 def _insert_many(self, i, j, x):
    608     """Inserts new nonzero at each (i, j) with value x
    609     Here (i,j) index major and minor respectively.
    610     i, j and x must be non-empty, 1d arrays.
   (...)
    613     Modifies i, j, x in place.
    614     """
--> 616     order = cupy.argsort(i)  # stable for duplicates
    617     i = i.take(order)
    618     j = j.take(order)

File ~/miniconda3/envs/anndata_test/lib/python3.10/site-packages/cupy/_sorting/sort.py:116, in argsort(a, axis, kind)
    114 if kind is not None and kind != 'stable':
    115     raise ValueError("kind can only be None or 'stable'")
--> 116 return a.argsort(axis=axis)

File cupy/_core/core.pyx:879, in cupy._core.core._ndarray_base.argsort()

File cupy/_core/core.pyx:896, in cupy._core.core._ndarray_base.argsort()

File cupy/_core/_routines_sorting.pyx:96, in cupy._core._routines_sorting._ndarray_argsort()

File cupy/cuda/thrust.pyx:117, in cupy.cuda.thrust.argsort()

RuntimeError: transform: failed to synchronize: cudaErrorIllegalAddress: an illegal memory access was encountered

ivirshup · 2023-07-28T10:25:22Z

Could you share some code that throws this?

for more information, see https://pre-commit.ci

Intron7 · 2023-07-28T11:12:40Z

Could you share some code that throws this?

from anndata import AnnData
from cupyx.scipy import sparse as cpsparse
from scipy import sparse
import cupy as cp
import numpy as np

rand = sparse.random(100000, 20000, density=0.05,dtype=np.float32, format="csr")
adata = AnnData(X= cpsparse.csr_matrix(rand))
adata = adata[:,:5000]
X = cp.random.rand(100000,5000, dtype= cp.float32)
adata.X = X

This works though

adata = AnnData(X= cpsparse.csr_matrix(rand))
adata = adata[:,:5000].copy()
X = cp.random.rand(100000,5000, dtype= cp.float32)
adata.X = X

ivirshup · 2023-07-28T12:06:05Z

In this instance I think you can do:

rand = sparse.random(100000, 20000, density=0.05,dtype=np.float32, format="csr")
adata = AnnData(X= cpsparse.csr_matrix(rand))
adata = adata[:,:5000]
X = cp.random.rand(100000,5000, dtype= cp.float32)

del adata.X
adata.X = X

But yeah the behavior is weird, but not a bug. I'm not really sure what a more graceful way to handle this would be here.

I think I could be up for a inplace conversion to actual though. Can you open an issue for this?

Intron7 · 2023-07-28T12:14:32Z

For very small matrices it works but is super slow.

Open an issue for the feature #1082

conftest.py

flying-sheep

Looking good! Once the TODOs are done, I think we’re good to go!

anndata/tests/test_gpu.py

Zethson and others added 2 commits July 21, 2023 13:04

Add GPU CI

d9a4f6c

Signed-off-by: zethson <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

b616478

for more information, see https://pre-commit.ci

Merge branch 'main' into feature/gpu_ci

01f6aa8

ivirshup added this to the 0.10.0 milestone Jul 21, 2023

ivirshup mentioned this pull request Jul 21, 2023

Document how GPU CI is setup and works scverse/governance#57

Closed

Zethson added 3 commits July 21, 2023 14:37

Add draft of Test action

eb7dc1d

Signed-off-by: zethson <[email protected]>

Add draft of Test action

7f52b2b

Signed-off-by: zethson <[email protected]>

Remove python specification

6c8e17b

Signed-off-by: zethson <[email protected]>

ivirshup reviewed Jul 21, 2023

View reviewed changes

.cirun.yml Outdated Show resolved Hide resolved

Switch to mamba

e9ddb3f

Signed-off-by: zethson <[email protected]>

ivirshup reviewed Jul 21, 2023

View reviewed changes

.github/workflows/build_gpu.yml Outdated Show resolved Hide resolved

Zethson and others added 3 commits July 21, 2023 15:33

Add shell check

f7376bc

Co-authored-by: Isaac Virshup <[email protected]>

Switch to mamba

73e5e7e

Signed-off-by: zethson <[email protected]>

micromamba list

12a1a9e

Signed-off-by: zethson <[email protected]>

ivirshup mentioned this pull request Jul 21, 2023

Set up GPU CI #1067

Closed

6 tasks

Zethson and others added 5 commits July 21, 2023 16:03

Add shell

f90f3dc

Signed-off-by: zethson <[email protected]>

Add environment-name

0a2d0d7

Signed-off-by: zethson <[email protected]>

rename environment-name

e8604f5

Signed-off-by: zethson <[email protected]>

specify python

c4a89e2

Remove env name

2e5d718

flying-sheep reviewed Jul 22, 2023

View reviewed changes

.github/workflows/build_gpu.yml Outdated Show resolved Hide resolved

ivirshup added 3 commits July 24, 2023 17:13

Don't make a shell

8f7ac30

add env name

a0bd5f7

Get git info so version is specified right

18c1d79

Zethson and others added 2 commits July 25, 2023 13:27

proper cirun label

f497401

Add gpu mark, --only-gpu argument

d4d81a4

Better gpu test

9c41aed

ivirshup added 4 commits July 26, 2023 12:43

Deduplicate some test params

5d69e2d

Support IO

bac0111

Fixes related to cupy/cupy#7757

2075a70

coverage

2c27bdb

ivirshup mentioned this pull request Jul 27, 2023

Cupy support in anndata #1080

Closed

2 tasks

ivirshup and others added 2 commits July 28, 2023 12:50

Cancel jobs if new commits are pushed + whitespace to trigger precomit

66ca927

[pre-commit.ci] auto fixes from pre-commit.com hooks

c0fdf9d

for more information, see https://pre-commit.ci

Update GPU CI name + paralellize GPU CI

1ba8ad1

Simplify pytest setup

d39c5d3

flying-sheep requested changes Jul 28, 2023

View reviewed changes

conftest.py Outdated Show resolved Hide resolved

Fix typo

03e38f9

flying-sheep approved these changes Jul 31, 2023

View reviewed changes

ivirshup added 2 commits July 31, 2023 09:18

Release note

7cda91f

Change run rules for GPU CI

c476a5d

Zethson commented Jul 31, 2023

View reviewed changes

anndata/tests/test_gpu.py Show resolved Hide resolved

ivirshup added the topic: gpu label Jul 31, 2023

ivirshup merged commit 8b1a7e4 into main Jul 31, 2023

ivirshup deleted the feature/gpu_ci branch July 31, 2023 10:18

ktravaglini mentioned this pull request Sep 25, 2023

Support for upcoming AnnData 0.10.0rc1 for compatibility with the latest rapids_singlecell scverse/scvi-tools#2268

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cupy support + CI #1066

Add cupy support + CI #1066

Zethson commented Jul 21, 2023 •

edited by ivirshup

Loading

codecov bot commented Jul 21, 2023 •

edited

Loading

ivirshup commented Jul 24, 2023

flying-sheep commented Jul 25, 2023 •

edited

Loading

Controlling what gets installed

ivirshup commented Jul 25, 2023

Intron7 commented Jul 26, 2023

Intron7 commented Jul 27, 2023 •

edited

Loading

ivirshup commented Jul 27, 2023

ivirshup commented Jul 27, 2023

Intron7 commented Jul 27, 2023

Intron7 commented Jul 28, 2023 •

edited

Loading

ivirshup commented Jul 28, 2023

Intron7 commented Jul 28, 2023

ivirshup commented Jul 28, 2023

Intron7 commented Jul 28, 2023

flying-sheep left a comment

Add cupy support + CI #1066

Add cupy support + CI #1066

Conversation

Zethson commented Jul 21, 2023 • edited by ivirshup Loading

codecov bot commented Jul 21, 2023 • edited Loading

Codecov Report

ivirshup commented Jul 24, 2023

flying-sheep commented Jul 25, 2023 • edited Loading

Controlling what gets installed

ivirshup commented Jul 25, 2023

Intron7 commented Jul 26, 2023

Intron7 commented Jul 27, 2023 • edited Loading

ivirshup commented Jul 27, 2023

ivirshup commented Jul 27, 2023

Intron7 commented Jul 27, 2023

Intron7 commented Jul 28, 2023 • edited Loading

ivirshup commented Jul 28, 2023

Intron7 commented Jul 28, 2023

ivirshup commented Jul 28, 2023

Intron7 commented Jul 28, 2023

flying-sheep left a comment

Choose a reason for hiding this comment

Zethson commented Jul 21, 2023 •

edited by ivirshup

Loading

codecov bot commented Jul 21, 2023 •

edited

Loading

flying-sheep commented Jul 25, 2023 •

edited

Loading

Intron7 commented Jul 27, 2023 •

edited

Loading

Intron7 commented Jul 28, 2023 •

edited

Loading