Initial Release #1

avik-pal · 2023-11-17T06:25:15Z

avik-pal · 2023-11-19T18:34:10Z

For LinearSolve.jl, LU and QR should just work, with a fallback for \ looping over each batch. To make this efficient for GPU arrays, we need to use the batched solvers directly -- surprisingly CUBLAS and CUSOLVER have APIs for direct batched linear solves but don't provide APIs to use the batched QR and such to do the linear solve efficiently.

Currently I make the assumption that the batchsize of A and b must match. But it is easy to generalize. @ChrisRackauckas the case that you mentioned where there is 1 A and N bs, it is easy to generalize and use trsm vs trsv on our end itself. So all users need to do is:

LinearProblem(BatchedArray(rand(4, 4, 1)), BatchedArray(4, 16))  # Single `A` but 16 - `b`s

codecov · 2023-11-21T01:10:58Z

Welcome to Codecov 🎉

Once merged to your default branch, Codecov will compare your coverage reports and display the results in this comment.

Thanks for integrating Codecov - We've got you covered ☂️

avik-pal · 2023-11-28T03:27:58Z

SimpleNonlinearSolve.jl example usage

using BatchedArrays, SimpleNonlinearSolve

u0 = BatchedArray(rand(3, 5))

prob1 = NonlinearProblem((u, p) -> u .^ 2 .- p, u0, 2.0)

solve(prob1, SimpleBroyden())

solve(prob1, SimpleDFSane())

solve(prob1, SimpleLimitedMemoryBroyden(; threshold = 2))

solve(prob1, SimpleNewtonRaphson())

solve(prob1, SimpleKlement())

solve(prob1, SimpleHalley())

I am leaving out TR for now since there is a potential correctness issue that needs some careful investigation. As a summary of this, branching is almost impossible to handle nicely if there are conditional computations inside the branch.

Fun part: Methods using Jacobian will be much faster using BatchedArrays since we can automatically color and propagate all the batch duals together. So the current SimpleNewtonRaphson with BatchedArrays is faster than the pre 1.0 BatchedSimpleNewtonRaphson

avik-pal force-pushed the ap/basic_impl branch 2 times, most recently from ce02274 to ecd53e7 Compare November 18, 2023 08:18

avik-pal force-pushed the ap/basic_impl branch 2 times, most recently from a8477e1 to 95cbb1c Compare November 21, 2023 01:09

avik-pal added 16 commits November 20, 2023 21:12

Initial Structure

88dba1d

Add LU and QR Factorizations

64baebf

Add some of the GPU implementations

6104301

Make broadcasting type stable

52bd13a

setindex fix

a58f44b

Moving over from NNlib

468c810

Support ForwardDiff.jl

746feae

Some progress on batched matmul

de1818e

Finish a version of the matmul implementation

fd1dcc1

NewtonRaphson is working 🎉

2576321

Finalize the matrix multiplication

ed3a356

CUDA LU + Fixed Broadcasting

9b69088

Add more matmul routines

aaa4eca

Add a note

bfc0236

Cuda LU use batched kernels

7d6c14e

Add some tests

3d191fd

avik-pal force-pushed the ap/basic_impl branch 3 times, most recently from 64b5930 to 6704c42 Compare November 21, 2023 02:35

Resolve Ambiguities

bbeadbe

avik-pal force-pushed the ap/basic_impl branch from 6704c42 to bbeadbe Compare November 21, 2023 02:37

avik-pal added 3 commits November 20, 2023 22:05

Add CUDA support for long rectangular matrices

d4e27d4

Proper QR batched solve

bbe90a6

Reuse more code

e24bb6b

Make the common solvers work

305c4bd

avik-pal added 6 commits November 27, 2023 23:01

Make Halley and Klement work

f026f9d

Create a special BatchedScalar type

d1dcb9f

Formatting

f91af02

More consistent BatchedArray --> BatchedScalar

4b32448

Modify the semantics of Conditionals

09dde11

Resolve cholesky ambiguity

1ff2549

avik-pal closed this Mar 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial Release #1

Initial Release #1

avik-pal commented Nov 17, 2023 •

edited

Loading

avik-pal commented Nov 19, 2023

codecov bot commented Nov 21, 2023

avik-pal commented Nov 28, 2023 •

edited

Loading

Initial Release #1

Initial Release #1

Conversation

avik-pal commented Nov 17, 2023 • edited Loading

avik-pal commented Nov 19, 2023

codecov bot commented Nov 21, 2023

Welcome to Codecov 🎉

avik-pal commented Nov 28, 2023 • edited Loading

SimpleNonlinearSolve.jl example usage

avik-pal commented Nov 17, 2023 •

edited

Loading

avik-pal commented Nov 28, 2023 •

edited

Loading