[CI] Update meson.yml #180

amontoison · 2024-01-11T02:05:19Z

No description provided.

amontoison · 2024-01-11T02:35:52Z

@mjacobse
Can you help us to fix this error?
https://github.com/ralna/spral/actions/runs/7483445621/job/20368703491?pr=180#step:10:320

jfowkes · 2024-01-11T09:44:28Z

@amontoison I would suggest that you do what the Makefile build system currently does and only use nvcc/nvc++ for building the CUDA sources as opposed to all of SPRAL.

mjacobse · 2024-01-11T10:00:44Z

We are talking about

/opt/nvidia/hpc_sdk/Linux_x86_64/23.11/compilers/share/llvm/bin/llc: error: /opt/nvidia/hpc_sdk/Linux_x86_64/23.11/compilers/share/llvm/bin/llc: /tmp/nvc++ew2moH1KAcc.ll:10314:148: error: use of undefined value '%dblk'
        invoke void  @_ZN5spral5ssids3cpu17ldlt_app_internal5BlockIdLi32ENS1_14BuddyAllocatorIiSaIdEEEE6backupINS2_10CopyBackupIdNS4_IdS5_EEEEEEvRT_ (ptr %dblk, ptr  %22) mustprogress

right? This looks like a compiler bug to me. The error is raised by llc which compiles LLVMs intermediate representation to assembly, so long after dealing with the C++ code itself. Since the error appears to be about the hidden this argument for the member function Block<...>::backup<...>, perhaps using a free-function instead would be a workaround. But ideally I think this should be fixed in the compiler.

jfowkes · 2024-01-11T10:04:32Z

Thanks @mjacobse, agreed let's just use nvcc for the CUDA until this gets fixed.

jfowkes · 2024-01-11T16:50:43Z

Further to the above, Nick gets an NVFORTRAN-F-0000-Internal compiler error when compiling src/ssids/gpu/factor.f90. Googling this suggests that Nvidia have quite a lot of work to do to bring nvfortran up to spec (it's based on the old pgfortran compiler).

amontoison · 2024-01-11T18:45:55Z

We are talking about
/opt/nvidia/hpc_sdk/Linux_x86_64/23.11/compilers/share/llvm/bin/llc: error: /opt/nvidia/hpc_sdk/Linux_x86_64/23.11/compilers/share/llvm/bin/llc: /tmp/nvc++ew2moH1KAcc.ll:10314:148: error: use of undefined value '%dblk'
        invoke void  @_ZN5spral5ssids3cpu17ldlt_app_internal5BlockIdLi32ENS1_14BuddyAllocatorIiSaIdEEEE6backupINS2_10CopyBackupIdNS4_IdS5_EEEEEEvRT_ (ptr %dblk, ptr  %22) mustprogress
right? This looks like a compiler bug to me. The error is raised by llc which compiles LLVMs intermediate representation to assembly, so long after dealing with the C++ code itself. Since the error appears to be about the hidden this argument for the member function Block<...>::backup<...>, perhaps using a free-function instead would be a workaround. But ideally I think this should be fixed in the compiler.

We also have an internal error in the same file for the icpc compiler on all platform.
icpx will not be released on Mac so it will never be fixed in the compiler.
If it's not a lot of work we should try to use the free-function.

jfowkes · 2024-01-12T08:20:48Z

Again that's an issue with icpc which has been deprecated and dropped from all the latest Intel compiler releases. There is no point doing lots of work just to support a deprecated compiler. Intel is no longer an option on macs unfortunately.

amontoison · 2024-01-12T20:04:48Z

@jfowkes I commented the failing platforms for CI.
If one day something is fixed upstream, we will be able to easily test the new version of the compilers.

I also added the OpenMP flag -mp for NVIDIA compilers in the main meson.build.

jfowkes · 2024-01-15T09:17:53Z

@amontoison looks great, are you happy for this to be merged? I am.

amontoison · 2024-01-15T16:36:50Z

@jfowkes Yes, we can merge the PR.

jfowkes · 2024-01-15T16:47:52Z

@amontoison great could you rebase master onto your branch just so we can see if it works with the other changes that you've made to meson.build in the other PR?

amontoison · 2024-01-15T17:46:23Z

done 👍

amontoison assigned jfowkes Jan 11, 2024

amontoison force-pushed the nvidia-hpc branch from d35b451 to 10b38c1 Compare January 11, 2024 02:43

jfowkes added the ci label Jan 11, 2024

jfowkes assigned amontoison Jan 11, 2024

jfowkes self-requested a review January 11, 2024 09:45

amontoison force-pushed the nvidia-hpc branch from 57fe0d9 to 6768690 Compare January 12, 2024 06:27

amontoison mentioned this pull request Jan 12, 2024

Update continuous integration ralna/GALAHAD#224

Merged

amontoison force-pushed the nvidia-hpc branch 3 times, most recently from b68450e to 60b5817 Compare January 12, 2024 20:02

amontoison changed the title ~~[CI] Add NVIDIA-HPC compilers~~ [CI] Update meson.yml Jan 12, 2024

jfowkes added build-system and removed ci labels Jan 15, 2024

jfowkes removed their assignment Jan 15, 2024

[CI] Update meson.yml

78691a2

amontoison force-pushed the nvidia-hpc branch from 60b5817 to 78691a2 Compare January 15, 2024 17:30

jfowkes approved these changes Jan 16, 2024

View reviewed changes

jfowkes merged commit 8187c9e into ralna:master Jan 16, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI] Update meson.yml #180

[CI] Update meson.yml #180

amontoison commented Jan 11, 2024

amontoison commented Jan 11, 2024

jfowkes commented Jan 11, 2024

mjacobse commented Jan 11, 2024

jfowkes commented Jan 11, 2024

jfowkes commented Jan 11, 2024

amontoison commented Jan 11, 2024 •

edited

Loading

jfowkes commented Jan 12, 2024

amontoison commented Jan 12, 2024

jfowkes commented Jan 15, 2024

amontoison commented Jan 15, 2024

jfowkes commented Jan 15, 2024 •

edited

Loading

amontoison commented Jan 15, 2024

[CI] Update meson.yml #180

[CI] Update meson.yml #180

Conversation

amontoison commented Jan 11, 2024

amontoison commented Jan 11, 2024

jfowkes commented Jan 11, 2024

mjacobse commented Jan 11, 2024

jfowkes commented Jan 11, 2024

jfowkes commented Jan 11, 2024

amontoison commented Jan 11, 2024 • edited Loading

jfowkes commented Jan 12, 2024

amontoison commented Jan 12, 2024

jfowkes commented Jan 15, 2024

amontoison commented Jan 15, 2024

jfowkes commented Jan 15, 2024 • edited Loading

amontoison commented Jan 15, 2024

amontoison commented Jan 11, 2024 •

edited

Loading

jfowkes commented Jan 15, 2024 •

edited

Loading