Kokkos interface #4

frostedoyster · 2024-10-24T11:51:31Z

This PR implements an experimental kokkos-metatensor interface.

Most of the accelerated code is written in torch as opposed to Kokkos, as torch wraps specialized CUDA kernels that are, most often, faster than the code generated by Kokkos.
Only the version with the neighbor remapping is implemented (for now), as the interface is relatively fast, so that in most cases not remapping would cripple the model way
more than slowing down the interface.

The exact consistency among the original interface, the Kokkos::Cuda interface and the Kokkos::OpenMP interface can be verified by running the metatensor and metatensor-kokkos examples.

I also added a readme in the examples folder to document the whole building and running procedure. I think this would be a very good place to point people to so they can compile and run the code. We can then remove it again in the future if needed

I tried to format with clang-format but I get way too many changes and I think I'm doing something wrong, so I didn't format in the end

Luthaf

There is a lot of duplicated code with the non-kokkos version which make it harder to understand what's different. I agree with you that we don't want to inherit from the standard pair_style directly, but it might make sense to have a separate base class that handles all the common things, and then leave just the load data from kokkos/create the systems/store data in kokkos as custom code.

I can try to give this a go myself!

Regarding the device, I did not find code that checked that the kokkos device matched the torch one. What should we do it they don't? Currently the code will crash/segfault since it uses kokkos pointers as device points without checking.

cmake/Modules/Packages/ML-METATENSOR.cmake

examples/PACKAGES/metatensor/in.metatensor

examples/PACKAGES/metatensor/nickel-lj.pt

Luthaf · 2024-10-30T10:00:11Z

src/KOKKOS/metatensor_system_kokkos.h

+template<class DeviceType>
+struct MetatensorSystemOptionsKokkos {
+    // Mapping from LAMMPS types to metatensor types
+    const int32_t* types_mapping;
+    const Kokkos::View<int32_t*, Kokkos::LayoutRight, DeviceType> types_mapping_kokkos;
+    // interaction range of the model, in LAMMPS units
+    double interaction_range;
+    // should we run extra checks on the neighbor lists?
+    bool check_consistency;
+};
+
+// data for metatensor neighbors lists
+template<class DeviceType>
+struct MetatensorNeighborsDataKokkos {
+    // single neighbors sample containing [i, j, S_a, S_b, S_c]
+    using sample_t = std::array<int32_t, 5>;
+
+    struct SampleHasher {
+        static void hash_combine(std::size_t& seed, const int32_t& v) {
+            seed ^= std::hash<int32_t>()(v) + 0x9e3779b9 + (seed<<6) + (seed>>2);
+        }
+
+        size_t operator()(const sample_t& s) const {
+            size_t hash = 0;
+            hash_combine(hash, s[0]);
+            hash_combine(hash, s[1]);
+            hash_combine(hash, s[2]);
+            hash_combine(hash, s[3]);
+            hash_combine(hash, s[4]);
+            return hash;
+        }
+    };
+
+    // cutoff for this NL in LAMMPS units
+    double cutoff;
+    // options of the NL as requested by the model
+    metatensor_torch::NeighborListOptions options;
+
+    // Below are cached allocations for the LAMMPS -> metatensor NL translation
+    // TODO: report memory usage for these?
+
+    // we keep the set of samples twice: once in `known_samples` to remove
+    // duplicated pairs, and once in `samples` in a format that can be
+    // used to create a torch::Tensor.
+    std::unordered_set<sample_t, SampleHasher> known_samples;
+    std::vector<sample_t> samples;
+    // pairs distances vectors
+    std::vector<std::array<double, 3>> distances_f64;
+    std::vector<std::array<float, 3>> distances_f32;
+};


This looks identical to the non-kokkos one, why is it duplicated here?

True for the second class in this snippet, but not for the first one. I think this is one of the very few parts of the code that can be re-used from the original interface, but I'll leave this re-organization up to you