Interleaved computation with communication in halo exchange #881

lslusarczyk · 2025-01-21T05:28:10Z

No description provided.

… into count

Count

lslusarczyk

Cyclic halos seems to be right decision. Two backends - not. Don't we just need a flag in segments (as you did)? Twoi backends seems to be not needed change.

Cyclic halos + special for_each (as you did) should be enough I think.

lslusarczyk · 2025-01-21T05:34:10Z

include/dr/mp/algorithms/for_each.hpp

+    return;
+  }
+
+  assert(aligned(dr));


makes no sense checking this for one range, not aligned can be two or more ranges, one is always aligned

lslusarczyk · 2025-01-21T05:36:10Z

include/dr/mp/algorithms/for_each.hpp

+
+  assert(aligned(dr));
+
+  for (auto &s : dr::ranges::segments(dr)) {


probably you wanted local_segments

lslusarczyk · 2025-01-21T05:37:57Z

include/dr/mp/containers/distributed_vector.hpp

-  void fence() { backend.fence(); }
+  void fence() { backend_.fence(); }
+
+  backend_type& backend(const std::size_t segment_index) { return backend_; }


add __attribute__((unused)) to these functoins, but I'm unsure this function is really needed

lslusarczyk · 2025-01-21T05:40:26Z

include/dr/mp/containers/dual_distributed_vector.hpp

+
+static constexpr std::size_t DUAL_SEGMENTS_PER_PROC = 2;
+
+class DualMpiBackend {


what is the difference between DualMPiBackend and MpiBackend types? if none, please use one type

It's a leftover from when I was experimenting with changing some code in the backend, thanks for pointing it out

lslusarczyk · 2025-01-21T05:54:10Z

include/dr/mp/containers/dual_distributed_vector.hpp

+
+  distribution distribution_;
+  std::size_t size_;
+  std::vector<dual_dv_segment<dual_distributed_vector>> segments_;


if these 2 lines are the only differences between ordinary and dual vector, then let's pass different template parameters and make segment type a template paramter

init() is different, also there are multiple pointers to local segments (std::vector<T *> datas_) and there are additional member functions that return the local segment currently suited for computation. Although I agree that the two classes might be merged, as long as it's still not fully developed I'd rather keep it split and merge them a bit later when it fully works.

lslusarczyk · 2025-01-21T05:58:33Z

include/dr/mp/containers/dual_distributed_vector.hpp

+    for (std::size_t i = 0; i < DUAL_SEGMENTS_PER_PROC; i++) {
+      if (size_ > 0) {
+        datas_.push_back(static_cast<T *>(
+          backends_[i].allocate(data_size_ * sizeof(value_type))));


do I understand correctly that you allocate twice as many memory as normal distributed_vector of the same size?
if so - this is unacceptable

I think this isn't the case -- note the added multiplication by DUAL_SEGMENTS_PER_PROC:

std::size_t segment_count = comm_size * DUAL_SEGMENTS_PER_PROC;

This results in segment_size_ being smaller, so the memory footprint should only come from more halos.

Filip Głębocki and others added 30 commits July 31, 2024 10:36

added count to mhp algorithms

25243a9

Merge branch 'main' of https://github.com/oneapi-src/distributed-ranges…

c950ea3

… into count

minor fix

7eec868

minor fixes

6090fdc

code review fixes

167702d

more code review fixes

755c896

removed redundant conditional

e98de3b

fixes according to pre-commit checks

f31b80c

Merge branch 'main' of https://github.com/oneapi-src/distributed-ranges…

b847d04

… into count

Merge branch 'main' of https://github.com/oneapi-src/distributed-ranges…

06cc78b

… into count

added cyclic_halo_impl and distributed_vector_dual

5511751

Merge pull request #1 from quazuo/count

b26665e

Count

added dual_segment and refined dual_distributed_vector

e1e9910

progress

bdecda7

Merge remote-tracking branch 'upstream/main'

9e81fd7

tiny fix

811307c

Merge branch 'main' of https://github.com/quazuo/distributed-ranges

a5fdcf5

prog

5353689

prog

6701c41

prog

f79fe45

prog

b126f8b

prog

e1d50c9

prog

0da1f2c

prog

871bd58

prog

2e3c96d

prog

dff502a

prog

89f9c18

prog

2e4cc88

prog

9c37e12

prog

78cbd29

Filip Głębocki added 28 commits December 27, 2024 16:45

prog

3d7a9a8

prog

2603a6c

prog

cdce405

prog

46f6ade

prog

e4eafa2

prog

ad460d8

prog

bd1e8ed

prog

8d0f5be

prog

046b7e4

prog

5c32e9e

prog

6e95e27

prog

46dff8e

prog

b842cc7

prog

d43b9d3

prog

1b6b21b

prog

d333564

prog

8690c17

prog

3860947

prog

6dcd2f4

prog

df55977

prog

4b0f293

prog

0fe709c

prog

05fa8f6

prog

0d94948

prog

33a1d4f

prog

8dd8a00

prog

d04461d

prog

497eb8c

lslusarczyk commented Jan 21, 2025

View reviewed changes

prog

29759f1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interleaved computation with communication in halo exchange #881

Interleaved computation with communication in halo exchange #881

lslusarczyk commented Jan 21, 2025

lslusarczyk left a comment

lslusarczyk Jan 21, 2025

lslusarczyk Jan 21, 2025

lslusarczyk Jan 21, 2025

lslusarczyk Jan 21, 2025

quazuo Jan 21, 2025

lslusarczyk Jan 21, 2025

quazuo Jan 21, 2025

lslusarczyk Jan 21, 2025

quazuo Jan 21, 2025


		assert(aligned(dr));

		for (auto &s : dr::ranges::segments(dr)) {


		static constexpr std::size_t DUAL_SEGMENTS_PER_PROC = 2;

		class DualMpiBackend {

Interleaved computation with communication in halo exchange #881

Are you sure you want to change the base?

Interleaved computation with communication in halo exchange #881

Conversation

lslusarczyk commented Jan 21, 2025

lslusarczyk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment