Concurrent execution of workflow steps #544

calum-chamberlain · 2023-04-12T05:13:22Z

What does this PR do?

This PR implements concurrent processing of the main steps in the .detect workflow. Alongside this there are several other major changes including moving all the matched-filter components out of core.matched_filter.matched_filter to focus on the tribe.detect method. The matched_filter function has been retained, but it simply creates a Tribe internally and runs the tribe.detect method.

Fundamentally this PR breaks the workflow down into the following steps:
0. Downloading data (if running .client_detect, which I recommend)

Processing data (which is now possible in parallel from a Process thanks to Convert preprocessing functions to multithreaded with GIL-released #540 )
Prepare data for correlation
Correlate and detect peaks
Convert peaks to Detections
Convert detections to Party
Collect all parties into a final Party

Each step as listed runs in it's own process, except steps 0 and 1, which run together when using .client_detect, and 0 is not run when using .detect. Communication between steps is via multiprocessing Queues. Steps will wait until data are available in their incoming queue, then work on those data and put them into their output queue which forms the input queue for the following step. Steps loop until they get None from the input queue, used to signify the end of processing.

.detect supports passing a queue as input, or a Stream. This means that if users do not want to use a client to get their data, they can provide a simple queue that might look something like:

import glob

from typing import Iterable
from multiprocessing import Queue, Process

from obspy import read

from eqcorrscan import Tribe


def reader(files: Iterable, output_queue: Queue):
    for file in files:
        output_queue.put(read(file))
    return

def main():
    tribe = Tribe().read("Some_tribe.tgz")
    files = glob.glob("somewhere-with-some-waveforms")
    files.sort()
    stream_queue = Queue(maxsize=1)
    stream_getter = Process(target=reader, kwargs={"files": files, "output_queue": stream_queue})

    stream_getter.start()
    party = tribe.detect(st=stream_getter, ...)

As currently implemented, only step 3, correlate and detect peaks, runs in the MainProcess. This enables this step to make use of multiprocessing if needed, and provides more safety for openmp threading in underlying C-functions. I attempted to move peak finding into its own Process, but found that putting the large arrays of cross-correlation sums into the Queue was prohibitively slow, and there is a strong risk of exceeding queue size. I also tried saving these and reading them back in in another process, but again, saving them to disk was too slow.

Minor changes:

_spike_test is now threaded for speed.
Detection objects provide a more helpful error when the correlation value exceeds the number of channels. I have only ever seen this happen for fmf correlations.
Party additional has been sped up
multi_find_peaks now uses multithreading rather than openmp C threading. I found this was faster and safer on my machine (see pytest fixtures in correlate_test alter omp threads #543).
Many minor structural changes to the correlation functions to enable pre-preparation of data into the form required for correlation functions.

Why was it initiated? Any relevant Issues?

Concurrent processing should enable us to more heavily load systems, which should please HPC admins... This PR speeds up EQcorrscan for large datasets, and should enable much better GPU utilisation when using the FMF backend.

PR Checklist

develop base branch selected?
This PR is not directly related to an existing issue (which has no PR yet).
All tests still pass.
Any new features or fixed regressions are be covered via new tests.
Any new or changed features have are fully documented.
Significant changes have been added to CHANGES.md.
~~- [ ] First time contributors have added your name to CONTRIBUTORS.md.~~

TODO:

Benchmarks - compare to develop and master
Tutorial, particularly on making use of clients - highlight obsplus for local client emulation
Fix any bugs that crop up in more major testing on large scale, long-duration datasets (currently applying to 11k templates over 10 years).
Clean code:
- Standardise queue naming (input_xx_queue, output_yy_queue)
- docstrings
  ~~- [ ] type-hints~~
- clean logic and make sure comments are appropriate
- Clean out stream pickle files once done (.streams/xxxx.pkl), but do not remove the .streams directory.
Meet coverage requirements.

…scan into parallel-steps

… safer

…datasets

…scan into parallel-steps

calum-chamberlain added 30 commits March 31, 2023 16:36

Move data collection to seperate method

fcb89e3

start making queue consumers

16f23dd

Possible implementation - untested

acda3bb

Merge branch 'parallel-steps' of https://github.com/eqcorrscan/EQcorr…

56e6e6c

…scan into parallel-steps

Process closing

d146c78

Merge branch 'parallel-steps' of https://github.com/eqcorrscan/EQcorr…

d1e8b8e

…scan into parallel-steps

Enforce same processing and run all in detect

fae3e2f

Debugging

8ddd44f

Merge branch 'parallel-steps' of https://github.com/eqcorrscan/EQcorr…

431c2c7

…scan into parallel-steps

Threshold in right place and allow pickle reading

7f4e72e

Do not allow more threads than are available

6e7a679

Find peaks needs to not request too many threads to avoid segfaults

6c5d26b

Parallel spike test

f47bfb5

Multi-find peaks using multithreading rather than openmp - faster and…

8da5896

… safer

Working concurrency

0b20a4e

Faster party addition

2cfbd3f

Return errors properly

f25620e

Test correct place for cached files

a148503

Check for None in add

c79935b

termination

4279dbd

Remove multi find peaks

c081bd1

timeout

10097c7

Do not test removed funcs

944c60e

Develop and test serial implementation

f936750

Cope with old tribes not having attributes

a3be6cc

Prep for correaltion in seperate process - can be slowdown for large …

bb61c8c

…datasets

Faster throughput and timers

ef6fa1f

More correaltion prep out of mainProcess

af0c46d

Make sure kwargs aren't changed

86ccdab

typos

81b4b30

calum-chamberlain added 28 commits December 1, 2023 15:25

Enhance findpeaks coverage - remove unused _multi_decluster

51831c0

Merge branch 'parallel-steps' of https://github.com/eqcorrscan/EQcorr…

1d703b9

…scan into parallel-steps

Add pre-grouped test

47ca25f

Remove unused process

0200767

Remove unused import

cd12ba8

Check for errors in party check

7e4f526

Start testing helper processes

f9f9ec1

flake8

4eb31f5

Paths

cd5e2de

Raise exception, not poison

517c856

Wait for poison

e8c6704

Wait for poison

299b988

Wait longer

8132747

Check print

d24c9b3

Merge branch 'parallel-steps' of https://github.com/eqcorrscan/EQcorr…

ecd67fc

…scan into parallel-steps

Restructure helper tests for re-use

40e28b5

Basic tests for all processes

0e2e5da

Cope with Process coverage reporting

f5e81a5

Merge branch 'parallel-steps' of https://github.com/eqcorrscan/EQcorr…

10e22e1

…scan into parallel-steps

Allow longer wait if needed

8cd77d7

Merge branch 'parallel-steps' of https://github.com/eqcorrscan/EQcorr…

e59c0d9

…scan into parallel-steps

Do not wait for output to be empty for too long

c9b734e

Report threading processing

f5267a6

Tribe doctests

9f26cb0

REstructure archive access to cover more

3c54c1d

Coverage

4059450

typo

02643fa

Test event similarity checks

9d863b6

calum-chamberlain merged commit 7231e1a into develop Dec 11, 2023
18 of 20 checks passed

calum-chamberlain deleted the parallel-steps branch December 11, 2023 03:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrent execution of workflow steps #544

Concurrent execution of workflow steps #544

calum-chamberlain commented Apr 12, 2023 •

edited

Loading

Concurrent execution of workflow steps #544

Concurrent execution of workflow steps #544

Conversation

calum-chamberlain commented Apr 12, 2023 • edited Loading

What does this PR do?

Why was it initiated? Any relevant Issues?

PR Checklist

TODO:

calum-chamberlain commented Apr 12, 2023 •

edited

Loading