Onboard ML inference ingest processor #172

ohltyler · 2024-06-10T23:57:57Z

Description

This PR onboards the generic ML inference ingest processor into the plugin. Specifically:

adds the MLInferenceProcessor interface used when constructing the workflow template
updates the MLTransformer under components/ (the interfaces for the ReactFlow drag-and-drop components)
adds and refactors configs under configs/ (the interfaces/classes used for the WorkflowInputs form) to support the ML inference processor. Specifies the base ml_processor config at the top-level, as the form inputs will be identical across the underlying ingest / search request / search response processors
adds a map config field type and associated interfaces to represent a map as a list of k/v pairs (note this is the format expected in the ml inference processors)
updates and simplifies fns in workflow_to_template_utils and removes the logic around pretrained models and handling model provisioning, as this is not in the target scope of initial release for this
updates the pre-defined semantic search template to use the generic ml processor instead of the explicit text embedding processor
adds a new MapField for processing a dynamic list of k/v pairs to represent a mapping - includes the components, Formik form integration, and yup schema validation
(unrelated change) updates the parsing logic of the search workflows response, as the original issue has been resolved. After rebasing, this was throwing NPE errors.

Demo video, showing the dynamic list of mappings for the input and output maps, persisted in the config, form validation, and actual execution. (Errors on the execution can be ignored for now; this work is in-progress. I have confirmed the ingest pipeline, default pipeline on the index, and execution of the ingest are all working as expected, as the errors faced are replicable by perform ingest directly against the created index.)

screen-capture.39.webm

Issues Resolved

Makes progress on #23

Check List

Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Tyler Ohlsen <[email protected]>

…late creation Signed-off-by: Tyler Ohlsen <[email protected]>

Signed-off-by: Tyler Ohlsen <[email protected]>

Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit f077c82)

Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit f077c82) Co-authored-by: Tyler Ohlsen <[email protected]>

ohltyler added 4 commits June 10, 2024 16:39

Rebase to latest 3.0 and api updates

ae5d359

Signed-off-by: Tyler Ohlsen <[email protected]>

Onboard ml inference processor (ingest). Need to test and finish temp…

39c619f

…late creation Signed-off-by: Tyler Ohlsen <[email protected]>

Cleanup toTemplate utils to work correctly

42da8c7

Signed-off-by: Tyler Ohlsen <[email protected]>

dynamic button text

921f539

Signed-off-by: Tyler Ohlsen <[email protected]>

ohltyler added backport 2.x rapid workflow editor labels Jun 10, 2024

ohltyler requested review from dbwiddis, owaiskazi19, joshpalis, amitgalitz and jackiehanyang as code owners June 10, 2024 23:57

ohltyler marked this pull request as draft June 10, 2024 23:58

ohltyler added 5 commits June 11, 2024 10:07

Add model form field, defaults, and schema

780cc15

Signed-off-by: Tyler Ohlsen <[email protected]>

Get full integration with form working for input and output maps

0c6b549

Signed-off-by: Tyler Ohlsen <[email protected]>

Wrap in form row; get visible validation working

2e140e4

Signed-off-by: Tyler Ohlsen <[email protected]>

Parse input/output maps and inject into processor config in template

cec79a6

Signed-off-by: Tyler Ohlsen <[email protected]>

Only add input map / output map if applicable; simplify k/v placeholder

6ceeb6c

Signed-off-by: Tyler Ohlsen <[email protected]>

ohltyler marked this pull request as ready for review June 11, 2024 21:49

joshpalis approved these changes Jun 11, 2024

View reviewed changes

ohltyler merged commit f077c82 into opensearch-project:main Jun 11, 2024
6 checks passed

ohltyler deleted the ml-inference-processor branch June 11, 2024 22:34

opensearch-trigger-bot bot pushed a commit that referenced this pull request Jun 11, 2024

Onboard ML inference ingest processor (#172)

8004a2d

Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit f077c82)

opensearch-trigger-bot bot mentioned this pull request Jun 11, 2024

[Backport 2.x] Onboard ML inference ingest processor #173

Merged

ohltyler added a commit that referenced this pull request Jun 11, 2024

Onboard ML inference ingest processor (#172) (#173)

d67df62

Signed-off-by: Tyler Ohlsen <[email protected]> (cherry picked from commit f077c82) Co-authored-by: Tyler Ohlsen <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Onboard ML inference ingest processor #172

Onboard ML inference ingest processor #172

ohltyler commented Jun 10, 2024 •

edited

Loading

Onboard ML inference ingest processor #172

Onboard ML inference ingest processor #172

Conversation

ohltyler commented Jun 10, 2024 • edited Loading

Description

Issues Resolved

Check List

ohltyler commented Jun 10, 2024 •

edited

Loading