Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adds preset contentRegistry for IngestProcessors #3281

Merged

Conversation

brianf-aws
Copy link
Contributor

@brianf-aws brianf-aws commented Dec 16, 2024

Background

Currently local models that use the parameters map within the payload to create a request can not create objects to be used for local model prediction, when using the MLInferenceIngestProcessor.
e.g.

## This is a cutom Asymmetric model prediction this passes
POST {{ _.domain }}/_plugins/_ml/_predict/text_embedding/{{ _.model_id }}
{
  "parameters": {
    "content_type": "query"
  },
  "text_docs": ["What day is it today?"],
  "target_response": ["sentence_embedding"]
}

## Ingest Pipeline configuration (Using the MLInferenceIngest Processor) that fails to create the MLInput needed for model prediciton

PUT {{ _.domain }}/_ingest/pipeline/asymmetric_embedding_ingest_pipeline
{
	"description": "ingest passage text and generate a embedding using an asymmetric model",
	"processors": [
		{
			"ml_inference": {

				"model_input": "{\"text_docs\":[\"${input_map.text_docs}\"],\"target_response\":[\"sentence_embedding\"],\"parameters\":{\"content_type\":\"query\"}}",
				"function_name": "text_embedding",
				"model_id": "{{ _.model_id }}",
				"input_map": [
					{
						"text_docs": "description"
					}
				],
				"output_map": [
					{
						"fact_embedding": "$.inference_results.*.output.*.data",
						"embedding_size": "$.inference_results.*.output.*.shape[0]"
					}
				]
			}
		}
	]
}

This requires a opensearch core change because it needs the contentRegistry,however given there is not much dependency on the registry (currently) we can give it the preset registry given in the MachineLearningPlugin class via the getNamedXContent() class to temporarily unblock this use case while a OpenSearch Core fix gives a proper change.

Context

Please see this issue #3276.

Related Issues

Temporarily resolves #3276 while waiting for a OpenSearch core fix.

Check List

  • New functionality includes manual testing.
  • New functionality has been documented.
  • API changes companion pull request created.
  • Commits are signed per the DCO using --signoff.
  • Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

…d local models

Curently local models that use the parameters map within the payload to create a request can not create objects to be used for local model prediction. This requires a opensearch core change because it needs the contentRegistry,however given there is not much dependency on the registry (currently) we can give it the preset registry given in the MachineLearningPlugin class vai the getNamedXContent() class

Signed-off-by: Brian Flores <[email protected]>
@austintlee
Copy link
Collaborator

Can you create a test (maybe an IT test) that fails w/o this fix?

@brianf-aws
Copy link
Contributor Author

brianf-aws commented Dec 17, 2024

Can you create a test (maybe an IT test) that fails w/o this fix?

Hey @austintlee, Im a bit confused do you mean create a test that shows this change works? If I create a test that fails. i.e. one that uses the parameters: { ... } it will pass since it now can create the MLInput properly. Also Im thinking if we should actually use the asymmetric model as IT its 283 mb

Instead of a IT i added a UT, as the crux of the problem is in the ingestProcessor not necessarily specific models. I hope you can understand my judgment

@brianf-aws brianf-aws had a problem deploying to ml-commons-cicd-env-require-approval December 17, 2024 21:34 — with GitHub Actions Failure
@brianf-aws brianf-aws had a problem deploying to ml-commons-cicd-env-require-approval December 17, 2024 21:34 — with GitHub Actions Failure
Signed-off-by: Brian Flores <[email protected]>
@brianf-aws brianf-aws had a problem deploying to ml-commons-cicd-env-require-approval December 17, 2024 21:41 — with GitHub Actions Failure
@brianf-aws brianf-aws had a problem deploying to ml-commons-cicd-env-require-approval December 17, 2024 21:41 — with GitHub Actions Failure
@mingshl
Copy link
Collaborator

mingshl commented Dec 17, 2024

@brianf-aws can you look up the usage of the parameters field and check what are the local model types that are impacted by this change? we know that asymmetric model type is impacted, wondering what other model types are impacted?

then we can test the changes to all impacted local model scenarios.

@mingshl
Copy link
Collaborator

mingshl commented Dec 17, 2024

I added the backport labels to make sure this bug fix can fix all the way back to when the local model is supported in ingest processors on June 11, #2508 in 2.15, please make sure the bwc tests passed in the backport PRs.

Also, would be nice if a second set of eyes can help double check the versions @brianf-aws https://github.com/opensearch-project/ml-commons/commits/2.15

@brianf-aws
Copy link
Contributor Author

@brianf-aws can you look up the usage of the parameters field and check what are the local model types that are impacted by this change? we know that asymmetric model type is impacted, wondering what other model types are impacted?

Good call out Im seeing a pattern where the machineLearning plugin gets invoked by the Node class and gets getNamedXContent() e.g. here.

        NamedXContentRegistry xContentRegistry = new NamedXContentRegistry(
            Stream.of(
                NetworkModule.getNamedXContents().stream(),
                IndicesModule.getNamedXContents().stream(),
                searchModule.getNamedXContents().stream(),
                pluginsService.filterPlugins(Plugin.class).stream().flatMap(p -> p.getNamedXContent().stream()),
                ClusterModule.getNamedXWriteables().stream()
            ).flatMap(Function.identity()).collect(toList())
        );

So it will call the registries as specified here

public List<NamedXContentRegistry.Entry> getNamedXContent() {
return ImmutableList
.of(
KMeansParams.XCONTENT_REGISTRY,
LinearRegressionParams.XCONTENT_REGISTRY,
AnomalyDetectionLibSVMParams.XCONTENT_REGISTRY,
SampleAlgoParams.XCONTENT_REGISTRY,
FitRCFParams.XCONTENT_REGISTRY,
BatchRCFParams.XCONTENT_REGISTRY,
LocalSampleCalculatorInput.XCONTENT_REGISTRY,
MetricsCorrelationInput.XCONTENT_REGISTRY,
AnomalyLocalizationInput.XCONTENT_REGISTRY_ENTRY,
RCFSummarizeParams.XCONTENT_REGISTRY,
LogisticRegressionParams.XCONTENT_REGISTRY,
TextEmbeddingModelConfig.XCONTENT_REGISTRY,
AsymmetricTextEmbeddingParameters.XCONTENT_REGISTRY
);
}

If you click through every registry it will invoke a method that parses and collects fields from the parameter map.

Here is an example for k-means [Albeit not what we think of when we think of a model], What this means is that the ML Processor has the ability to run any models.

while (parser.nextToken() != XContentParser.Token.END_OBJECT) {
String fieldName = parser.currentName();
parser.nextToken();
switch (fieldName) {
case CENTROIDS_FIELD:
k = parser.intValue(false);
break;
case ITERATIONS_FIELD:
iterations = parser.intValue(false);
break;
case DISTANCE_TYPE_FIELD:
distanceType = DistanceType.from(parser.text());
break;
default:
parser.skipChildren();
break;
}
}

@dhrubo-os
Copy link
Collaborator

RestBedRockInferenceIT > test_bedrock_multimodal_model FAILED
    org.opensearch.client.ResponseException: method [POST], host [http://127.0.0.1:39501/], URI [/_plugins/_ml/models/_register], status line [HTTP/1.1 500 Internal Server Error]
    {"error":{"root_cause":[{"type":"null_pointer_exception","reason":"Cannot invoke \"org.opensearch.cluster.metadata.MappingMetadata.getSourceAsMap()\" because the return value of \"org.opensearch.cluster.metadata.IndexMetadata.mapping()\" is null"}],"type":"null_pointer_exception","reason":"Cannot invoke \"org.opensearch.cluster.metadata.MappingMetadata.getSourceAsMap()\" because the return value of \"org.opensearch.cluster.metadata.IndexMetadata.mapping()\" is null"},"status":500}

@brianf-aws brianf-aws temporarily deployed to ml-commons-cicd-env-require-approval December 17, 2024 23:49 — with GitHub Actions Inactive
@brianf-aws brianf-aws had a problem deploying to ml-commons-cicd-env-require-approval December 18, 2024 00:32 — with GitHub Actions Failure
false,
localModelInput
);
try {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

// Act & Assert: Verify NullPointerException and its message
    NullPointerException exception = assertThrows(
        NullPointerException.class,
        () -> processor.execute(ingestDocument, handler),
        "Expected NullPointerException due to null xContentRegistry"
    );

    assertTrue(exception.getMessage().contains("Cannot invoke"),
               "Exception message should indicate a failure due to null mlInput");

What do you think about this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like this, but the problem is that the exception is passed by the handler its not done by the method itself. So this wouldn't be possible thats the reason why this class and more specifically this method has a catch to make sure that an exception is not possible. i.e. the handler passes an exception only.

…ssor

We needed to make sure that a IT existed so that the preset content registry on the processor could work with parametized local models. By providing an IT that uses the asymetric embedding model its proven that the content registry is needed to create the embeddings. In this specific test case I used a ingest pipeline to convert passage embeddings, by simulating the pipeline to save test time.

Signed-off-by: Brian Flores <[email protected]>
@brianf-aws brianf-aws temporarily deployed to ml-commons-cicd-env-require-approval December 23, 2024 19:28 — with GitHub Actions Inactive
@brianf-aws brianf-aws temporarily deployed to ml-commons-cicd-env-require-approval December 23, 2024 19:28 — with GitHub Actions Inactive
@brianf-aws brianf-aws temporarily deployed to ml-commons-cicd-env-require-approval December 23, 2024 21:05 — with GitHub Actions Inactive
@ylwu-amzn ylwu-amzn merged commit df1b1ef into opensearch-project:main Dec 31, 2024
7 checks passed
@opensearch-trigger-bot
Copy link
Contributor

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.x 2.x
# Navigate to the new working tree
cd .worktrees/backport-2.x
# Create a new branch
git switch --create backport/backport-3281-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 df1b1ef19856e36e71af3481eaabfec1a33c98cc
# Push it to GitHub
git push --set-upstream origin backport/backport-3281-to-2.x
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-3281-to-2.x.

@opensearch-trigger-bot
Copy link
Contributor

The backport to 2.15 failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.15 2.15
# Navigate to the new working tree
cd .worktrees/backport-2.15
# Create a new branch
git switch --create backport/backport-3281-to-2.15
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 df1b1ef19856e36e71af3481eaabfec1a33c98cc
# Push it to GitHub
git push --set-upstream origin backport/backport-3281-to-2.15
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.15

Then, create a pull request where the base branch is 2.15 and the compare/head branch is backport/backport-3281-to-2.15.

@opensearch-trigger-bot
Copy link
Contributor

The backport to 2.16 failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.16 2.16
# Navigate to the new working tree
cd .worktrees/backport-2.16
# Create a new branch
git switch --create backport/backport-3281-to-2.16
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 df1b1ef19856e36e71af3481eaabfec1a33c98cc
# Push it to GitHub
git push --set-upstream origin backport/backport-3281-to-2.16
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.16

Then, create a pull request where the base branch is 2.16 and the compare/head branch is backport/backport-3281-to-2.16.

@opensearch-trigger-bot
Copy link
Contributor

The backport to 2.17 failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.17 2.17
# Navigate to the new working tree
cd .worktrees/backport-2.17
# Create a new branch
git switch --create backport/backport-3281-to-2.17
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 df1b1ef19856e36e71af3481eaabfec1a33c98cc
# Push it to GitHub
git push --set-upstream origin backport/backport-3281-to-2.17
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.17

Then, create a pull request where the base branch is 2.17 and the compare/head branch is backport/backport-3281-to-2.17.

@opensearch-trigger-bot
Copy link
Contributor

The backport to 2.18 failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.18 2.18
# Navigate to the new working tree
cd .worktrees/backport-2.18
# Create a new branch
git switch --create backport/backport-3281-to-2.18
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 df1b1ef19856e36e71af3481eaabfec1a33c98cc
# Push it to GitHub
git push --set-upstream origin backport/backport-3281-to-2.18
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.18

Then, create a pull request where the base branch is 2.18 and the compare/head branch is backport/backport-3281-to-2.18.

brianf-aws added a commit to brianf-aws/ml-commons that referenced this pull request Dec 31, 2024
…3281)

* add preset xContentRegistry to ingestProcessors for custom parametized local models

Curently local models that use the parameters map within the payload to create a request can not create objects to be used for local model prediction. This requires a opensearch core change because it needs the contentRegistry,however given there is not much dependency on the registry (currently) we can give it the preset registry given in the MachineLearningPlugin class vai the getNamedXContent() class

Signed-off-by: Brian Flores <[email protected]>

* Adds UT for proving models depend on xContentRegistry for prediction

Signed-off-by: Brian Flores <[email protected]>

* apply spotless

Signed-off-by: Brian Flores <[email protected]>

* Adds IT for Asymmetric Embedding scenario with MLInferenceIngestProcessor

We needed to make sure that a IT existed so that the preset content registry on the processor could work with parametized local models. By providing an IT that uses the asymetric embedding model its proven that the content registry is needed to create the embeddings. In this specific test case I used a ingest pipeline to convert passage embeddings, by simulating the pipeline to save test time.

Signed-off-by: Brian Flores <[email protected]>

---------

Signed-off-by: Brian Flores <[email protected]>
(cherry picked from commit df1b1ef)
brianf-aws added a commit to brianf-aws/ml-commons that referenced this pull request Dec 31, 2024
…3281)

* add preset xContentRegistry to ingestProcessors for custom parametized local models

Curently local models that use the parameters map within the payload to create a request can not create objects to be used for local model prediction. This requires a opensearch core change because it needs the contentRegistry,however given there is not much dependency on the registry (currently) we can give it the preset registry given in the MachineLearningPlugin class vai the getNamedXContent() class

Signed-off-by: Brian Flores <[email protected]>

* Adds UT for proving models depend on xContentRegistry for prediction

Signed-off-by: Brian Flores <[email protected]>

* apply spotless

Signed-off-by: Brian Flores <[email protected]>

* Adds IT for Asymmetric Embedding scenario with MLInferenceIngestProcessor

We needed to make sure that a IT existed so that the preset content registry on the processor could work with parametized local models. By providing an IT that uses the asymetric embedding model its proven that the content registry is needed to create the embeddings. In this specific test case I used a ingest pipeline to convert passage embeddings, by simulating the pipeline to save test time.

Signed-off-by: Brian Flores <[email protected]>

---------

Signed-off-by: Brian Flores <[email protected]>
(cherry picked from commit df1b1ef)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] MLInferenceIngestProcessor has xContentRegistry as null
5 participants