-
Notifications
You must be signed in to change notification settings - Fork 66
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] error on complex types list type field [category] has empty string, cannot process it
#678
Comments
@zane-neo can you look into this issue? |
@toyaokeke From your example, I see two different cases:
Can you confirm which case is your real production case? |
I did found issue in code, but the error message seems differ with yours, I created this ticket to track the issue: opensearch-project/ml-commons#2309. Still trying to understand your case to see if there's other issues here. |
Hi @zane-neo and thank you for looking into this. The
|
Also the ticket you created I also see that error as well when I run the simulator too opensearch-project/ml-commons#2309
The main issue with both is that I am creating embeddings on |
@toyaokeke , would like to double confirm on the
NOT
The reason asking this is because if it's list type, it would need much more complex fix, if it's a map type the fix would be easier, I created this issue to track list of map issue: #686 |
Correct it is a map type |
@toyaokeke the new you can try using I am still working on the tutorial for using ml inference processors using neural search query, will notice here once I have it |
Hi @mingshl thank you for directing me to this! If I understand correctly this I am using AWS Managed Service, and it currently only supports up to 2.11. I would be more than happy to test that processor once AWS releases support for that version 🙏🏿 |
@mingshl considering what you shared, is this a bug that will still be fixed for the |
Hi @zane-neo, just checking in to see if this bug has been resolved and can be closed? |
@zane-neo can you pls validate and confirm if the bug has been fixed ? Thanks! |
as part of the fix, I was also wondering if more detail could be provided in the error message? for example, which field within a nested attribute is causing the error? for example, {
"failures": {
"index": "neural-search-index-v2",
"id": "5302821",
"cause": {
"type": "illegal_argument_exception",
"reason": "list type field [category] has empty string, cannot process it"
},
"status": 400
},
...
} I do not know which field within {
"failures": {
"index": "neural-search-index-v2",
"id": "5302821",
"cause": {
"type": "illegal_argument_exception",
"reason": "[name] field within [category] within ... [rootEntity] entity has empty string, cannot process it"
},
"status": 400
},
...
} |
cc : @model-collapse |
@toyaokeke Sorry missed to update this issue, this is already fixed in this PR: #687. The root cause is when validating the map type field, the fields not shown in configuration also get validated. The fix removed the fix on those non-embedding fields, so you should not see this error and no need to worry about the field name causing the issue. |
Initial bug reported in opensearch-project/ml-commons#2303
What is the bug?
I am creating a text embedding processor that creates vectors on a nested field. However, I receive
illegal_argument_exception
because not all the fields in the object meet the requirementHere is the explanation from the AWS support specialist
However, I am not creating vectors on
id
so I don't understand why it must follow these requirements. Is this expected behaviour or is this a bug?How can one reproduce the bug?
Steps to reproduce the behavior:
What is the expected behavior?
should create vectors on category name
What is your host/environment?
Do you have any screenshots?
Do you have any additional context?
invalid doc
valid doc
The text was updated successfully, but these errors were encountered: