You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When building a product or full-text search based application in OpenSearch, careful consideration and thought needs to be put into building the index based on the data being ingested. Often, a document to be indexed is derived from several different sources (e.g. databases, tables, or files). In OpenSearch, the best practice is to denormalize this data into a single document. This proposal is to build tooling that inspects source data and generates a suggested OpenSearch index mapping file so that search application builders do not need to start from scratch and also do not need to default to dynamic mapping.
General Use Case
Upload a CSV, JSON document, or SQL query against a source
Optionally specify a primary or unique key
Mapping file generator then:
4. sorts the data based on the key provided
5. loops through X number of rows to generate a suggestion for mapping file including nested documents, datatypes,
analyzers, and autosuggest.
Search Application Builder can take the mapping file and tweak/correct the mappings.
Questions
Is there an existing component, like data-prepper, that can be used to support this case?
Many organizations reuse datatypes, so we would want to learn from existing mappings. How can this be done?
The text was updated successfully, but these errors were encountered:
Summary
When building a product or full-text search based application in OpenSearch, careful consideration and thought needs to be put into building the index based on the data being ingested. Often, a document to be indexed is derived from several different sources (e.g. databases, tables, or files). In OpenSearch, the best practice is to denormalize this data into a single document. This proposal is to build tooling that inspects source data and generates a suggested OpenSearch index mapping file so that search application builders do not need to start from scratch and also do not need to default to dynamic mapping.
General Use Case
4. sorts the data based on the key provided
5. loops through X number of rows to generate a suggestion for mapping file including nested documents, datatypes,
analyzers, and autosuggest.
Questions
The text was updated successfully, but these errors were encountered: