Add an example for ingesting CloudWatch logs into OpenSearch using OpenSearch Ingestion pipeline. #1056

sb2k16 · 2024-07-02T14:43:00Z

The purpose of this pull request is to provide an example on how OpenSearch Ingestion Pipeline could be used to ingest CloudWatch logs into Opensearch using a CloudWatch Lambda subscription filter.

This example creates the following resources:

OpenSearch Serverless collection in a VPC where the logs would eventually be written to by the OpenSearch Ingestion Pipeline sink.
OpenSearch Ingestion pipeline
CloudWatch Subscription Filter with Lambda function to call the OSI pipeline endpoint to push logs data received from an incoming log stream.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

…bscription filter.

Signed-off-by: Souvik Bose <[email protected]>

dlvenable · 2024-07-02T17:25:18Z

typescript/opensearch/cwlogs_ingestion/resources/lambda/cw_subscription_filter/handler.py

+    for logEvent in logEvents:
+        request = {}
+        request['@id'] = logEvent['id']
+        request['@timestamp'] = str(datetime.now().year) + '0' + str(datetime.now().month) + '0' + str(datetime.now().day)


Each log event has a timestamp property. Why not use that?

dlvenable · 2024-07-02T17:25:57Z

typescript/opensearch/cwlogs_ingestion/resources/lambda/cw_subscription_filter/handler.py

+    logEvents = cwLogs['logEvents']
+    for logEvent in logEvents:
+        request = {}
+        request['@id'] = logEvent['id']


Why did you include @ in here? I think this may be confusing and require additional processing to remove. I think we would be fine without the @.

Maybe keep it on @timestamp if anything.

dlvenable · 2024-07-02T17:28:29Z

typescript/opensearch/cwlogs_ingestion/resources/lambda/cw_subscription_filter/handler.py

+def cw_subscription_handler(event, context):
+
+    """Extract the data from the event"""
+    data = jmespath.search("awslogs.data", event)


Is there any limit to the amount of data in each call? OSI only accepts sizes of 10mb or less by default.

dlvenable · 2024-07-02T17:31:39Z

typescript/opensearch/cwlogs_ingestion/lib/os_setup_stack.ts

+
+        // Create a dashboard access role
+        const dashboardAccessRole = new Role(this, `${this.STACK_RESOURCE_NAMING_PREFIX}DashboardAccessRole`, {
+          assumedBy: new ServicePrincipal('ec2.amazonaws.com'),


Why is this role assumable by EC2? Wouldn't we have the account be the one to assume it?

dlvenable · 2024-07-02T17:36:13Z

typescript/opensearch/cwlogs_ingestion/resources/lambda/log_emitter/handler.py

+    id = str(randrange(10000))
+    source['id'] = id
+    source['timestamp'] = str(datetime.now())
+    source['message'] = 'Hello world'


It may be nice to make this more interesting and look like some application log.

dlvenable · 2024-07-02T17:37:11Z

typescript/opensearch/cwlogs_ingestion/resources/pipeline/configuration.yaml

+  source:
+    http:
+      path: /logs/ingest
+  sink:


I think this pipeline example would be more helpful if we ran grok. See my comment above about application logs. We could possibly have a log that you could grok here.

sbose2k21 and others added 2 commits June 27, 2024 17:55

Example CDK project for CloudWatch logs ingestion using OSI and CW su…

3d435dd

…bscription filter.

Rename the resources and add README

9e93944

Signed-off-by: Souvik Bose <[email protected]>

dlvenable suggested changes Jul 2, 2024

View reviewed changes

kaiz-io added 3 commits September 17, 2024 13:17

Merge branch 'main' into osi-cwlogs-ingest-impl

4032042

Merge branch 'main' into osi-cwlogs-ingest-impl

dba385f

Merge branch 'main' into osi-cwlogs-ingest-impl

34636fa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an example for ingesting CloudWatch logs into OpenSearch using OpenSearch Ingestion pipeline. #1056

Add an example for ingesting CloudWatch logs into OpenSearch using OpenSearch Ingestion pipeline. #1056

sb2k16 commented Jul 2, 2024

dlvenable Jul 2, 2024

dlvenable Jul 2, 2024

dlvenable Jul 2, 2024

dlvenable Jul 2, 2024

dlvenable Jul 2, 2024

dlvenable Jul 2, 2024

Add an example for ingesting CloudWatch logs into OpenSearch using OpenSearch Ingestion pipeline. #1056

Are you sure you want to change the base?

Add an example for ingesting CloudWatch logs into OpenSearch using OpenSearch Ingestion pipeline. #1056

Conversation

sb2k16 commented Jul 2, 2024

dlvenable Jul 2, 2024

Choose a reason for hiding this comment

dlvenable Jul 2, 2024

Choose a reason for hiding this comment

dlvenable Jul 2, 2024

Choose a reason for hiding this comment

dlvenable Jul 2, 2024

Choose a reason for hiding this comment

dlvenable Jul 2, 2024

Choose a reason for hiding this comment

dlvenable Jul 2, 2024

Choose a reason for hiding this comment