You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The AWS Transcribe Streaming SDK C++ implementation is consuming excessive CPU resources when processing audio streams. Each individual stream consumes approximately 100% CPU usage, scaling linearly with multiple streams (e.g., 3 streams = 300% CPU usage). This appears inefficient for an operation that should primarily be handling audio data transmission to AWS Transcribe service.
I have tested using the CRT-HTTP version also and I get similar results. Will follow up with CRT-HTTP Docker version if requested.
It will slightly fluctuate on CPU usage but will mostly stick around 100%. I have tested on Macbook M1 running docker and then multiple Linux EC2 instance types and had the same results.
Is this performance intended/expected?
Regression Issue
Select this option if this issue appears to be a regression.
Expected Behavior
Minimal CPU usage for streaming audio to AWS Transcribe service
Efficient handling of multiple concurrent streams without linear CPU scaling
CPU usage should primarily be focused on audio data transmission rather than processing
Describe the bug
The AWS Transcribe Streaming SDK C++ implementation is consuming excessive CPU resources when processing audio streams. Each individual stream consumes approximately 100% CPU usage, scaling linearly with multiple streams (e.g., 3 streams = 300% CPU usage). This appears inefficient for an operation that should primarily be handling audio data transmission to AWS Transcribe service.
I have tested using the CRT-HTTP version also and I get similar results. Will follow up with CRT-HTTP Docker version if requested.
It will slightly fluctuate on CPU usage but will mostly stick around 100%. I have tested on Macbook M1 running docker and then multiple Linux EC2 instance types and had the same results.
Is this performance intended/expected?
Regression Issue
Expected Behavior
Current Behavior
Reproduction Steps
Here is the minimal reproduction steps in a single Dockerfile using the sample code.
Dockerfile
Please note:
Steps:
docker build -t transcribe-cpu-test-example .
docker exec -it transcribe-container bash /aws-doc-sdk-examples/cpp/example_code/transcribe-streaming/build/get_transcript
Repeat step 4 in additional terminals to observe CPU scaling with multiple streams
You will notice high cpu usage.
Possible Solution
Potential memory leaks or inefficient resource handling in the streaming implementation.
Additional Information/Context
AWS CPP SDK version used
Latest
Compiler and Version used
gcc (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0
Operating System and version
Ubuntu 22.04 LTS (running in Docker container)
The text was updated successfully, but these errors were encountered: