Performance issues with onnx_tf #353

Terizian · 2019-01-16T04:48:12Z

I have developed a model on Matlab and saved it using the onnx framework. The size of the model is 25 MB and opset version is 6. I am currently trying to move this model to production using the supported libraries on Python. In my code, I have the following:

model = onnx.load(onnx_path)
tf_rep = prepare(model)

This takes 25.71 seconds to run, making it very heavy for production usage.

Additionally, when running the predictions:

output = tf_rep.run(x)

Each prediction takes on average 4 seconds to run. My target is running 100 predictions in a second and I'm finding that impossible with the framework. What are things that I may try to speed it up?

The text was updated successfully, but these errors were encountered:

tjingrant · 2019-01-16T04:57:17Z

Try this: #271 .

This issue and related patch makes tf_rep.run significantly faster. As for prepare, you should only do that once and cache the resultant tensorflow representation throughout the lifetime of your production application.

Terizian · 2019-01-16T08:11:21Z

I have tried to look into the solution suggested on that thread but I'm not certain on where to include these following changes:

tf_rep.sess = tf.Session(graph=tf_rep.graph)

Current you use batch size 1: float[1,5,224,224], with is inefficient for tf.

It'll be really great if you could guide me on this.

tjingrant · 2019-01-18T00:44:35Z

That comment was written because inferencing without batching is inefficient (due to lack of parallelism). Usually, people export models with explicit batch size (often 1) and this can be a performance bottleneck.

Moreover, session creation is also very time consuming, which is what the related PR was trying to resolve. But that PR has gone a bit outdated and may not work out of the box, @fumihwh can you try updating your patch: #273 ? We should probably merge this PR ASAP since it's quite useful.

anuar12 · 2019-03-08T03:01:09Z

Would be great if this PR was merged, I am facing the exact problem it would solve.

vibhuagrawal14 · 2019-03-09T13:33:25Z

@Terizian Can you try exporting the graph (using tf_rep.export_graph) and using it directly? If I remember correctly, that improved the performance significantly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance issues with onnx_tf #353

Performance issues with onnx_tf #353

Terizian commented Jan 16, 2019

tjingrant commented Jan 16, 2019

Terizian commented Jan 16, 2019

tjingrant commented Jan 18, 2019

anuar12 commented Mar 8, 2019

vibhuagrawal14 commented Mar 9, 2019

Performance issues with onnx_tf #353

Performance issues with onnx_tf #353

Comments

Terizian commented Jan 16, 2019

tjingrant commented Jan 16, 2019

Terizian commented Jan 16, 2019

tjingrant commented Jan 18, 2019

anuar12 commented Mar 8, 2019

vibhuagrawal14 commented Mar 9, 2019