WIP Allow Python backend to directly write Numpy arrays to SHM #264

asos-danielbunting · 2023-06-24T13:37:44Z

No description provided.

Tabrizian

@asos-danielbunting thanks for the PR. I was wondering what is the use-case that this PR is trying to address? Is the idea to pre-allocate the buffers in shared memory and directly work with them to speed up the inference process?

Could you please share more details about the places where this becomes useful?

asos-danielbunting · 2023-07-03T16:53:26Z

Hi @Tabrizian I'm looking at trying to speed up passing a large tensor between a Python BLS model doing preprocessing and a Tensorflow inference model.

As you say the idea is to allocate the buffer and directly write my data into it from the python side and so avoid an extra allocation + copy time. I've run a couple of tests and for my use case this can speed up my inference time by a decent amount eg for a 100000 x 200 float32 tensor the saving was 30ms

Tabrizian · 2023-08-03T22:47:01Z

Dockerfile

@@ -0,0 +1,31 @@
+FROM asnpdsacr.azurecr.io/public/tritonserver:23.05-tf2-python-py3


Tabrizian · 2023-08-03T22:47:53Z

src/pb_stub.cc

@@ -431,8 +431,12 @@ Stub::StubSetup()
  py::setattr(


Remove all the changes except the ones in the src directory.

Tabrizian · 2023-08-03T22:48:04Z

src/pb_stub.cc


  c_python_backend_utils.attr("shared_memory") = py::cast(shm_pool_.get());
+  python_backend_utils.attr("shared_memory") = py::cast(shm_pool_.get());


This is not needed.

Tabrizian · 2023-08-03T22:48:11Z

src/pb_stub.cc

@@ -494,6 +498,7 @@ Stub::Initialize(bi::managed_external_buffer::handle_t map_handle)
      python_backend_utils, "InferenceResponse",
      c_python_backend_utils.attr("InferenceResponse"));
  c_python_backend_utils.attr("shared_memory") = py::cast(shm_pool_.get());
+  python_backend_utils.attr("shared_memory") = py::cast(shm_pool_.get());


Not required.

Tabrizian · 2023-08-03T22:51:37Z

src/pb_stub.cc

@@ -1603,6 +1608,8 @@ PYBIND11_EMBEDDED_MODULE(c_python_backend_utils, module)

  py::register_exception<PythonBackendException>(
      module, "TritonModelException");
+
+  module.def("new_shm_tensor", &PbTensor::CreateInSHM, "Creates a new Tensor directly into shared memory");


Can we rename this to pb.Tensor.new(shape, dtype, device='cpu')?

Tabrizian · 2023-08-03T22:53:06Z

src/pb_tensor.cc

+          reinterpret_cast<char*>(tensor_shm_ptr) + pb_memory_offset,
+          shm_handle + pb_memory_offset, false);
+    tensor_shm_ptr->memory = 0;
+    std::cout << "Offset is - " << pb_memory_offset<<  "\n";


Remove print statement.

Tabrizian · 2023-08-03T22:53:13Z

src/pb_tensor.cc

+{
+
+    // Input params of tensor
+    //std::vector<int64_t> dims = std::vector<int64_t>({10, 10});


Remove comment.

asos-danielbunting added 6 commits June 19, 2023 18:30

Working but slower

1789090

VM updates

a76cc1e

Working!

6ed6f52

Clearn up

9504358

Calculate the required padding

e407d0d

Add model with candidatesss

e609166

Tabrizian reviewed Jul 3, 2023

View reviewed changes

Tabrizian reviewed Aug 3, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Allow Python backend to directly write Numpy arrays to SHM #264

WIP Allow Python backend to directly write Numpy arrays to SHM #264

asos-danielbunting commented Jun 24, 2023

Tabrizian left a comment

asos-danielbunting commented Jul 3, 2023

Tabrizian Aug 3, 2023

Tabrizian Aug 3, 2023

Tabrizian Aug 3, 2023

Tabrizian Aug 3, 2023

Tabrizian Aug 3, 2023

Tabrizian Aug 3, 2023

Tabrizian Aug 3, 2023

		@@ -0,0 +1,31 @@
		FROM asnpdsacr.azurecr.io/public/tritonserver:23.05-tf2-python-py3


		c_python_backend_utils.attr("shared_memory") = py::cast(shm_pool_.get());
		python_backend_utils.attr("shared_memory") = py::cast(shm_pool_.get());

WIP Allow Python backend to directly write Numpy arrays to SHM #264

Are you sure you want to change the base?

WIP Allow Python backend to directly write Numpy arrays to SHM #264

Conversation

asos-danielbunting commented Jun 24, 2023

Tabrizian left a comment

Choose a reason for hiding this comment

asos-danielbunting commented Jul 3, 2023

Tabrizian Aug 3, 2023

Choose a reason for hiding this comment

Tabrizian Aug 3, 2023

Choose a reason for hiding this comment

Tabrizian Aug 3, 2023

Choose a reason for hiding this comment

Tabrizian Aug 3, 2023

Choose a reason for hiding this comment

Tabrizian Aug 3, 2023

Choose a reason for hiding this comment

Tabrizian Aug 3, 2023

Choose a reason for hiding this comment

Tabrizian Aug 3, 2023

Choose a reason for hiding this comment