You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Microsoft Fabric is a SaaS Data Analytics platform from Microsoft. It allows you to work with Data Warehouses and perform Data Engineering and Data Science tasks in a managed platform. The Fabric product is more closely related to other business focused data analysis tools like Power BI rather than Azure Cloud Data Science platforms like Azure ML.
RAPIDS Alignment
Microsoft Fabric has two services which could leverage RAPIDS.
It looks like Synapse is being overhauled as part of Fabric, so perhaps this support may return at some point in the future.
Python Notebooks
Microsoft Fabric also has a managed Python Notebooks service. This allows users to execute arbitrary Python code within a Fabric environment and comes with standard PyData libraries like pandas and scikit-learn out of the box. This would be a good candidate for accelerating via RAPIDS libraries like cudf and cuml and could support zero-code-change acceleration with cudf.pandas. It's also likely libraries like polars are being used in these environments, which support GPU acceleration via RAPIDS today.
There is currently no way to configure the hardware of the underlying VM so it isn't possible to install and leverage the RAPIDS libraries in this environment. For more information see #503.
Other distributed frameworks
Given that it is possible to launch Spark clusters it is feasible to run other distributed frameworks like Dask, Ray or Legate on this infrastructure. However we would need GPU hardware support before we could explore this further.
Next steps
In order to enable RAPIDS usage on Microsoft Fabric we first need GPUs to be made available in both the Spark and Python Notebooks environments.
Support for Spark RAPIDS on Microsoft Fabric Spark clusters will be explored by the Spark RAPIDS team if GPU hardwar becomes available.
If and when GPUs become available in the Python Notebooks environment the RAPIDS Cloud Deployment Team can investigate the best practice methods to install RAPIDS libraries into those environments.
We can then build out workflow examples showing how to read data and perform GPU accelerated Data Analytics in Microsoft Fabric.
The text was updated successfully, but these errors were encountered:
Microsoft Fabric is a SaaS Data Analytics platform from Microsoft. It allows you to work with Data Warehouses and perform Data Engineering and Data Science tasks in a managed platform. The Fabric product is more closely related to other business focused data analysis tools like Power BI rather than Azure Cloud Data Science platforms like Azure ML.
RAPIDS Alignment
Microsoft Fabric has two services which could leverage RAPIDS.
Spark
Fabric has a close history with Microsoft Synapse which is a managed Spark platform. The Spark RAPIDS team has a documentation page on using Synapse, however accorsing to the Microsoft documentation GPU support on Synapse has been deprecated.
It looks like Synapse is being overhauled as part of Fabric, so perhaps this support may return at some point in the future.
Python Notebooks
Microsoft Fabric also has a managed Python Notebooks service. This allows users to execute arbitrary Python code within a Fabric environment and comes with standard PyData libraries like
pandas
andscikit-learn
out of the box. This would be a good candidate for accelerating via RAPIDS libraries likecudf
andcuml
and could support zero-code-change acceleration withcudf.pandas
. It's also likely libraries likepolars
are being used in these environments, which support GPU acceleration via RAPIDS today.There is currently no way to configure the hardware of the underlying VM so it isn't possible to install and leverage the RAPIDS libraries in this environment. For more information see #503.
Other distributed frameworks
Given that it is possible to launch Spark clusters it is feasible to run other distributed frameworks like Dask, Ray or Legate on this infrastructure. However we would need GPU hardware support before we could explore this further.
Next steps
In order to enable RAPIDS usage on Microsoft Fabric we first need GPUs to be made available in both the Spark and Python Notebooks environments.
Support for Spark RAPIDS on Microsoft Fabric Spark clusters will be explored by the Spark RAPIDS team if GPU hardwar becomes available.
If and when GPUs become available in the Python Notebooks environment the RAPIDS Cloud Deployment Team can investigate the best practice methods to install RAPIDS libraries into those environments.
We can then build out workflow examples showing how to read data and perform GPU accelerated Data Analytics in Microsoft Fabric.
The text was updated successfully, but these errors were encountered: