Paper: AI Model Share - subtitle: An Integrated Toolkit for Collaborative Machine Learning Model Development, Provenance Tracking, and Deployment in Python #911

hp2500 · 2024-05-30T18:35:36Z

If you are creating this PR in order to submit a draft of your paper, please name your PR with Paper: <title>. An editor will then add a paper label and GitHub Actions will be run to check and build your paper.

See the project readme for more information.

Editor: Charles Lindsey @cdlindsey

Reviewers:

Ankur Ankan @ankurankan
William Zijie Zhang @Transurgeon
Amadi Gabriel Udu @AmadiGabriel

myst.yml

github-actions · 2024-05-30T20:46:48Z

Curvenote Preview

Directory	Preview	Checks	Updated (UTC)
papers/heinrich_peters	🔍 Inspect	✅ 49 checks passed (10 optional)	Sep 3, 2024, 2:06 AM

hongsupshin · 2024-05-31T21:51:03Z

@hp2500 Hi Heinrich, thanks for your submission! I have a few questions about the supporting material you submitted. It seems like the main manuscript doesn’t refer the supporting material, and I am wondering what its purpose is.

It also seems that the supporting material includes screenshots of your software and I would really appreciate what value they have regarding the manuscript. We would like the authors to be judicious about the use of supporting material because it is an extra burden for everyone including reviewers and proceeding chairs.

If you can answer these questions, I would really appreciate it!

hp2500 · 2024-05-31T22:16:51Z

Hi @hongsupshin , thanks for checking in!

We are referring to the SI in the manuscript. However, we had to manually include the references bc the \ref{} links didn't seem to work across documents. I am happy to add additional references wherever it might be useful.

The screenshots are indeed examples of the different functionalities of our software. We felt like a pure description might be a bit dry, so we tried to give the reader a better idea of what the different components look like. If you feel strongly about this I can remove the SI or attach the most important pieces to the main document. Alternatively, I can upload the SI to a separate repo and simply provide a link for those who are curious.

Your guidance is much appreciated. Maybe the reviewers want to chime in on this, too?

On a different note, can I continue to update the PR as we go or should I refrain from making changes before the reviewer's feedback is in?

hongsupshin

@hp2500 Thanks for the prompt reply! (I am moving the conversation as a formal review)

Regarding the supporting material in general, I went to your website and I was wondering whether your team has online examples of the figures you shared in the supporting material. If this is the case, you can probably just cite the online material, which might be more useful then static figures (because it would be interactive and especially since you have many tutorials).

If this already exists, you can consider this suggestion. But if it doesn't and you have to create additional material, let's just keep the supporting material with figures as is. I made a comment about one of the figures in the SI, so take a look and see if it makes sense. Thank you!

papers/heinrich_peters/figures/model.png

hongsupshin · 2024-06-01T16:09:32Z

On a different note, can I continue to update the PR as we go or should I refrain from making changes before the reviewer's feedback is in?

We haven't assigned the reviewers yet, and it's likely that we will continue accepting papers until the end of next week, please keep updating your PR if needed!

hongsupshin · 2024-06-20T02:19:57Z

On a different note, can I continue to update the PR as we go or should I refrain from making changes before the reviewer's feedback is in?

We haven't assigned the reviewers yet, and it's likely that we will continue accepting papers until the end of next week, please keep updating your PR if needed!

We have reviewers assigned to this paper @hp2500 We haven't heard from you for the past 3 weeks. Can you give us an update about the status of the paper?

hp2500 · 2024-06-20T03:56:35Z

On a different note, can I continue to update the PR as we go or should I refrain from making changes before the reviewer's feedback is in?

We haven't assigned the reviewers yet, and it's likely that we will continue accepting papers until the end of next week, please keep updating your PR if needed!

We have reviewers assigned to this paper @hp2500 We haven't heard from you for the past 3 weeks. Can you give us an update about the status of the paper?

My apologies, @hongsupshin - I didn't realize you were holding off on reviews because of my comment. The paper has been ready since my latest commit.
By all means, feel free to start the review process. I would like to get it done without further delay and I am looking forward to the comments!

ankurankan · 2024-06-25T01:42:44Z

Hi everyone, I am one of the reviewers. I have had a first look at the paper, and I believe it is already in very good shape. I have only a few minor comments as outlined below:

In the "Introduction" and "Related Work" sections, the authors mention that current existing tools are difficult to use and have a high barrier to entry, but do not provide any reasoning behind why this is the case and what exactly the main pain points are. I think adding the reasons explicitly (or a table of features) would make the argument stronger and also help in highlighting the features of AIMS.
There appears to be a typo in the "Collaborative Model Development" section at: "standardized leaderboards Experiments and Competitions." It seems there might be a missing period.
Writing class and function names such as ModelPlayground, input_type, etc., in italics would improve readability.

hongsupshin · 2024-06-27T01:11:00Z

@ankurankan Hi, would you mind leaving the comments directly on the corresponding file? You can press the + button to leave a comment and this will start a former review.

ankurankan · 2024-06-27T14:37:45Z

papers/heinrich_peters/main.tex

+\end{abstract}
+
+\section{Introduction}
+Machine learning (ML) is revolutionizing a wide range of research areas and industries, providing data-driven solutions to important societal problems. However, researchers and practitioners lack easy-to-use, structured pathways to collaboratively develop and rapidly deploy ML models. Traditionally, researchers have been using version-control systems like GitHub in combination with custom model evaluation and benchmarking experiments to ensure reproducibility and to compare models. However, larger-scale collaboration and crowd-sourcing are severely limited in the absence of standardized tasks and standardized processes for model sharing and evaluation. Additionally, most models developed by data scientists do not progress past the proof-of-concept stage and are never deployed \citep{davenport_is_2022, siegel_models_2022}, preventing the wider audience from participating in the promise of applied ML research. While the recent rise of platforms and tools like Hugging Face Hub \citep{noauthor_hugging_2023}, TensorFlow Hub \citep{noauthor_tensorflow_2023}, and MLflow \citep{noauthor_mlflow_2023,chen_developments_2020, zaharia_accelerating_2018}, illustrates the demand for open-source model repositories and MLOps solutions, barriers of entry are still high for researchers, educators, and practitioners from non-technical disciplines. Model Share AI (AIMS) addresses this problem by providing a lightweight, easy-to-use alternative. In a few lines of code, users can create Model Playgrounds - standardized ML project spaces that offer an all-in-one MLOps toolkit for collaborative model improvement, experiment tracking, model metadata analytics, and instant model deployment, allowing researchers and other data scientists to rapidly share, improve, and learn from ML models in one streamlined workflow.


This paragraph mentions that the current existing tools are difficult to use and have a high barrier to entry, but do not provide any reasoning behind why this is the case and what exactly the main pain points are. I think adding the reasons explicitly (or a table of features) would make the argument stronger and also help in highlighting the features of AIMS.

Thank you for this suggestion. We have adjusted the related work section to explains how AIMS uniquely fit into the ecosystem of existing solutions.

ankurankan · 2024-06-27T14:38:24Z

papers/heinrich_peters/main.tex

+
+
+\subsection{Key Functions}
+\paragraph{Collaborative Model Development} A key feature of AIMS is its focus on collaborative model development and crowd-sourced model improvement, enabling teams to iterate quickly by allowing collaborators to build on each other's progress, even across libraries. For supervised learning tasks, users can collaboratively submit models into Experiments or Competitions associated with a Model Playground project in order to track model performance and rank submissions in standardized leaderboards Experiments and Competitions are set up by providing evaluation data against which the predictions of submitted models are evaluated. Standardized model evaluations allow collaborators to track the testing performance of their models along with a wide range of model metadata that are automatically extracted from submitted models and added to the model registry (see section below). Out of the box, AIMS calculates accuracy, f1-score, precision, and recall for classification tasks, and mean squared error, root mean squared error, mean absolute error, and $R^{2}$-scores for regression tasks, but users can submit custom evaluation functions for more flexibility. The main difference between Experiments and Competitions is that a proportion of the evaluation data is kept secret for Competitions, preventing participants from deliberately overfitting on evaluation data. Being able to submit models into shared Experiments enables ML teams to standardize tasks, rigorously track their progress, and build on each other's success, while Competitions facilitate crowd-sourced solutions for any ML task. Both Experiments and Competitions can be either public (any AIMS user can submit) or private (only designated team members can submit). Users can deploy any model from an Experiment or Competition into the REST API associated with their Model Playground with a single line of code. 


"standardized leaderboards Experiments and Competitions." It seems there might be a missing period.

Thank you for catching this. Fixed.

ankurankan · 2024-06-27T14:38:47Z

papers/heinrich_peters/main.tex

+\end{figure}
+
+\subsection{Architecture}
+AIMS consists of three main components: an open-source Python library, user-owned cloud backend resources, and the AIMS website. The AIMS Python library is the main interface allowing users to set up Model Playground pages (including Experiments and Competitions), submit and deploy models, analyze model metadata, and reproduce model artifacts. It provides an accessible layer that facilitates the creation of the cloud backend resources that power REST APIs, as well as model evaluations and model metadata extraction. The ModelPlayground() class acts as a local representation of a Model Playground page and its associated REST API. It provides a range of methods to configure, change, and query Model Playground resources. A detailed overview of the Python library is provided below (AIMS Workflow). 


Writing class and function names such as ModelPlayground, input_type, etc., in italics would improve readability.

This is a good suggestion. We have changed the manuscript accordingly.

cbcunc · 2024-07-14T22:44:31Z

@AmadiGabriel Good to meet you at SciPy. I am inviting you to review this paper. You were sent an invitation from GitHub to be a collaborator on this repository. Please accept the invitation. Your review should be in the form of GitHub review comments: https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/reviewing-changes-in-pull-requests/commenting-on-a-pull-request

AmadiGabriel

In a bid to democratising ML ecosystem and expanding it non-technical disciplines, AIMS is a laudable initiative which introduces a promising approach to addressing to ease-of-use toolkit for collaborative ML model development and deployment.

papers/heinrich_peters/main.tex

cdlindsey · 2024-07-26T18:32:28Z

I would like to see more details on the support of experiments. Can you store certain metrics?

AmadiGabriel · 2024-08-06T06:31:03Z

@AmadiGabriel Good to meet you at SciPy. I am inviting you to review this paper. You were sent an invitation from GitHub to be a collaborator on this repository. Please accept the invitation. Your review should be in the form of GitHub review comments: https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/reviewing-changes-in-pull-requests/commenting-on-a-pull-request

@cbcunc The authors have made further improvements on the paper based on my comments. Kindly proceed with subsequent stages of the review process.

hp2500 · 2024-08-07T02:21:04Z

I would like to see more details on the support of experiments. Can you store certain metrics?

The metrics are stored and can be queried by users through the Python library or on the AIMS website. In case you mean additional metrics, there is the possibility of submitting a dictionary of key-value pairs to the "custom_metadata" argument of the "submit_model" method. We now mention this in the "AIMS Workflow" section.

hp2500 added 7 commits May 23, 2024 18:12

first version of manuscript transferred

403ac62

small edits

0ede580

fixed SI

9550f3b

Credit & DOI

8f1370b

DOI

635974e

DOI

9bf7db5

DOI

34da16b

This comment was marked as outdated.

Sign in to view

rowanc1 reviewed May 30, 2024

View reviewed changes

myst.yml Outdated Show resolved Hide resolved

Recovered template folders

43a2938

rowanc1 added the paper This indicates that the PR in question is a paper label May 30, 2024

hongsupshin self-requested a review May 31, 2024 21:51

hongsupshin reviewed Jun 1, 2024

View reviewed changes

papers/heinrich_peters/figures/model.png Outdated Show resolved Hide resolved

hongsupshin self-assigned this Jun 4, 2024

cbcunc added the ready-for-review label Jun 7, 2024

ameyxd unassigned hongsupshin Jun 11, 2024

cbcunc assigned cdlindsey Jun 11, 2024

ankurankan reviewed Jun 27, 2024

View reviewed changes

AmadiGabriel suggested changes Jul 21, 2024

View reviewed changes

edits

3e0c86d

hp2500 added 3 commits August 2, 2024 16:37

leaderboard ranking

d79b051

revised impact section

e27df4c

sentence order intro

9573a91

language edits

7b4472f

hp2500 added 4 commits August 7, 2024 23:48

edits

0932132

language edits

3159d96

language edits

869a26f

edits impact section

9eedb1a

hongsupshin approved these changes Sep 5, 2024

View reviewed changes

cbcunc merged commit a362e86 into scipy-conference:2024 Sep 25, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paper: AI Model Share - subtitle: An Integrated Toolkit for Collaborative Machine Learning Model Development, Provenance Tracking, and Deployment in Python #911

Paper: AI Model Share - subtitle: An Integrated Toolkit for Collaborative Machine Learning Model Development, Provenance Tracking, and Deployment in Python #911

hp2500 commented May 30, 2024 •

edited by cbcunc

Loading

This comment was marked as outdated.

github-actions bot commented May 30, 2024 •

edited

Loading

hongsupshin commented May 31, 2024

hp2500 commented May 31, 2024

hongsupshin left a comment

hongsupshin commented Jun 1, 2024

hongsupshin commented Jun 20, 2024

hp2500 commented Jun 20, 2024

ankurankan commented Jun 25, 2024

hongsupshin commented Jun 27, 2024

ankurankan Jun 27, 2024

hp2500 Aug 2, 2024

ankurankan Jun 27, 2024

hp2500 Aug 2, 2024

ankurankan Jun 27, 2024

hp2500 Aug 2, 2024

cbcunc commented Jul 14, 2024

AmadiGabriel left a comment

cdlindsey commented Jul 26, 2024

AmadiGabriel commented Aug 6, 2024

hp2500 commented Aug 7, 2024



		\subsection{Key Functions}
		\paragraph{Collaborative Model Development} A key feature of AIMS is its focus on collaborative model development and crowd-sourced model improvement, enabling teams to iterate quickly by allowing collaborators to build on each other's progress, even across libraries. For supervised learning tasks, users can collaboratively submit models into Experiments or Competitions associated with a Model Playground project in order to track model performance and rank submissions in standardized leaderboards Experiments and Competitions are set up by providing evaluation data against which the predictions of submitted models are evaluated. Standardized model evaluations allow collaborators to track the testing performance of their models along with a wide range of model metadata that are automatically extracted from submitted models and added to the model registry (see section below). Out of the box, AIMS calculates accuracy, f1-score, precision, and recall for classification tasks, and mean squared error, root mean squared error, mean absolute error, and $R^{2}$-scores for regression tasks, but users can submit custom evaluation functions for more flexibility. The main difference between Experiments and Competitions is that a proportion of the evaluation data is kept secret for Competitions, preventing participants from deliberately overfitting on evaluation data. Being able to submit models into shared Experiments enables ML teams to standardize tasks, rigorously track their progress, and build on each other's success, while Competitions facilitate crowd-sourced solutions for any ML task. Both Experiments and Competitions can be either public (any AIMS user can submit) or private (only designated team members can submit). Users can deploy any model from an Experiment or Competition into the REST API associated with their Model Playground with a single line of code.

Paper: AI Model Share - subtitle: An Integrated Toolkit for Collaborative Machine Learning Model Development, Provenance Tracking, and Deployment in Python #911

Paper: AI Model Share - subtitle: An Integrated Toolkit for Collaborative Machine Learning Model Development, Provenance Tracking, and Deployment in Python #911

Conversation

hp2500 commented May 30, 2024 • edited by cbcunc Loading

This comment was marked as outdated.

github-actions bot commented May 30, 2024 • edited Loading

hongsupshin commented May 31, 2024

hp2500 commented May 31, 2024

hongsupshin left a comment

Choose a reason for hiding this comment

hongsupshin commented Jun 1, 2024

hongsupshin commented Jun 20, 2024

hp2500 commented Jun 20, 2024

ankurankan commented Jun 25, 2024

hongsupshin commented Jun 27, 2024

ankurankan Jun 27, 2024

Choose a reason for hiding this comment

hp2500 Aug 2, 2024

Choose a reason for hiding this comment

ankurankan Jun 27, 2024

Choose a reason for hiding this comment

hp2500 Aug 2, 2024

Choose a reason for hiding this comment

ankurankan Jun 27, 2024

Choose a reason for hiding this comment

hp2500 Aug 2, 2024

Choose a reason for hiding this comment

cbcunc commented Jul 14, 2024

AmadiGabriel left a comment

Choose a reason for hiding this comment

cdlindsey commented Jul 26, 2024

AmadiGabriel commented Aug 6, 2024

hp2500 commented Aug 7, 2024

hp2500 commented May 30, 2024 •

edited by cbcunc

Loading

github-actions bot commented May 30, 2024 •

edited

Loading