[Serve] multiple submodels serve #298

cyber-pioneer · 2024-12-26T12:00:37Z

Type

New Feature

Description

multiple submodels serve
Given many submodules. Simply configure the keywords of the submodules, including resource allocation and interdependencies among the submodels.
multiple nodes serve
Given many well-connected nodes. Simply configure the keywords of the nodes, including resource type and address messages.
ci support serve case
Serve test case contains 2 steps: serve and call.

aoyulong · 2025-01-24T06:48:08Z

flagscale/runner/runner_serve.py

-        if with_test:
-            f.write(f'bash -c "$cmd; sync" \n')
+        # TODO: need a option to control whether to append or overwrite the output file
+        # Now, it always appends to the output file


Yes, if there is an option available, users will be satisfied.

aoyulong · 2025-01-24T06:51:17Z

flagscale/runner/runner_serve.py

        if self.command_line_mode:
            self.user_script = "flagscale/serve/run_vllm.py"
+        elif isinstance(entrypoint, str) and entrypoint.endswith(".py"):
+            self.user_script = entrypoint
+        elif entrypoint is None:


Should we require users to provide the entrypoint?

This is an option. Models pipeline can be built via user's entrypoint file or yaml config.

aoyulong · 2025-01-24T06:52:59Z

flagscale/serve/core/dag.py

+from flagscale.logger import logger
+
+
+class Builder:


Builder for what? I believe the name should be more specific.

aoyulong · 2025-01-24T06:54:42Z

flagscale/serve/core/dag.py

+    def check_and_get_port(self, target_port=None, host="0.0.0.0"):
+        """
+        Check if a specific port is free; if not, allocate a free port.
+        :param target_port: The port number to check, default is None.
+        :param host: The host address to check, default is "0.0.0.0".
+        :return: A tuple (is_free, port), where `is_free` indicates if the target port is free,
+                and `port` is the allocated port (target_port if free, or a new free port).
+        """


The code should be placed in the utilities so that it can be reused.

aoyulong

LGTM

cyber-pioneer requested a review from a team as a code owner December 26, 2024 12:00

cyber-pioneer changed the title ~~[Debug]Serve] expected serve~~ [Debug][Serve] expected serve Dec 26, 2024

cyber-pioneer force-pushed the final_serve branch 4 times, most recently from 61007c2 to 7b89e05 Compare December 27, 2024 08:34

cyber-pioneer changed the title ~~[Debug][Serve] expected serve~~ [Serve] expected serve Jan 20, 2025

cyber-pioneer force-pushed the final_serve branch from 3f00d5d to 3b4e708 Compare January 22, 2025 03:22

cyber-pioneer added 22 commits January 22, 2025 11:26

add manual emu infer serve

6e727c5

add case to build dag

f16fa78

fix code

616c0cd

add serve dag

f018be0

better design and demo

97ca7ad

support launch task from upper api

abac6d6

serve emu dev1

ff6c82c

polish check logic

b92bd37

serve add router

2425590

finish whole router process

2437951

polish code

2865f0e

polish code

0a16a1b

polish code

2cef2d1

fix progress

d76eea5

debug1

56d4b37

use origin remote

87d30f1

support LLM instance in remote

46688ff

polish code

a56339f

polish request keys

6c510f5

fix

f7c3201

fix code

61b34a5

fix

c39def0

cyber-pioneer added 4 commits January 23, 2025 11:05

fix code

Loading
Loading status checks…

e73ae9a

add model

779ec05

polish ci

Loading
Loading status checks…

0482b0b

polish code

Loading
Loading status checks…

9ab5b39

cyber-pioneer force-pushed the final_serve branch from fb6204f to 9ab5b39 Compare January 23, 2025 04:59

cyber-pioneer added 9 commits January 23, 2025 15:31

polish code

Loading
Loading status checks…

19044f5

polish code

Loading
Loading status checks…

6d52c1d

debug ci

Loading
Loading status checks…

8425cd0

debug ci

Loading
Loading status checks…

c5cce08

debug ci

Loading
Loading status checks…

a9021d6

debug ci

Loading
Loading status checks…

f92463d

polish code

Loading
Loading status checks…

6f60e87

polish code

Loading
Loading status checks…

0f483e3

polish log print

b1404fa

cyber-pioneer force-pushed the final_serve branch from 38b60e3 to 01529bc Compare January 24, 2025 02:23

add log details

Loading
Loading status checks…

06152f7

cyber-pioneer force-pushed the final_serve branch from 01529bc to 06152f7 Compare January 24, 2025 02:35

polish log

5da2eca

aoyulong reviewed Jan 24, 2025

View reviewed changes

cyber-pioneer added 9 commits January 24, 2025 18:10

polish code

Loading
Loading status checks…

38ae360

polish code

Loading
Loading status checks…

529ffb0

polish code

Loading
Loading status checks…

c87d66d

polish code

Loading
Loading status checks…

bfd87dc

polish code

Loading
Loading status checks…

bf045eb

polish name

Loading
Loading status checks…

a378c5f

polish config

Loading
Loading status checks…

4874027

polish config

Loading
Loading status checks…

5834f3b

polish config

Loading
Loading status checks…

9d8bc59

aoyulong approved these changes Jan 24, 2025

View reviewed changes

aoyulong merged commit b139a54 into FlagOpen:main Jan 24, 2025
24 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Serve] multiple submodels serve #298

[Serve] multiple submodels serve #298

cyber-pioneer commented Dec 26, 2024 •

edited

Loading

aoyulong Jan 24, 2025

aoyulong Jan 24, 2025

cyber-pioneer Jan 24, 2025

aoyulong Jan 24, 2025

cyber-pioneer Jan 24, 2025

aoyulong Jan 24, 2025

cyber-pioneer Jan 24, 2025

aoyulong left a comment

[Serve] multiple submodels serve #298

[Serve] multiple submodels serve #298

Conversation

cyber-pioneer commented Dec 26, 2024 • edited Loading

Type

Description

aoyulong Jan 24, 2025

Choose a reason for hiding this comment

aoyulong Jan 24, 2025

Choose a reason for hiding this comment

cyber-pioneer Jan 24, 2025

Choose a reason for hiding this comment

aoyulong Jan 24, 2025

Choose a reason for hiding this comment

cyber-pioneer Jan 24, 2025

Choose a reason for hiding this comment

aoyulong Jan 24, 2025

Choose a reason for hiding this comment

cyber-pioneer Jan 24, 2025

Choose a reason for hiding this comment

aoyulong left a comment

Choose a reason for hiding this comment

cyber-pioneer commented Dec 26, 2024 •

edited

Loading