-
Notifications
You must be signed in to change notification settings - Fork 540
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sample_concatenate_permutation ineffective in user.conf #2028
Comments
@psyhtest For running custom open models, we can still use the official model name in
Is there a requirement to use the |
This works for all other parameters we tried e.g.
Using a fixed official name for every unofficial model may lead to confusion. |
Yes @psyhtest - https://github.com/mlcommons/inference/blob/master/loadgen/test_settings_internal.cc#L700
Why it should be a confusion? For example, all open sparse model submissions from NeuralMagic were done using the official model name |
Nvidia is using NeuralMagic is using the If there is a reason to allow |
It would be rather weird to have to use, for example, Then, if we use a generic name for several workloads, how do we distinguish between results for different workloads in the results table? |
Also, what if we want to explore, say, one or more sets of ultra low latency constraints (lower than "low latency")? |
The name is only for the use in "user.conf". In the "models" directory under results, measurements etc, custom names can be used like done here |
Currently you can change the latency constraints by using the user.conf file and the official model name right? If you think "custom model name" must be allowed to be used in user.conf file, we can discuss that in tomorrow WG meeting. The problem is that then anyone can override anything and submission checker will need to capture all invalid configurations. |
Hi @psyhtest currently only the below flags are restricted to "only" mlperf.conf file. Anything else, you should be able to load via user.conf.
I have added a PR to allow Other parameters in (only in) mlperf.conf are fine right? |
Thanks @arjunsuresh! Yes, the random number generator seeds should definitely be read-only for each round. And changing |
Thank you @psyhtest for confirming. |
We would like to run a custom model under the same rules as
llama2-70b
, i.e. to visit each sample of the OpenOrca dataset at least once.However, when we place the following line in
user.conf
(custom
is the name of the model):it doesn't affect the execution, with
mlperf_log_detail.txt
confirming thatsample_concatenate_permutation
is still disabled:Only when we place this line in
mlperf.conf
and rebuild LoadGen, we can see it taking effect, withmlperf_log_detail.txt
confirming thatsample_concatenate_permutation
is now enabled:But then we get an error:
Since
mlperf.conf
should now be static, we consider this behaviour a bug in LoadGen.The text was updated successfully, but these errors were encountered: