-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug report:config eval dataset with mme, raise errors:LocalTokenNotFoundError #303
Comments
in additions, this error can be solved by login to 2025-01-17 17:24:11.499 | INFO | llmc.eval.eval_vqa:eval:82 - Setting random seed to 0 | Setting numpy seed to 1234 | Setting torch manual seed to 1234
2025-01-17 17:24:19.947 | INFO | llmc.models.internvl2:__init__:436 - Using 1 devices with tensor parallelism
2025-01-17 17:24:19.947 | WARNING | llmc.eval.eval_vqa:_adjust_config:148 - Overwriting default num_fewshot of mme from None to 0
2025-01-17 17:24:19.947 | WARNING | llmc.eval.eval_vqa:_adjust_config:148 - Overwriting default num_fewshot of mme from None to 0
2025-01-17 17:24:19.947 | INFO | lmms_eval.evaluator_utils:from_taskdict:91 - No metadata found in task config for mme, using default n_shot=0
2025-01-17 17:24:19.948 | INFO | lmms_eval.api.task:build_all_requests:425 - Building contexts for mme on rank 0...
100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2374/2374 [00:00<00:00, 167526.59it/s]
2025-01-17 17:24:46.659 | INFO | lmms_eval.evaluator:evaluate:446 - Running generate_until requests
Model Responding: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2374/2374 [07:29<00:00, 5.28it/s]
Postprocessing: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2374/2374 [00:20<00:00, 114.93it/s]
2025-01-17 17:32:36.888 | INFO | utils:mme_aggregate_results:124 - code_reasoning: 95.00
2025-01-17 17:32:36.888 | INFO | utils:mme_aggregate_results:124 - numerical_calculation: 40.00
2025-01-17 17:32:36.888 | INFO | utils:mme_aggregate_results:124 - text_translation: 155.00
2025-01-17 17:32:36.888 | INFO | utils:mme_aggregate_results:124 - commonsense_reasoning: 112.86
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - artwork: 142.25
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - celebrity: 115.88
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - count: 133.33
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - color: 153.33
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - position: 155.00
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - OCR: 87.50
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - landmark: 152.00
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - scene: 155.50
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - existence: 195.00
2025-01-17 17:32:36.889 | INFO | utils:mme_aggregate_results:124 - posters: 120.41
2025-01-17 17:32:37.007 | INFO | llmc.eval.utils:eval_model:90 - EVAL: vqa on mme is
|Tasks|Version|Filter|n-shot| Metric | | Value | |Stderr|
|-----|-------|------|-----:|--------------------|---|--------:|---|------|
|mme |Yaml |none | 0|mme_cognition_score |↑ | 402.8571|± | N/A|
|mme |Yaml |none | 0|mme_perception_score|↑ |1410.2072|± | N/A| |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
backend
i'm trying to quantizing
InternVL2-2B
configeval dataset
withmme
, but met the error named:LocalTokenNotFoundError
, it. seems like try to download dataset, but the dataset is in local.analysis
llmc/eval/eval_vqa.py:98 task_dict = get_task_dict(tasks, task_manager)
, this line tried to download dataset, which requires thetoken
;/home/chenxiaotao03/Reposities/llmc/resource/data/llm_dataset/text/eval/MME
, the codeself.eval_dataset_path
inllmc/eval/eval_vqa.py
are not used. maybe it caused the bug.llm eval
, please help me to solve the problem, thanks.config
crash stack
The text was updated successfully, but these errors were encountered: