small simple bug-fix: update lost infos in function create_dataset function #144

im-Kitsch · 2023-09-10T21:20:51Z

Description

This is fix for simple bug-fix for function create_dataset_from_collector_env().

Add docstring for parameters
Add metadata attribute to dataset, meta data like algorithm name author author Email is not saved though it's passed into function.

for function create_dataset_from_collector_env()
ref_min/ref_max score is not saved to metadata, it's easily to find the problem.

I think the bugs are quite clear and definitely is a bug, you can take a look,

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have run pytest -v and no errors are present.
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I solved any possible warnings that pytest -v has generated that are related to my code to the best of my knowledge.
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

balisujohn · 2023-09-11T05:27:13Z

Good catches. Thanks for your contribution! All these fixes look correct to me. I think it would be good to make the values for author_name, author_email, code_permalink only be added if the corresponding argument is not equal to None. Once that's done I will merge this PR after a final review. As for minari version as an argument, I'm wondering if we should make it non-optional, but we can sort that out in a future PR.

im-Kitsch · 2023-09-11T20:00:44Z

Hi,

I pushed update to save author_name, author_email, code_permalink and algorithm name only when it's not None. And additionally I changed a code of combine dataset so that it could optional parameters are not saved.

I agree that we need consider if minari version should be optional or mandatory.

Futhermore, I think the dataset's structure should be clearly specified and let it be stable.

E.g. here in combine_dataset, if author or author_email are not saved then it will throw error.

Minari/minari/utils.py

Lines 277 to 290 in ea40978

    
           combined_data_file.attrs.modify( 
        
               "total_episodes", last_episode_id + dataset.total_episodes 
        
           ) 
        
           combined_data_file.attrs.modify( 
        
               "total_steps", 
        
               combined_data_file.attrs["total_steps"] + dataset.spec.total_steps, 
        
           ) 
        
           # TODO: list of authors, and emails 
        
           with h5py.File(dataset.spec.data_path, "r") as dataset_file: 
        
               combined_data_file.attrs.modify("author", dataset_file.attrs["author"]) 
        
               combined_data_file.attrs.modify( 
        
                   "author_email", dataset_file.attrs["author_email"] 
        
               )

There should be still small errors like this one. I think this is just caused by more attributes added and not stablely specified.

balisujohn

The code seems almost ready to merge. Please remove the commented out lines, and add a basic test to for behavior when the optional dataset_metadata keys modified in this PR are provided when creating an instance of MinariDataset

balisujohn · 2023-09-16T07:31:21Z

minari/utils.py

+                        combined_data_file.attrs.modify(
+                            optional_parameter, dataset_file.attrs[optional_parameter]
+                        )
+                # combined_data_file.attrs.modify("author", dataset_file.attrs["author"])


please remove commented out lines

im-Kitsch · 2023-09-26T16:04:46Z

Hi,
I added one test case for combine dataset and modified create dataset test case, could you take a look?

I think these tests should be modified in the future again, since currently those additional infomations could not be got directly from Minaridataset instance. But have to be read manualy from h5py file.

minari/utils.py

tests/utils/test_dataset_combine.py

tests/utils/test_dataset_creation.py

younik · 2023-10-09T15:08:52Z

tests/utils/test_dataset_combine.py

+        assert dt_file.attrs["code_permalink"] == _final_code_link
+        assert dt_file.attrs["author"] == "WillDudley" + str(n_data - 1)
+        assert dt_file.attrs["author_email"] == "[email protected]" + str(n_data - 1)


btw, this behavior is a little weird in combine dataset (you just save the attributes of the last)
Anyway, it is fixed in #133 that we will merge after, so it is okay for now

Ah, yes, I don't think only saving the attribute of last is a good choice, but it's just to keep same with combine_dataset() implementation. Definitely it will be changed in the future, but this PR is just to save the fogortten metadata code_permalink and algorithm_name.

younik

Just fix the pre-commit, and then looks good to me

im-Kitsch · 2023-10-09T22:12:45Z

Just fix the pre-commit, and then looks good to me

Hi, I think the pre-commit is fixed now.

take over

im-Kitsch added 2 commits September 10, 2023 22:22

update lost infos

d57ec25

update

f916d8e

small spell error

c33d8b7

balisujohn previously requested changes Sep 16, 2023

View reviewed changes

im-Kitsch added 2 commits September 26, 2023 17:49

update

fa5ee96

update

a6f308e

try to fix pre-commit

2264c00

im-Kitsch requested a review from balisujohn October 4, 2023 14:43

younik requested changes Oct 9, 2023

View reviewed changes

update according to comment

40b5b5e

younik approved these changes Oct 9, 2023

View reviewed changes

fix pre-commit reformatt

dad4694

younik merged commit c43a612 into Farama-Foundation:main Oct 10, 2023
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

small simple bug-fix: update lost infos in function create_dataset function #144

small simple bug-fix: update lost infos in function create_dataset function #144

im-Kitsch commented Sep 10, 2023

balisujohn commented Sep 11, 2023 •

edited

Loading

im-Kitsch commented Sep 11, 2023 •

edited

Loading

balisujohn left a comment

balisujohn Sep 16, 2023

im-Kitsch commented Sep 26, 2023

younik Oct 9, 2023

im-Kitsch Oct 9, 2023

younik left a comment

im-Kitsch commented Oct 9, 2023

small simple bug-fix: update lost infos in function create_dataset function #144

small simple bug-fix: update lost infos in function create_dataset function #144

Conversation

im-Kitsch commented Sep 10, 2023

Description

Type of change

Checklist:

balisujohn commented Sep 11, 2023 • edited Loading

im-Kitsch commented Sep 11, 2023 • edited Loading

balisujohn left a comment

Choose a reason for hiding this comment

balisujohn Sep 16, 2023

Choose a reason for hiding this comment

im-Kitsch commented Sep 26, 2023

younik Oct 9, 2023

Choose a reason for hiding this comment

im-Kitsch Oct 9, 2023

Choose a reason for hiding this comment

younik left a comment

Choose a reason for hiding this comment

im-Kitsch commented Oct 9, 2023

balisujohn commented Sep 11, 2023 •

edited

Loading

im-Kitsch commented Sep 11, 2023 •

edited

Loading