Quid solved #428

Collins-Webdev · 2024-10-18T18:59:17Z

Contributor checklist

This pull request is on a separate branch and not the main branch
I have tested my code with the pytest command as directed in the testing section of the contributing guide

Description

Related issue

#ISSUE_NUMBER

* Overview This PR addresses issue scribe-org#423 by implementing error handling for missing QID values in the `language_metadata.json` file. The changes focus on enhancing the robustness of the `cli_utils.py` module, particularly in scenarios where language entries lack a QID. ** Changes 1. Modified the `language_to_qid` dictionary creation process in `cli_utils.py`: - Implemented a try-except block to catch potential KeyErrors when accessing QID values. - Added a warning message for languages with missing QIDs. 2. Updated the `validate_language_and_data_type` function: - Enhanced error handling to accommodate languages without QIDs. - Improved the validation process to prevent crashes due to missing QID data. 3. Refactored related code sections for consistency and maintainability. * Technical Details - Utilized the `dict.get()` method with a default value of `None` to safely access potentially missing QID keys. - Implemented a logging mechanism to warn about missing QIDs without halting execution. - Adjusted the validation logic to gracefully handle languages with missing QIDs, allowing the CLI to continue functioning for valid entries. ** Testing - Conducted thorough testing by removing QIDs from various language entries in `language_metadata.json`. - Verified that the CLI continues to function correctly for languages with valid QIDs. - Confirmed that appropriate warnings are logged for languages with missing QIDs. - Tested edge cases, including scenarios with multiple missing QIDs and mixed valid/invalid entries. ** Impact These changes significantly improve the resilience of the Scribe-Data CLI, ensuring it can operate effectively even when faced with incomplete language metadata. This enhancement aligns with our goal of creating a more robust and user-friendly tool. ** Next Steps - Consider implementing a more comprehensive logging system for better traceability of warnings and errors. - Explore the possibility of adding unit tests specifically for QID error handling scenarios. - Evaluate the need for a data validation step during the metadata file loading process to preemptively identify and report missing or malformed entries.

github-actions · 2024-10-18T18:59:44Z

Thank you for the pull request!

The Scribe team will do our best to address your contribution as soon as we can. The following is a checklist for maintainers to make sure this process goes as well as possible. Feel free to address the points below yourself in further commits if you realize that actions are needed :)

If you're not already a member of our public Matrix community, please consider joining! We'd suggest using Element as your Matrix client, and definitely join the General and Data rooms once you're in. Also consider joining our bi-weekly Saturday dev syncs. It'd be great to have you!

Maintainer checklist

The linting and formatting workflow within the PR checks do not indicate new errors in the files changed
The CHANGELOG has been updated with a description of the changes for the upcoming release and the corresponding issue (if necessary)

…with sub-language support

catreedle · 2024-10-19T04:33:21Z

src/scribe_data/cli/cli_utils.py

+for lang in language_metadata["languages"]:
+    lang_lower = lang["language"].lower()
+    qid = lang.get("qid")
+
+    if qid is None:
+        print(f"Warning: 'qid' missing for language {lang['language']}")
+    else:
+        language_map[lang_lower] = lang
+        language_to_qid[lang_lower] = qid


Thank you for your work on this, @Collins-Webdev! 😊

I believe we don't need to change the previous method of retrieving languages using language_metadata.items(), as the latest version of language_metadata.json no longer includes the languages key, trying to access language_metadata["languages"] will cause an error.

I also think we should check for sub_languages first before attempting to retrieve the language qid. Languages like Norwegian and Chinese, which have sub-languages, naturally don't have their own qid.

- Remove assumption of 'languages' key in language_metadata - Handle sub-languages correctly - Improve warning messages for missing qids

Collins-Webdev · 2024-10-19T11:41:46Z

Hello @catreedle 👋🏼,
Thank you for your feedback 🙏🏽.
I've made the following changes based on your comments:

Removed the assumption of a 'languages' key in the language_metadata dictionary. The code now iterates directly over the language_metadata items.
Added handling for sub-languages. The code now checks for the presence of 'sub_languages' before processing the main language qid.
Improved the warning messages to differentiate between missing qids for main languages and sub-languages.

Please review the changes and let me know if any further modifications are needed.

andrewtavis · 2024-10-19T14:21:19Z

src/scribe_data/cli/cli_utils.py

@@ -27,8 +27,6 @@

 from scribe_data.utils import DEFAULT_JSON_EXPORT_DIR

-# MARK: CLI Variables


Please don't remove marks from the code in the future @Collins-Webdev as these are meant to make the files more manageable to navigate.

andrewtavis · 2024-10-19T14:23:26Z

50d4c30 further added back in some single quotes that were removed and were causing the tests to fail :)

andrewtavis

Thanks for the changes here, @Collins-Webdev! And also appreciate the review, @catreedle! Made mine much easier 😊 Would be great to get continued support on checking PRs!

andrewtavis added the hacktoberfest-accepted Accepted as a part of Hacktoberfest label Oct 18, 2024

andrewtavis self-requested a review October 18, 2024 19:20

Resolve merge conflict in cli_utils.py, combining QID error handling …

8725acb

…with sub-language support

Collins-Webdev mentioned this pull request Oct 18, 2024

language_metadata.json qid error handling #423

Closed

2 tasks

catreedle reviewed Oct 19, 2024

View reviewed changes

Refactor language metadata processing in cli_utils.py

8f4287d

- Remove assumption of 'languages' key in language_metadata - Handle sub-languages correctly - Improve warning messages for missing qids

Collins-Webdev force-pushed the quid-solved branch from 5fdd991 to 8f4287d Compare October 19, 2024 11:35

Collins-Webdev and others added 2 commits October 19, 2024 12:36

Refactor language metadata processing in cli_utils.py

c356f5d

- Remove assumption of 'languages' key in language_metadata - Handle sub-languages correctly - Improve warning messages for missing qids

Merge branch 'main' into quid-solved

4ae0f1a

andrewtavis added 3 commits October 19, 2024 16:12

Push main version of all Ukrainian queries

6c78475

Merge branch 'main' into quid-solved

3114e57

Re-hoise for loop and add spacing

50fa02e

andrewtavis reviewed Oct 19, 2024

View reviewed changes

Add quotes back in to fix tests

50d4c30

andrewtavis approved these changes Oct 19, 2024

View reviewed changes

andrewtavis merged commit 8321dc3 into scribe-org:main Oct 19, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quid solved #428

Quid solved #428

Collins-Webdev commented Oct 18, 2024

github-actions bot commented Oct 18, 2024 •

edited by andrewtavis

Loading

catreedle Oct 19, 2024 •

edited

Loading

catreedle Oct 19, 2024

Collins-Webdev commented Oct 19, 2024

andrewtavis Oct 19, 2024

andrewtavis commented Oct 19, 2024

andrewtavis left a comment

		@@ -27,8 +27,6 @@

		from scribe_data.utils import DEFAULT_JSON_EXPORT_DIR

		# MARK: CLI Variables

Quid solved #428

Quid solved #428

Conversation

Collins-Webdev commented Oct 18, 2024

Contributor checklist

Description

Related issue

github-actions bot commented Oct 18, 2024 • edited by andrewtavis Loading

Thank you for the pull request!

Maintainer checklist

catreedle Oct 19, 2024 • edited Loading

Choose a reason for hiding this comment

catreedle Oct 19, 2024

Choose a reason for hiding this comment

Collins-Webdev commented Oct 19, 2024

andrewtavis Oct 19, 2024

Choose a reason for hiding this comment

andrewtavis commented Oct 19, 2024

andrewtavis left a comment

Choose a reason for hiding this comment

github-actions bot commented Oct 18, 2024 •

edited by andrewtavis

Loading

catreedle Oct 19, 2024 •

edited

Loading