docs : docstrings examples - dtypes #1121

anopsy · 2024-10-03T14:05:07Z

Just a quick check: considering I'll have to use cast to get some of the datatypes, maybe I should wrap the from_native into a func after all?

What type of PR is this? (check all applicable)

Related issues

Related issue docs: add docstring examples for dtypes #1077
Closes #

Checklist

Code follows style guide (ruff)
Tests added
Documented the changes

If you have comments or can explain your changes, please do so below.

MarcoGorelli

thanks @anopsy !

for Categorical, it looks like casting from string is unsupported in old pyarrow versions* - but you can create the categorical directly:

>>> ca = pa.chunked_array(pa.array(['a', 'b'], type=pa.dictionary(pa.uint32(), pa.string())))
>>> nw.from_native(ca, series_only=True)
┌─────────────────────────────────────────┐
| Narwhals Series                         |
| Use `.to_native()` to see native output |
└─────────────────────────────────────────┘
>>> nw.from_native(ca, series_only=True).dtype
Categorical

For Struct / List / Array, there is a way to do that in pandas too use pd.ArrowDtype:

In [11]: pd.Series(data, dtype=pd.ArrowDtype(pa.large_list(pa.large_string())))
Out[11]:
0      ['narwhal' 'orca']
1    ['beluga' 'vaquita']
dtype: large_list<item: large_string>[pyarrow]

Similarly, for the struct example, I think you could use

pa.struct({'a': pa.int64(), 'b': pa.large_list(pa.large_string())})

*maybe we should only run doctests on the latest versions...that would save us a lot of # doctest: +SKIPs which would be a good thing

MarcoGorelli

thanks for updating!

MarcoGorelli · 2024-11-13T09:55:54Z

tests/stable_api_test.py

@@ -12,6 +12,33 @@
 from tests.utils import Constructor
 from tests.utils import assert_equal_data

+DTYPES = {


just out of interest, why add this?

hey Marco, I wasn't updating, I was just trying to resolve some conflicts and update my fork/local repo. I was working on the dtypes file and also had to adjust the tests and then you did some changes to the test file (among those was adding this DTYPES = {....}-thing, I think you did it during one of the live streams.)

thanks for resolving merge conflicts then 🙌

i think it might not be necessary any more to have DTYPES here then, could you try removing it from this file?

Yes, I think I first merged your changes and then committed them back, in the process of updating my local repo. I can see now that you refactored this test 2 weeks ago or so. Yes, I'll try to get it right!

MarcoGorelli · 2024-11-13T09:57:36Z

narwhals/dtypes.py

+        >>> ser_pl = pl.Series(data)
+        >>> ser_pa = pa.chunked_array([data])
+
+        >>> nw.from_native(ser_pl, series_only=True).dtype
+        List(String)
+        >>> nw.from_native(ser_pa, series_only=True).dtype
+        List(String)


if you want a pandas example we could add

pd.Series(data, dtype=pd.ArrowDtype(pa.large_list(pa.large_string())))

Okay, it's done, although I did --force push after rebasing, I hope I didn't break anything. If everything is fine, could we just merge this PR and I will continue from a new branch because I butchered this one so much 😅

anopsy changed the title ~~[docs] : docstrings examples for Int64 and Float64 dtypes~~ docs : docstrings examples for Int64 and Float64 dtypes Oct 3, 2024

github-actions bot added the documentation Improvements or additions to documentation label Oct 3, 2024

anopsy changed the title ~~docs : docstrings examples for Int64 and Float64 dtypes~~ docs : docstrings examples - dtypes Oct 5, 2024

MarcoGorelli reviewed Oct 16, 2024

View reviewed changes

MarcoGorelli reviewed Nov 13, 2024

View reviewed changes

anopsy added 2 commits November 15, 2024 08:54

docstrings examples for Int64 and Float64 dtypes rufformatted

d4e8d29

add pd example to List

bdcda9a

anopsy force-pushed the dtypes branch from 14b9937 to bdcda9a Compare November 15, 2024 08:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs : docstrings examples - dtypes #1121

docs : docstrings examples - dtypes #1121

anopsy commented Oct 3, 2024

MarcoGorelli left a comment •

edited

Loading

MarcoGorelli left a comment

MarcoGorelli Nov 13, 2024

anopsy Nov 13, 2024

MarcoGorelli Nov 13, 2024

anopsy Nov 13, 2024

MarcoGorelli Nov 13, 2024

anopsy Nov 15, 2024

docs : docstrings examples - dtypes #1121

Are you sure you want to change the base?

docs : docstrings examples - dtypes #1121

Conversation

anopsy commented Oct 3, 2024

What type of PR is this? (check all applicable)

Related issues

Checklist

If you have comments or can explain your changes, please do so below.

MarcoGorelli left a comment • edited Loading

Choose a reason for hiding this comment

MarcoGorelli left a comment

Choose a reason for hiding this comment

MarcoGorelli Nov 13, 2024

Choose a reason for hiding this comment

anopsy Nov 13, 2024

Choose a reason for hiding this comment

MarcoGorelli Nov 13, 2024

Choose a reason for hiding this comment

anopsy Nov 13, 2024

Choose a reason for hiding this comment

MarcoGorelli Nov 13, 2024

Choose a reason for hiding this comment

anopsy Nov 15, 2024

Choose a reason for hiding this comment

MarcoGorelli left a comment •

edited

Loading