Release [NIGHTLY] v25.04.00 · rapidsai/cudf

🔗 Links

🚨 Breaking Changes

Remove deprecated rolling window functionality (#17993) @wence-
Remove deprecated nvtext::minhash_permuted APIs (#17939) @davidwendt
Remove dataframe protocol (#17909) @vyasr
Use new rapids-logger library (#17899) @vyasr
Added Multi-input & Scalar Support for Transform UDFs (#17881) @lamarrr
Fixed incorrect PTX parsing of ret instruction after branch label (#17859) @lamarrr
Use KvikIO to enable file's fast host read and host write (#17764) @kingcrimsontianyu

🐛 Bug Fixes

Compatibility with dask.dataframe's is_scalar (#18030) @TomAugspurger
Fix the build error due to KvikIO update (#18025) @kingcrimsontianyu
Fix failing ibis test (#18022) @Matt711
Skip failing polars tests (#18015) @Matt711
Fix to_arrow to return consistent pandas-metadata (#18009) @galipremsagar
Compatibility with Dask main (#17992) @TomAugspurger
[Bug] Fix Parquet-metadata sampling in cudf-polars (#17991) @rjzamora
Add missing include for calling std::iota() (#17983) @davidwendt
Fix pickle and unpickling for all objects (#17980) @galipremsagar
Install duckdb the default backend for ibis in the cudf.pandas integration tests (#17972) @Matt711
Check null count too in sum aggregation (#17964) @Matt711
Raise NotImplementedError for groupby.agg if duplicate columns would be created (#17956) @mroeschke
Ensure disabling the module accelerator is thread-safe (#17955) @vyasr
Fix DataFrame/Series.rank for int and null data in mode.pandas_compatible (#17954) @mroeschke
Limit buffer size in reallocation policy in JSON reader (#17940) @shrshi
Make cudf.pandas proxy array picklable (#17929) @Matt711
Add missing standard includes (#17928) @miscco
Fix torch integration test (#17923) @Matt711
Fix to_pandas writable bug for datetime and timedelta types (#17913) @galipremsagar
Raise NotImplementedError if .merge(suffixes=) introduces duplicate labels (#17905) @mroeschke
Fix groupby scans with int and NA data in mode.pandas_compatible (#17895) @mroeschke
Patch __init__ of cudf constructors to parse through cudf.pandas proxy objects (#17878) @galipremsagar
Fixed incorrect PTX parsing of ret instruction after branch label (#17859) @lamarrr

🚀 New Features

Make most cudf-polars Node objects pickleable (#17998) @rjzamora
Add Column.serialize to cudf-polars (#17990) @rjzamora
Add catboost integration tests (#17931) @Matt711
Added Multi-input & Scalar Support for Transform UDFs (#17881) @lamarrr
Run spark-rapids-jni CI (#17781) @KyleFromNVIDIA
Add multi-partition Shuffle operation to cuDF Polars (#17744) @rjzamora
Add support for unary negation operator (#17560) @Matt711
Add multi-partition Join support to cuDF-Polars (#17518) @rjzamora

🛠️ Improvements

Update spark-rapids-jni CI image version to cuda12.8.0 (#18024) @pxLi
consolidate more conda solves in CI (#18014) @jameslamb
Replace cub::Int2Type with cuda::std::integral_constant (#18013) @miscco
Pass dtype objects to Column.astype (#18008) @mroeschke
Move cudf::lists::detail::make_empty_lists_column to public API (#17996) @davidwendt
Create Conda CI test env in one step (#17995) @KyleFromNVIDIA
Add seed parameter to cudf hash_character_ngrams (#17994) @davidwendt
Remove deprecated rolling window functionality (#17993) @wence-
Continue on failures in cudf.pandas integration tests CI job (#17987) @Matt711
Avoid cudf.dtype calls in build_column/column_empty/.where (#17979) @mroeschke
Use Conda XGBoost (#17959) @jakirkham
Refactor predicate pushdown to reuse row group pruning in experimental PQ reader (#17946) @mhaseeb123
Use shared-workflows branch-25.04 (#17943) @bdice
Get rid of the deprecated thrust::identity (#17942) @PointKernel
Remove deprecated nvtext::minhash_permuted APIs (#17939) @davidwendt
Enable third party library integration tests in CI with cudf.pandas (#17936) @galipremsagar
Add build_type input field for test.yaml (#17925) @gforsyth
Use rapids-pip-retry in CI jobs that might need retries (#17920) @gforsyth
More avoid cudf.dtype internally in favor of pre-defined, supported types (#17918) @mroeschke
Initialize inout parameter (#17911) @miscco
Remove dataframe protocol (#17909) @vyasr
Rename PascalCase functions and types to to snake_case to improve consistency (#17908) @vuule
Use new rapids-logger library (#17899) @vyasr
Add pylibcudf.Scalar.from_py for construction from Python strings, bool, int, float (#17898) @mroeschke
Remove cudf.Scalar from factorize (#17897) @mroeschke
disallow fallback to Make in Python builds (#17894) @jameslamb
Remove orc::gpu namespace (#17891) @vuule
Only run Auto Assign PR workflow if PR is not merged (#17888) @mroeschke
Update pre-commit-hooks to version 0.6.0 (#17887) @KyleFromNVIDIA
Forward-merge branch-25.02 to branch-25.04 (#17885) @bdice
Add script to run pylibcudf tests (#17882) @bdice
Migrate to NVKS for amd64 CI runners (#17877) @bdice
Fix merge conflict for branch-25.02 into branch-25.04 (#17874) @davidwendt
Remove decimal32/64 to decimal128 conversion in Parquet writer (#17869) @mhaseeb123
Expose JSON reader options to builder in pylibcudf (#17866) @shrshi
Remove cudf.Scalar from .dt timedelta properties (#17863) @mroeschke
Added support for custom types in PTX parser (#17861) @lamarrr
Remove cudf.Scalar from date_range/to_datetime (#17860) @mroeschke
Avoid cudf.dtype internally in favor of pre-defined, supported types (#17839) @mroeschke
Fixing auto-merge branch-25.02 into branch-25.04 (#17828) @davidwendt
Include more information in error messages in the nvcomp adapter (#17814) @vuule
Use KvikIO to enable file's fast host read and host write (#17764) @kingcrimsontianyu
Remove cudf._lib.column in favor of pylibcudf. (#17760) @mroeschke
Replaced std::string with std::string_view and removed excessive copies in cudf::io (#17734) @lamarrr

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NIGHTLY] v25.04.00

🔗 Links

🚨 Breaking Changes

🐛 Bug Fixes

🚀 New Features

🛠️ Improvements

Contributors