[NIGHTLY] v25.04.00
Pre-release
Pre-release
·
191 commits
to branch-25.04
since this release
π Links
π¨ Breaking Changes
- Remove deprecated rolling window functionality (#17993) @wence-
- Remove deprecated nvtext::minhash_permuted APIs (#17939) @davidwendt
- Remove dataframe protocol (#17909) @vyasr
- Use new rapids-logger library (#17899) @vyasr
- Added Multi-input & Scalar Support for Transform UDFs (#17881) @lamarrr
- Fixed incorrect PTX parsing of
ret
instruction after branch label (#17859) @lamarrr - Use KvikIO to enable file's fast host read and host write (#17764) @kingcrimsontianyu
π Bug Fixes
- Compatibility with dask.dataframe's
is_scalar
(#18030) @TomAugspurger - Fix the build error due to KvikIO update (#18025) @kingcrimsontianyu
- Fix failing ibis test (#18022) @Matt711
- Skip failing polars tests (#18015) @Matt711
- Fix
to_arrow
to return consistent pandas-metadata (#18009) @galipremsagar - Compatibility with Dask
main
(#17992) @TomAugspurger - [Bug] Fix Parquet-metadata sampling in cudf-polars (#17991) @rjzamora
- Add missing include for calling std::iota() (#17983) @davidwendt
- Fix pickle and unpickling for all objects (#17980) @galipremsagar
- Install duckdb the default backend for ibis in the cudf.pandas integration tests (#17972) @Matt711
- Check null count too in sum aggregation (#17964) @Matt711
- Raise NotImplementedError for groupby.agg if duplicate columns would be created (#17956) @mroeschke
- Ensure disabling the module accelerator is thread-safe (#17955) @vyasr
- Fix DataFrame/Series.rank for int and null data in mode.pandas_compatible (#17954) @mroeschke
- Limit buffer size in reallocation policy in JSON reader (#17940) @shrshi
- Make
cudf.pandas
proxy array picklable (#17929) @Matt711 - Add missing standard includes (#17928) @miscco
- Fix torch integration test (#17923) @Matt711
- Fix
to_pandas
writable bug fordatetime
andtimedelta
types (#17913) @galipremsagar - Raise NotImplementedError if
.merge(suffixes=)
introduces duplicate labels (#17905) @mroeschke - Fix groupby scans with int and NA data in mode.pandas_compatible (#17895) @mroeschke
- Patch
__init__
ofcudf
constructors to parse throughcudf.pandas
proxy objects (#17878) @galipremsagar - Fixed incorrect PTX parsing of
ret
instruction after branch label (#17859) @lamarrr
π New Features
- Make most cudf-polars
Node
objects pickleable (#17998) @rjzamora - Add
Column.serialize
to cudf-polars (#17990) @rjzamora - Add catboost integration tests (#17931) @Matt711
- Added Multi-input & Scalar Support for Transform UDFs (#17881) @lamarrr
- Run spark-rapids-jni CI (#17781) @KyleFromNVIDIA
- Add multi-partition
Shuffle
operation to cuDF Polars (#17744) @rjzamora - Add support for unary negation operator (#17560) @Matt711
- Add multi-partition
Join
support to cuDF-Polars (#17518) @rjzamora
π οΈ Improvements
- Update spark-rapids-jni CI image version to cuda12.8.0 (#18024) @pxLi
- consolidate more conda solves in CI (#18014) @jameslamb
- Replace
cub::Int2Type
withcuda::std::integral_constant
(#18013) @miscco - Pass dtype objects to Column.astype (#18008) @mroeschke
- Move cudf::lists::detail::make_empty_lists_column to public API (#17996) @davidwendt
- Create Conda CI test env in one step (#17995) @KyleFromNVIDIA
- Add seed parameter to cudf hash_character_ngrams (#17994) @davidwendt
- Remove deprecated rolling window functionality (#17993) @wence-
- Continue on failures in cudf.pandas integration tests CI job (#17987) @Matt711
- Avoid cudf.dtype calls in build_column/column_empty/.where (#17979) @mroeschke
- Use Conda XGBoost (#17959) @jakirkham
- Refactor predicate pushdown to reuse row group pruning in experimental PQ reader (#17946) @mhaseeb123
- Use shared-workflows branch-25.04 (#17943) @bdice
- Get rid of the deprecated
thrust::identity
(#17942) @PointKernel - Remove deprecated nvtext::minhash_permuted APIs (#17939) @davidwendt
- Enable third party library integration tests in CI with
cudf.pandas
(#17936) @galipremsagar - Add build_type input field for
test.yaml
(#17925) @gforsyth - Use
rapids-pip-retry
in CI jobs that might need retries (#17920) @gforsyth - More avoid cudf.dtype internally in favor of pre-defined, supported types (#17918) @mroeschke
- Initialize inout parameter (#17911) @miscco
- Remove dataframe protocol (#17909) @vyasr
- Rename PascalCase functions and types to to snake_case to improve consistency (#17908) @vuule
- Use new rapids-logger library (#17899) @vyasr
- Add
pylibcudf.Scalar.from_py
for construction from Python strings, bool, int, float (#17898) @mroeschke - Remove cudf.Scalar from factorize (#17897) @mroeschke
- disallow fallback to Make in Python builds (#17894) @jameslamb
- Remove
orc::gpu
namespace (#17891) @vuule - Only run Auto Assign PR workflow if PR is not merged (#17888) @mroeschke
- Update pre-commit-hooks to version 0.6.0 (#17887) @KyleFromNVIDIA
- Forward-merge branch-25.02 to branch-25.04 (#17885) @bdice
- Add script to run pylibcudf tests (#17882) @bdice
- Migrate to NVKS for amd64 CI runners (#17877) @bdice
- Fix merge conflict for branch-25.02 into branch-25.04 (#17874) @davidwendt
- Remove decimal32/64 to decimal128 conversion in Parquet writer (#17869) @mhaseeb123
- Expose JSON reader options to builder in pylibcudf (#17866) @shrshi
- Remove cudf.Scalar from .dt timedelta properties (#17863) @mroeschke
- Added support for custom types in PTX parser (#17861) @lamarrr
- Remove cudf.Scalar from date_range/to_datetime (#17860) @mroeschke
- Avoid
cudf.dtype
internally in favor of pre-defined, supported types (#17839) @mroeschke - Fixing auto-merge branch-25.02 into branch-25.04 (#17828) @davidwendt
- Include more information in error messages in the nvcomp adapter (#17814) @vuule
- Use KvikIO to enable file's fast host read and host write (#17764) @kingcrimsontianyu
- Remove cudf._lib.column in favor of pylibcudf. (#17760) @mroeschke
- Replaced std::string with std::string_view and removed excessive copies in cudf::io (#17734) @lamarrr