Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update dependency io.openlineage:openlineage-java to v1.26.0 #3002

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

renovate[bot]
Copy link
Contributor

@renovate renovate bot commented Jan 1, 2025

This PR contains the following updates:

Package Change Age Adoption Passing Confidence
io.openlineage:openlineage-java 1.23.0 -> 1.26.0 age adoption passing confidence

Release Notes

OpenLineage/OpenLineage (io.openlineage:openlineage-java)

v1.26.0

Compare Source

Added
  • dbt: Consume dbt structured logs and report progress in real time. #3314 @​MassyB
    If --consume-structured-logs flag is set, dbt integration will consume dbt structured logs and report execution progress in real time.
  • Java: Add transform transport to allow event modification. #3301 @​pawel-big-lebowski
    New transport type allows to modify the event based on the specified transformer class.
  • Java: Parallel event emitting for composite transport. #3305[#3305] @​pawel-big-lebowski
    Emit events in parallel for composite transport. Running in parallel is a default behaviour continueOnFailure set to true. Default value of continueOnFailure got changed from false to true.
  • Spark: Collect ScanReport and CommitReport in OpenLineage events when dealing with Iceberg tables. #3256 @​pawel-big-lebowski
    Collects additional Iceberg metrics for datasets read or written through the library. Visit Dataset Metrics docs for more details.
  • dbt: add support for duckdb adapter #3280 @​mobuchowski
    Adds support for duckdb adapter for dbt integration.
Changed
  • Spark: Add DatasetFactory to support Dataset creation. #3207 @​pawel-big-lebowski
    Adds DatasetFactory to support Dataset creation. This class is used to create Dataset instances for DatasetFactory.
Fixed

v1.25.0

Compare Source

Added
  • Dbt: Add support for Column-Level Lineage in dbt integration. #3264 @​mayurmadnani
    Dbt integration now uses SQL parser to add information about collected column-level lineage.
  • Spark: Add input and output statistics about datasets read and written. #3240#3263 @​pawel-big-lebowski
    Fix issues related to existing output statistics collection mechanism and fetch input statistics. Output statistics contain now amount of files written, bytes size as well as records written. Input statistics contain bytes size and number of files read, while record count is collected only for DataSourceV2 sources.
  • Introduced InputStatisticsInputDatasetFacet #3238 @​pawel-big-lebowski
    Extend spec with a new facet InputStatisticsInputDatasetFacet modelled after a similar OutputStatisticsOutputDatasetFacet to contain statistics about input dataset read by a job.
Changed
  • Spark: Exclude META-INF/*TransportBuilder from Spark Extension Interfaces #3244 @​tnazarew
    Excludes META-INF/*TransportBuilder to avoid version conflicts
  • Spark: enables building input/output facets through DatasetFactory #3207 @​pawel-big-lebowski
    Adds extra capabilities into DatasetFactory class, marks some public developers' API methods as deprecated.
Fixed
  • dbt: fix compatibility with dbt v1.8 #3228 @​NJA010
    dbt integration now takes into account modified test_metadata field
  • Spark: enabled Delta 3.x version compatibility #3253 @​Jorricks
    Take into account modified initialSnapshot name

v1.24.2

Compare Source

Added
  • Spark: Add Dataproc run facet to include jobType property #3167 @​codelixir
    Updates the GCP Dataproc run facet to include jobType property
  • Add EnvironmentVariablesRunFacet to core spec #3186 @​JDarDagran
    Use EnvironmentVariablesRunFacet in Python client
  • Add assertions for format in test events #3221 @​JDarDagran
  • Spark: Add integration tests for EMR #3142 @​arturowczarek
    Spark integration has integration tests for EMR
Changed
  • Move Kinesis to separate module, migrate HTTP transport to httpclient5 #3205 @​mobuchowski
    Moves Kinesis integration to a separate module and updates HTTP transport to use HttpClient 5.x
  • Docs: Upgrade docusaurus to 3.6 #3219 @​arturowczarek
  • Spark: Limit the Seq size in RddPathUtils::extract() #3148 @​codelixir
    Adds flag to limit the logs in RddPathUtils::extract() to avoid OutOfMemoryError for large jobs
Fixed

v1.24.1

Compare Source

v1.24.0

Compare Source


Configuration

📅 Schedule: Branch creation - "every 3 months on the first day of the month" (UTC), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR is behind base branch, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

Signed-off-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Copy link

netlify bot commented Jan 1, 2025

Deploy Preview for peppy-sprite-186812 canceled.

Name Link
🔨 Latest commit d415db2
🔍 Latest deploy log https://app.netlify.com/sites/peppy-sprite-186812/deploys/67748c4fc603c800088951ff

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

0 participants