Nov 2024 schema #32

fivetran-reneeli · 2025-01-31T17:33:02Z

PR Overview

This PR will address the following Issue/Feature: #28

This PR will result in the following new package version: v0.5.0

Schema changes from Nov 2024

Please provide the finalized CHANGELOG entry which details the relevant changes included in this PR:

to be completed

PR Checklist

Basic Validation

Please acknowledge that you have successfully performed the following commands locally:

dbt run –full-refresh && dbt test
dbt run (if incremental models are present) && dbt test

Before marking this PR as "ready for review" the following have been applied:

The appropriate issue has been linked, tagged, and properly assigned
All necessary documentation and version upgrades have been applied
docs were regenerated (unless this PR does not include any code or yml updates)
BuildKite integration tests are passing
Detailed validation steps have been provided below

Detailed Validation

Please share any and all of your validation steps:

If you had to summarize this PR in an emoji, which would it be?

💃

fivetran-reneeli · 2025-01-31T18:13:16Z

models/apple_store__app_version_report.sql

-        distinct *
-    from reporting_grain_combined
+-- pre-reporting grain: unions all unique dimension values
+pre_reporting_grain as (


For context, I know we discussed doing full outer joins, but after looking into it, I realized it could pose a risk of losing records depending on what table is being used in the join, as explained by here. The alternative is to do a coalesce in the join key, but that may be clunky especially with more than 3 ctes being joined. Therefore I decided to do this union all - then dedupe method. This was also the method in the old version of these models too.

…sistency test, update changelog

fivetran-joemarkiewicz

@fivetran-reneeli thanks for this PR. A few comments below following this review.

fivetran-joemarkiewicz · 2025-02-03T21:37:58Z

models/apple_store__app_version_report.sql

+    select date_day, app_id, app_version, source_type, source_relation from app_crashes
+    union all
+    select date_day, app_id, app_version, source_type, source_relation from install_deletions
+    union all
+    select date_day, app_id, app_version, source_type, source_relation from sessions_activity


Request to format this in our usual manner instead of having all fields on one line.

Same request wherever this format is used in this PR

fivetran-joemarkiewicz · 2025-02-03T21:38:47Z

models/apple_store__app_version_report.sql

+        coalesce(id.deletions, 0) as deletions,
+        coalesce(id.installations, 0) as installations,
+        coalesce(sa.sessions, 0) as sessions
+    from reporting_grain rg


Request to use the below format as we typically use in our other data models for consistency.

Suggested change

from reporting_grain rg

from reporting_grain as rg

Please make these same updates in the other models where this format is used.

fivetran-joemarkiewicz · 2025-02-03T21:45:05Z

models/apple_store__app_version_report.sql

+app_crashes as (
+    select
+        app_id,
+        app_version,
+        date_day,
+        cast(null as {{ dbt.type_string() }}) as source_type,
+        source_relation,
+        sum(crashes) as crashes
+    from {{ var('app_crash_daily') }}
+    group by 1,2,3,4,5


What was the reasoning for making these ctes as opposed to ephemeral models as they were used in the previous version?

Same question for all the other cases of this in this PR

As discussed live, since these CTEs are unique per model due to the differences in grains, there's no advantage to modulating them in their own separate ephemeral models.

…d update models

fivetran-reneeli · 2025-02-05T20:45:28Z

models/intermediate/int_apple_store__date_spine.sql

+
+{% set first_date_query %}
+
+    select min(date_day) as min_date_day


If this makes sense, I will add the subscription staging models too. Just wanted to have a proof of concept first before spending time adding the subscription logic.

fivetran-reneeli added 7 commits January 30, 2025 23:10

update source type report

2f1fdc1

update app version report

046da4c

updates

632def4

seed file deletions and changes and additions

7084f63

update version and configs

c3a9b78

model revamps

5559078

subscription report

1ddd8c0

fivetran-reneeli commented Jan 31, 2025

View reviewed changes

fivetran-reneeli added 12 commits January 31, 2025 16:23

rm int models

9b0b948

end model revisions

6746c2b

new int models

dd7458d

rename references

2bf9dfe

update yml with new fields, rm old fields, update uniqueness

c7df6f7

readme update and version bump

f941c5f

new int models

f61eafe

rm old int models

5c26fa4

updated reports, changelog, docs

77fd1f0

docs

5555efc

deps

105389c

schema

aeb2564

fivetran-reneeli requested a review from fivetran-joemarkiewicz February 1, 2025 06:33

fivetran-reneeli self-assigned this Feb 1, 2025

rm integrity territory report since staging model is removed, fix con…

e3e45db

…sistency test, update changelog

fivetran-joemarkiewicz requested changes Feb 3, 2025

View reviewed changes

fivetran-reneeli added 5 commits February 4, 2025 10:26

seed updates

9fc03a1

style updates and regen docs

90753f0

rm active devices l30 days and update decision log. add date spine an…

43d4f81

…d update models

docs

cfafe69

switch from cross join to left join

4dc1890

fivetran-reneeli commented Feb 5, 2025

View reviewed changes

fivetran-reneeli added 5 commits February 5, 2025 16:29

fix download def and switch null to empty to correctly join

a2bbfeb

rm subcription from union all and fix comma

fd64d93

schema

43fb57b

rm empty value for source_type

75bfc3d

schema

661da32

fivetran-reneeli requested a review from fivetran-joemarkiewicz February 6, 2025 16:04

fivetran-reneeli added 4 commits February 6, 2025 15:42

make date spine a table

729bef4

update date spine and docs

c387358

make prerelease a1

4e4d751

docs

25ffcac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nov 2024 schema #32

Nov 2024 schema #32

fivetran-reneeli commented Jan 31, 2025 •

edited

Loading

fivetran-reneeli Jan 31, 2025 •

edited

Loading

fivetran-joemarkiewicz left a comment

fivetran-joemarkiewicz Feb 3, 2025

fivetran-joemarkiewicz Feb 3, 2025

fivetran-joemarkiewicz Feb 3, 2025

fivetran-joemarkiewicz Feb 3, 2025

fivetran-joemarkiewicz Feb 3, 2025

fivetran-joemarkiewicz Feb 3, 2025

fivetran-reneeli Feb 4, 2025

fivetran-reneeli Feb 5, 2025


		{% set first_date_query %}

		select min(date_day) as min_date_day

Nov 2024 schema #32

Are you sure you want to change the base?

Nov 2024 schema #32

Conversation

fivetran-reneeli commented Jan 31, 2025 • edited Loading

PR Overview

PR Checklist

Basic Validation

Detailed Validation

If you had to summarize this PR in an emoji, which would it be?

fivetran-reneeli Jan 31, 2025 • edited Loading

Choose a reason for hiding this comment

fivetran-joemarkiewicz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fivetran-reneeli commented Jan 31, 2025 •

edited

Loading

fivetran-reneeli Jan 31, 2025 •

edited

Loading