snowplow_unified_sessions fails to run on Snowflake due to duplicates #90
Labels
status:needs_triage
Needs maintainer triage.
type:bug
Bugs or weaknesses. The issue has to contain steps to reproduce.
Describe the bug
We receive an error when running the
snowplow_unified_sessions
model in Production on Snowflake. The error is as follows:This error seems to be due to be linked to the
mobile_language_name
column that was added and the fact that deduplication is only done forpostgres
andspark
but notsnowflake
.Steps to reproduce
Set the dbt_project.yml to the following:
Execute
dbt run --select package:snowplow_unified
.Expected results
Expected no errors from the
snowplow_unified_sessions
model.Actual results
Receiving an error:
Screenshots and log output
System information
The contents of your
packages.yml
file:Which database are you using dbt with?
The output of
dbt --version
:The operating system you're using:
macOS 15.0.1 (24A348)
OR
Debian Bookworm (python:3.9-slim Docker image)
The output of
python --version
:Python 3.9.19
Additional context
Possible problem with line of code here:
dbt-snowplow-unified/models/sessions/scratch/snowplow_unified_sessions_this_run.sql
Line 465 in 1223a5f
and here:
dbt-snowplow-unified/models/sessions/scratch/snowplow_unified_sessions_this_run.sql
Line 485 in 1223a5f
Are you interested in contributing towards the fix?
Unfortunately, I have no time to do it directly.
The text was updated successfully, but these errors were encountered: