test: add test for impure function correlation behavior #9014

NickCrews · 2024-04-18T19:47:55Z

Related to #8921,
trying to write down exactly what the expected behavior is.

I figure we can use this PR to hash out exaclty what we want the semantics to be, and then the other discussions might be easier because our goal is written down precisely somewhere. Please let me know if you agree or disagree with this behavior, or if there are other tests we should add.

Need to fix the UDF test case in a followup. Also wasn't sure where to put these tests, I put them in their own little file but if you point me elsewhere Iwill move them.

NickCrews · 2024-04-18T19:56:41Z

oooh, these CI failures are revealing differences in the backend behaviors... wheeee into the 🐰 🕳️ we go!

ibis/backends/tests/test_impure.py

NickCrews · 2024-04-19T00:41:58Z

I can think of two reasons a user might care about the semantics here, and I hope we can support both of their needs:

impureness/correlatedness. If a function is impure, they may want it to be executed once, or many times, depending on if they want them to be correlated. See this example
performance. If some computation is slow, they only want it to happen a single time.

So I think this means that we as ibis authors can't assume what the goals of the user is, and how many times they want an expression executed. Therefore, we shouldn't do any clever rewrites or mergings of selects. I think we need to keep a more 1:1 correspondence between what the user writes and the SQL we produce: Every time the user does a .select(), .mutate(), etc, (except for simple column renamings, and maybe a few other cases) that leads to exactly one more with select .... as ... in the generated SQL.

IDK, what do you think of this train of reasoning? I think I'm fairly convinced that those two use cases are the requirements for success, but perhaps there is a different/better way of accomplishing that goal

kszucs · 2024-04-22T15:58:41Z

@NickCrews please rebase to test with #9023 change included

cpcloud · 2024-04-22T16:32:50Z

@NickCrews My only objection would be that

performance. If some computation is slow, they only want it to happen a single time.

Is not something we can enforce even if we never merge any select statements. This kind of guarantee is at the level of the query engine.

NickCrews · 2024-04-23T07:11:47Z

@cpcloud yup you are right with guarantees, "suggesting" to the backend is the best we can do.

What do you think in general of my proposal of "one CTE per .select"? I'm not sure if you're skeptical of the whole thing or just the performance claims... Thanks!

NickCrews · 2024-04-23T07:17:35Z

I'm trying to decide what is higher priority:

An implementation that is faster 90% of the time, but does clever things and therefore isn't able to be tuned by the user that 10% of the time they need it

Vs

An implementation that is a bit slower in the majority of cases, but is always fine tunable to get the perf you need in the edge cases.

NickCrews · 2024-04-23T19:22:12Z

slowly going through the backends and adding the correct marks for each kind of failure...

NickCrews · 2024-05-07T01:48:44Z

@cpcloud @kszucs I think this is ready for review whenever you get the chance! I think this is the groundwork for defining the current state, and after we get this in then we can start talking about what we think ideal behavior should be, and how to get there.

NickCrews · 2024-05-21T21:13:54Z

Anything I can do here to help move this forward?

cpcloud · 2024-06-13T11:19:40Z

@NickCrews Can you write a docstring for each of these tests explaining what's being tested in each? I know that's atypical, but this is a very hairy problem with lots of specifics that are important to understand, and words like "impure" and "correlation" need to be precisely defined so that everyone is on the same page about exactly what is being tested.

NickCrews · 2024-06-13T14:41:41Z

I think that is a good idea. Will do.

NickCrews · 2024-06-29T18:58:12Z

oops, I just wrote the notes as comments, not docstrings, I assume that is still OK? I think this is ready to review now, thanks!

Related to ibis-project#8921, trying to write down exactly what the expected behavior is.

…tions

cpcloud · 2024-10-08T14:41:44Z

It seems like this behavior is intentional across our backends, and that writing a query as a CTE in no way guarantees how many times it will be evaluated. It could be one, or it could be as many as there are references to the CTE.

NickCrews · 2024-10-08T16:28:56Z

Darn it. Here are specifically which backends.

Looks like DuckDB is aware of this, and they want to fix it, but not currently feasible for them to fix.

NickCrews · 2024-10-08T16:35:48Z

ughh, what do we want to do? In order to get the self-join behavior that the user wants, I think the backend would have to materialize a temporary table. But I really don't think we want to do that automatically for them. So the best we could do is to detect this happening, and then error? That sounds quite hairy.

cpcloud · 2024-10-08T16:58:23Z

I think the best we can do is to recommend calling .cache() in these cases. That's the only thing that will guarantee the desired semantics.

NickCrews · 2024-10-08T17:25:50Z

Do we do any sort of check to ensure people don't footgun themselves, or do we only just let them get weird results and then have to start searching the issues/docs? I can't think of where this should go in the docs, maybe we want a "common pitfalls" or "troubleshooting" page? I'm not sure if there are other footguns like this that we should call out that would also fit in there

cpcloud · 2024-10-09T13:29:39Z

Do we do any sort of check to ensure people don't footgun themselves, or do we only just let them get weird results and then have to start searching the issues/docs? I can't think of where this should go in the docs, maybe we want a "common pitfalls" or "troubleshooting" page? I'm not sure if there are other footguns like this that we should call out that would also fit in there

We don't have a check for this and I don't think it's worth adding because of the complexity of such a check.

I'm not sure if there are other footguns like this that we should call out that would also fit in there.

There are almost certainly are, but let's try to stay focused on this issue before moving on to other things.

For when someone finds these tests in 6 months, they will have some idea of what they can do about them, and where to go looking next.

NickCrews · 2024-10-09T16:58:00Z

OK, sounds good to me to just get flink passing and then get this PR merged and call it a day, at least the behavior is written down somewhere. How does this sound as a plan?

I added a few comments to the tests so someone sees what our conclusions were.

NickCrews force-pushed the test-correlation branch from 0992a3f to 48d4a51 Compare April 18, 2024 19:51

cpcloud reviewed Apr 18, 2024

View reviewed changes

ibis/backends/tests/test_impure.py Outdated Show resolved Hide resolved

NickCrews force-pushed the test-correlation branch from 48d4a51 to 1fea07f Compare April 19, 2024 00:27

NickCrews mentioned this pull request Apr 19, 2024

bug: inlining expressions leads to wrong results for non-pure functions #8921

Open

1 task

NickCrews force-pushed the test-correlation branch 2 times, most recently from 5d9a1ec to 21b180c Compare April 19, 2024 20:22

NickCrews force-pushed the test-correlation branch 2 times, most recently from 7700941 to d14384c Compare April 23, 2024 19:21

NickCrews force-pushed the test-correlation branch 4 times, most recently from ee0ae0c to 37ae5dd Compare April 23, 2024 22:47

NickCrews enabled auto-merge (rebase) April 23, 2024 23:13

NickCrews mentioned this pull request Apr 25, 2024

bug: merging selections combines filters in incorrect way #9058

Closed

1 task

NickCrews requested review from cpcloud and kszucs May 7, 2024 01:48

NickCrews force-pushed the test-correlation branch 2 times, most recently from ee68083 to 3834829 Compare June 29, 2024 18:57

NickCrews and others added 16 commits October 8, 2024 09:43

test: add test for impure function correlation behavior

ab2ae3b

Related to ibis-project#8921, trying to write down exactly what the expected behavior is.

fix(udf): make udfs impure and avoid merging selects with impure func…

eab4206

…tions

fix(postgres): dedent in udfs

c577d6f

fix(duckdb): thread udf parameters through

0f30227

chore: split expr and execution

674b92b

chore: give nows a shape

751f2a1

chore: split expr and execution

e15cb02

test: skip on non backend tests

8339160

chore: fix pyspark

73e8e35

chore: fix timestampnow decompilation

e187200

chore(flink): fix timestamp now

f4c2989

chore: remove unused error

78506b1

chore: run impure tests all in the same process

117d15b

test(ctes): add test for joining with uuid-generated keys

4180104

test: remove use of old pandas and dask markers

de49350

test: add test for impurity of self join

552c2cf

cpcloud force-pushed the test-correlation branch from 0d2406b to 552c2cf Compare October 8, 2024 13:46

NickCrews mentioned this pull request Oct 8, 2024

CTE column value changes every time the table is used duckdb/duckdb#12875

Open

2 tasks

cpcloud force-pushed the test-correlation branch from 93a42fa to e09d66d Compare October 8, 2024 17:04

chore: fix test

8b51625

cpcloud force-pushed the test-correlation branch from e09d66d to 8b51625 Compare October 8, 2024 17:05

chore: add workaround notes and discussion conclusion to test comments

e877e0d

For when someone finds these tests in 6 months, they will have some idea of what they can do about them, and where to go looking next.

chore: revert flink compiler change

055da77

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: add test for impure function correlation behavior #9014

test: add test for impure function correlation behavior #9014

NickCrews commented Apr 18, 2024 •

edited

Loading

NickCrews commented Apr 18, 2024

NickCrews commented Apr 19, 2024

kszucs commented Apr 22, 2024

cpcloud commented Apr 22, 2024

NickCrews commented Apr 23, 2024

NickCrews commented Apr 23, 2024

NickCrews commented Apr 23, 2024

NickCrews commented May 7, 2024

NickCrews commented May 21, 2024

cpcloud commented Jun 13, 2024

NickCrews commented Jun 13, 2024

NickCrews commented Jun 29, 2024

cpcloud commented Oct 8, 2024 •

edited

Loading

NickCrews commented Oct 8, 2024 •

edited

Loading

NickCrews commented Oct 8, 2024

cpcloud commented Oct 8, 2024

NickCrews commented Oct 8, 2024

cpcloud commented Oct 9, 2024

NickCrews commented Oct 9, 2024

test: add test for impure function correlation behavior #9014

Are you sure you want to change the base?

test: add test for impure function correlation behavior #9014

Conversation

NickCrews commented Apr 18, 2024 • edited Loading

NickCrews commented Apr 18, 2024

NickCrews commented Apr 19, 2024

kszucs commented Apr 22, 2024

cpcloud commented Apr 22, 2024

NickCrews commented Apr 23, 2024

NickCrews commented Apr 23, 2024

NickCrews commented Apr 23, 2024

NickCrews commented May 7, 2024

NickCrews commented May 21, 2024

cpcloud commented Jun 13, 2024

NickCrews commented Jun 13, 2024

NickCrews commented Jun 29, 2024

cpcloud commented Oct 8, 2024 • edited Loading

NickCrews commented Oct 8, 2024 • edited Loading

NickCrews commented Oct 8, 2024

cpcloud commented Oct 8, 2024

NickCrews commented Oct 8, 2024

cpcloud commented Oct 9, 2024

NickCrews commented Oct 9, 2024

NickCrews commented Apr 18, 2024 •

edited

Loading

cpcloud commented Oct 8, 2024 •

edited

Loading

NickCrews commented Oct 8, 2024 •

edited

Loading