Skip to content

Conversation

fsdvh
Copy link
Collaborator

@fsdvh fsdvh commented May 8, 2025

Which issue does this PR close?

Closes #.

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

tlm365 and others added 30 commits August 18, 2024 06:49
* udwf sig

Signed-off-by: jayzhan211 <[email protected]>

* add coerce_types

Signed-off-by: jayzhan211 <[email protected]>

* add doc

Signed-off-by: jayzhan211 <[email protected]>

---------

Signed-off-by: jayzhan211 <[email protected]>
* fix: sqllogictest

* Revert "fix: sqllogictest"

This reverts commit 4957a1d.

* fix: sqllogictest

* remove any type signature

* coerce type from null  to date32

* fmt

* slt

* Revert "coerce type from null  to date32"

This reverts commit bccdc2e.

* replace type coerce by `coerce_types` method

* fmt

* fix underscored param
* do not add redundant subquery ordering into plan

* format code

* add license

* fix test cases with sort plan removing

* fix comment

* keep sorting on ordering mode test cases

* protect test intentions with order + limit

* protect test intentions with order + limit

* Tmp

* Minor changes

* Minor changes

* Minor changes

* Implement top down recursion with delete check

* Minor changes

* Minor changes

* initialize fetch() api for execution plan
remove unnecessary limit plans when used with sort + fetch
add test case for Sort and Limit with offset
push down limit even if a child with no fetch appears when the child supports push down

* Address reviews

* Update comments

* Minor changes

* Make test deterministic

* add supports limit push down to union exec

* support limit push down with multi children cases

* fix typos

Co-authored-by: Mehmet Ozan Kabak <[email protected]>

* Add fetch info to the statistics

* optimize tpch test plans

* Enforce distribution use inexact count estimate also.

* Minor changes

* Minor changes

* merge with apache main
add pushes_global_limit_into_multiple_fetch_plans test case
change limit_pushdown.rs as manual top down operator and simplify algorithm by supporting most parent node remove and other pushdown cases

* format code

* fix doc paths

* fix doc paths

* remove redundant code block

* if partition count is 1 put GlobalLimitExec

* fix test cases

* Apply suggestions from code review

* fix syntax errors

* Simplify branches

* remove redundant limit plans from merge

---------

Co-authored-by: Mustafa Akur <[email protected]>
Co-authored-by: Mehmet Ozan Kabak <[email protected]>
…ependency Projections (apache#12060)

* Fix exprlist bug

* Update datafusion/expr/src/utils.rs

Co-authored-by: Mehmet Ozan Kabak <[email protected]>

* Update utils.rs

* Update joins.slt

---------

Co-authored-by: Mehmet Ozan Kabak <[email protected]>
Update async-trait version in CLI and catalog crates and pin to the
version defined in the root Cargo file where possible.
* fix: support min/max for Float16 type

* minor: uncomment arrow_typeof float16 in sqllocigtests
* draft implement utf8_view for replace

* add function signature

* Add sql test

* move macro util to replace function

* remove unused import

* rust format

* change return type from utf8view to utf8

---------

Co-authored-by: Andrew Lamb <[email protected]>
…reated by doc example (apache#12034)

* Remove checked in file that is created by doc example

* revert changes to .gitignore

* Update example to use tmpdir
* Minor: update sqllogictest to treat Utf8View as text

* fmt
* Planner: support `LATERAL` subqueries

* Planner: use `DFSchema::merge` in `create_relation_subquery`

In order to compute the `set_outer_from_schema` argument we currently
use `DFSchema::join`. When we combine the current outer FROM schema with
the current outer query schema columns from the latter should override
columns from the first, so the correct way is to use `DFSchema::merge`.

To witness the fix, note that the query in the fixed test case isn't
planned as expected without the accompanying changes.

* Update plans

---------

Co-authored-by: Andrew Lamb <[email protected]>
…nction, add `AggregateUDFImpl::is_null` (apache#11989)

* schema assertion and fix the mismatch from logical and physical

Signed-off-by: jayzhan211 <[email protected]>

* add more msg

Signed-off-by: jayzhan211 <[email protected]>

* cleanup

Signed-off-by: jayzhan211 <[email protected]>

* rm test1

Signed-off-by: jayzhan211 <[email protected]>

* nullable for scalar func

Signed-off-by: jayzhan211 <[email protected]>

* nullable

Signed-off-by: jayzhan211 <[email protected]>

* rm field

Signed-off-by: jayzhan211 <[email protected]>

* rm unsafe block and use internal error

Signed-off-by: jayzhan211 <[email protected]>

* rm func_name

Signed-off-by: jayzhan211 <[email protected]>

* rm nullable option

Signed-off-by: jayzhan211 <[email protected]>

* add test

Signed-off-by: jayzhan211 <[email protected]>

* add more msg

Signed-off-by: jayzhan211 <[email protected]>

* fix test

Signed-off-by: jayzhan211 <[email protected]>

* rm row number

Signed-off-by: jayzhan211 <[email protected]>

* Update datafusion/expr/src/udaf.rs

Co-authored-by: Andrew Lamb <[email protected]>

* Update datafusion/expr/src/udaf.rs

Co-authored-by: Andrew Lamb <[email protected]>

* fix failed test from apache#12050

Signed-off-by: jayzhan211 <[email protected]>

* cleanup

Signed-off-by: jayzhan211 <[email protected]>

* add doc

Signed-off-by: jayzhan211 <[email protected]>

---------

Signed-off-by: jayzhan211 <[email protected]>
Co-authored-by: Andrew Lamb <[email protected]>
* remove redundant loop

* remove redundant loop

* remove unused methods
More consistency and less opportunity for column name mismatch.
…2026)

* Add new user doc to translate logical plan to physical plan

apache#7306

* prettier

* Run doc examples as part of cargo --doc

* Update first example to run

* Fix next example

* fix last example

* prettier

* clarify table source

* prettier

* Revert changes

---------

Co-authored-by: Andrew Lamb <[email protected]>
* Improve rpad udf by using a GenericStringBuilder

* fix format

* refine code
apache#12076)

* fix: Panic non-integer for nth_value function

* chore: Display actual value

* Update datafusion/physical-plan/src/windows/mod.rs

Co-authored-by: Marco Neumann <[email protected]>

* chore

---------

Co-authored-by: Marco Neumann <[email protected]>
* Remove vestigal `datafusion-docs` module compilation

* fix build
* Add Utf8View support to STRPOS function

* fix type inconsistency

* fix type inconsistency

* refactor tests
* Update itertools requirement from 0.12 to 0.13

Updates the requirements on [itertools](https://github.com/rust-itertools/itertools) to permit the latest version.
- [Changelog](https://github.com/rust-itertools/itertools/blob/master/CHANGELOG.md)
- [Commits](rust-itertools/itertools@v0.12.0...v0.13.0)

---
updated-dependencies:
- dependency-name: itertools
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <[email protected]>

* Update Cargo.lock

* Avoid deprecated API

* nested-functions: workspace version of itertools

---------

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Andrew Lamb <[email protected]>
Co-authored-by: Eduard Karacharov <[email protected]>
* fix the wildcard expand for filter plan

* expand the wildcard for the error message

* add the tests

* fix recompute_schema

* fix clippy

* cargo fmt

* change the check for having clause

* rename the function and moving the tests

* fix check

* expand the schema for aggregate plan

* reduce the time to expand wildcard

* clean the testing table after tested

* fmt and address review

* stop expand wildcard and add more check for group-by and selects

* simplify the having check
richox and others added 16 commits April 2, 2025 11:32
* fix case_column_or_null with nullable when conditions

* improve sqllogictests for case_column_or_null

---------

Co-authored-by: zhangli20 <[email protected]>
…#14245)

* fix: LimitPushdown rule uncorrect remove some GlobalLimitExec

* Fix some logic for maybe fetch

* Fix test

* Address comments

* Address comments

* Add comments

* Address comments
* fix: Limits are not applied correctly

* Add easy fix

* Add fix

* Add slt testing

* Address comments
* fix: FULL OUTER JOIN and LIMIT produces wrong results

* Fix minor slt testing

* fix test
* fix: Limit together with pushdown_filters

* Fix format

* Address new comments

* Fix testing case to hit the problem
* Add fix for segfault in ByteGroupValueBuilder

* spelling
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.