Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
502 commits
Select commit Hold shift + click to select a range
e30ca2e
chore(deps): bump taiki-e/install-action from 2.62.60 to 2.62.61 (#19…
dependabot[bot] Dec 3, 2025
09fe307
chore(deps): bump actions/stale from 10.1.0 to 10.1.1 (#19061)
dependabot[bot] Dec 3, 2025
2217c2b
chore: merge make_array and spark array (#19006)
jizezhang Dec 3, 2025
b402dff
chore(deps): bump actions/checkout from 6.0.0 to 6.0.1 (#19060)
dependabot[bot] Dec 3, 2025
11c9789
Add documentation example for `PartitionPruningStatistics` (#19020)
alamb Dec 3, 2025
9af6858
Add `force_filter_selections` to restore `pushdown_filters` behavior …
alamb Dec 3, 2025
91e623b
chore: upgrade expr and execution crates to rust 2024 edition (#19047)
timsaucer Dec 3, 2025
d4820d1
fix: log metadata differences when comparing physical and logical sch…
erratic-pattern Dec 3, 2025
eed97cf
Implement timezone-aware handling for to_timestamp functions.
kosiew Oct 12, 2025
bb579d5
Merge branch 'apache:main' into timestamp-17998
Omega359 Dec 3, 2025
7c00e07
Implement timezone-aware handling for to_timestamp functions.
Omega359 Dec 3, 2025
0c6b654
Implement FFI task context and task context provider (#18918)
timsaucer Dec 3, 2025
6751f44
refactor: Refactor spark make_interval signature away from user defin…
kumarUjjawal Dec 4, 2025
434a23b
Fix: Align sort_merge_join filter output with join schema to fix righ…
kumarUjjawal Dec 4, 2025
90aa47e
Support Substrait Round-Trip of `EmptyRelation` Including `produce_on…
kosiew Dec 4, 2025
00e7952
chore(deps): bump taiki-e/install-action from 2.62.61 to 2.62.62 (#19…
dependabot[bot] Dec 4, 2025
71fdad0
chore: enforce clippy::allow_attributes for datasource crates (#19068)
chakkk309 Dec 4, 2025
63a8c65
common: Add hashing support for REE arrays (#18981)
brancz Dec 4, 2025
f22a3f3
Use `tpchgen-cli` to generate tpch data in bench.sh (#19035)
alamb Dec 4, 2025
5c4eaf7
Updates from code review.
Omega359 Dec 4, 2025
71fe6d3
Merge remote-tracking branch 'origin/timestamp-17998' into timestamp-…
Omega359 Dec 4, 2025
8dac8f1
Minor: fix link errors in docs (#19088)
alamb Dec 4, 2025
388db0e
Update aggregate probe to be locked only if skipping aggregation (#18…
hareshkh Dec 4, 2025
d150af7
Adds memory-bound DefaultListFilesCache (#18855)
BlakeOrth Dec 4, 2025
107cb5e
Fix function doc CI check (#19093)
alamb Dec 4, 2025
a3a020f
Fix Schema Duplication Errors in Self‑Referential INTERSECT/EXCEPT by…
kosiew Dec 5, 2025
e01fa0c
Cut `Parquet` over to PhysicalExprAdapter, remove `SchemaAdapter` (#1…
adriangb Dec 5, 2025
a33ae5e
run cargo fmt to fix after #18998 (#19102)
adriangb Dec 5, 2025
a5fc3c7
bench: set test_util as required feature for aggregate_vectorized (#1…
rluvaton Dec 5, 2025
15096a6
Extract parsing of UTC to a static.
Omega359 Dec 5, 2025
1844c88
Update Committer / PMC list (#19105)
alamb Dec 5, 2025
03c1a8d
use ProjectionExprs:project_statistics in FileScanConfig (#19094)
adriangb Dec 5, 2025
f04a0eb
Temporarily ignore test_cache_with_ttl_and_lru test (#19115)
alamb Dec 5, 2025
482c6b8
refactor: move human readable display utilities to `datafusion-common…
2010YOUY01 Dec 5, 2025
2b05b09
feat: Add builder API for CreateExternalTable to reduce verbosity (#1…
AryanBagade Dec 5, 2025
1800d15
Always remove unecessary software from github runners for all jobs (f…
alamb Dec 5, 2025
c293854
fix: fix panic when lo is greater than hi (#19099)
tshauck Dec 5, 2025
340a28c
fix: escape underscores when simplifying `starts_with` (#19077)
willemv Dec 5, 2025
cf9d078
[datafusion-spark]: Refactor make_dt_interval's signature away from u…
codetyri0n Dec 5, 2025
6746007
feat(spark): Implement Spark functions `url_encode`, `url_decode` and…
anhvdq Dec 6, 2025
c479dee
fix deprecation notes with incorrect versions from #13083 (#19135)
adriangb Dec 7, 2025
944f7f2
Run the examples in the new format (#18946)
cj-zhukov Dec 7, 2025
cb2f3d2
Add constant expression evaluator to physical expression simplifier (…
adriangb Dec 7, 2025
fc6d0a4
Fix shuffle function to report nullability correctly (#19184)
harshitsaini17 Dec 7, 2025
41f7137
chore: enforce clippy::allow_attributes for physical crates (#19185)
carlosahs Dec 8, 2025
3a0064d
fix: custom nullability for length (#19175) (#19182)
skushagra Dec 8, 2025
998f534
Update 5 crates to rust 2024 edition (#19091)
timsaucer Dec 8, 2025
215cdff
Coalesce batches inside hash join, reuse indices buffer (#18972)
Dandandan Dec 8, 2025
dd35f7c
slt test coverage for `CASE` exprs with constant value lookup tables…
alamb Dec 8, 2025
4c3e3c1
Fix fmt after logical conflict (#19208)
alamb Dec 8, 2025
e8384fb
chore: Add TPCDS benchmarks (#19138)
comphead Dec 8, 2025
adaed42
Arc partition values in TableSchema (#19137)
adriangb Dec 8, 2025
0812d27
feat: Move DefaultMetadataCache into its own module (#19125)
AryanBagade Dec 8, 2025
662a3ba
fix: inverted null_percent logic in in_list benchmark (#19204)
geoffreyclaude Dec 8, 2025
cde6dfa
Add sorted data benchmark. (#19042)
zhuqi-lucas Dec 8, 2025
f2b476f
Refactor PhysicalExprSimplfier to &self instead of &mut self (#19212)
adriangb Dec 8, 2025
3fabee7
Revert adding PhysicalOptimizerRule::optimize_plan (#19186)
adriangb Dec 8, 2025
4504d94
chore(deps): bump uuid from 1.18.1 to 1.19.0 (#19199)
dependabot[bot] Dec 8, 2025
af3ab52
chore(deps): bump async-compression from 0.4.34 to 0.4.35 (#19201)
dependabot[bot] Dec 8, 2025
dcc09c3
chore(deps): bump taiki-e/install-action from 2.62.62 to 2.62.63 (#19…
dependabot[bot] Dec 8, 2025
7d958b0
chore(deps): bump tracing-subscriber from 0.3.20 to 0.3.22 (#19200)
dependabot[bot] Dec 8, 2025
71a6982
chore(deps): bump wasm-bindgen-test from 0.3.55 to 0.3.56 (#19202)
dependabot[bot] Dec 8, 2025
55a38d4
feat: Add `remove_optimizer_rule` to `SessionContext` (#19209)
nuno-faria Dec 8, 2025
ab7fe0e
bench: add dedicated Utf8View benchmarks for InList (#19211)
geoffreyclaude Dec 8, 2025
83736ef
Fix PruningPredicate interaction with DynamicFilterPhysicalExpr that …
adriangb Dec 9, 2025
ad9b779
Implement physical and logical codecs in FFI (#19079)
timsaucer Dec 9, 2025
4ddee14
refactor: Refactor spark width bucket signature away from user define…
kumarUjjawal Dec 9, 2025
2626fc2
fix: Ensure column names do not change with `expand_views_at_output` …
nuno-faria Dec 9, 2025
4fb36b2
fix: bitmap_count should report nullability correctly (#19195)
harshitsaini17 Dec 9, 2025
1a6df66
fix: bit_count function to report nullability correctly (#19197)
harshitsaini17 Dec 9, 2025
ea9f354
fix: derive custom nullability for spark `bit_shift` (#19222)
kumarUjjawal Dec 9, 2025
7ea5066
Sort Merge Join: Reduce batch concatenation, use `BatchCoalescer`, ne…
mbutrovich Dec 9, 2025
a30cf37
Add relation planner extension support to customize SQL planning (#17…
geoffreyclaude Dec 9, 2025
20870c1
feat: integrate batch coalescer with repartition exec (#19002)
jizezhang Dec 9, 2025
c0e8bb5
Push down InList or hash table references from HashJoinExec depending…
adriangb Dec 9, 2025
21a16e4
Add additional tests for InListExpr (#19050)
adriangb Dec 9, 2025
7ac40d7
chore(deps): bump taiki-e/install-action from 2.62.63 to 2.62.64 (#19…
dependabot[bot] Dec 9, 2025
dc78613
Move partition handling out of PhysicalExprAdapter (#19128)
adriangb Dec 9, 2025
cd160de
fix: spark elt custom nullability (#19207)
EeshanBembi Dec 9, 2025
dc6a712
Use strum in the examples (#19126) (#19205)
cj-zhukov Dec 9, 2025
d85595e
perf: optimize CASE WHEN lookup table (2.5-22.5 times faster) (#18183)
rluvaton Dec 9, 2025
e8a0829
Allow Logical expression ScalarVariable to represent an extension typ…
batmnnn Dec 9, 2025
dc4e3ab
feat: Implement the `statistics_cache` function (#19054)
nuno-faria Dec 9, 2025
ca67edc
[Proto]: Serialization support for `AsyncFuncExec` (#19118)
mach-kernel Dec 10, 2025
26af02a
chore: add test case for decimal overflow (#19255)
Jefffrey Dec 10, 2025
79a2f5e
chore(deps): bump taiki-e/install-action from 2.62.64 to 2.62.65 (#19…
dependabot[bot] Dec 10, 2025
2a08013
chore: update 6 crates to rust edition 2024 (#19196)
timsaucer Dec 10, 2025
021188e
Push down projection expressions into ParquetOpener (#19111)
adriangb Dec 10, 2025
321401c
fix: `array_remove`/`array_remove_n`/`array_remove_all` not using the…
rluvaton Dec 10, 2025
c1aa1b5
Track column sizes in Statistics; propagate through projections (#19113)
adriangb Dec 10, 2025
39a1f75
Implement FFI_Session (#19223)
timsaucer Dec 11, 2025
89a9c9d
Feat: Add an option for fast tests by gating slow tests to extended_t…
Yuvraj-cyborg Dec 11, 2025
5496c30
fix: typo in sql/ddl (#19276)
mag1c1an1 Dec 11, 2025
e914935
chore: enforce clippy::allow_attributes for 7 crates (#19133)
chakkk309 Dec 11, 2025
bde1608
feat: Preserve File Partitioning From File Scans (#19124)
gene-bordegaray Dec 11, 2025
58345f9
Improve ProjectionExpr documentation and comments (#19263)
alamb Dec 11, 2025
85d8a88
add specialized InList implementations for common scalar types (#18832)
adriangb Dec 11, 2025
c8add5c
Update README occording to the new examples (#18529) (#19257)
cj-zhukov Dec 11, 2025
044a4a7
Add make_time function (#19183)
Omega359 Dec 11, 2025
33b3f02
dev: Add CI doc prettier check to local `rust_lint.sh` (#19254)
2010YOUY01 Dec 11, 2025
a3b3eb5
bug: Eliminate dead round-robin insertion in enforce distribution (#1…
gene-bordegaray Dec 11, 2025
f9c030a
Automatically download tpcds benchmark data to the right place (#19244)
alamb Dec 11, 2025
8601151
fix: flaky cache test (#19140)
xonx4l Dec 11, 2025
10c933c
[datafusion-spark]: Refactor hex's signature away from user_defined (…
codetyri0n Dec 12, 2025
9b3e417
Merge remote-tracking branch 'upstream/main' into timestamp-17998
Omega359 Dec 12, 2025
7dded42
Updates from merge.
Omega359 Dec 12, 2025
1ea061c
remove unused functions.
Omega359 Dec 12, 2025
06f27e9
fix : correct nullability propagation for spark.bitwise_not (#19224)
shifluxxc Dec 12, 2025
aebae99
fix: Add custom nullability for Spark ILIKE function (#19206)
Eshaan-byte Dec 12, 2025
96ddd55
fix: derive custom nullability for spark `map_from_arrays` (#19275)
kumarUjjawal Dec 12, 2025
18e4a0c
fix: derive custom nullability for spark map_from_entries (#19274)
kumarUjjawal Dec 12, 2025
2c6f6d3
fix: derive custom nullable for spark `make_dt_interval` (#19236)
kumarUjjawal Dec 12, 2025
c4ca946
fix: derive custome nullable for the spark last_day (#19232)
kumarUjjawal Dec 12, 2025
49cfee0
fix: derive custom nullable for spark `date_sub` (#19225)
kumarUjjawal Dec 12, 2025
8458946
added custom nullability for char (#19268)
skushagra Dec 12, 2025
5a01e68
Update to_date udf function to support a consistent set of argument t…
Omega359 Dec 12, 2025
d61f1a7
replace HashTableLookupExpr with lit(true) in proto serialization (#1…
adriangb Dec 13, 2025
fedddbc
chore: fix return_field_from_args doc (#19307)
xumingming Dec 13, 2025
4088192
chore: enforce clippy::allow_attributes for spark,sql,sustrait (#19309)
kumarUjjawal Dec 14, 2025
94d9c8e
Simplify make_date & fix null handling (#19296)
Jefffrey Dec 14, 2025
033116e
fix: Fix a few minor issues with join metrics (#19283)
linhr Dec 14, 2025
9163fee
fix: derive nullability for spark `bit_get` (#19220)
kumarUjjawal Dec 14, 2025
16fa35f
Allow base64 encoding of fixedsizebinary arrays (#18950)
maxburke Dec 14, 2025
befaf93
chore: update 11 crates to Rust 2024 edition (#19258)
timsaucer Dec 14, 2025
ec11f42
feat: Add constant column extraction and rewriting for projections in…
Weijun-H Dec 14, 2025
dbf9265
fix: pow() with integer base and negative float exponent returns erro…
adriangb Dec 15, 2025
2bea796
Minor: remove unnecessary unit tests for fixed size binary (#19318)
alamb Dec 15, 2025
899a762
Populate partition column statistics for PartitionedFile (#19284)
adriangb Dec 15, 2025
7122aef
Merge branch 'main' into timestamp-17998
Omega359 Dec 15, 2025
7c05b20
refactor: move metrics module to `datafusion-common` crate (#19247)
2010YOUY01 Dec 15, 2025
9d4fe15
Add library user guide for extending SQL syntax (#19265)
geoffreyclaude Dec 15, 2025
b828fbf
chore(deps): bump taiki-e/install-action from 2.62.65 to 2.62.67 (#19…
dependabot[bot] Dec 15, 2025
58377bf
chore(deps): bump ctor from 0.6.1 to 0.6.3 (#19328)
dependabot[bot] Dec 15, 2025
fc88240
Refactor `power()` signature away from user defined (#18968)
Jefffrey Dec 16, 2025
933657e
feat: Support sliding window queries for MedianAccumulator by impleme…
petern48 Dec 16, 2025
02c647a
chore: enforce `clippy::allow_attributes` for optimizer and macros (#…
kumarUjjawal Dec 16, 2025
3a67d86
fix(concat): correct nullability inference (nullable only if all argu…
ujjwaltwri Dec 16, 2025
266a7d0
chore(deps): bump taiki-e/install-action from 2.62.67 to 2.63.3 (#19349)
dependabot[bot] Dec 16, 2025
ea8087d
chore(deps): bump clap from 4.5.50 to 4.5.53 (#19326)
dependabot[bot] Dec 16, 2025
199cf06
chore(deps): bump insta from 1.43.2 to 1.44.3 (#19327)
dependabot[bot] Dec 16, 2025
79cfe8e
Add runtime config options for `list_files_cache_limit` and `list_fil…
delamarch3 Dec 16, 2025
89af694
remove repartition exec from coalesce batches optimizer (#19239)
jizezhang Dec 16, 2025
be0cf05
minor: cleanup unnecessary config in `decimal.slt` (#19352)
Jefffrey Dec 16, 2025
c53a448
Fix panic for `GROUPING SETS(())` and handle empty-grouping aggregate…
kosiew Dec 16, 2025
775277a
Update datafusion-core crate to Rust 2024 edition (#19332)
timsaucer Dec 16, 2025
efd793b
Minor: clean up titles and links n extending operators and optimizer …
alamb Dec 16, 2025
1f26716
Update 4 crates to rust 2024 edition (#19357)
timsaucer Dec 16, 2025
50d20dd
preserve Field metadata in first_value/last_value (#19335)
adriangb Dec 16, 2025
3a41cc6
Establish the high level API for sort pushdown and the optimizer rule…
zhuqi-lucas Dec 17, 2025
b806117
Fix flaky SpillPool channel test by synchronizing reader and writer t…
kosiew Dec 17, 2025
2b67ba8
[minor] Upgrade rust version (#19363)
Dandandan Dec 17, 2025
401345b
Minor: fix cargo fmt (#19368)
zhuqi-lucas Dec 17, 2025
7900cd6
chore: enforce clippy::allow_attributes for proto, pruning, session (…
kumarUjjawal Dec 17, 2025
59dcc36
Update remaining crates to rust 2024 edition (#19361)
timsaucer Dec 17, 2025
e89d0bd
PR review feedback.
Omega359 Dec 17, 2025
f4e0934
PR review feedback.
Omega359 Dec 17, 2025
6d68731
Merge branch 'main' into timestamp-17998
Omega359 Dec 17, 2025
2e95f26
Minor: Make `ProjectionExpr::new` easier to use with constants (#19343)
alamb Dec 17, 2025
fe11ad6
fix: Added nullable return from date_add(#19151) (#19229)
manishkr Dec 17, 2025
3879b24
fix: spark sha1 nullability reporting (#19242)
ShashidharM0118 Dec 17, 2025
1e4bd75
Feat: DefaultListFilesCache prefix-aware for partition pruning optimi…
Yuvraj-cyborg Dec 17, 2025
14cd71e
feat: add compression level configuration for JSON/CSV writers (#18954)
Smotrov Dec 17, 2025
4e7bba4
Extend in_list benchmark coverage (#19376)
geoffreyclaude Dec 18, 2025
d68b629
Add hashing microbenchmark `with_hashes` (#19373)
alamb Dec 18, 2025
db10d2c
Merge branch 'main' into timestamp-17998
Omega359 Dec 18, 2025
cb3fa1c
fix: derive custom nullability for the spark `next_day` (#19253)
kumarUjjawal Dec 18, 2025
0a0416d
Move `newlines_in_values` from `FileScanConfig` to `CsvSource` (#19313)
adriangb Dec 18, 2025
2c9077a
[datafusion-cli] Implement average LIST duration for object store pro…
peterxcli Dec 18, 2025
8120da4
chore(deps): bump taiki-e/install-action from 2.63.3 to 2.64.0 (#19382)
dependabot[bot] Dec 18, 2025
ead8209
update insta snapshots (#19381)
kosiew Dec 18, 2025
d59ebac
feat(spark): implement Spark `try_sum` function (#18569)
davidlghellin Dec 18, 2025
c2747eb
feat: Support log for Decimal32 and Decimal64 (#18999)
Mark1626 Dec 19, 2025
cbf33d1
Fix regression for negative-scale decimal128 in log (#19315)
shifluxxc Dec 19, 2025
d493f3d
Add Decimal support to Ceil and Floor (#18979)
kumarUjjawal Dec 19, 2025
8550010
Fix input handling for encoding functions & various refactors (#18754)
Jefffrey Dec 19, 2025
75d2473
Remove SchemaAdapter (#19345)
adriangb Dec 19, 2025
887aa9f
fix: preserve ListFilesCache TTL when not set in config (#19401)
ShashidharM0118 Dec 19, 2025
91cfb69
feat(proto): Add protobuf serialization for HashExpr (#19379)
adriangb Dec 19, 2025
2e3707e
fix: projection for `CooperativeExec` and `CoalesceBatchesExec` (#19400)
haohuaijin Dec 19, 2025
b3d2cb6
Fix ORDER BY positional reference regression with aliased aggregates …
adriangb Dec 19, 2025
5419ff5
feat: hash partitioning satisfies subset (#19304)
gene-bordegaray Dec 19, 2025
8cc8c11
Optimize muti-column grouping with StringView/ByteView (option 2) - 2…
alamb Dec 20, 2025
6fa9c1a
Optimize hashing for StringView and ByteView (15-70% faster) (#19374)
alamb Dec 20, 2025
eb30c19
Implement disk spilling for all grouping ordering modes in GroupedHas…
pepijnve Dec 20, 2025
32e6fe8
feat: update FFI TableProvider and ExecutionPlan to use FFI Session a…
timsaucer Dec 20, 2025
d8e68a4
refactor: add ParquetOpenerBuilder to reduce test code duplication (#…
ShashidharM0118 Dec 20, 2025
4a91743
Add documentation.
Omega359 Dec 20, 2025
4249e4e
bench: add `range_and_generate_series` (#19428)
rluvaton Dec 20, 2025
8d9cb2e
Refactored ScalarDataType out of the PR.
Omega359 Dec 20, 2025
64a29b4
Merge branch 'main' into timestamp-17998
Omega359 Dec 20, 2025
65a6bc4
chore: use extend instead of manual loop in multi group by (#19429)
rluvaton Dec 21, 2025
2c3566c
doc: add example for cache factory (#19139)
jizezhang Dec 21, 2025
72f2a97
Merge branch 'main' into timestamp-17998
Omega359 Dec 21, 2025
1acaf7a
chore(deps): bump taiki-e/install-action from 2.64.0 to 2.64.2 (#19399)
dependabot[bot] Dec 21, 2025
9fe9ec7
fix: spark crc32 custom nullability (#19271)
watanaberin Dec 21, 2025
9a9e4dd
Add recursive protection on planner's `create_physical_expr` (#19299)
rgehan Dec 22, 2025
d9d55cf
chore(deps): bump aws-config from 1.8.11 to 1.8.12 (#19453)
dependabot[bot] Dec 22, 2025
edc693f
chore(deps): bump log from 0.4.28 to 0.4.29 (#19452)
dependabot[bot] Dec 22, 2025
c7f9fdf
chore(deps): bump taiki-e/install-action from 2.64.2 to 2.65.1 (#19451)
dependabot[bot] Dec 22, 2025
8e95627
chore(deps): bump sphinx-reredirects from 1.0.0 to 1.1.0 in /docs (#1…
dependabot[bot] Dec 22, 2025
5fedb84
chore(deps): bump insta from 1.44.3 to 1.45.0 (#19454)
dependabot[bot] Dec 22, 2025
1579ec3
Merge pull request #7
kosiew Dec 22, 2025
783b684
Small doc fix.
Omega359 Dec 22, 2025
a886b9e
added support for negative scale for log decimal32/64 and power (#19409)
shifluxxc Dec 22, 2025
1e59164
Remove core dependency from ffi (#19422)
timsaucer Dec 22, 2025
5764925
Merge branch 'main' into timestamp-17998
Omega359 Dec 22, 2025
bb9a4a7
bench: increase in_list benchmark coverage (#19443)
geoffreyclaude Dec 22, 2025
48f5d0b
fix: Fix skip aggregate test to cover regression (#19461)
kumarUjjawal Dec 23, 2025
d0d9311
fix: [19450]Added flush for tokio file(substrait) write (#19456)
manishkr Dec 23, 2025
258e18c
Use SortMergeJoinExec name consistently in physical plan outputs (#19…
xavlee Dec 23, 2025
d844f86
Add:arrow_metadata() UDF (#19435)
xonx4l Dec 23, 2025
4a1f69f
Update date_bin to support Time32 and Time64 data types (#19341)
Omega359 Dec 23, 2025
72f1746
feat: Add decimal support for round (#19384)
kumarUjjawal Dec 23, 2025
e6faacb
Fix panic during spill to disk in clickbench query (#19421)
alamb Dec 23, 2025
677c543
Optimize memory footprint of view arrays from `ScalarValue::to_array_…
Jefffrey Dec 23, 2025
33ac70d
minor: refactoring of some `ScalarValue` code (#19439)
Jefffrey Dec 23, 2025
0bd8809
fix: csv schema_infer_max_records set to 0 return null datatype (#19432)
haohuaijin Dec 24, 2025
902d3b3
fix: Add custom nullability for Spark LIKE function (#19218)
KaranPradhan266 Dec 24, 2025
67b526a
Refactor Spark crc32 & sha1 to remove unnecessary scalar argument che…
Jefffrey Dec 24, 2025
47ddd50
Add link to arrow-rs ticket in comments (#19479)
alamb Dec 24, 2025
a405d3f
Support nested field access in `get_field` with multiple path argumen…
adriangb Dec 24, 2025
d2830b6
chore(deps): bump taiki-e/install-action from 2.65.1 to 2.65.2 (#19474)
dependabot[bot] Dec 24, 2025
6ce2374
Improve plan_to_sql handling of empty projections with dialect-specif…
kosiew Dec 25, 2025
ef2c1a3
examples: replace sql_dialect with custom_sql_parser example (#19383)
geoffreyclaude Dec 25, 2025
03904e1
Replace custom merge operator with arrow-rs implementation (#19424)
pepijnve Dec 25, 2025
ea2e22c
Implement nested recursive CTEs (#18956)
Tpt Dec 25, 2025
e586ff5
fix: implement custom nullability for spark abs function (#19395)
batmnnn Dec 25, 2025
8bd8df0
Merge branch 'main' into timestamp-17998
Omega359 Dec 25, 2025
058bcb0
fix: custom nullability for format_string (#19173) (#19190)
skushagra Dec 26, 2025
6274080
Update `to_unixtime` udf function to support a consistent set of argu…
kumarUjjawal Dec 26, 2025
ed7af0b
Add: PI upper/lower bound f16 constants to ScalarValue (#19497)
xonx4l Dec 27, 2025
8532731
chore: enforce clippy::allow_attributes for datafusion-ffi crate (#19…
chakkk309 Dec 27, 2025
ae35177
Add CI check to ensure examples are documented in README (#19371)
cj-zhukov Dec 27, 2025
e5ca510
perf: Improve performance of `to_hex` (> 2x) (#19503)
andygrove Dec 27, 2025
d20c5d6
fix : snapshot to the modern multiline format (#19517)
Nachiket-Roy Dec 28, 2025
5b90cee
perf: improve performance of string repeat (#19502)
andygrove Dec 28, 2025
134be4c
chore(deps): bump taiki-e/install-action from 2.65.2 to 2.65.3 (#19499)
dependabot[bot] Dec 28, 2025
d825e5f
docs : clarify unused test utility (#19508)
Nachiket-Roy Dec 28, 2025
bb4e0ec
perf: Optimize `starts_with` and `ends_with` for scalar arguments (#1…
andygrove Dec 28, 2025
85c696d
Date / time / interval arithmetic improvements (#19460)
Omega359 Dec 28, 2025
8246631
fix: Implement `reset_state` for `LazyMemoryExec` (#19362)
nuno-faria Dec 28, 2025
6ac7b89
Preserve ORDER BY in Unparser for projection -> order by pattern (#19…
adriangb Dec 29, 2025
9eddf47
fix: CteWorkTable: properly apply TableProvider::scan projection argu…
Tpt Dec 29, 2025
4960284
fix: Median() integer overflow (#19509)
kumarUjjawal Dec 29, 2025
10db6b3
Redesign the try_reverse_output to support more cases (#19446)
zhuqi-lucas Dec 29, 2025
a95c7fc
feat: fix matching for named parameters with non-lowercase signatures…
bubulalabu Dec 29, 2025
83ed192
refactor: Spark `ascii` signature away from `user_defined` (#19513)
kumarUjjawal Dec 29, 2025
36df145
feat: Add per-expression evaluation timing metrics to ProjectionExec …
2010YOUY01 Dec 29, 2025
3aa0ab7
Fix: SparkAscii nullability to depend on input nullability (#19531)
Yuvraj-cyborg Dec 29, 2025
55845e0
Merge remote-tracking branch 'upstream/main' into timestamp-17998
Omega359 Dec 29, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
63 changes: 3 additions & 60 deletions .asf.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -54,72 +54,14 @@ github:
# needs to be updated as part of the release process
# .asf.yaml doesn't support wildcard branch protection rules, only exact branch names
# https://github.com/apache/infrastructure-asfyaml?tab=readme-ov-file#branch-protection
# Keeping set of protected branches for future releases
# Meanwhile creating a prerelease script that will update the branch protection names
# automatically. Keep track on it https://github.com/apache/datafusion/issues/17134
# these branches protection blocks autogenerated during release process which is described in
# https://github.com/apache/datafusion/tree/main/dev/release#2-add-a-protection-to-release-candidate-branch
branch-50:
required_pull_request_reviews:
required_approving_review_count: 1
branch-51:
required_pull_request_reviews:
required_approving_review_count: 1
branch-52:
required_pull_request_reviews:
required_approving_review_count: 1
branch-53:
required_pull_request_reviews:
required_approving_review_count: 1
branch-54:
required_pull_request_reviews:
required_approving_review_count: 1
branch-55:
required_pull_request_reviews:
required_approving_review_count: 1
branch-56:
required_pull_request_reviews:
required_approving_review_count: 1
branch-57:
required_pull_request_reviews:
required_approving_review_count: 1
branch-58:
required_pull_request_reviews:
required_approving_review_count: 1
branch-59:
required_pull_request_reviews:
required_approving_review_count: 1
branch-60:
required_pull_request_reviews:
required_approving_review_count: 1
branch-61:
required_pull_request_reviews:
required_approving_review_count: 1
branch-62:
required_pull_request_reviews:
required_approving_review_count: 1
branch-63:
required_pull_request_reviews:
required_approving_review_count: 1
branch-64:
required_pull_request_reviews:
required_approving_review_count: 1
branch-65:
required_pull_request_reviews:
required_approving_review_count: 1
branch-66:
required_pull_request_reviews:
required_approving_review_count: 1
branch-67:
required_pull_request_reviews:
required_approving_review_count: 1
branch-68:
required_pull_request_reviews:
required_approving_review_count: 1
branch-69:
required_pull_request_reviews:
required_approving_review_count: 1
branch-70:
required_pull_request_reviews:
required_approving_review_count: 1
pull_requests:
# enable updating head branches of pull requests
allow_update_branch: true
Expand All @@ -129,3 +71,4 @@ github:
# https://datafusion.apache.org/
publish:
whoami: asf-site

14 changes: 14 additions & 0 deletions .github/actions/setup-builder/action.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -46,3 +46,17 @@ runs:
# https://github.com/actions/checkout/issues/766
shell: bash
run: git config --global --add safe.directory "$GITHUB_WORKSPACE"
- name: Remove unnecessary preinstalled software
shell: bash
run: |
echo "Disk space before cleanup:"
df -h
apt-get clean
# remove tool cache: about 8.5GB (github has host /opt/hostedtoolcache mounted as /__t)
rm -rf /__t/* || true
# remove Haskell runtime: about 6.3GB (host /usr/local/.ghcup)
rm -rf /host/usr/local/.ghcup || true
# remove Android library: about 7.8GB (host /usr/local/lib/android)
rm -rf /host/usr/local/lib/android || true
echo "Disk space after cleanup:"
df -h
4 changes: 2 additions & 2 deletions .github/workflows/audit.yml
Original file line number Diff line number Diff line change
Expand Up @@ -40,9 +40,9 @@ jobs:
security_audit:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
- name: Install cargo-audit
uses: taiki-e/install-action@6f9c7cc51aa54b13cbcbd12f8bbf69d8ba405b4b # v2.62.47
uses: taiki-e/install-action@de7896b7cd1c7d181266425abbe571b5a8c757bc # v2.65.3
with:
tool: cargo-audit
- name: Run audit check
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/dependencies.yml
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@ jobs:
container:
image: amd64/rust
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
submodules: true
fetch-depth: 1
Expand All @@ -62,7 +62,7 @@ jobs:
container:
image: amd64/rust
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
- name: Install cargo-machete
run: cargo install cargo-machete --version ^0.9 --locked
- name: Detect unused dependencies
Expand Down
34 changes: 21 additions & 13 deletions .github/workflows/dev.yml
Original file line number Diff line number Diff line change
Expand Up @@ -32,8 +32,9 @@ jobs:
runs-on: ubuntu-latest
name: Check License Header
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
- name: Install HawkEye
# This CI job is bound by installation time, use `--profile dev` to speed it up
run: cargo install hawkeye --version 6.2.0 --locked --profile dev
- name: Run license header check
run: ci/scripts/license_header.sh
Expand All @@ -42,18 +43,25 @@ jobs:
name: Use prettier to check formatting of documents
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/setup-node@2028fbc5c25fe9cf00d9f06a71cc4710d4507903 # v6.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
- uses: actions/setup-node@395ad3262231945c25e8478fd5baf05154b1d79f # v6.1.0
with:
node-version: "20"
- name: Prettier check
run: |
# if you encounter error, rerun the command below and commit the changes
#
# ignore subproject CHANGELOG.md because they are machine generated
npx [email protected] --write \
'{datafusion,datafusion-cli,datafusion-examples,dev,docs}/**/*.md' \
'!datafusion/CHANGELOG.md' \
README.md \
CONTRIBUTING.md
git diff --exit-code
# if you encounter error, see instructions inside the script
run: ci/scripts/doc_prettier_check.sh

typos:
name: Spell Check with Typos
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
persist-credentials: false
# Version fixed on purpose. It uses heuristics to detect typos, so upgrading
# it may cause checks to fail more often.
# We can upgrade it manually once a while.
- name: Install typos-cli
run: cargo install typos-cli --locked --version 1.37.0
- name: Run typos check
run: ci/scripts/typos_check.sh
6 changes: 3 additions & 3 deletions .github/workflows/docs.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -32,16 +32,16 @@ jobs:
runs-on: ubuntu-latest
steps:
- name: Checkout docs sources
uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1

- name: Checkout asf-site branch
uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
ref: asf-site
path: asf-site

- name: Setup Python
uses: actions/setup-python@e797f83bcb11b83ae66e0230d6156d7c80228e7c # v6.0.0
uses: actions/setup-python@83679a892e2d95755f2dac6acb0bfd1e9ac5d548 # v6.1.0
with:
python-version: "3.12"

Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/docs_pr.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -40,12 +40,12 @@ jobs:
name: Test doc build
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
submodules: true
fetch-depth: 1
- name: Setup Python
uses: actions/setup-python@e797f83bcb11b83ae66e0230d6156d7c80228e7c # v6.0.0
uses: actions/setup-python@83679a892e2d95755f2dac6acb0bfd1e9ac5d548 # v6.1.0
with:
python-version: "3.12"
- name: Install doc dependencies
Expand Down
8 changes: 4 additions & 4 deletions .github/workflows/extended.yml
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@ jobs:
runs-on: ubuntu-latest
# note: do not use amd/rust container to preserve disk space
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
ref: ${{ github.event.inputs.pr_head_sha }} # will be empty if triggered by push
submodules: true
Expand All @@ -93,7 +93,7 @@ jobs:
runs-on: ubuntu-latest
# note: do not use amd/rust container to preserve disk space
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
ref: ${{ github.event.inputs.pr_head_sha }} # will be empty if triggered by push
submodules: true
Expand Down Expand Up @@ -137,7 +137,7 @@ jobs:
container:
image: amd64/rust
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
ref: ${{ github.event.inputs.pr_head_sha }} # will be empty if triggered by push
submodules: true
Expand All @@ -158,7 +158,7 @@ jobs:
container:
image: amd64/rust
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
ref: ${{ github.event.inputs.pr_head_sha }} # will be empty if triggered by push
submodules: true
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ jobs:
contents: read
pull-requests: write
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1

- name: Assign GitHub labels
if: |
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/large_files.yml
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ jobs:
check-files:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: actions/checkout@8e8c483db84b4bee98b60c0593521ed34d9990e8 # v6.0.1
with:
fetch-depth: 0
- name: Check size of new Git objects
Expand Down
Loading