Skip to content

Commit 3cdafa4

Browse files
committed
rebasing
1 parent dc13ab9 commit 3cdafa4

File tree

1 file changed

+2
-29
lines changed

1 file changed

+2
-29
lines changed

docs/blog/posts/spatial-tables-data-lakehouses.md

Lines changed: 2 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,13 @@
11
---
22
date:
3-
created: 2025-04-30
3+
created: 2025-7-26
44
authors:
55

66
- matt_powers
77
- kelly
88
title: Geospatial Data on Iceberg - The Lakehouse Advantage
99
---
1010

11-
<<<<<<< HEAD
12-
=======
1311
<!--
1412
Licensed to the Apache Software Foundation (ASF) under one
1513
or more contributor license agreements. See the NOTICE file
@@ -29,32 +27,12 @@ title: Geospatial Data on Iceberg - The Lakehouse Advantage
2927
under the License.
3028
-->
3129

32-
<<<<<<< HEAD
33-
<<<<<<< HEAD
34-
>>>>>>> 14074d417b ([DOCS] putting metadata at top)
35-
# Geospatial Data on Iceberg: The Lakehouse Advantage
36-
37-
This post delves into the benefits of Lakehouse architecture for spatial tables
38-
and differentiate its approach from standard data warehouses and data lakes.
39-
=======
4030
This post discusses the benefits of Lakehouse architecture for spatial
4131
tables, comparing the Lakehouse approach to standard data warehouses and data lakes.
42-
>>>>>>> 3128e022f4 (blog refinements)
43-
=======
44-
TODO: Rework intro
45-
46-
This post discusses the benefits of Lakehouse architecture for spatial
47-
tables, comparing the Lakehouse approach to that of data warehouses and data lakes.
48-
>>>>>>> bad3190a21 (formatting fixes)
4932

5033
While spatial data requires different types of metadata and optimizations,
5134
it _doesn't_ require entirely different file formats.
5235

53-
#### Key Points
54-
55-
* Geospatial Data has native support in Apache Parquet and Apache Iceberg.
56-
* TODO
57-
5836
Recent advancements, specifically the addition of native support for geometry/geography types to
5937
Apache Parquet and the Apache Iceberg V3 specification, enable the spatial data community
6038
to fully integrate with and leverage the benefits of Lakehouse architectures.
@@ -319,12 +297,8 @@ and `Polygon` geometries for purchases tied to an approximate region.
319297

320298
## Joining tables containing spatial and non-spatial data
321299

322-
<<<<<<< HEAD
323-
Let's discuss how to join the customers and customer_purchases tables.
324-
=======
325300
Let's discuss how to use Sedona to join the non-spatial data
326301
in the `customers` table with the spatial data in the `customer_purchases` table.
327-
>>>>>>> bad3190a21 (formatting fixes)
328302

329303
```py
330304
customers = sedona.table("local.db.customers")
@@ -601,8 +575,7 @@ filter. The required process involves reading the entire existing dataset, apply
601575
the modification to all records in memory, and then rewriting the whole modified
602576
dataset back to storage, overwriting the original.
603577

604-
This highlights key data
605-
lake disadvantages: the inefficiency of the full read/rewrite cycle, and the
578+
This highlights key data lake disadvantages: the inefficiency of the full read/rewrite cycle, and the
606579
critical lack of atomicity in the overwrite step, which risks data corruption
607580
or loss if the write operation fails partway through.
608581

0 commit comments

Comments
 (0)