Skip to main content

Module watermark

Module watermark 

Source
Expand description

High-water / incremental row filter applied after ingest (file or DB).

When super::IngestionOptions::watermark_column and super::IngestionOptions::watermark_exclusive_above are both set, only rows where the watermark column is strictly greater than the high-water value are kept. Rows with a null in that column are dropped.

Functionsยง

apply_watermark_after_ingest
Apply watermark filtering when options request it (call after validate_watermark_config).
apply_watermark_filter
Keep only rows where column compares strictly greater than floor (per column DataType).
max_value_in_column
Maximum value in column over non-null cells (ordering matches the column DataType: Int64 / Bool / Utf8 use Ord; Float64 uses IEEE total order via f64::total_cmp).
validate_watermark_config
Ensure watermark options are consistent with schema and with each other.