Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch to new data passing API #109

Merged
merged 113 commits into from
Mar 4, 2024
Merged
Show file tree
Hide file tree
Changes from 106 commits
Commits
Show all changes
113 commits
Select commit Hold shift + click to select a range
6c9efe1
rename table client -> engine client
nicklan Jan 24, 2024
ec08901
add engine_data and trait method
nicklan Jan 24, 2024
f6c1072
checkpoint, working parsing metadata/protocol
nicklan Jan 25, 2024
0643945
checkpoint, can parse metadata in snapshot
nicklan Jan 26, 2024
53c8850
make scan tests pass (with some todos)
nicklan Jan 27, 2024
32a28a9
All internal tests pass
nicklan Jan 27, 2024
d7930e6
clean up parsing a bit
nicklan Jan 27, 2024
0e54c24
remove old comment
nicklan Jan 27, 2024
438a1e2
add todo comment
nicklan Jan 27, 2024
5ab0021
New better way to do list/map
nicklan Jan 29, 2024
ab89728
put back old test
nicklan Jan 29, 2024
e182e3d
add dv support
nicklan Jan 30, 2024
6b3efe3
start to think about DVs
nicklan Jan 30, 2024
fd80a97
make dv.rs tests work
nicklan Jan 31, 2024
4bf5b47
fmt and small fix
nicklan Jan 31, 2024
304da21
add some tests back
nicklan Jan 31, 2024
a7393fe
fix "list_from"
nicklan Jan 31, 2024
54a0c1a
remove todo
nicklan Jan 31, 2024
54a0d7f
allow default client to use EngineData
nicklan Feb 1, 2024
a8e8920
use some unsafe magic to make more tests pass
nicklan Feb 1, 2024
33923bb
fmt
nicklan Feb 1, 2024
82cff1b
some clippy guided cleanup
nicklan Feb 1, 2024
dc6e963
switch to selection vec
nicklan Feb 3, 2024
1cff896
make parse_json not refrence record batch anymore
nicklan Feb 5, 2024
e6d5eb5
almost have arrow out of lib.rs
nicklan Feb 5, 2024
28031fa
add back md test
nicklan Feb 6, 2024
4294020
put back dataskippingfilter (but it still uses arrow)
nicklan Feb 6, 2024
ef0e79c
make comment more clear
nicklan Feb 6, 2024
b6b560c
doc updates
nicklan Feb 6, 2024
ae12b63
add back parquet reader test
nicklan Feb 6, 2024
f6cdd00
remove commented code
nicklan Feb 6, 2024
f4c8507
put back default client json tests
nicklan Feb 6, 2024
4d3631a
remove unsafe (woo!)
nicklan Feb 6, 2024
95c75ce
minor cleanup
nicklan Feb 6, 2024
3902fcb
rename to try_from_engine_data
nicklan Feb 6, 2024
200a8ac
all arrow out of lib :)
nicklan Feb 7, 2024
3ac637f
move simple_client to a feature, make it default
nicklan Feb 7, 2024
49253b7
add materialize for map_item
nicklan Feb 7, 2024
9a286e6
mostly remove old action parsing code and make maps work right
nicklan Feb 7, 2024
8b0a60f
fmt + clippy
nicklan Feb 7, 2024
15f0122
test_read_files
nicklan Feb 8, 2024
bdf540b
Apply suggestions from code review
nicklan Feb 12, 2024
5905ed2
address some minor comments
nicklan Feb 14, 2024
1f83540
switch to add_string
nicklan Feb 14, 2024
5f3e6d4
switch to using as_list
nicklan Feb 14, 2024
9aa7d36
address more comments:
nicklan Feb 14, 2024
2fb11f5
cleaner min_file_name
nicklan Feb 14, 2024
9834460
clean up `list_from` a bit + fmt
nicklan Feb 14, 2024
cff7b98
Update kernel/src/simple_client/json.rs
nicklan Feb 14, 2024
eb0f3c6
Apply suggestions from code review
nicklan Feb 14, 2024
7f8ab66
fixups for review merge
nicklan Feb 14, 2024
01bcd9a
address comment re itereator chain
nicklan Feb 14, 2024
e5f0116
fix bug
nicklan Feb 14, 2024
93e346b
Apply suggestions from code review
nicklan Feb 14, 2024
d1f5531
validate magic in DV
nicklan Feb 14, 2024
7198c90
get_json_filename
nicklan Feb 14, 2024
ab64aec
only default-client needs tokio
nicklan Feb 14, 2024
1194be2
rename res_arry -> res_array
nicklan Feb 14, 2024
c288f2e
Update kernel/src/simple_client/parquet.rs
nicklan Feb 14, 2024
ef33863
fix lints
nicklan Feb 14, 2024
5b47c93
address comments
nicklan Feb 14, 2024
23b7196
Initial bit of extract_into
nicklan Feb 15, 2024
65a21cf
doc comments and fmt
nicklan Feb 15, 2024
ff85d93
fully switch to extract_into
nicklan Feb 15, 2024
5fea8d3
fmt
nicklan Feb 15, 2024
423853e
initial work on row-getter data passing
nicklan Feb 16, 2024
e2913ff
add get_data_item.rs
nicklan Feb 16, 2024
b30bb2c
cleanup extract, support all previous types, better errors
nicklan Feb 16, 2024
baf1997
extract references to Maps/Lists
nicklan Feb 16, 2024
f96314c
cleanup extract
nicklan Feb 16, 2024
89f920d
switch to new data passing style
nicklan Feb 17, 2024
46b7e1f
reformat a bit
nicklan Feb 17, 2024
3536132
only getters need to implement ExtractInto
nicklan Feb 17, 2024
190c05c
make GetDataItem macro for impls
nicklan Feb 17, 2024
1a6b444
remove trivial casts warning, address minor comments
nicklan Feb 20, 2024
0dbb5ed
get rid of DataItem
nicklan Feb 21, 2024
582e8be
break once we've found p&m
nicklan Feb 21, 2024
ab88f14
extract error handling
nicklan Feb 21, 2024
4458c9a
add doc comment
nicklan Feb 21, 2024
ca37359
fold Extractor into EngineData trait
nicklan Feb 21, 2024
a6ed21e
add materialize for list
nicklan Feb 21, 2024
cee7d16
fmt
nicklan Feb 21, 2024
7ff8589
make magic constant a `const`
nicklan Feb 21, 2024
72dcb54
use try_collect
nicklan Feb 21, 2024
b9601a7
remove commented code
nicklan Feb 21, 2024
8d78cf7
comment updates
nicklan Feb 21, 2024
67fe622
error improvements
nicklan Feb 21, 2024
3fab759
fix comment
nicklan Feb 21, 2024
ace56fa
better macro format
nicklan Feb 21, 2024
d7afb3b
impl TypedGetData for Vec and Map
nicklan Feb 21, 2024
db8d28d
return Options for try_new_from_data
nicklan Feb 21, 2024
2d57456
created_time is optional
nicklan Feb 21, 2024
22d85ca
stats are String
nicklan Feb 21, 2024
3f13937
doc comment and fmt
nicklan Feb 21, 2024
619bffd
refactor action defs and parsing. remove ActionType enum
nicklan Feb 21, 2024
a72eac6
add todo
nicklan Feb 21, 2024
977e251
add missing files
nicklan Feb 21, 2024
6750a2a
add comment
nicklan Feb 21, 2024
bd9e17b
GIANT Merge branch 'main' into new-data-passing-api
nicklan Feb 22, 2024
a2dedf5
checkpoint, actually compiles. needs expressions to be put back
nicklan Feb 23, 2024
f5d80c3
woo, all tests working.
nicklan Feb 23, 2024
0a9cd11
add is_empty to make clippy happy
nicklan Feb 23, 2024
b268b4e
final clippy fixes
nicklan Feb 23, 2024
aab7fb8
import style
nicklan Feb 23, 2024
8d397e6
put back test that merge inexplicably removed
nicklan Feb 23, 2024
a2cb373
use error constructors where strings are static
nicklan Feb 23, 2024
1a19d19
missed one
nicklan Feb 23, 2024
c130692
impl From for SimpleData -> RecordBatch
nicklan Feb 23, 2024
fdd927a
bunch of doc fixes
nicklan Feb 23, 2024
b4699c0
small comment changes
nicklan Feb 26, 2024
1c922c9
fmt
nicklan Feb 26, 2024
4a14694
Update kernel/src/snapshot.rs
nicklan Mar 4, 2024
14d0aff
fix typo
nicklan Mar 4, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
23 changes: 15 additions & 8 deletions kernel/Cargo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -39,34 +39,41 @@ arrow-ord = { version = "^49.0", optional = true }
arrow-schema = { version = "^49.0", optional = true }
futures = { version = "0.3", optional = true }
object_store = { version = "^0.8.0", optional = true }
parquet = { version = "^49.0", optional = true, features = [
"async",
"object_store",
] }
# Used in default and simple client
parquet = { version = "^49.0", optional = true }

# optionally used with default client (though not required)
tokio = { version = "1", optional = true, features = ["rt-multi-thread"] }

[features]
default = ["default-client"]
arrow-conversion = ["arrow-schema"]
default = ["simple-client"]
default-client = [
"arrow-conversion",
"arrow-arith",
"arrow-json",
"arrow-ord",
"arrow-schema",
"futures",
"object_store",
"parquet",
"parquet/async",
"parquet/object_store",
"tokio"
]

developer-visibility = []
simple-client = [
"arrow-conversion",
"arrow-json",
"parquet"
]

[dev-dependencies]
arrow = { version = "^49.0", features = ["json", "prettyprint"] }
deltakernel = { path = ".", features = ["tokio"] }
deltakernel = { path = ".", features = ["default-client"] }
test-log = { version = "0.2", default-features = false, features = ["trace"] }
tempfile = "3"
test-case = { version = "3.1.0" }
tokio = { version = "1" }
tracing-subscriber = { version = "0.3", default-features = false, features = [
"env-filter",
"fmt",
Expand Down
282 changes: 282 additions & 0 deletions kernel/src/actions/deletion_vector.rs
Original file line number Diff line number Diff line change
@@ -0,0 +1,282 @@
//! Code relating to parsing and using deletion vectors

use std::io::{Cursor, Read};
use std::sync::Arc;

use roaring::RoaringTreemap;
use url::Url;

use crate::{DeltaResult, Error, FileSystemClient};

#[derive(Debug, Clone, PartialEq, Eq)]
pub struct DeletionVectorDescriptor {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we want this to be pub? Could it just be pub(crate)?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we do want actions to be pub, since we expect external systems might use them. at the very least, all of the action stuff is pub for now, so I'll leave it for a follow-up discussion :)

/// A single character to indicate how to access the DV. Legal options are: ['u', 'i', 'p'].
pub storage_type: String,

/// Three format options are currently proposed:
/// - If `storageType = 'u'` then `<random prefix - optional><base85 encoded uuid>`:
/// The deletion vector is stored in a file with a path relative to the data
/// directory of this Delta table, and the file name can be reconstructed from
/// the UUID. See Derived Fields for how to reconstruct the file name. The random
/// prefix is recovered as the extra characters before the (20 characters fixed length) uuid.
/// - If `storageType = 'i'` then `<base85 encoded bytes>`: The deletion vector
/// is stored inline in the log. The format used is the `RoaringBitmapArray`
/// format also used when the DV is stored on disk and described in [Deletion Vector Format].
/// - If `storageType = 'p'` then `<absolute path>`: The DV is stored in a file with an
/// absolute path given by this path, which has the same format as the `path` field
/// in the `add`/`remove` actions.
///
/// [Deletion Vector Format]: https://github.com/delta-io/delta/blob/master/PROTOCOL.md#Deletion-Vector-Format
pub path_or_inline_dv: String,

/// Start of the data for this DV in number of bytes from the beginning of the file it is stored in.
/// Always None (absent in JSON) when `storageType = 'i'`.
pub offset: Option<i32>,

/// Size of the serialized DV in bytes (raw data size, i.e. before base85 encoding, if inline).
pub size_in_bytes: i32,

/// Number of rows the given DV logically removes from the file.
pub cardinality: i64,
}

impl DeletionVectorDescriptor {
pub fn unique_id(&self) -> String {
if let Some(offset) = self.offset {
format!("{}{}@{offset}", self.storage_type, self.path_or_inline_dv)
} else {
format!("{}{}", self.storage_type, self.path_or_inline_dv)
}
}

pub fn absolute_path(&self, parent: &Url) -> DeltaResult<Option<Url>> {
match self.storage_type.as_str() {
"u" => {
let prefix_len = self.path_or_inline_dv.len() as i32 - 20;
if prefix_len < 0 {
return Err(Error::deletion_vector("Invalid length"));
}
let decoded = z85::decode(&self.path_or_inline_dv[(prefix_len as usize)..])
.map_err(|_| Error::deletion_vector("Failed to decode DV uuid"))?;
let uuid = uuid::Uuid::from_slice(&decoded)
.map_err(|err| Error::DeletionVector(err.to_string()))?;
let dv_suffix = if prefix_len > 0 {
format!(
"{}/deletion_vector_{uuid}.bin",
&self.path_or_inline_dv[..(prefix_len as usize)]
)
} else {
format!("deletion_vector_{uuid}.bin")
};
let dv_path = parent
.join(&dv_suffix)
.map_err(|_| Error::DeletionVector(format!("invalid path: {dv_suffix}")))?;
Ok(Some(dv_path))
}
"p" => Ok(Some(Url::parse(&self.path_or_inline_dv).map_err(|_| {
Error::DeletionVector(format!("invalid path: {}", self.path_or_inline_dv))
})?)),
"i" => Ok(None),
other => Err(Error::DeletionVector(format!(
"Unknown storage format: '{other}'."
))),
}
}

pub fn read(
&self,
fs_client: Arc<dyn FileSystemClient>,
parent: Url,
) -> DeltaResult<RoaringTreemap> {
match self.absolute_path(&parent)? {
None => {
let bytes = z85::decode(&self.path_or_inline_dv)
.map_err(|_| Error::deletion_vector("Failed to decode DV"))?;
RoaringTreemap::deserialize_from(&bytes[12..])
.map_err(|err| Error::DeletionVector(err.to_string()))
}
Some(path) => {
let offset = self.offset;
let size_in_bytes = self.size_in_bytes;

let dv_data = fs_client
.read_files(vec![(path, None)])?
.next()
.ok_or(Error::missing_data("No deletion vector data"))??;

let mut cursor = Cursor::new(dv_data);
if let Some(offset) = offset {
// TODO should we read the datasize from the DV file?
// offset plus datasize bytes
nicklan marked this conversation as resolved.
Show resolved Hide resolved
cursor.set_position((offset + 4) as u64);
}

let mut buf = vec![0; 4];
cursor
.read(&mut buf)
.map_err(|err| Error::DeletionVector(err.to_string()))?;
let magic = i32::from_le_bytes(
buf.try_into()
.map_err(|_| Error::deletion_vector("failed to read magic bytes"))?,
);
if magic != 1681511377 {
return Err(Error::DeletionVector(format!("Invalid magic {magic}")));
}

let mut buf = vec![0; size_in_bytes as usize];
cursor
.read(&mut buf)
.map_err(|err| Error::DeletionVector(err.to_string()))?;

RoaringTreemap::deserialize_from(Cursor::new(buf))
.map_err(|err| Error::DeletionVector(err.to_string()))
}
}
}
}

pub(crate) fn treemap_to_bools(treemap: RoaringTreemap) -> Vec<bool> {
fn combine(high_bits: u32, low_bits: u32) -> usize {
((u64::from(high_bits) << 32) | u64::from(low_bits)) as usize
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rescuing #109 (comment) from github oblivion...

this would silently lose information if sizeof(usize) < sizeof(u64); we should add a static assertion to make it fail at compile time instead.

(even if we move it from kernel to engine, the same issue would remain in our default clients)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep, I've filed #124 since we'd need to add a dependency

}

match treemap.max() {
Some(max) => {
// there are values in the map
//TODO(nick) panic if max is > MAX_USIZE
let mut result = vec![true; max as usize + 1];
let bitmaps = treemap.bitmaps();
for (index, bitmap) in bitmaps {
for bit in bitmap.iter() {
let vec_index = combine(index, bit);
result[vec_index] = false;
}
}
result
}
None => {
// empty set, return empty vec
vec![]
}
}
}

#[cfg(test)]
mod tests {
use roaring::RoaringTreemap;
use std::path::PathBuf;

use super::*;
use crate::{simple_client::SimpleClient, EngineInterface};

use super::DeletionVectorDescriptor;

fn dv_relateive() -> DeletionVectorDescriptor {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit

Suggested change
fn dv_relateive() -> DeletionVectorDescriptor {
fn dv_relative() -> DeletionVectorDescriptor {

DeletionVectorDescriptor {
storage_type: "u".to_string(),
path_or_inline_dv: "ab^-aqEH.-t@S}K{vb[*k^".to_string(),
offset: Some(4),
size_in_bytes: 40,
cardinality: 6,
}
}

fn dv_absolute() -> DeletionVectorDescriptor {
DeletionVectorDescriptor {
storage_type: "p".to_string(),
path_or_inline_dv:
"s3://mytable/deletion_vector_d2c639aa-8816-431a-aaf6-d3fe2512ff61.bin".to_string(),
offset: Some(4),
size_in_bytes: 40,
cardinality: 6,
}
}

fn dv_inline() -> DeletionVectorDescriptor {
DeletionVectorDescriptor {
storage_type: "i".to_string(),
path_or_inline_dv: "wi5b=000010000siXQKl0rr91000f55c8Xg0@@D72lkbi5=-{L".to_string(),
offset: None,
size_in_bytes: 40,
cardinality: 6,
}
}

fn dv_example() -> DeletionVectorDescriptor {
DeletionVectorDescriptor {
storage_type: "u".to_string(),
path_or_inline_dv: "vBn[lx{q8@P<9BNH/isA".to_string(),
offset: Some(1),
size_in_bytes: 36,
cardinality: 2,
}
}

#[test]
fn test_deletion_vector_absolute_path() {
let parent = Url::parse("s3://mytable/").unwrap();

let relative = dv_relateive();
let expected =
Url::parse("s3://mytable/ab/deletion_vector_d2c639aa-8816-431a-aaf6-d3fe2512ff61.bin")
.unwrap();
assert_eq!(expected, relative.absolute_path(&parent).unwrap().unwrap());

let absolute = dv_absolute();
let expected =
Url::parse("s3://mytable/deletion_vector_d2c639aa-8816-431a-aaf6-d3fe2512ff61.bin")
.unwrap();
assert_eq!(expected, absolute.absolute_path(&parent).unwrap().unwrap());

let inline = dv_inline();
assert_eq!(None, inline.absolute_path(&parent).unwrap());

let path =
std::fs::canonicalize(PathBuf::from("./tests/data/table-with-dv-small/")).unwrap();
let parent = url::Url::from_directory_path(path).unwrap();
let dv_url = parent
.join("deletion_vector_61d16c75-6994-46b7-a15b-8b538852e50e.bin")
.unwrap();
let example = dv_example();
assert_eq!(dv_url, example.absolute_path(&parent).unwrap().unwrap());
}

#[test]
fn test_deletion_vector_read() {
let path =
std::fs::canonicalize(PathBuf::from("./tests/data/table-with-dv-small/")).unwrap();
let parent = url::Url::from_directory_path(path).unwrap();
let simple_client = SimpleClient::new();
let fs_client = simple_client.get_file_system_client();

let example = dv_example();
let tree_map = example.read(fs_client, parent).unwrap();

let expected: Vec<u64> = vec![0, 9];
let found = tree_map.iter().collect::<Vec<_>>();
assert_eq!(found, expected)
}

// this test is ignored by default as it's expensive to allocate such big vecs full of `true`. you can run it via:
// cargo test actions::action_definitions::tests::test_dv_to_bools
#[test]
#[ignore]
fn test_dv_to_bools() {
let mut rb = RoaringTreemap::new();
rb.insert(0);
rb.insert(2);
rb.insert(7);
rb.insert(30854);
rb.insert(4294967297);
rb.insert(4294967300);
let bools = super::treemap_to_bools(rb);
let mut expected = vec![true; 4294967301];
expected[0] = false;
expected[2] = false;
expected[7] = false;
expected[30854] = false;
expected[4294967297] = false;
expected[4294967300] = false;
assert_eq!(bools, expected);
}
}
Loading
Loading