diff --git a/README.md b/README.md index c8a5efc..57c09d9 100644 --- a/README.md +++ b/README.md @@ -11,6 +11,7 @@ Capillaries is a data processing framework that: | | BEFORE | AFTER | | ----------- | ------ |------ | +| Cloud-friendly | depends | Can be deployed to the cloud within minutes, Docker-ready | | Data aggregation | SQL joins | Capillaries [lookups](doc/glossary.md#lookup) in Cassandra + [Go expressions](doc/glossary.md#go-expressions) (scalability, parallel execution) | | Data filtering | SQL queries, custom code | [Go expressions](doc/glossary.md#go-expressions) (scalability, maintainability) | | Data transform | SQL expressions, custom code | [Go expressions](doc/glossary.md#go-expressions), Python [formulas](doc/glossary.md#py_calc-processor) (parallel execution, maintainability) | diff --git a/doc/beforeafter.png b/doc/beforeafter.png old mode 100644 new mode 100755 index 99a54f6..c847f2c Binary files a/doc/beforeafter.png and b/doc/beforeafter.png differ diff --git a/doc/beforeafter.svg b/doc/beforeafter.svg index 7427e28..e70cd1a 100644 --- a/doc/beforeafter.svg +++ b/doc/beforeafter.svg @@ -1,6 +1,7 @@ - + \ No newline at end of file diff --git a/doc/qna.md b/doc/qna.md index 9376f53..fb75907 100644 --- a/doc/qna.md +++ b/doc/qna.md @@ -105,11 +105,11 @@ A. Here are some, in no particular order: 1. Performance enhancements, espcecially those related to the efficient use of Cassandra. -2. Read/write from/to other file formats, maybe databases. +2. Read/write from/to other file formats, maybe databases. Update 2023: Apache Parquet support was added. 3. Creating node configuration is a tedious job. Consider adding a toolbelt command that takes a CSV file as an input and generates JSON for a corresponding file_table/table_file node. Update 2023: done, see [proto_file_reader_creator test](../test/code/proto_file_reader_creator/README.md). -4. Is the lack of NULL support a deal-breaker? +4. Is the lack of NULL vsalues support a deal-breaker? Update March 2024: support for *_if aggregate functions was added, it should help mitigate the lack of NULL support. 5. Need a strategy to mitigate potential security threats introduced by py_calc. SELinux/AppArmor?