-
Notifications
You must be signed in to change notification settings - Fork 4k
Closed
Description
StreamReader& StreamReader::operator>>(optional<std::string>& v) {
CheckColumn(Type::BYTE_ARRAY, ConvertedType::UTF8);
ByteArray ba;
StreamWriter& StreamWriter::WriteVariableLength(const char* data_ptr,
std::size_t data_len) {
CheckColumn(Type::BYTE_ARRAY, ConvertedType::UTF8);
Though the C++ Parquet::Schema::Node allows physical type of BYTE_ARRAY with ConvertedType=NONE, the stream reader/writer classes throw when ConvertedType != UTF8.
std::string is, unfortunately, the canonical byte buffer class in C++.
A simple approach might be to create >>parquet::ByteArray.. with columnCheck(BYTE_ARRAY, NONE), and let the user take it from there. that would use the existing methods that >>std::string uses.. just an idea.
I am new to this forum, and have assigned MAJOR to this bug, but gladly defer to those who have a better grasp of classification.
Reporter: ian
Note: This issue was originally created as PARQUET-1958. Please see the migration documentation for further details.
pulkomandy