-
Notifications
You must be signed in to change notification settings - Fork 1.5k
PARQUET-423: Replace old Log class with SLF4J Logging #369
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
I have no clue how to fix this error and what it has to do with my changes:
Please advise. |
|
What do you want?
|
|
I already created PARQUET-725 for those failing AVRO tests. |
|
The root cause is AVRO-1799 which surfaced after changing the logging setup. |
|
@julienledem : Can you comment on this please? Thanks. |
|
Hi @nielsbasjes |
|
I did some performance tests to see what the differences are when focusing on MessageColumnIO which seems to be a bottleneck in the logging situation. Just about each method in there has 2 calls to logging functions: The original code does this:
Simply doing a
Adding a
Same but now
Moving the
My conclusions: Apparently the So this is what I implemented now: As you can see above this makes the performance pretty close to the original performance. |
|
The rebase/pull/merge seems to indicate a few files as 'changed by me' that I haven't changed. |
|
In general this looks good to me. When you do In some classes (ColumnWriter) we could leave a local |
…ions during build time
|
@julienledem : Please check the latest commits as they should implement the changes you mentioned. |
|
Hi @nielsbasjes, |
|
@julienledem Done. |
|
@nielsbasjes thanks for your patience. Merged! |
|
Thanks @nielsbasjes! It's great to have this done! |
And make writing files less noisy Author: Niels Basjes <nbasjes@bol.com> Closes apache#369 from nielsbasjes/PARQUET-423-2 and squashes the following commits: b31e30f [Niels Basjes] Merge branch 'master' of github.com:apache/parquet-mr into PARQUET-423-2 2d4db4b [Niels Basjes] Merge branch 'PARQUET-423-2' of github.com:nielsbasjes/parquet-mr into PARQUET-423-2 49fcaa7 [Niels Basjes] PARQUET-423: Remove debug logging statements in high performance sections during build time aaaf4a6 [Niels Basjes] Merge branch 'PARQUET-423-2' of github.com:nielsbasjes/parquet-mr into PARQUET-423-2 745666e [Niels Basjes] Undo needless change 94e0c7a [Niels Basjes] PARQUET-423: Further optimize logging performance b72f924 [Niels Basjes] PARQUET-423: Improved the performance cb7eb61 [Niels Basjes] PARQUET-423: Workaround AVRO errors 7d161b3 [Niels Basjes] PARQUET-423: Restore the old (obsolete) Log class 05d6a47 [Niels Basjes] PARQUET-423: Replace old Log class with SLF4J Logging 692ebfb [Niels Basjes] Undo needless change f1ede3d [Niels Basjes] PARQUET-423: Further optimize logging performance a0c6b59 [Niels Basjes] PARQUET-423: Improved the performance 67bef9b [Niels Basjes] PARQUET-423: Workaround AVRO errors 87cd64f [Niels Basjes] PARQUET-423: Restore the old (obsolete) Log class 96d97d5 [Niels Basjes] PARQUET-423: Replace old Log class with SLF4J Logging
And make writing files less noisy Author: Niels Basjes <nbasjes@bol.com> Closes apache#369 from nielsbasjes/PARQUET-423-2 and squashes the following commits: b31e30f [Niels Basjes] Merge branch 'master' of github.com:apache/parquet-mr into PARQUET-423-2 2d4db4b [Niels Basjes] Merge branch 'PARQUET-423-2' of github.com:nielsbasjes/parquet-mr into PARQUET-423-2 49fcaa7 [Niels Basjes] PARQUET-423: Remove debug logging statements in high performance sections during build time aaaf4a6 [Niels Basjes] Merge branch 'PARQUET-423-2' of github.com:nielsbasjes/parquet-mr into PARQUET-423-2 745666e [Niels Basjes] Undo needless change 94e0c7a [Niels Basjes] PARQUET-423: Further optimize logging performance b72f924 [Niels Basjes] PARQUET-423: Improved the performance cb7eb61 [Niels Basjes] PARQUET-423: Workaround AVRO errors 7d161b3 [Niels Basjes] PARQUET-423: Restore the old (obsolete) Log class 05d6a47 [Niels Basjes] PARQUET-423: Replace old Log class with SLF4J Logging 692ebfb [Niels Basjes] Undo needless change f1ede3d [Niels Basjes] PARQUET-423: Further optimize logging performance a0c6b59 [Niels Basjes] PARQUET-423: Improved the performance 67bef9b [Niels Basjes] PARQUET-423: Workaround AVRO errors 87cd64f [Niels Basjes] PARQUET-423: Restore the old (obsolete) Log class 96d97d5 [Niels Basjes] PARQUET-423: Replace old Log class with SLF4J Logging
And make writing files less noisy Author: Niels Basjes <nbasjes@bol.com> Closes apache#369 from nielsbasjes/PARQUET-423-2 and squashes the following commits: b31e30f [Niels Basjes] Merge branch 'master' of github.com:apache/parquet-mr into PARQUET-423-2 2d4db4b [Niels Basjes] Merge branch 'PARQUET-423-2' of github.com:nielsbasjes/parquet-mr into PARQUET-423-2 49fcaa7 [Niels Basjes] PARQUET-423: Remove debug logging statements in high performance sections during build time aaaf4a6 [Niels Basjes] Merge branch 'PARQUET-423-2' of github.com:nielsbasjes/parquet-mr into PARQUET-423-2 745666e [Niels Basjes] Undo needless change 94e0c7a [Niels Basjes] PARQUET-423: Further optimize logging performance b72f924 [Niels Basjes] PARQUET-423: Improved the performance cb7eb61 [Niels Basjes] PARQUET-423: Workaround AVRO errors 7d161b3 [Niels Basjes] PARQUET-423: Restore the old (obsolete) Log class 05d6a47 [Niels Basjes] PARQUET-423: Replace old Log class with SLF4J Logging 692ebfb [Niels Basjes] Undo needless change f1ede3d [Niels Basjes] PARQUET-423: Further optimize logging performance a0c6b59 [Niels Basjes] PARQUET-423: Improved the performance 67bef9b [Niels Basjes] PARQUET-423: Workaround AVRO errors 87cd64f [Niels Basjes] PARQUET-423: Restore the old (obsolete) Log class 96d97d5 [Niels Basjes] PARQUET-423: Replace old Log class with SLF4J Logging
And make writing files less noisy