Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Index the (text) datasets contents to enable full-text search - DuckDB #1296
feat: Index the (text) datasets contents to enable full-text search - DuckDB #1296
Changes from all commits
1e41964
f37a829
340d85e
c53af5f
8cac1c5
31387ba
23ce3ee
ac0a2d9
dff50cf
132d4ca
4659117
f0794a8
ddad27a
05d3362
cec74e3
96587d8
b02fa17
8679ce9
163928e
b1238f5
08e784f
fd298be
2afe9f3
0bfcb62
2ff4f91
3c9b4ee
c78e99a
6eba4d9
39e7ded
e94e1d4
4daf93d
aa68660
e28142f
a51d7d3
129b8c4
edd120d
a65e8dd
059c632
9ecf923
874fabd
c36202f
9b82a66
3326014
1410737
1d9574e
7971b34
431163d
80c7b5d
b599b10
a1b3d8e
187d7b6
fd01ec6
ff4a833
5c9639e
ce4163a
67e801f
9e9e25a
517a479
b807613
e77b6b4
3005e2e
84687e0
27743d5
021ea34
2d6b21c
80a3c21
b6f3bd9
550f118
930f6c0
ecfa8c5
a92fe90
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Using a non-shared temporary directory would also have worked no ?
I'm fine with using a shared storage though
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it's due to the size (the pods have nearly no space, while the shared storage has as much space as we need (EFS))
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(even if, by default, the local duckdb will be small)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we cant to increase this it we'll need to know how much time it takes to index datasets and to query them. We can see that later by adding some profiling.