You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I still don't know if the Colpali model is only contextual to each or all documents. If only contextual to each document, then we could integrate late chunking to maximize the ColPali performance. Currently, this late chunking method is easier to implement, more efficient, and more robust to missing context than traditional chunking methods, as ColPali still keeps 1-page chunking. These might further remove ColPali from the need for pre-processing pipelines.
Would this mean that you would do "patching" on the embedding space, rather than pixels?
Is there currently some hyperparameter that restricts the chunking to a single page?
I still don't know if the Colpali model is only contextual to each or all documents. If only contextual to each document, then we could integrate late chunking to maximize the ColPali performance. Currently, this late chunking method is easier to implement, more efficient, and more robust to missing context than traditional chunking methods, as ColPali still keeps 1-page chunking. These might further remove ColPali from the need for pre-processing pipelines.
Details:
The text was updated successfully, but these errors were encountered: