Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

[FiD] Add specialized chunking to search engine retrievers #4227

Merged
merged 4 commits into from
Dec 8, 2021

Conversation

klshuster
Copy link
Contributor

Patch description
This PR is intended to copy the functionality of the woi_chunk_retrieved_docs mutator within the search engine retriever.

Testing steps
Added CI

for i in range(0, len(tokens), self.len_chunk)
]
return self.chunk_reranker.get_top_chunks(query, doc_chunks)
if self.chunk_reranker != 'woi_chunk_retrieved_docs':
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shouldn't this be the chunk_ranker_type? I think you are comparing the string to the ChunkRanker object here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oh yes you're right

Copy link
Contributor

@mojtaba-komeili mojtaba-komeili left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Other than the comment I had the rest LGTM.

@klshuster klshuster merged commit 6012f84 into main Dec 8, 2021
@klshuster klshuster deleted the woi_chunking branch December 8, 2021 19:49
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants