-
-
Notifications
You must be signed in to change notification settings - Fork 72
[Examples][Store] Implement indexing pipeline #465
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Examples][Store] Implement indexing pipeline #465
Conversation
643b560
to
d3a3a15
Compare
0827740
to
bbc3145
Compare
…ark) This PR was squashed before being merged into the main branch. Discussion ---------- [AI Bundle][Demo] Make vectorizers configurable | Q | A | ------------- | --- | Bug fix? | no | New feature? | yes | Docs? | no | Issues | Refs #465 | License | MIT Add support for configuring vectorizers via ai.yaml configuration, allowing reuse across multiple indexers and centralized vectorizer management. Commits ------- 7acf871 [AI Bundle][Demo] Make vectorizers configurable
53486dc
to
a6b0ace
Compare
a6b0ace
to
242501e
Compare
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
0b5dc7e
to
15236b3
Compare
…arStark) This PR was merged into the main branch. Discussion ---------- [Store] Add with `TextDocument::withContent` method | Q | A | ------------- | --- | Bug fix? | no | New feature? | yes | Docs? | no | Issues | Helpful for #465 | License | MIT Commits ------- 4a3014f [Store] Add withContent method to TextDocument with test
6d2dcf7
to
a49521a
Compare
a49521a
to
6bfb1ed
Compare
382151b
to
b668771
Compare
@@ -32,7 +34,7 @@ protected function execute(InputInterface $input, OutputInterface $output): int | |||
$io = new SymfonyStyle($input, $output); | |||
$io->title('Loading RSS of Symfony blog as embeddings into ChromaDB'); | |||
|
|||
$this->embedder->embedBlog(); | |||
$this->indexer->index('https://feeds.feedburner.com/symfony/blog'); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
would be the source
- as a next step
src/store/src/IndexerInterface.php
Outdated
*/ | ||
public function index(TextDocument|iterable $documents, int $chunkSize = 50): void; | ||
public function index(null|string|array $source = null, array $options = []): void; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
would also argue for removing null
here
src/store/src/Indexer.php
Outdated
if ($documents instanceof TextDocument) { | ||
$documents = [$documents]; | ||
// Prevent conflicting sources | ||
if (null !== $source && null !== $this->source) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this way we have both, we can set a default on config level, but cannot overwrite then, as one wants to overwrite, another one wants to merge. You can define/configure an indexer now with source = null, in this case you can provide it via the method.
ef78629
to
659778d
Compare
d77a852
to
29349c1
Compare
Thank you @OskarStark. |
Needs
./runner indexer
bin/console app:blog:embed
(removed)bin/console app:blog:query
bin/console ai:store:index blog