feat: use milvus in chat llamaindex #4

thucpn · 2024-07-31T13:55:43Z

No description provided.

vercel · 2024-07-31T13:55:47Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
chat-llamaindex	❌ Failed (Inspect)			Aug 1, 2024 10:42am

marcusschiesser · 2024-07-31T14:31:18Z

app/api/chat/engine/index.ts

+    );
+  }
+
+  return await VectorStoreIndex.fromVectorStore(store);


Can we move a shared function getIndex to shared.ts?

marcusschiesser · 2024-07-31T14:32:41Z

app/api/chat/engine/generate.ts

    });
+    const storageContext = await storageContextFromDefaults({ vectorStore });


Try using the shared function getIndex here and use is the index directly in fromDocuments below (without storage context)

Yes, we can do:

const index = await getIndex(datasource); const documents = await getDocuments(datasource); // Set private=false to mark the document as public (required for filtering) documents.forEach((doc) => { doc.metadata["private"] = "false"; }); await runPipeline(index, documents);

But inside runPipeline function, we are setting private = true (we need to set private = false when generating public documents)
I guess we shouldn't append metadata to documents inside runPipeline function (can move it outside)

I just updated to split the getIndex function. Thank you! That's a cool idea that solves my pain.
Previously, when creating a new bot, users had to manually create a collection to start chatting.
But now, when creating a new bot, they can start by uploading a private file. It will automatically create a collection on Milvus

marcusschiesser · 2024-08-01T08:32:32Z

app/api/chat/engine/chat.ts

+  const isCollectionExist = await getMilvusClient().hasCollection({
+    collection_name: datasource,
+  });
+  if (!isCollectionExist.value) {
+    throw new Error(
+      `Collection "${datasource}" does not exist! Run the generate script or try uploading a file.`,
+    );
+  }


how about we move this check into getIndex?

It won't work when running generate (because when generating we will create a new collection)

how about adding a checkExists parameter to getIndex then?

getIndex({datasource, checkExists: false})

Yes, let me try

marcusschiesser · 2024-08-01T08:34:26Z

app/api/chat/engine/shared.ts

+  }
+}
+
+export async function getIndex(datasource: string) {


to mimize changes with base branch, call this getDatasource and move it to index.ts

marcusschiesser · 2024-08-01T08:34:58Z

app/api/chat/upload/pipeline.ts

 ) {
-  // Update documents with metadata


good we can also do this in CL directly

marcusschiesser · 2024-08-01T08:35:03Z

app/api/chat/upload/upload.ts

@@ -14,6 +14,16 @@ export async function uploadDocument(
  const fileBuffer = Buffer.from(content, "base64");
  const documents = await loadDocuments(fileBuffer, mimeType);
  const { filename } = await saveDocument(fileBuffer, mimeType);
-  const index = await getDataSource(datasource);
-  return await runPipeline(index, documents, filename);
+  const index = await getIndex(datasource);


good we can also do this in CL directly

marcusschiesser · 2024-08-01T08:36:13Z

create-llama.sh

@@ -20,7 +20,7 @@ npx -y create-llama@0.1.25 \
    --post-install-action none \
    --no-llama-parse \
    --example-file \
-    --vector-db none \
+    --vector-db milvus \


goal: we just need to change this from none to milvus to use Milvus in chat llamaindex

thucpn · 2024-08-01T10:49:04Z

Open new PR here: #5

thucpn mentioned this pull request Jul 31, 2024

feat: use latest create-llama and llamaindex run-llama/chat-llamaindex#100

Closed

marcusschiesser reviewed Jul 31, 2024

View reviewed changes

thucpn added 2 commits July 31, 2024 21:57

feat: use milvus vector store in create-llama command

870a31b

feat: update milvus engine

3175f76

thucpn force-pushed the feat/use-milvus-in-chat-llamaindex branch from 256bd96 to 3175f76 Compare July 31, 2024 14:58

vercel bot had a problem deploying to Preview July 31, 2024 14:58 Failure

refactor: split getIndex function

50fe446

vercel bot had a problem deploying to Preview July 31, 2024 15:31 Failure

thucpn requested a review from marcusschiesser July 31, 2024 15:31

marcusschiesser reviewed Aug 1, 2024

View reviewed changes

thucpn added 5 commits August 1, 2024 16:21

refactor: use create-llama on main to reduce code

2d0f99d

feat: use milvus vector store in create-llama command

8c6bcd9

feat: update milvus engine

995c01f

refactor: split getIndex function

b8b0e8a

chore: resolve conflict

d9e155c

vercel bot had a problem deploying to Preview August 1, 2024 10:42 Failure

thucpn closed this Aug 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use milvus in chat llamaindex #4

feat: use milvus in chat llamaindex #4

thucpn commented Jul 31, 2024

vercel bot commented Jul 31, 2024 •

edited

Loading

marcusschiesser Jul 31, 2024

marcusschiesser Jul 31, 2024

thucpn Jul 31, 2024

thucpn Jul 31, 2024

marcusschiesser Aug 1, 2024

thucpn Aug 1, 2024

marcusschiesser Aug 1, 2024

thucpn Aug 1, 2024

marcusschiesser Aug 1, 2024

marcusschiesser Aug 1, 2024

marcusschiesser Aug 1, 2024

marcusschiesser Aug 1, 2024

thucpn commented Aug 1, 2024

		});
		const storageContext = await storageContextFromDefaults({ vectorStore });

feat: use milvus in chat llamaindex #4

feat: use milvus in chat llamaindex #4

Conversation

thucpn commented Jul 31, 2024

vercel bot commented Jul 31, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thucpn commented Aug 1, 2024

vercel bot commented Jul 31, 2024 •

edited

Loading