Skip to content

Commit 59eac14

Browse files
committed
Merge branch 'lightspeed-core:main' into stream_query, also build common structure for both query and streaming_query
2 parents dc04bf4 + b5eb048 commit 59eac14

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

45 files changed

+4831
-756
lines changed

docs/byok_guide.md

Lines changed: 479 additions & 0 deletions
Large diffs are not rendered by default.

docs/config.png

218 Bytes
Loading

docs/config.puml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ class "ConversationCacheConfiguration" as src.models.config.ConversationCacheCon
4747
memory : Optional[InMemoryCacheConfig]
4848
postgres : Optional[PostgreSQLDatabaseConfiguration]
4949
sqlite : Optional[SQLiteDatabaseConfiguration]
50-
type : Literal['memory', 'sqlite', 'postgres'] | None
50+
type : Literal['noop', 'memory', 'sqlite', 'postgres'] | None
5151
check_cache_configuration() -> Self
5252
}
5353
class "CustomProfile" as src.models.config.CustomProfile {

docs/config.svg

Lines changed: 269 additions & 269 deletions
Loading

docs/openapi.json

Lines changed: 507 additions & 7 deletions
Large diffs are not rendered by default.

docs/openapi.md

Lines changed: 156 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -348,6 +348,69 @@ Returns:
348348
| 404 | Not Found | |
349349
| 503 | Service Unavailable | |
350350
| 422 | Validation Error | [HTTPValidationError](#httpvalidationerror) |
351+
## GET `/v2/conversations`
352+
353+
> **Get Conversations List Endpoint Handler**
354+
355+
Handle request to retrieve all conversations for the authenticated user.
356+
357+
358+
359+
360+
361+
### ✅ Responses
362+
363+
| Status Code | Description | Component |
364+
|-------------|-------------|-----------|
365+
| 200 | Successful Response | [ConversationsListResponseV2](#conversationslistresponsev2) |
366+
## GET `/v2/conversations/{conversation_id}`
367+
368+
> **Get Conversation Endpoint Handler**
369+
370+
Handle request to retrieve a conversation by ID.
371+
372+
373+
374+
### 🔗 Parameters
375+
376+
| Name | Type | Required | Description |
377+
|------|------|----------|-------------|
378+
| conversation_id | string | True | |
379+
380+
381+
### ✅ Responses
382+
383+
| Status Code | Description | Component |
384+
|-------------|-------------|-----------|
385+
| 200 | Successful Response | [ConversationResponse](#conversationresponse) |
386+
| 400 | Missing or invalid credentials provided by client | [UnauthorizedResponse](#unauthorizedresponse) |
387+
| 401 | Unauthorized: Invalid or missing Bearer token | [UnauthorizedResponse](#unauthorizedresponse) |
388+
| 404 | Not Found | |
389+
| 422 | Validation Error | [HTTPValidationError](#httpvalidationerror) |
390+
## DELETE `/v2/conversations/{conversation_id}`
391+
392+
> **Delete Conversation Endpoint Handler**
393+
394+
Handle request to delete a conversation by ID.
395+
396+
397+
398+
### 🔗 Parameters
399+
400+
| Name | Type | Required | Description |
401+
|------|------|----------|-------------|
402+
| conversation_id | string | True | |
403+
404+
405+
### ✅ Responses
406+
407+
| Status Code | Description | Component |
408+
|-------------|-------------|-----------|
409+
| 200 | Successful Response | [ConversationDeleteResponse](#conversationdeleteresponse) |
410+
| 400 | Missing or invalid credentials provided by client | [UnauthorizedResponse](#unauthorizedresponse) |
411+
| 401 | Unauthorized: Invalid or missing Bearer token | [UnauthorizedResponse](#unauthorizedresponse) |
412+
| 404 | Not Found | |
413+
| 422 | Validation Error | [HTTPValidationError](#httpvalidationerror) |
351414
## GET `/readiness`
352415

353416
> **Readiness Probe Get Method**
@@ -562,6 +625,21 @@ Global service configuration.
562625
| authorization | | |
563626
| customization | | |
564627
| inference | | |
628+
| conversation_cache | | |
629+
630+
631+
## ConversationCacheConfiguration
632+
633+
634+
Conversation cache configuration.
635+
636+
637+
| Field | Type | Description |
638+
|-------|------|-------------|
639+
| type | | |
640+
| memory | | |
641+
| sqlite | | |
642+
| postgres | | |
565643

566644

567645
## ConversationDeleteResponse
@@ -697,6 +775,20 @@ Example:
697775
| conversations | array | |
698776

699777

778+
## ConversationsListResponseV2
779+
780+
781+
Model representing a response for listing conversations of a user.
782+
783+
Attributes:
784+
conversations: List of conversation IDs associated with the user.
785+
786+
787+
| Field | Type | Description |
788+
|-------|------|-------------|
789+
| conversations | array | |
790+
791+
700792
## CustomProfile
701793

702794

@@ -881,6 +973,17 @@ Model representing response for forbidden access.
881973
| detail | array | |
882974

883975

976+
## InMemoryCacheConfig
977+
978+
979+
In-memory cache configuration.
980+
981+
982+
| Field | Type | Description |
983+
|-------|------|-------------|
984+
| max_entries | integer | |
985+
986+
884987
## InferenceConfiguration
885988

886989

@@ -1102,12 +1205,36 @@ Model representing LLM response to a query.
11021205
Attributes:
11031206
conversation_id: The optional conversation ID (UUID).
11041207
response: The response.
1208+
rag_chunks: List of RAG chunks used to generate the response.
1209+
referenced_documents: The URLs and titles for the documents used to generate the response.
1210+
tool_calls: List of tool calls made during response generation.
1211+
TODO: truncated: Whether conversation history was truncated.
1212+
TODO: input_tokens: Number of tokens sent to LLM.
1213+
TODO: output_tokens: Number of tokens received from LLM.
1214+
TODO: available_quotas: Quota available as measured by all configured quota limiters
1215+
TODO: tool_results: List of tool results.
11051216

11061217

11071218
| Field | Type | Description |
11081219
|-------|------|-------------|
11091220
| conversation_id | | The optional conversation ID (UUID) |
11101221
| response | string | Response from LLM |
1222+
| rag_chunks | array | |
1223+
| tool_calls | | List of tool calls made during response generation |
1224+
| referenced_documents | array | List of documents referenced in generating the response |
1225+
1226+
1227+
## RAGChunk
1228+
1229+
1230+
Model representing a RAG chunk used in the response.
1231+
1232+
1233+
| Field | Type | Description |
1234+
|-------|------|-------------|
1235+
| content | string | The content of the chunk |
1236+
| source | | Source document or URL |
1237+
| score | | Relevance score |
11111238

11121239

11131240
## ReadinessResponse
@@ -1143,6 +1270,22 @@ Example:
11431270
| providers | array | List of unhealthy providers in case of readiness failure. |
11441271

11451272

1273+
## ReferencedDocument
1274+
1275+
1276+
Model representing a document referenced in generating a response.
1277+
1278+
Attributes:
1279+
doc_url: Url to the referenced doc.
1280+
doc_title: Title of the referenced doc.
1281+
1282+
1283+
| Field | Type | Description |
1284+
|-------|------|-------------|
1285+
| doc_url | | URL of the referenced document |
1286+
| doc_title | string | Title of the referenced document |
1287+
1288+
11461289
## SQLiteDatabaseConfiguration
11471290

11481291

@@ -1209,6 +1352,19 @@ TLS configuration.
12091352
| tls_key_password | | |
12101353

12111354

1355+
## ToolCall
1356+
1357+
1358+
Model representing a tool call made during response generation.
1359+
1360+
1361+
| Field | Type | Description |
1362+
|-------|------|-------------|
1363+
| tool_name | string | Name of the tool called |
1364+
| arguments | object | Arguments passed to the tool |
1365+
| result | | Result from the tool |
1366+
1367+
12121368
## UnauthorizedResponse
12131369

12141370

0 commit comments

Comments
 (0)