fix(ci): modify docs embeddings ci to only run on english documentation#1455
Merged
waleedlatif1 merged 1 commit intostagingfrom Sep 26, 2025
Merged
fix(ci): modify docs embeddings ci to only run on english documentation#1455waleedlatif1 merged 1 commit intostagingfrom
waleedlatif1 merged 1 commit intostagingfrom
Conversation
Contributor
There was a problem hiding this comment.
Greptile Overview
Summary
Modified the documentation embeddings CI process to target only English documentation by changing the default docs path from /docs to /docs/en. This optimization reduces processing time by excluding German, Spanish, French, Japanese, and Chinese documentation from the embeddings generation.
- Performance improvement: Processes only English docs instead of all 6 languages
- Path change: Updated default
docsPathfromdocs/todocs/en/ - Maintains compatibility: Still allows custom path override via options parameter
Confidence Score: 5/5
- This PR is safe to merge with minimal risk
- Single line change that correctly appends '/en' to existing path structure, verified directory exists, maintains all existing functionality while improving performance
- No files require special attention
Important Files Changed
File Analysis
| Filename | Score | Overview |
|---|---|---|
| apps/sim/scripts/process-docs-embeddings.ts | 5/5 | Simple path modification to target only English docs - correctly appends '/en' to existing path structure |
Sequence Diagram
sequenceDiagram
participant CI as CI Pipeline
participant Script as process-docs-embeddings.ts
participant FS as File System
participant Chunker as DocsChunker
participant DB as Database
Note over CI,DB: Before: Processing all languages
CI->>Script: Execute with default config
Script->>FS: Read docs from /docs/ (all languages)
FS-->>Script: Return docs from en/, es/, fr/, de/, ja/, zh/
Script->>Chunker: Process all language docs
Chunker-->>Script: Generate embeddings for all languages
Script->>DB: Store embeddings
Note over CI,DB: After: Processing only English docs
CI->>Script: Execute with modified config
Script->>FS: Read docs from /docs/en/ (English only)
FS-->>Script: Return docs from en/ only
Script->>Chunker: Process English docs only
Chunker-->>Script: Generate embeddings for English only
Script->>DB: Store English embeddings
1 file reviewed, no comments
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
modify docs embeddings ci to only run on english documentation. docs embeddings were running in every language and updates were taking a while
Type of Change
Testing
Tested manually.
Checklist