feat(copilot): JSON sanitization logic + operations sequence diff correctness #1521

icecrasher321 · 2025-10-02T00:32:21Z

Summary

Make copilot JSON language tracking consistent across workflows and operations.

Type of Change

New feature

Testing

Manually via training modal

Checklist

Code follows project style guidelines
Self-reviewed my changes
Tests added/updated and passing
No new warnings introduced
I confirm that I have read and agree to the terms outlined in the Contributor License Agreement (CLA)

vercel · 2025-10-02T00:32:26Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
sim	Ready	Preview	Comment	Oct 2, 2025 10:13pm

1 Skipped Deployment

Project	Deployment	Preview	Comments	Updated (UTC)
docs	Skipped			Oct 2, 2025 10:13pm

greptile-apps

Greptile Overview

Summary

This PR introduces significant architectural changes to standardize JSON serialization and workflow data structures for copilot training consistency. The main changes include:

Core Architecture Refactoring:

JSON Sanitization Overhaul: The json-sanitizer.ts file has been completely restructured to embed connections directly within blocks rather than maintaining separate edges/loops/parallels arrays. This creates a more nested, self-contained data structure that aligns with how copilot operations work internally.
Enhanced Security: Added comprehensive sensitive data detection with regex patterns and OAuth input type checking to prevent API keys and secrets from being included in training data.
Simplified Data Structures: Replaced complex ReactFlow-specific structures with a unified nested approach where loops/parallels are represented as nestedNodes within their parent blocks.

Training Modal Enhancement:
Added a new "Send Live State" tab to the training modal (training-modal.tsx) that allows users to capture and submit the current workflow state directly for copilot training, complementing the existing session recording functionality. This provides a quick way to submit complete workflow examples without going through the full editing session process.

Operations System Updates:

Edit Sequence Computation: Refactored compute-edit-sequence.ts to work with the new sanitized format, introducing extractAllEdgesFromBlocks function and unified nestedNodes handling.
Workflow Editing: Enhanced edit-workflow.ts with comprehensive nested node support for loop and parallel blocks, enabling proper hierarchical block structure management.
Metadata Standardization: Renamed copilot block metadata properties (commonParameters → inputSchema, inputs → inputDefinitions) for better semantic clarity.

User Settings Integration:
Added two new boolean settings (showFloatingControls and showTrainingControls) to support the new UI controls, with appropriate defaults to maintain backward compatibility while allowing users to opt into new training features.

These changes work together to create a more consistent and robust foundation for copilot training data, moving away from UI-specific ReactFlow concepts toward a more standardized, nested data representation that better reflects actual workflow execution patterns.

Important Files Changed

Changed Files

Filename	Score	Overview
apps/sim/app/workspace/[workspaceId]/w/[workflowId]/components/training-controls/training-modal.tsx	4/5	Adds new "Send Live State" tab for direct workflow state submission to copilot training
apps/sim/lib/workflows/training/compute-edit-sequence.ts	3/5	Major refactor to work with sanitized JSON format, removes edge removal tracking
apps/sim/lib/workflows/json-sanitizer.ts	4/5	Complete restructuring to embed connections in blocks and use nested structures
apps/sim/lib/copilot/tools/server/blocks/get-blocks-metadata-tool.ts	5/5	Renames metadata properties for better semantic consistency
apps/sim/app/api/users/me/settings/route.ts	5/5	Adds user settings for training UI controls with proper defaults
apps/sim/lib/copilot/tools/server/workflow/edit-workflow.ts	4/5	Enhances workflow editing with comprehensive nested node support

Confidence score: 3/5

This PR involves significant architectural changes that affect core serialization logic and may have widespread implications across the codebase
Score reflects the complexity and scope of the refactoring, particularly the structural changes to JSON sanitization that could impact other systems consuming this data
Pay close attention to compute-edit-sequence.ts and json-sanitizer.ts as they contain the most significant structural changes

Sequence Diagram

sequenceDiagram
    participant User
    participant TrainingModal
    participant CopilotTrainingStore
    participant JSONSanitizer
    participant TrainingAPI
    participant AgentIndexer

    User->>TrainingModal: "Open Training Modal"
    TrainingModal->>CopilotTrainingStore: "Get current state"
    CopilotTrainingStore-->>TrainingModal: "Return training data"

    User->>TrainingModal: "Start Training Session"
    TrainingModal->>CopilotTrainingStore: "startTraining(title, prompt)"
    CopilotTrainingStore->>CopilotTrainingStore: "Capture start snapshot"
    CopilotTrainingStore-->>TrainingModal: "Training session started"

    User->>User: "Edit workflow (add/edit/delete blocks)"

    User->>TrainingModal: "Stop Training"
    CopilotTrainingStore->>JSONSanitizer: "sanitizeForCopilot(startState)"
    JSONSanitizer-->>CopilotTrainingStore: "Sanitized start state"
    CopilotTrainingStore->>JSONSanitizer: "sanitizeForCopilot(endState)"
    JSONSanitizer-->>CopilotTrainingStore: "Sanitized end state"
    CopilotTrainingStore->>CopilotTrainingStore: "computeEditSequence(start, end)"
    CopilotTrainingStore->>CopilotTrainingStore: "Save dataset with operations"
    CopilotTrainingStore-->>TrainingModal: "Dataset saved"

    User->>TrainingModal: "Send Dataset to Indexer"
    TrainingModal->>JSONSanitizer: "sanitizeForCopilot(input/output)"
    JSONSanitizer-->>TrainingModal: "Sanitized workflow states"
    TrainingModal->>TrainingAPI: "POST /api/copilot/training"
    TrainingAPI->>AgentIndexer: "POST /examples/add"
    AgentIndexer-->>TrainingAPI: "Success/Error response"
    TrainingAPI-->>TrainingModal: "Training result"
    TrainingModal->>CopilotTrainingStore: "markDatasetSent(id)"

    User->>TrainingModal: "Send Live Workflow"
    TrainingModal->>JSONSanitizer: "sanitizeForCopilot(currentWorkflow)"
    JSONSanitizer-->>TrainingModal: "Sanitized workflow"
    TrainingModal->>TrainingAPI: "POST /api/copilot/training/examples"
    TrainingAPI->>AgentIndexer: "POST /examples/add"
    AgentIndexer-->>TrainingAPI: "Success/Error response"
    TrainingAPI-->>TrainingModal: "Result"

Additional Comments (1)

apps/sim/lib/workflows/json-sanitizer.ts, line 277 (link)

style: Export sanitization uses different logic than copilot sanitization - could lead to inconsistencies

_{6 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

apps/sim/lib/copilot/tools/server/workflow/edit-workflow.ts

…rectness (#1521) * add state sending capability * progress * add ability to add title and description to workflow state * progress in language * fix * cleanup code * fix type issue * fix subflow deletion case * Workflow console tool * fix lint --------- Co-authored-by: Siddharth Ganesan <siddharthganesan@gmail.com>

icecrasher321 added 4 commits September 30, 2025 16:03

add state sending capability

c999586

progress

9d861a4

add ability to add title and description to workflow state

2b08778

progress in language

6d3d094

greptile-apps bot reviewed Oct 2, 2025

View reviewed changes

apps/sim/lib/copilot/tools/server/workflow/edit-workflow.ts Show resolved Hide resolved

apps/sim/lib/copilot/tools/server/workflow/edit-workflow.ts Show resolved Hide resolved

fix

2dedaa8

vercel bot temporarily deployed to Preview – docs October 2, 2025 02:16 Inactive

vercel bot had a problem deploying to Preview – sim October 2, 2025 02:20 Failure

cleanup code

dffa234

vercel bot temporarily deployed to Preview – docs October 2, 2025 02:30 Inactive

vercel bot had a problem deploying to Preview – sim October 2, 2025 02:33 Failure

fix type issue

0e9ed39

vercel bot temporarily deployed to Preview – docs October 2, 2025 02:54 Inactive

vercel bot deployed to Preview – sim October 2, 2025 02:58 View deployment

fix subflow deletion case

309bab0

vercel bot temporarily deployed to Preview – docs October 2, 2025 20:02 Inactive

vercel bot deployed to Preview – sim October 2, 2025 20:06 View deployment

Merge branch 'staging' into feat/add-example-copilot-training

9af869a

vercel bot deployed to Preview – docs October 2, 2025 22:04 View deployment

vercel bot deployed to Preview – sim October 2, 2025 22:05 View deployment

Workflow console tool

37d848b

vercel bot temporarily deployed to Preview – docs October 2, 2025 22:05 Inactive

fix lint

1484ac6

vercel bot temporarily deployed to Preview – docs October 2, 2025 22:07 Inactive

icecrasher321 merged commit 4bc37db into staging Oct 2, 2025
9 of 10 checks passed

vercel bot deployed to Preview – sim October 2, 2025 22:13 View deployment

waleedlatif1 mentioned this pull request Oct 2, 2025

v0.4.3: posthog, docs updates, search modal improvements #1534

Merged

waleedlatif1 deleted the feat/add-example-copilot-training branch October 7, 2025 23:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(copilot): JSON sanitization logic + operations sequence diff correctness #1521

feat(copilot): JSON sanitization logic + operations sequence diff correctness #1521

Uh oh!

icecrasher321 commented Oct 2, 2025

Uh oh!

vercel bot commented Oct 2, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(copilot): JSON sanitization logic + operations sequence diff correctness #1521

feat(copilot): JSON sanitization logic + operations sequence diff correctness #1521

Uh oh!

Conversation

icecrasher321 commented Oct 2, 2025

Summary

Type of Change

Testing

Checklist

Uh oh!

vercel bot commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Greptile Overview

Summary

Important Files Changed

Confidence score: 3/5

Sequence Diagram

Additional Comments (1)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vercel bot commented Oct 2, 2025 •

edited

Loading

greptile-apps bot left a comment •

edited

Loading