Operationalising Foundry Local at the Disconnected Edge: One-Month Prototype Lessons #236
Liam-dev
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
How we stood up a mission‑critical prototype of Foundry Local in a fully disconnected environment, the architectural decisions we made, what worked, what hurt, and what we’re asking the community (and platform team) for—especially around embeddings and RAG.
Pace Applied Solutions – Edge AI Team
1. Why We Did This
We’re prototyping a high‑value, mission‑critical Edge AI capability that must:
Foundry Local emerged as the core inference layer after multiple design passes across alternative stacks.
2. Environment Constraints (The Real Battle)
Operating conditions:
Key principle that emerged:
3. Final Architecture (Current Prototype)
4. Foundry Local Deployment Workflow (Offline Pattern)
This has been reliable for:
5. Lightweight UI Wrapper Strategy
Why not heavier frameworks? Faster to bootstrap + fewer build dependencies.
What it does:
Port Resolution Logic (simplified sketch):
6. What Worked Well
7. The Hard Stuff
8. RAG & Embeddings: The Major Gap
We have functional chat + agents, but RAG is where friction spikes – and this is critical at user need at edge:
What would dramatically improve disconnected adoption:
If any of this is already possible today (especially #2), we would love a pointer.
9. Value Delivered (Already)
Even in month one:
10. Quick Start Tips (For Others Attempting Similar)
11. Open Questions (Community / Team Ask)
We’d appreciate guidance or confirmation on:
12. What We Can Share Back
Happy to share any learnings or insights if useful.
13. Next Steps (Our Side)
Work with users on the current prototype and keep experimenting with other options, eagerly look for new releases and help test out live!
14. Final Note
Foundry Local has been genuinely impressive as a core inference backbone in constrained environments. Even with the friction points, it’s already accelerated our trajectory. Looking forward to future iterations—this is a strong foundation.
If you have tips, corrections, or can unblock the embeddings question—reach out. We’re only a month in and eager to refine.
Beta Was this translation helpful? Give feedback.
All reactions