feat(context optimization): Optimize LLM Context Management and File Handling #578

thecodacus · 2024-12-07T10:40:01Z

Optimize LLM Context Management and File Handling

Overview

This PR significantly improves how we manage LLM context and file handling in chat interactions. The changes optimize memory usage, extend chat context length, and provide a single source of truth for file content, resulting in more reliable and efficient AI operations.

Key Changes

1. Context Optimization

Implemented a new file context system that maintains a single source of truth for code files
Added file content filtering using gitignore-style patterns to exclude irrelevant files
Simplified bolt actions in chat history by truncating file content
Added line numbers to code context for better reference tracking

2. Chat History Management

Optimized message processing to remove redundant file contexts from chat history
Implemented content simplification for assistant messages containing file actions
Added support for streaming file contexts efficiently
Modified the chat client to handle the new file context system

3. Workbench Improvements

Enhanced action execution queue management
Added duplicate action execution prevention
Improved file action handling in the webcontainer
Modified artifact management for better state tracking

Benefits

Extended Chat Context: Enables longer conversations by reducing redundant content
Improved Accuracy: Single source of truth for file content reduces hallucinations
Better Memory Usage: Optimized context management reduces memory overhead
Enhanced Reliability: Consistent file context handling improves AI response accuracy

Technical Details

Added createFilesContext function to generate structured file contexts
Implemented simplifyBoltActions for optimizing chat history
Modified stream handling to incorporate file context efficiently
Added proper file filtering using common ignore patterns

Testing

Verified context optimization with large codebases
Tested chat history management with complex interactions
Validated file content handling and reference accuracy
Confirmed proper handling of ignore patterns

Migration Notes

No breaking changes. Existing chat interactions will automatically benefit from the optimizations.

Future Improvements

Implement differential context updates
Optimize context generation for even larger repositories

Preview

Context.Optimization.demo.mp4

…at overhead

coleam00 · 2024-12-07T23:31:55Z

@thecodacus I am going to review this over the next couple of days, this is absolutely fantastic and I want to give it some proper testing!

coleam00

Fantastic work @thecodacus!! I took a look at everything and tested it on my end.

I think this is ready to merge except I just have a couple of questions:

Do you intend on removing all the debug messages in the terminal? The ones that print out processedMessages. Super helpful for me to see what is happening behind the scenes btw! But it would be good to remove these before merging I'm thinking.
Which LLMs have you tested this with? I tested with Qwen 2.5 Coder 32b and seemed to get suboptimal results compared to what I usually get, but then again you never know with local LLMs sometimes, it could be a fluke.

thecodacus · 2024-12-10T19:15:10Z

tested with some local models 7B and llama3.2B not much difference there,also tested with gpt-4o and claude sonnet almost similar output I got

thecodacus · 2024-12-12T12:41:04Z

@wonderwhy-er , Since this is a small architectural change, I like to confirm with you , if this has any conflict with any changes that you planned for future?

if not then I will merge

aliasfoxkde · 2024-12-12T19:03:48Z

Fantastic work @thecodacus!! I took a look at everything and tested it on my end.

I think this is ready to merge except I just have a couple of questions:

Do you intend on removing all the debug messages in the terminal? The ones that print out processedMessages. Super helpful for me to see what is happening behind the scenes btw! But it would be good to remove these before merging I'm thinking.

Which LLMs have you tested this with? I tested with Qwen 2.5 Coder 32b and seemed to get suboptimal results compared to what I usually get, but then again you never know with local LLMs sometimes, it could be a fluke.

I like this a lot but yes, it is very verbose console logging. The way I would imagine handling this is disable console logging for production builds. So, the build command would turn it off (or something like that). For Development, and tracking down issues, this is great!

…zation feat(context optimization): Optimize LLM Context Management and File Handling

feat(context optimization):improved context management and redused ch…

ea5c624

…at overhead

thecodacus requested review from wonderwhy-er, chrismahoney and coleam00 December 7, 2024 10:40

thecodacus mentioned this pull request Dec 7, 2024

Refinement of folder import #426

Merged

thecodacus mentioned this pull request Dec 9, 2024

feat(Filters): Add Middleware Filter Chain System to add user written customization to llm calls #614

Closed

coleam00 reviewed Dec 10, 2024

View reviewed changes

removed console logs

dfbbea1

thecodacus added 2 commits December 12, 2024 02:38

Merge branch 'main' into context-optimization

3c7b125

Merge branch 'main' into context-optimization

da37d94

thecodacus merged commit 8c4397a into stackblitz-labs:main Dec 13, 2024
1 check passed

thecodacus mentioned this pull request Dec 26, 2024

feat: redact file contents from chat and put latest files into system prompt #904

Merged

leex279 mentioned this pull request Jan 16, 2025

Wrong files are getting badly updated and right files are not #1045

Open

dustinwloring1988 mentioned this pull request Jan 19, 2025

Workbench Never Finishes Some Commands openaisoftware/bolt.new#68

Open

JJ-Dynamite pushed a commit to val-x/valenClient that referenced this pull request Jan 29, 2025

Merge pull request stackblitz-labs#578 from thecodacus/context-optimi…

30668c9

…zation feat(context optimization): Optimize LLM Context Management and File Handling

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(context optimization): Optimize LLM Context Management and File Handling #578

feat(context optimization): Optimize LLM Context Management and File Handling #578

thecodacus commented Dec 7, 2024

coleam00 commented Dec 7, 2024

coleam00 left a comment

thecodacus commented Dec 10, 2024

thecodacus commented Dec 12, 2024 •

edited

Loading

aliasfoxkde commented Dec 12, 2024

feat(context optimization): Optimize LLM Context Management and File Handling #578

feat(context optimization): Optimize LLM Context Management and File Handling #578

Conversation

thecodacus commented Dec 7, 2024

Optimize LLM Context Management and File Handling

Overview

Key Changes

1. Context Optimization

2. Chat History Management

3. Workbench Improvements

Benefits

Technical Details

Testing

Migration Notes

Future Improvements

Preview

coleam00 commented Dec 7, 2024

coleam00 left a comment

Choose a reason for hiding this comment

thecodacus commented Dec 10, 2024

thecodacus commented Dec 12, 2024 • edited Loading

aliasfoxkde commented Dec 12, 2024

thecodacus commented Dec 12, 2024 •

edited

Loading