* Documentation

SLM Integration

Technical breakdown of Phase 2 "Smart Compiler": Integrating SLMs for context-aware error correction.

The Turf project is evolving its “Smart Compiler” capabilities from static heuristic-based suggestions (Phase 1) to dynamic, context-aware error correction powered by Small Language Models (SLMs).

The Core Concept

Traditional compilers provide error messages based on predefined rules. Phase 2 of the Turf compiler aims to provide human-like reasoning for why an error occurred and how to fix it by feeding the compiler’s internal state into an SLM.

Technical Architecture

The integration follows a structured pipeline designed to minimize latency and maximize the relevance of the SLM’s output.

1. Context Extraction

When a TurfError is raised, the DiagnosticEngine captures more than just the error message. It extracts a Semantic Context Block:

Source Snippet: The offending line plus 5-10 lines of surrounding code.
Symbol Table State: Names and types of all variables currently in scope.
Parser State: The current token being processed and the expected grammar production.
Type Information: Details about inferred types and conversion failures.

2. Prompt Engineering

The extracted context is transformed into a structured prompt. A typical prompt structure includes:

System Instruction: “You are an expert Turf language compiler assistant.”
Code Context: The source snippet with line numbers.
Error Identification: The raw error message generated by the compiler (e.g., TypeError).
Objective: “Suggest the most likely fix and explain the mistake in one concise sentence.”

3. Feedback Loop & Verification

To ensure suggestions are valid, the compiler can implement a Speculative Re-parsing mechanism:

The SLM provides a suggested code replacement.
The compiler applies the fix in an isolated, temporary memory buffer.
The parser and linter are run against the modified buffer.
If the error is resolved (and no new ones are introduced), the suggestion is presented to the user with a high confidence score.

Implementation Roadmap

Prototype: Integration with local SLM runners (e.g., Ollama) via CLI hooks.
Optimization: Caching common error patterns and their SLM-generated fixes to reduce repeated API calls.
UX: Direct integration into the turf CLI output, providing a --fix flag to automatically apply the SLM’s suggestion.

Privacy & Security

Turf’s SLM integration is designed to be Privacy-First. By prioritizing local SLM execution, no source code ever leaves the user’s machine, making it suitable for sensitive and professional development environments.