Phase 6: Inference Agent

vLLM + Qwen-Agent + MCP tool integration for production inference. Planned architecture includes screenshot ingestion endpoint, code generation API, and IDE plugin.

Status: planned

Technologies

  • vLLM
  • Qwen-Agent
  • MCP
  • FastAPI

Outputs

  • Inference API endpoint
  • IDE plugin
← Phase 5