Phase 6: Inference Agent
vLLM + Qwen-Agent + MCP tool integration for production inference. Planned architecture includes screenshot ingestion endpoint, code generation API, and IDE plugin.
Status: planned
Technologies
vLLMQwen-AgentMCPFastAPI
Outputs
- Inference API endpoint
- IDE plugin