High-Performance LLM Proxy
Specifically built for Claude (Claude.ai) and Gemini (Google AI Studio, Google Vertex AI)
Core Advantages
Full-Featured Frontend
- Integrated React frontend providing a complete functional experience
Efficient Architecture
- Occupies one-tenth the resources compared to script language implementations, with ten times the performance, easily handling thousands of requests per second
- Event-driven design, decoupled logic, supports hot reloading and multiple configuration methods
- High-performance response caching supported by Moka technology
- Multi-threaded asynchronous processing based on Tokio and Axum
- Fingerprint-level Chrome simulation Rquest HTTP client
Intelligent Cookie Management
- Automatic classification and management of account status
- Fine-grained polling mechanism to maximize resource utilization
Full Platform Compatibility
- Rust static compilation, single binary deployment, no environment dependencies needed
- Native support for macOS/Android and other platforms
- Extremely low memory usage (only single-digit MB)
- No need for virtual machines or complex dependencies
Enhanced Features
- Built-in proxy server support (no TUN required)
- Concurrent cache request handling
- Gemini additional support:
- Google AI Studio and Google Vertex AI
- OpenAI compatible mode / Gemini format
- Painless HTTP Keep-Alive support
- Claude additional support:
- OpenAI compatible mode / Claude format
- Extend Thinking
- Stop sequences implemented on the proxy side
- Image attachment uploads
- Web search
- Claude Max