Fix
- Adds handling for network resets during long polls.
Improvement
- Accommodates new changes for LangGraph handlers.
// pip
pip install maxim-py==3.4.11
// UV
uv add maxim-py==3.4.11
New
- New cookbook to use dSPY evaluators using Maxim SDK
// pip
pip install maxim-py==3.4.11
// UV
uv add maxim-py==3.4.11
Whether you're building dynamic prompts, injecting structured context, or running personalized prompt tests at scaleāJinja2 syntax makes it easier than ever to craft powerful, composable prompts right inside our Prompt playground.
With Jinja2 support, you can structure your prompts like actual codeāloop through lists, add conditional statements, and dynamically inject variablesāall while keeping your prompt logic clean and reusable. Follow this demo to learn more.
Obsessed with Grok? As a step towards becoming your one-stop solution for all experimentation and evaluation needs, weāve added support for xAI modelsāgiving you a greater selection pool and flexibility to test, compare, and refine your AI workflows.
No complex setupājust plug in your API key and start building in minutes.
Google's latest Gemini 2.5 Pro model is now available on Maxim. Leverage its advanced reasoning and multimodal capabilities to design custom evaluators and prompt experiments.
Start using this model via the Google provider:
ā
Go to Settings > Models > Google and add Gemini 2.5 Pro Experimental
Tired of managing your secrets across multiple places? Vault is a centralized, encrypted home for all your sensitive variablesāfrom API keys to tokens. Use your secrets in Maxim workflows or API evals by simply referencing them from Vault.
All secrets are encrypted at rest and in transit using RSA 2048-bit encryption, ensuring that even if someone gains access to the system, your data remains protected without the private key. Vault is built for robust security and peace of mind.
While our SDK support has made it seamless to work with Maxim, weāve now introduced a comprehensive REST APIāgiving you even more flexibility to integrate Maxim into your existing workflows. Use Maxim APIs to:
Teams today spend countless hours manually going back and forth with their agents to assess quality and identify failure modes. And itās not a one-time thingā unintended regressions happen all the time, whether due to model updates or optimizations based on user feedback.
Introducing Maximās agent simulation! Now, you can evaluate your AI agents in just three simple steps:
1ļøā£ Define the real-world scenarios, user personas, and any other context you want to test your agents on.
2ļøā£ Pick the right evaluators, from predefined evaluators to custom metrics or human reviews, for your use case and trigger a test run.
3ļøā£ Analyze your agentās performance, debug issues, and iterate.
The best part? Simply bring your agents via an API endpoint and get started in minutes!
Teams store sensitive customer data, API keys, and production logs on the Maxim platformāand we take that trust seriously. To enhance account security, Maxim now supports two-factor authentication (2FA), adding an extra layer of protection against unauthorized access to your data.
Settings ā”ļø Organization info ā”ļø Two-factor authentication.
Our new Prompt Diff feature lets you compare two prompt versions side-by-side with a Git diff-style view revealing every tweak. See how your prompt is evolving over time and correlate changes in performance with prompt changes.
Need help? With our in-app support, you can get assistance, ask questions, and share feedback ā all with a click of a button. With 24/7 support, weāre here to ensure youāre never blocked and can stay focused on building, testing, and scaling your AI workflows.