9.2 C
New York

Remote Agents in Vibe: Fueled by Mistral Medium 3.5

Published:

In today’s tech landscape, coding agents have primarily been anchored to our laptops. This paradigm is shifting as we embrace a more dynamic and efficient way to code by moving these agents to the cloud. Now, coding tasks can be initiated remotely, allowing agents to run autonomously in parallel and notify users upon completion. This seamless integration can be realized through the Mistral Vibe CLI or directly within Le Chat, enabling you to offload coding tasks without interrupting your conversation flow.

At the heart of this innovation is Mistral Medium 3.5, which is currently in public preview. This advanced model serves as our new default within both Mistral Vibe and Le Chat. It is purposefully designed to execute long-duration coding and productivity tasks with utmost ease. Furthermore, the newly introduced Work mode in Le Chat (also in preview) enhances these capabilities by employing a sophisticated agent adept at handling complex, multi-step tasks such as research, analysis, and cross-tool operations.

Highlights of Mistral Medium 3.5

  1. The centerpiece of the update is Mistral Medium 3.5, our flagship model. This 128 billion parameter dense model consolidates instruction-following, reasoning, and coding into a single entity, released with open weights under a modified MIT license.
  2. Real-world performance is strong, with the model capable of being self-hosted on just four GPUs, making it an appealing choice for developers.
  3. Mistral Vibe introduces remote agents for asynchronous coding, allowing sessions to run in the cloud. Users can initiate tasks from the CLI or Le Chat, teleporting ongoing local CLI sessions to the cloud with all their history intact.
  4. Initiating coding tasks in Le Chat keeps everything connected, allowing the remote runtime to function while users attend to other matters.
  5. Work mode in Le Chat harnesses the power of Mistral Medium 3.5, enabling the completion of complex multi-step tasks and the ability to call tools in parallel until the task is finished.

What is Mistral Medium 3.5?

Mistral Medium 3.5 represents an evolution in model architecture as our first flagship merged model available in public preview. This 128B model features a remarkable 256k context window, adeptly managing instruction-following, reasoning, and coding tasks all under one roof. The model excels in real-world applications and can be self-hosted with a minimal hardware requirement. We engineered the vision encoder from scratch to accommodate various image sizes and aspect ratios, optimizing its versatility.

The performance metrics are impressive: Mistral Medium 3.5 boasts a score of 77.6% on SWE-Bench Verified, surpassing competitors like Devstral 2 and Qwen3.5 397B A17B models. In addition, it achieves an impressive score of 91.4 on τ³-Telecom, showcasing its agentic capabilities.

Frame 2147228534

Math Instruct Final

The robustness of the model makes it particularly suitable for long-horizon tasks, capable of reliably integrating multiple tools to generate structured outputs that can be readily consumed by downstream code. This architecture has been crucial in making asynchronous cloud agents in Vibe a practical reality.

Vibe Remote Agents

With the introduction of Vibe, coding sessions can now tackle lengthy tasks while you focus on other priorities. Multiple agents can operate in parallel, effectively removing you as the bottleneck from processing each step. These cloud agents can be launched from either Mistral Vibe CLI or Le Chat, where you’ll have visibility into the agent’s actions, including file diffs, tool calls, and progress updates. Additionally, ongoing local CLI sessions can be transferred seamlessly to the cloud, preserving historical context and task state.

Medium Scheme

Vibe integrates directly with existing systems developers use daily, allowing human intervention where necessary. It bridges the gap with platforms like GitHub for code management, Linear and Jira for tracking issues, Sentry for incident management, and collaboration tools like Slack and Teams for updates. Each coding session operates within an isolated sandbox environment, which includes the ability to perform broad edits and installs. At the completion of a task, the agent can create a pull request on GitHub, alerting you to review the outcome without having to comb through every keystroke.

This configuration suits a high volume of well-defined developer work—think module refactors, test generation, dependency upgrades, continuous integration evaluations, and bug fixes.

Utilizing Workflows orchestrated through Mistral Studio, Vibe has been developed initially for internal use, subsequently extending capabilities to enterprise customers. This opportunity is now available for all users to initiate coding tasks via the web, linking coding actions with both CLI and web environments for complete flexibility.

Coding sessions can be launched directly in Le Chat, where tasks articulated during the conversation can seamlessly transition to the cloud, returning as completed branches or draft pull requests later.

Introducing Work Mode in Le Chat (Preview)

In a major leap forward, the new Work mode in Le Chat provides an enhanced agentic capability tailored for intricate tasks. Powered by the new harness and Mistral Medium 3.5, it functions as the execution backend for the assistant, enabling it to read and write while utilizing multiple tools concurrently to handle complex tasks until completion.

Here’s a glimpse of what Work mode enables you to accomplish today:

  1. Cross-tool workflows that allow you to catch up on emails, messages, and calendars in a consolidated process, along with preparing for meetings by aggregating context and relevant information.
  2. Research and synthesis capabilities, diving deep into topics across the web and internal documents to generate comprehensive briefs or reports ready for refinement before sharing.
  3. Efficient inbox triage and drafting replies, creating actionable items in Jira from discussions, and summarizing communication for your team on Slack.

The agent in Work mode persists longer than traditional chat interactions, allowing it to navigate complexities and iterative problem-solving while ensuring that every action taken is transparent. Users can view tool calls and the rationale behind decisions, with explicit permissions required for sensitive actions like sending messages or modifying data.

Getting Started with Mistral Medium 3.5

Today, Mistral Medium 3.5 is available across both Mistral Vibe and Le Chat, powering remote coding agents and the new Work mode in Le Chat for Pro, Team, and Enterprise plans. Through the API, the pricing structure is set at $1.5 per million input tokens and $7.5 per million output tokens. Additionally, open weights are hosted on Hugging Face under a modified MIT license.

For those interested in prototyping, Mistral Medium 3.5 is hosted on NVIDIA GPU-accelerated endpoints, available for exploration on build.nvidia.com. It can also be accessed as a scalable containerized inference microservice via NVIDIA NIM.

Join Us at the Frontiers of Coding Agents

We are on the lookout for talented individuals across research, engineering, and product roles to push the boundaries of agentic systems. Explore our current openings and consider becoming part of this revolutionary journey.

Related articles

Recent articles

bitcoin
Bitcoin (BTC) $ 76,064.00 1.85%
ethereum
Ethereum (ETH) $ 2,264.99 2.78%
tether
Tether (USDT) $ 0.999523 0.03%
xrp
XRP (XRP) $ 1.37 1.57%
bnb
BNB (BNB) $ 616.10 1.69%
usd-coin
USDC (USDC) $ 0.99979 0.00%
solana
Solana (SOL) $ 83.16 2.11%
tron
TRON (TRX) $ 0.325522 0.76%
figure-heloc
Figure Heloc (FIGR_HELOC) $ 1.03 0.05%
staked-ether
Lido Staked Ether (STETH) $ 2,265.05 3.46%
dogecoin
Dogecoin (DOGE) $ 0.106308 3.10%
whitebit
WhiteBIT Coin (WBT) $ 57.17 4.15%
usds
USDS (USDS) $ 0.999731 0.00%
leo-token
LEO Token (LEO) $ 10.37 0.04%
hyperliquid
Hyperliquid (HYPE) $ 39.10 3.54%
cardano
Cardano (ADA) $ 0.246502 2.20%
wrapped-steth
Wrapped stETH (WSTETH) $ 2,779.67 3.22%
bitcoin-cash
Bitcoin Cash (BCH) $ 445.02 2.03%
monero
Monero (XMR) $ 378.96 1.18%
wrapped-bitcoin
Wrapped Bitcoin (WBTC) $ 76,243.00 3.12%
chainlink
Chainlink (LINK) $ 9.14 2.33%
binance-bridged-usdt-bnb-smart-chain
Binance Bridged USDT (BNB Smart Chain) (BSC-USD) $ 0.998762 0.02%
canton-network
Canton (CC) $ 0.150011 0.34%
wrapped-beacon-eth
Wrapped Beacon ETH (WBETH) $ 2,466.93 3.47%
zcash
Zcash (ZEC) $ 333.14 0.00%
stellar
Stellar (XLM) $ 0.158965 2.67%
usd1-wlfi
USD1 (USD1) $ 0.999755 0.03%
wrapped-eeth
Wrapped eETH (WEETH) $ 2,465.31 3.39%
memecore
MemeCore (M) $ 3.42 3.26%
dai
Dai (DAI) $ 0.999393 0.05%
susds
sUSDS (SUSDS) $ 1.08 0.16%
litecoin
Litecoin (LTC) $ 55.80 1.61%
avalanche-2
Avalanche (AVAX) $ 9.17 2.00%
ethena-usde
Ethena USDe (USDE) $ 0.999047 0.01%
coinbase-wrapped-btc
Coinbase Wrapped BTC (CBBTC) $ 76,366.00 3.12%
hedera-hashgraph
Hedera (HBAR) $ 0.088366 2.13%
rain
Rain (RAIN) $ 0.007883 6.03%
shiba-inu
Shiba Inu (SHIB) $ 0.000006 2.52%
weth
WETH (WETH) $ 2,268.37 3.40%
sui
Sui (SUI) $ 0.906588 2.50%
paypal-usd
PayPal USD (PYUSD) $ 1.00 0.00%
the-open-network
Toncoin (TON) $ 1.31 2.59%
usdt0
USDT0 (USDT0) $ 0.998824 0.03%
crypto-com-chain
Cronos (CRO) $ 0.068502 1.10%
hashnote-usyc
Circle USYC (USYC) $ 1.12 0.00%
tether-gold
Tether Gold (XAUT) $ 4,626.67 1.37%
global-dollar
Global Dollar (USDG) $ 0.999788 0.00%
bittensor
Bittensor (TAO) $ 250.02 3.45%
blackrock-usd-institutional-digital-liquidity-fund
BlackRock USD Institutional Digital Liquidity Fund (BUIDL) $ 1.00 0.00%
pax-gold
PAX Gold (PAXG) $ 4,629.44 1.53%