20 Nov 17:17

Ahajha

5226bb0

Modular 25.7 Latest

Latest

The 25.7 release delivers a fully open-source MAX Python API, an experimental PyTorch-like Model API for faster, easier development and compilation, and major performance and portability gains including bfloat16 on Grace Hopper and Grace Blackwell. Mojo also sees significant upgrades with expanded Apple silicon GPU coverage and safer GPU programming features like stricter type checking, improved pointers, clearer error messages, and better Address Sanitizer support.
Check out all of 25.7's updates with the full MAX changelog and Mojo changelog.

Assets 2

22 Sep 16:45

Ahajha

modular/v25.6.0

f2f9aa6

Modular 25.6

The 25.6 release is a major milestone in our mission to build a unified compute layer for AI. MAX offers a model serving framework that now spans from consumer CPUs and GPUs to the world's most powerful datacenter GPUs, now including industry leading throughput on both NVIDIA Blackwell (B200) and AMD MI355X. We've also added a max benchmark command to our CLI tool that makes it easy to verify all our performance benefits for yourself.

Plus, support for Apple Silicon GPUs is well on its way. For the first time, Mojo developers can directly tap into Mac GPUs—allowing you to write GPU algorithms that run unmodified across Apple Silicon GPUs, NVIDIA Blackwell, AMD MI325X, AMD MI355X, and more. On top of that, you can also now pip install mojo as a standalone package for enhanced Python-to-Mojo interoperability.

Check out all of 25.6's updates with the full MAX changelog and Mojo changelog.

Assets 2

05 Aug 15:47

Ahajha

modular/v25.5.0

bc45a42

Modular 25.5

Modular Platform 25.5 is here, and introduces Large Scale Batch Inference: a highly asynchronous, at-scale batch API built on open standards and powered by Mammoth.
This release also features the open source launch of the MAX Graph API and expanded support for writing custom PyTorch operators directly in MAX. In addition, we’ve made Modular Platform development and deployment easier with optimized Docker containers and new standalone Mojo Conda packages. Check out all of 25.5’s updates with the full MAX and Mojo changelogs.

Assets 2

18 Jun 15:59

Ahajha

modular/v25.4.0

d84cf9a

Modular 25.4

We're excited to announce Modular Platform 25.4, a major release that brings the full power of AMD GPUs to our entire platform. This release marks a major leap toward democratizing access to high-performance AI by enabling seamless portability to AMD GPUs. Developers can now build and deploy models optimized for peak performance, with zero reliance on any single hardware vendor—unlocking greater flexibility, lower costs, and broader access to compute.

For more details, see the 25.4 changelog and the release blog post.

Assets 2

06 May 16:44

Ahajha

modular/v25.3.0

f01551d

Modular 25.3

Modular Platform's 25.3 release introduces a unified pip install modular package, granting access to Mojo and MAX. This release open-sources MAX Kernels and Serving APIs, totaling over 500,000 lines of code. Google Colab support is now available, enabling execution of MAX models, and a simplified community license for MAX and Mojo, aiming to lower entry barriers. This update reflects a commitment to building in the open and putting the community first.

For additional details, checkout the changelog.

Assets 2

25 Mar 15:30

Ahajha

max/v25.2.0

c36596e

MAX 25.2

Announcing MAX 25.2, featuring significant enhancements for large-scale AI deployment and GPU optimization. This release adds comprehensive NVIDIA Hopper support with high-performance kernels, multi-GPU tensor parallelism for large models like Llama-3.3-70B, and expanded model support (Phi3, Olmo, Granite). Key additions include GPTQ quantization for memory efficiency, advanced long context optimizations (in-flight batching, chunked prefill, copy-on-write), and improved kernel caching reducing compilation times up to 28%. New Mojo GPU APIs offer developers greater control and performance.

For additional details, checkout the changelog.

Assets 2

13 Feb 16:44

patrickdoc

max/v25.1.0

a16d7a6

Mojo 25.1

Release 25.1

We're excited to announce the release of MAX 25.1, marking a significant evolution in our approach to delivering cutting-edge AI development tools to our community. This release substantially improves the developer experience for Agentic and LLM workflows, introduces a new nightly release model that includes a new GPU programming interface, and launches MAX Builds - your one-stop destination for GenAI development.

For additional details, checkout the changelog

Assets 2

17 Dec 18:05

patrickdoc

max/v24.6.0

fa8d0df

Mojo 24.6

Release 24.6

We are excited to announce the release of MAX 24.6, featuring a preview of MAX GPU! At the heart of the MAX 24.6 release is MAX GPU – the first vertically integrated Generative AI serving stack that eliminates the dependency on vendor-specific computation libraries like NVIDIA’s CUDA.

MAX GPU is built on two groundbreaking technologies. The first is MAX Engine, a high-performance AI model compiler and runtime built with innovative Mojo GPU kernels for NVIDIA GPUs–free from CUDA or ROCm dependencies. The second is MAX Serve, a sophisticated Python-native serving layer specifically engineered for LLM applications. MAX Serve expertly handles complex request batching and scheduling, delivering consistent and reliable performance, even under heavy workloads.

For additional details, checkout the changelog and the release announcement.

Assets 2

26 Sep 21:26

patrickdoc

mojo/v24.5.0

61a97c6

Mojo 24.5

Release 24.5

We are excited to announce the release of MAX 24.5! This release includes support for installing MAX as a conda package with magic, a powerful new package and virtual environment manager. We’re also introducing two new Python APIs for MAX Graph and MAX Driver, which will ultimately provide the same low-level programming interface as the Mojo Graph API. MAX Engine has improved performance for Llama3, with 24.5 generating tokens for Llama an average of 15% to 48% faster. Lastly, this release also adds support for Python 3.12, and drops support for Python 3.8 and Ubuntu 20.04.

For additional details, checkout the changelog and the release announcement.

Assets 2

Releases: modular/modular

Modular 25.7

Uh oh!

Modular 25.6

Uh oh!

Modular 25.5

Uh oh!

Modular 25.4

Uh oh!

Modular 25.3

Uh oh!

MAX 25.2

Uh oh!

Mojo 25.1

Release 25.1

Uh oh!

Mojo 24.6

Release 24.6

Uh oh!

Mojo 24.5

Release 24.5

Uh oh!