Arcee Trinity Models: Rebooting US Open Source AI with Apache 2.0

by Anika Shah - Technology
0 comments

For much of 2025, the frontier of open-weight language models has been defined not in Silicon Valley or New York City, but in Beijing and Hangzhou.

Chinese research labs including Alibaba’s Qwen, DeepSeek, Moonshot and Baidu have rapidly set the pace in developing large-scale, open Mixture-of-Experts (MoE) models – often with permissive licenses and leading benchmark performance. While openai fielded its own open source, general purpose LLM this summer as well – gpt-oss-20B and 120B – the uptake has been slowed by so many equally or better performing alternatives.

Now, one small U.S. company is pushing back.

Today, Arcee Ai Announced the release of Trinity Mini and Trinity Nano Preview, the first two models in its new “Trinity” family-an open-weight MoE model suite fully trained in the United States.

Users can try the former directly for themselves in a chatbot format on Acree’s new website, chat.arcee.aiand developers can download the code for both models on Hugging Face and run it themselves,as well as modify them/fine-tune to their liking – all for free under an enterprise-kind Apache 2.0 license.

While small compared to the largest frontier models, these releases represent a rare attempt by a U.S. startup to build end-to-end open-weight models at scale-trained from scratch, on American infrastructure, using a U.S.-curated dataset pipeline.

“I’m experiencing a combination of extreme pride in my team and crippling exhaustion, so I’m struggling to put into words just how excited I am to have these models out,” wrote Arcee Chief Technology Officer (CTO) Lucas Atkins in a post on the social network X (formerly Twitter). “Especially Mini.”

A third model, Trinity Large, is already in training: a 420B parameter model with 13B active parameters per token, scheduled to launch in January 2026.

“We want to add something that has been missing in that picture,” Atkins wrote in the Trinity launch manifesto published on Arcee’s website.”A serious open weight model family trained end“`html



Arcee’s Trinity Mini: A High-Performing Open-Source LLM

Arcee’s Trinity Mini: A High-Performing open-Source LLM

Arcee has released Trinity Mini, a 7 billion parameter language model (LLM) designed for performance and accessibility.built with a Mixture-of-Experts (MoE) architecture, Trinity Mini aims to deliver strong reasoning capabilities in a relatively compact size. A smaller variant, Trinity Nano, is also available, demonstrating the viability of sparse MoE architectures with under 1 billion active parameters per token.

Chart showing performance of Arcee Trinity Mini LLM compared to other slightly larger parameter models. Credit: Arcee

Benchmarks show Trinity Mini performing competitively with larger models across reasoning tasks, including outperforming gpt-oss on the SimpleQA benchmark (tests factual recall and whether the model admits uncertainty), MMLU (Zero shot, measuring broad academic knowledge and reasoning across many subjects without examples), and BFCL V3 (evaluates multi-step function calling and real-world tool use):

  • MMLU (zero-shot): 84.95

  • Math-500: 92.10

  • GPQA-Diamond: 58.55

  • BFCL V3: 59.67

Latency and throughput numbers across providers like Together and Clarifai show 200+ tokens per second throughput with sub-three-second E2E latency-making Trinity Mini viable for interactive applications and agent pipelines.

Access, Pricing, and Ecosystem Integration

Both Trinity models are released under the permissive, enterprise-friendly, Apache 2.0 license allowing unrestricted commercial and research use. Trinity Mini is available via:

API pricing for Trinity Mini via OpenRouter:

  • $0.045 per million input tokens

  • $0.15 per million output tokens

  • A free tier is available for a limited time on openrouter

The model is already integrated into apps including Benchable.ai, Open WebUI, and SillyTavern. it’s supported in Hugging Face Transformers, VLLM, LM Studio, and llama.cpp.

Data Without Compromise: DatologyAI’s Role

Central to Arcee’s approach is control over training data-a sharp contrast to many open models trained on web-scraped or legally ambiguous datasets. That’s where DatologyAI, a data curation startup co-founded by former Meta and DeepMind researcher Ari Morcos, plays a critical role.

DatologyAI’s platform automates data filtering, deduplication, and quality enhancement across modalities, ensuring Arcee’s training corpus avoids the pitfalls of noisy, biased, or copyright-risk content.

For Trinity, DatologyAI helped construct a 10 trillion token curriculum organized into three phases: 7T general data, 1.8T high-quality text, and 1.2T STEM-heavy material,including math and code.

This is the same partnership that powered Arcee’s AFM-4.5B-but scaled substantially in both size and complexity. According to Arcee, it was Datology’s filtering and data-ranking tools that allowed Trinity to scale cleanly while improving performance on tasks like mathematics, QA, and agent tool use.

Datology’s contribution also extends into synthetic data generation. For Trinity Large,the company has produced over 10 trillion synthetic

Related Posts

Leave a Comment