mixture-of-experts - Sean Kim

March 17, 2026

Published by Sean Kim on March 17, 2026

Categories

AI Tools & Services

DeepSeek V4: A Trillion-Parameter Multimodal AI Built on Huawei Chips, Not NVIDIA — What Developers Need to Know

One trillion parameters. 37 billion active per token. A million-token context window. $0.42 per million output tokens. And not a single NVIDIA chip was used to […]

February 17, 2026

Published by Sean Kim on February 17, 2026

Categories

AI Tools & Services

DeepSeek Cloud Pricing Shockwave: How $0.07 Tokens Reshaped Enterprise AI in 2026

$0.07 per million tokens — DeepSeek cloud pricing has shattered every benchmark in the AI industry. At 140 times cheaper than GPT-4 Turbo’s $10 per million, […]

December 31, 2025

Published by Sean Kim on December 31, 2025

Categories

AI Tools & Services

NVIDIA Nemotron 3 Deep Dive: How Nano 30B, Super 100B, and Ultra 500B Are Redefining Open AI Models

What if a 500-billion-parameter model could match frontier performance while only activating 50 billion parameters at inference time? That is not a hypothetical — it is […]

December 23, 2025

Published by Sean Kim on December 23, 2025

Categories

AI Tools & Services

Meta AI 2025 Recap: From Llama 3 to 4 — The Year Open Source Reshaped AI

One billion downloads. That was the milestone Meta AI 2025 hit in March — and it was only the beginning. A month later, Llama 4 dropped […]

November 21, 2025

Published by Sean Kim on November 21, 2025

Categories

AI Tools & Services

NVIDIA Nemotron 3 Mamba-Transformer MoE Architecture: Why 3B Active Parameters Beat 30B Dense Models for AI Agents

Finally — an open model family that doesn’t force you to choose between accuracy and throughput. NVIDIA Nemotron 3 just dropped with a hybrid Mamba-Transformer MoE […]

October 3, 2025

Published by Sean Kim on October 3, 2025

Categories

AI Tools & Services

Meta Connect 2025: How Llama 4 Scout’s 17B Parameters Are Redefining On-Device AI Processing

A 109-billion-parameter multimodal AI model that fits on a single GPU — Meta actually pulled it off. Llama 4 Scout, unveiled at Meta Connect 2025, runs […]

September 26, 2025

Published by Sean Kim on September 26, 2025

Categories

AI Tools & Services

Open Source LLM September 2025: Qwen3 Omni, DeepSeek V3.2, and Why China Now Owns 40% of Global AI Models

Three major open-source model drops in a single month. Alibaba shipped Qwen3-Max, Qwen3-Next, and Qwen3-Omni in rapid succession. DeepSeek quietly published V3.2-Exp with a sparse attention […]

May 28, 2025

Published by Sean Kim on May 28, 2025

Categories

AI Tools & Services

Databricks DBRX: How 36B Active Parameters Beat 70B Dense Models — One Year Later

A 132B parameter model activates just 36B parameters at inference — and still outperforms models nearly twice its active size. That is not a theoretical claim. […]