Llama: AI News Week Ending 04/17/2026

Model	Size/Class	Format	Hosted Provider	Best Local Path	Notes
Huihui Gemma 4 E2B Abliterated v2	E2B	GGUF	No	Ollama / llama.cpp	Gemma 4 MoE with ~2B active params. Multimodal (image+text in, text out). Abliterated for reduced refusal. Lightweight enough to run fast, but MoE active-param sizing means quality punches above its weight class.
Huihui Gemma 4 E4B Abliterated	E4B	GGUF	No	Ollama / llama.cpp	Same Gemma 4 MoE family as E2B but with ~4B active params. Multimodal. Better quality ceiling than E2B at the cost of more compute per token.
SultrySilicon V2	7B	GGUF	No	Ollama / llama.cpp	Roleplay-focused 7B model. Smallest in the set. Good for quick creative/RP sanity checks, not for reasoning or instruction-following benchmarks.
Huihui-GLM-4.6V-Flash-Abliterated	9B	GGUF	No	Ollama / llama.cpp	Based on Z.ai GLM-4.6V-Flash. Vision-language model (image+text). Abliterated. Bilingual Chinese/English. Fast inference variant of the GLM-4.6V family.
Gemma-2-Ataraxy-9B	9B	GGUF	No	Ollama / llama.cpp	Merge of Gemma-2-9B-SimPO and Gemma-2-Gutenberg-9B. Creative writing and roleplay oriented. Scored well on EQ-Bench. Good balance of instruction-following and literary quality at 9B.
MythoMax-L2-13B	13B	GGUF	No	Ollama / llama.cpp	By Gryphe. Llama 2 merge of MythoLogic-L2 and Huginn using experimental per-tensor gradient merging. One of the most downloaded RP/creative models ever (~59k GGUF downloads). Strong at both roleplay and storywriting. Alpaca format. The OG.
Dan’s PersonalityEngine V1.3.0	24B	GGUF	No	Ollama / llama.cpp	Fine-tuned from Mistral Small 3.1 24B Base. Trained on a massive mix: roleplay, storywriting, tool use, math, reasoning, code, medical, legal, and survival topics. Multilingual (EN, AR, DE, FR, ES, HI, PT, JA, KO). A genuine generalist with personality.
SuperGemma4 26B Abliterated Multimodal	26B multimodal	GGUF	No	custom multimodal stack	Based on Gemma 4 26B-A4B. Multimodal (image-text-to-text). Abliterated with low refusal. Optimized for Apple Silicon (MLX). Supports Korean + English. Tool use and coding tags.
Gemma 3 27B Abliterated	27B	GGUF	No	Ollama / llama.cpp	Abliterated version of Google’s Gemma 3 27B instruct. Multimodal (image-text-to-text). Reduced refusal behavior while preserving instruction-following quality.
Huihui Gemma 4 31B Abliterated	31B	GGUF	No	Ollama / llama.cpp	Abliterated Gemma 4 31B instruct. Multimodal (any-to-any pipeline tag). Dense 31B, not MoE. Strongest Gemma 4 dense abliterated option.
Gemma 4 31B Abliterated	31B	GGUF + safetensors	No	Ollama / llama.cpp	Same base as above (Gemma 4 31B-it) but different abliteration method using mlabonne’s harmful_behaviors + harmless_alpaca datasets. Both formats in one repo.
Huihui-Qwen3.5-35B-A3B-Claude-4.6-Opus-Abliterated	35B A3B	GGUF	No	Ollama / llama.cpp	Qwen 3.5 MoE (35B total, ~3B active). Distilled from Claude 4.6 Opus reasoning. Chain-of-thought and reasoning-focused. Abliterated. Multimodal. Punches well above its active param count on reasoning tasks.
Midnight Rose 70B v2.0.3	70B	GGUF	No	Ollama / llama.cpp	By sophosympatheia. Complex multi-stage SLERP/DARE-TIES merge of WizardLM, Tulu-2-DPO, Dolphin, and earlier Midnight Rose versions. Uncensored. Designed for roleplay and storytelling. Scored surprisingly high on EQ-Bench even at low quants. ~6k context sweet spot.
Midnight Miqu 70B v1.5	70B	GGUF	No	Ollama / llama.cpp	Llama-family merge of Midnight-Miqu v1.0 and Tess-70B. Creative writing and roleplay focused. 32k context. Known for strong prose quality and character consistency at 70B scale.
Midnight Rose 103B v2.0.3	103B	GGUF	No	heavy self-host	Same lineage as the 70B but scaled up. Importance-matrix GGUF by mradermacher. Firmly in the “need real hardware” category.
DeepSeek V3	671B A37B	safetensors	Yes: DeepInfra, Novita	Hosted preferred	Massive MoE. 671B total, 37B active. Strong on code, math, and instruction-following. Pre-trained on ~15T tokens. Use via OpenRouter, not locally.
DeepSeek V3.2	685B A37B	safetensors	No confirmed provider yet	Hosted preferred	Successor to V3. Same general architecture class. Not a local play.
Behemoth-123B-v1	123B	GGUF	No	heavy self-host	Mistral-family 123B. Creative/RP community model. Massive parameter count makes it impractical for casual local use but prized for output quality in the r/LocalLLM community.
Monstral-123B	123B	GGUF	No	heavy self-host	Mistral-family 123B. Text generation and chat focused. Same weight class as Behemoth, different training mix and community lineage.
BlackSheep-Large	~27B	GGUF	No	Ollama / llama.cpp	By TroyDoesAI. Canonical repo is gated. Q8_0 is ~29.5 GB, placing it in the 27B-class. Community RP/creative model.

Ethan B. Holland

Leave a ReplyCancel reply

AI News #137: Week Ending May 15, 2026 with 44 Executive Summaries

AI News #136: Week Ending May 08, 2026 with 61 Executive Summaries

AI News #135: Week Ending May 01, 2026 with 54 Executive Summaries

AI News #134: Week Ending April 24, 2026 with 49 Executive Summaries

AI News #133: Week Ending April 17, 2026 with 50 Executive Summaries

AI News #132: Week Ending April 10, 2026 with 54 Executive Summaries

AI News 131: Week Ending April 03, 2026 with 48 Executive Summaries

AI News 130: Week Ending March 27, 2026 with 83 Executive Summaries

AI News 129: Week Ending March 20, 2026 with 60 Executive Summaries

AI News #128: Week Ending March 13, 2026 with 49 Executive Summaries

AI News #127: Week Ending March 06, 2026 with 32 Executive Summaries

AI News #126: Week Ending February 27, 2026 with 37 Executive Summaries

AI News #125: Week Ending February 20, 2026 with 49 Executive Summaries

AI News #124: Week Ending February 13, 2026 with 35 Executive Summaries

Vail YOLO Adventure Part II: Reuniting with the Gore Range and Sending Off My Daughter 30 Years Later

My YOLO Vail Story – 19 Years Old With a One Way Ticket and No Plan

My friend Mike’s gift of love

Eulogy for Michael Bernstein, my buddy

AI News #89: Week Ending June 13, 2025 with 33 Executive Summaries, Top 45 Links, and 7 Helpful Visuals

AI News #86: Week Ending May 23, 2025 with 18 Executive Summaries, Top 93 Links, and 12 Helpful Visuals

AI News #85: Week Ending May 16, 2025 with 25 Executive Summaries, Top 56 Links, and 10 Helpful Visuals

Delaware Technical and Community College – Artificial Intelligence Keynote – Ethan Holland – April 2025

AI News #82: Week Ending April 25, 2025 with 35 Executive Summaries, Top 67 Links, and 2 Helpful Visuals

Always be kind. To everyone.

Billie Eilish – Bag Guy

Cage The Elephant

Handmade memes had a posse

Apple is pulling a Braveheart and can change the way we use phones whenever they choose

The AI Future: Exploring the Adjacent Possible with Emerging AI Solutions

Chesapeake AP Broadcasters Association 2026 Convention: AI Trends and Demos – Ethan Holland

Zhipu AI: AI News Week Ending 05/15/2026

Trending

Chesapeake AP Broadcasters Association 2026 Convention: AI Trends and Demos – Ethan Holland

Zhipu AI: AI News Week Ending 05/15/2026

Qwen: AI News Week Ending 05/15/2026

HuggingFace: AI News Week Ending 05/15/2026

Llama: AI News Week Ending 04/17/2026

Share this:

Like this:

Leave a ReplyCancel reply

Trending

Discover more from Ethan B. Holland