tag: models
2026-03-31
Qwen3.5-35B A3B Uncensored — HauhauCS (Aggressive)
Hugging Face model page for "Qwen3.5-35B A3B Uncensored" by HauhauCS — an uncensored, aggressively tuned 35B variant of Qwen3.5. Use with caution; may produce unsafe or disallowed outputs.2026-04-01
free-coding-models — vava-nessa
Community-curated list of free/open coding models, checkpoints and resources for local code generation, research and experimentation.Introducing Mercury 2
InceptionLabs announces Mercury 2 — a new generation model focused on improved reasoning, multimodal capabilities, and efficiency for production deployments. Blog post with technical highlights and links to model cards and docs.2026-04-02
LFM2.5-350M — 350M model trained on 28T tokens
Announcement of LFM2.5-350M: a 350M‑parameter model trained on ~28T tokens aimed at reliable data extraction and tool use. Under 500MB when quantized, optimized for constrained compute, memory and low latency; highlights agentic loop capabilities at small scale.PrismML — Bonsai 1‑bit 8B (launch announcement)
PrismML emerges from stealth and announces the Bonsai family: 1‑bit Bonsai 8B (≈1.15 GB), plus 4B and 1.7B variants. The tweet highlights extreme compression for high "intelligence density", edge deployment, and open‑sourcing under Apache‑2.0.2026-04-03
Gemma 4 model page
Official Google DeepMind page for Gemma 4, covering model family details, capabilities, and release information.Gemma 4 on YouTube
Video overview of Gemma 4.Unsloth releases Gemma 4 31B Instruct GGUF on Hugging Face
Unsloth published Gemma 4 31B Instruct in GGUF format on Hugging Face for easier local inference in llama.cpp-compatible runtimes.2026-04-07
DeepSeek V4 model will run entirely on Huawei AI chips
Huawei Central report about DeepSeek V4 reportedly running entirely on Huawei AI chips, highlighting model hardware alignment and domestic AI infrastructure.2026-04-09
Meta introduces Muse Spark MSL
Meta AI blog post introducing Muse Spark MSL, a new model release or system announcement from Meta.2026-04-15
llama.cpp
High-performance C/C++ inference engine for running LLMs locally across CPUs and GPUs.2026-04-20
HY-World 2.0
HY-World 2.0 is a multimodal world model for reconstructing, generating, and simulating 3D worlds, with open-source code and models for world reconstruction.2026-04-21
Kimi K2.6
Kimi announces Kimi K2.6, an open-source model focused on coding, long-horizon execution, and agent swarm workflows.2026-05-04