ggml and llama.cpp join Hugging Face & Custom AI chips for fast inference - Hacker News (Feb 20, 2026)
Please support this podcast by checking out our sponsors: - KrispCall: Agentic Cloud Telephony - https://try.krispcall.com/tad - Discover the Future of AI Audio with ElevenLabs - https://try.elevenlabs.io/tad - Build Any Form, Without Code with Fillout. 50% extra signup credits - https://try.fillout.com/the_automated_daily Support The Automated Daily directly: Buy me a coffee: https://buymeacoffee.com/theautomateddaily Today's topics: ggml and llama.cpp join Hugging Face - ggml.ai’s core team, including llama.cpp maintainer Georgi Gerganov, is joining Hugging Face to scale “Local AI” support while keeping ggml-org projects open-source and community-governed. Keywords: ggml, llama.cpp, Hugging Face, GGUF, local inference, open source. Custom AI chips for fast inference - Startup Taalas claims it can compile an AI model into dedicated silicon in about two months, targeting sub-millisecond latency and radically lower cost and power. Keywords: custom silicon, Llama 3.1 8B, tokens/sec, DRAM
Today's Hacker News Topics
- 01
ggml and llama.cpp join Hugging Face
— ggml.ai’s core team, including llama.cpp maintainer Georgi Gerganov, is joining Hugging Face to scale “Local AI” support while keeping ggml-org projects open-source and community-governed. Keywords: ggml, llama.cpp, Hugging Face, GGUF, local inference, open source. - 02
Custom AI chips for fast inference
— Startup Taalas claims it can compile an AI model into dedicated silicon in about two months, targeting sub-millisecond latency and radically lower cost and power. Keywords: custom silicon, Llama 3.1 8B, tokens/sec, DRAM-like density, quantization, inference cost. - 03
Gemini 3.1 Pro and agentic tools
— Google rolls out Gemini 3.1 Pro in preview across AI Studio, Vertex AI, Android Studio, and consumer apps, pitching stronger reasoning and agent-ready workflows. Keywords: Gemini 3.1 Pro, ARC-AGI-2, Antigravity, Gemini API, NotebookLM, reasoning. - 04
Faster diffusion language model decoding
— Together AI proposes Consistency Diffusion Language Models to speed diffusion-style text generation using block-wise causal attention, KV caching, and trajectory distillation. Keywords: diffusion language models, CDLM, KV cache, distillation, latency, GSM8K, MBPP. - 05
Learning codebases with visualization tooling
— A developer shows that building a custom event visualizer can turn an unfamiliar codebase into something understandable, illustrated via Next.js Turbopack and a tricky tree-shaking bug. Keywords: Turbopack, Next.js, SWC, PURE annotations, scope hoisting, visualization. - 06
Web Components as framework alternative
— An argument that the modern browser platform—Custom Elements, Shadow DOM, and events—can handle many UI needs without heavy frameworks, avoiding upgrade churn. Keywords: Web Components, Custom Elements, Shadow DOM, Custom Events, standards, React alternative. - 07
Raspberry Pi Pico 2 extreme overclocking
— Pimoroni pushed the RP2350 in the Raspberry Pi Pico 2 to 800–860+ MHz using voltage mods and dry-ice cooling, noting RISC-V cores slightly outperform ARM per MHz. Keywords: RP2350, Pico 2, overclocking, dry ice, core voltage, CoreMark, RISC-V. - 08
C defer cleanup lands in compilers
— C’s proposed defer cleanup feature is now practical: TS 25755 is finalized and Clang 22 ships support, with GCC implementations emerging and portability fallbacks available. Keywords: C defer, TS 25755, Clang 22, GCC, cleanup, resource safety. - 09
Hokusai sketches rediscovered in Europe
— 103 “lost” Hokusai sketches for an unfinished ‘Great Picture Book of Everything’ resurfaced and were acquired by the British Museum, expanding access through digitization. Keywords: Hokusai, ukiyo-e, rediscovery, British Museum, provenance, digitization. - 10
Austin robotics and acoustic hiring
— 9 Mothers is hiring on-site engineers in Austin across AI, computer vision, robotics, and acoustic DSP roles, with high salary bands and equity. Keywords: hiring, Austin, robotics, computer vision, DSP, machine learning, equity.