Cohere Releases North Mini Code, a 30B-Parameter MoE Model for Agentic Software Engineering
Cohere's first coding-specialized model combines sparse mixture-of-experts architecture with reinforcement learning for agent-based development tasks.
Cohere's first coding-specialized model combines sparse mixture-of-experts architecture with reinforcement learning for agent-based development tasks.
A deluge of low-quality AI-generated pull requests and issues is straining open-source project maintainers, forcing triage decisions that slow security patches and feature development.
Developer releases mitmwall, a mitmproxy-based firewall for intercepting unauthorized data flows from AI agents and supply-chain attacks in local environments.
A new open-source tool lets developers branch LLM inference mid-generation, skip redundant prefill computation, and merge agent outputs—addressing a core bottleneck in multi-agent reasoning systems.
Open-source speech-to-text models and existing LLM subscriptions can replicate Wispr Flow's functionality at zero additional cost, according to Wired AI testing.
Warp open-sources its terminal and partners with OpenAI to build 'Open Agentic Development,' a model where AI agents co-create ~90% of pull requests under human supervision.
Hugging Face enables fully on-device speech-to-speech for Reachy Mini robots using open-source models and cascaded AI pipelines.
Outlines, an open-source Python library, enforces schema compliance in LLM responses using finite-state automata and token-level masking, reducing hallucination and parsing failures.
NanoClaw creators turn down a buyout offer and secure seed funding from Valley Capital Partners, Hugging Face's Clem Delangue, and others, betting on community-driven growth over quick exit.
New CrossEncoder rerankers built on ModernBERT achieve state-of-the-art performance at multiple scales with published training recipes.
Maintainers describe the influx of automated vulnerability submissions as 'almost unmanageable,' prompting debate over AI tooling governance.
Automated vulnerability detection tools are flooding the Linux security mailing list with low-quality reports, making it difficult to triage genuine issues.
A new benchmarking framework evaluates complete AI agent systems—not just models—across six diverse tasks, reporting both quality and cost metrics for practical deployment decisions.
Iceland-based developer Hermann Haraldsson built an open source Bluetooth-connected AMOLED display that shows Claude Code token consumption with pixel-art animations.
IBM releases two Apache 2.0 multilingual embedding models built on ModernBERT, with 32K-token context and coverage for 200+ languages.
DeepSeek is raising its first outside funding at a $45B valuation, led by a Chinese state investment vehicle, to counter researcher poaching.
An open-source project packages self-hosted LLMs, speech-to-text, text-to-speech, and MCP tooling into a single Docker Compose deployment.
OpenAI and five hardware partners release MRC through the Open Compute Project to reduce congestion and hardware-fault disruptions in large GPU clusters.
GitHub user erogol's BlaGPT offers an open-source research sandbox for evaluating LM architectures and components on compact datasets.
Google unveiled the Gemini Enterprise Agent Platform, eighth-generation TPUs, and open model Gemma 4 in a month-spanning push to dominate the agentic AI era.
A new open-source repository walks developers through building a modern large language model from scratch, with every line of code annotated and explained in plain language.
The mlc-ai/web-llm project runs language models entirely inside a browser tab via WebGPU, cutting out server round-trips and keeping user data on-device.
The bitsandbytes library applies 4-bit and 8-bit quantization to PyTorch models, making 70B+ parameter LLMs runnable on consumer GPUs and underpinning the QLoRA fine-tuning wave.
The Zig programming language project has formalized a documented rationale for rejecting AI-generated code contributions, offering a governance model for the broader open-source ecosystem.
A developer-showcased tool called OmniForge aims to bring document intelligence and audio capture together under a single local LLM stack.
Canonical's Ubuntu AI rollout triggers community pushback and demands for opt-out controls, while Ubuntu forks position themselves as AI-free alternatives.
IBM's new trio of fully-dense LLMs reaches 512K-token context and outperforms a larger mixture-of-experts predecessor through rigorous data curation alone.
An open-source project adds structured reasoning about knowledge and uncertainty to LLMs — entirely offline, no API required.