Stable-WorldModel: Reproducible World Model Research with Model-Predictive Co...
World model research with model-predictive control has long suffered from fragmentation – researchers must build their own data pipelines, environment wrappers, planning solvers, and evaluation protocols, making it nearly impossible to compare results across papers. Stable-WorldModel, from a team including...
Microsoft Agent Governance Toolkit: Deterministic Security for AI Agents
AI agents are moving from prototypes to production systems that call tools, browse the web, query databases, and delegate tasks to other agents. Once deployed, these agents make decisions autonomously, and the consequences of a misbehaving agent can be severe....
LiteParse: Fast Lightweight PDF Parsing with Bounding Boxes from LlamaIndex
PDF parsing with bounding boxes has long been a challenge for developers who need to preserve spatial layout information – column structures, table alignments, and positional metadata that most parsers discard. LiteParse, from the LlamaIndex team behind the popular LlamaParse...
Dograh AI: Open-Source Voice Agent Platform with Drag-and-Drop Workflow Builder
Dograh AI: Open-Source Voice Agent Platform with Drag-and-Drop Workflow Builder Dograh AI is an open source voice AI platform that gives developers full control over building, deploying, and managing production voice agents. Unlike proprietary alternatives such as Vapi and Retell,...
Inside Cursor's Official Plugin Ecosystem: Architecture, Patterns, and the Fu...
The Cursor plugin ecosystem is not a random collection of tools. When Cursor published its official plugins repository, it shipped 13 plugins that together define how AI coding agents are extended, orchestrated, and governed. Every single plugin falls into the...
Compound Engineering Plugin: AI Skills and Agents That Make Each Unit of Work...
Compound Engineering Plugin: AI Skills and Agents That Make Each Unit of Work Easier Than the Last The Compound Engineering Plugin by Every is a multi-platform AI development toolkit that ships 37 skills and 44 specialized agents across 10 coding...
Sandcastle: Orchestrate AI Coding Agents in Isolated Sandboxes
Introduction AI coding agent sandbox orchestration has become essential infrastructure for teams running autonomous code generation at scale. When AI agents modify your codebase, they can introduce breaking changes, install conflicting dependencies, or execute arbitrary commands that compromise your development...
AutoResearchClaw: Autonomous Research Pipeline From Idea to Paper
AutoResearchClaw: Autonomous Research Pipeline From Idea to Paper AutoResearchClaw is a fully autonomous and self-evolving research pipeline that transforms a single research idea into a conference-ready academic paper. Developed by AIMING Lab at UNC, this 23-stage Python pipeline handles everything...
12-Factor Agents: Building Reliable LLM-Powered Software
Introduction The 12 factor agents methodology provides a principled framework for building reliable, production-grade LLM-powered software. Created by Dex Horthy, founder of Humanlayer, this methodology argues that the best AI agents are not autonomous loops that run until they hit...
Taste Skill: The Anti-Slop Frontend Framework That Makes AI Agents Design Lik...
Every AI coding agent can build a frontend. The problem is that they all build the same frontend: centered hero with a purple gradient, three equal feature cards, Inter font on slate-900, and an em-dash in every headline. Taste Skill,...
Oh-My-Pi: AI Coding Agent for Terminal with Hash-Anchored Edits
What Is Oh-My-Pi? Oh-My-Pi (omp) is a terminal-based AI coding agent that takes the standard “LLM in the loop” pattern and adds four distinguishing primitives: hash-anchored edits that prevent concurrent agent runs from clobbering each other, LSP integration so edits...
Anthropic Knowledge Work Plugins: Open Source Plugins for Claude Cowork
Anthropic Knowledge Work Plugins: Open Source Plugins for Claude Cowork Anthropic has open-sourced knowledge-work-plugins, a collection of 11 role-specific plugins that transform Claude from a general-purpose assistant into a specialist for your job function. Built for Claude Cowork, Anthropic’s agentic...