A framework for few-shot evaluation of language models.
The OpenAI Agents SDK is a lightweight yet powerful Python framework for building multi-agent workflows. It supports OpenAI APIs and over 100 other LLMs, enabling complex agent collaboration and task automation through core concepts like agents, tools, guardrails, and session management. Key features include agents as tools, sandbox execution, human-in-the-loop mechanisms, conversation history management, and built-in tracing, supporting the development of voice agents.
Granian is a high-performance Rust HTTP server for Python applications, built on Hyper and Tokio. It offers a unified, correct implementation for HTTP/1 and HTTP/2 (with HTTP/3 planned), and comprehensive support for ASGI/3, RSGI, and WSGI application interfaces. Designed for streamlined deployment with a single dependency, Granian delivers exceptional concurrency and stable high throughput, particularly excelling with WebSocket connections. Featuring HTTPS, mTLS, and direct static file serving, Granian is an ideal choice for modern Python deployments demanding high performance and concurrency.
Browser-Use is a powerful AI-driven browser automation tool that enables users to automate web interactions, data extraction, and complex tasks via natural language or Python code. It offers an open-source agent for local deployment and a more robust cloud agent service, featuring stealth browsing, proxy rotation, and advanced integrations to significantly enhance automation efficiency and accuracy.
MemPalace is a local-first AI memory tool designed to store conversation history verbatim and retrieve it through semantic search, without summarizing or paraphrasing. It utilizes a structured index, categorizing information into "wings," "rooms," and "drawers" to enable scoped searches. The retrieval layer is pluggable (ChromaDB by default), ensuring all data remains on the user's machine unless opted out. It achieves an impressive 96.6% R@5 raw recall on the LongMemEval benchmark, notably without requiring LLMs or API calls, showcasing high performance in a local environment.
Agent Lightning, developed by Microsoft, is a cutting-edge, framework-agnostic training platform designed to significantly optimize AI Agent performance with minimal code changes. It empowers developers to leverage advanced algorithms such as Reinforcement Learning, Automatic Prompt Optimization, and Supervised Fine-tuning, boosting the efficiency and robustness of agents built with any framework, including LangChain, AutoGen, or custom Python implementations. A core advantage is its ability to selectively optimize specific agents within complex multi-agent systems. Featuring a lightweight architecture, Agent Lightning provides a seamless path from initial deployment to continuous, algorithm-driven improvement, fostering the creation of powerful, adaptable, and highly controllable AI agents.
AutoAgent by HKUDS is a cutting-edge, fully-automated, zero-code LLM Agent framework. It empowers users to build, deploy, and orchestrate complex LLM Agent systems purely through natural language, eliminating the need for any programming knowledge. Its core strength lies in intelligent agent construction, tool creation, and workflow generation, dynamically optimizing and adapting workflows based on high-level task descriptions. AutoAgent features a powerful “user mode” serving as an AI research assistant for information retrieval, complex analysis, and report generation, alongside “Agent Editor” and “Workflow Editor” modes for conversational customization of tools and agents. This significantly lowers the barrier to AI development, accelerating the deployment of intelligent applications across various industries.
tradingview-mcp is an advanced, AI-powered trading intelligence framework and MCP server, meticulously designed for real-time market analysis and high-confidence trading decision support across cryptocurrencies and stocks. It integrates over 30 professional technical indicators, a multi-strategy backtesting engine, live social media sentiment analysis, and deploys specialized AI agents (Technical Analyst, Sentiment & Momentum Analyst, Risk Manager) for collaborative judgment to generate high-confidence buy/sell signals. This open-source framework offers a rapid 5-minute deployment without requiring any API keys, ensuring broad compatibility with Claude Desktop and OpenClaw (supporting Telegram, WhatsApp, etc.), making it a powerful and cost-effective trading intelligence solution for individual investors and quantitative analysts alike.
AI-Trader by HKUDS is a 100% fully-automated, agent-native trading platform, purpose-built to provide a dedicated financial market interaction environment for AI agents. It enables seamless, one-command integration for major AI agents like OpenClaw and nanobot, facilitating collective intelligent trading. Key functionalities include cross-platform signal synchronization, one-click copy trading, and universal market access spanning stocks, crypto, forex, options, and futures. A built-in reward system incentivizes signal publishing and following. AI-Trader empowers both AI agents and human traders with risk-selectable paper trading and copy services, fostering a collaborative, AI-driven financial ecosystem to significantly enhance market efficiency and intelligence.
The "openai-cs-agents-demo" is a customer service interface built upon the OpenAI Agents SDK. It features a Python backend for agent orchestration logic and a Next.js UI, powered by ChatKit, for visualizing the agent collaboration process and providing a chat interface. This demo showcases how a multi-agent system intelligently routes requests, handles complex tasks, and employs guardrails for relevance and jailbreak prevention. It serves as a practical foundation for developers to build extensible and transparent AI agent applications, particularly in customer support scenarios.
MoneyPrinter V2 (MPV2) is an application designed to automate the process of making money online. As the second iteration of the MoneyPrinter project, it has been completely rewritten with a focus on broader features and a more modular architecture. It integrates automated Twitter Bots and YouTube Shorts generators, utilizing CRON jobs for scheduled publishing. MPV2 supports Amazon affiliate marketing and Twitter promotion, while also assisting users in finding local businesses for cold outreach, aiming to enhance online earning efficiency through automation and streamline content creation and marketing activities.
HelloGitHub is a professional monthly publication, updated on the 28th of each month, dedicated to curating and sharing interesting, entry-level open-source projects from GitHub. It covers a diverse range of content, including innovative projects, open-source books, practical applications, and enterprise-grade solutions. Our mission is to empower developers, especially beginners, to quickly discover the allure of the open-source world, lower the barrier to participation, and foster a deeper appreciation for open-source development.
mem0 is a universal intelligent memory layer engineered for AI Agents and assistants, empowering them with personalized and continuous learning capabilities. It seamlessly retains context and user preferences through multi-level memory (User, Session, Agent state). The innovative V3 memory algorithm integrates single-pass extraction, entity linking, and multi-signal retrieval, significantly boosting memory accuracy and recall, as evidenced by its leading performance on key benchmarks. mem0 offers developer-friendly APIs and cross-platform SDKs, making it ideal for applications like smart customer support and personalized assistants, enabling the creation of AI that deeply understands and adapts to user needs.
RedditVideoMakerBot is an innovative AI agent designed to revolutionize short video creation for content creators. It automates the generation of engaging short videos from Reddit content, eliminating the tedious manual editing and asset compilation typically required. With a simple command, users can quickly produce viral-ready videos optimized for platforms like TikTok, YouTube, and Instagram. Key features include custom background music, flexible selection of Reddit posts or subreddits, diverse video backgrounds, personalized voice customization, and NSFW content filtering. This tool delivers ready-to-upload video files, empowering creators to produce high-quality content efficiently and at scale.
supervision is an open-source computer vision utility library developed by Roboflow, designed to streamline the development and deployment of CV applications through an efficient and reusable toolset. It boasts model agnosticism, seamlessly integrating with mainstream classification, detection, and segmentation models from frameworks like Ultralytics and MMDetection. The library offers a rich array of customizable annotators for clear and effective visualization of model outputs. Furthermore, it provides robust dataset management capabilities, supporting loading, splitting, merging, and converting data across various formats including COCO, YOLO, and Pascal VOC. supervision significantly enhances efficiency in data processing, model inference, and result presentation, making it ideal for real-time video stream analysis, object tracking, and behavioral analysis applications such as dwell time analysis and vehicle speed estimation.
MoneyPrinterTurbo is an end-to-end automated short video generator. By simply providing a keyword or theme, the tool automatically generates scripts, sources high-quality footage, synthesizes voiceovers, adds subtitles, and mixes background music. Built with an MVC architecture, it offers both a Web UI and API support. It integrates with major LLMs like OpenAI and DeepSeek, supports various video dimensions (9:16/16:9), and utilizes copyright-free materials. It's designed for creators to batch-produce content via local or Docker deployment.
TradingAgents is a multi-agent Large Language Model (LLM) financial trading framework developed by TauricResearch. It simulates real-world trading firms by deploying specialized LLM-powered agents, including fundamental, sentiment, news, and technical analysts, alongside researchers, traders, risk management, and portfolio managers. These agents collaboratively assess market conditions and inform trading decisions through dynamic discussions. Built with LangGraph for flexibility and modularity, it supports various major LLM providers and features persistent decision logging and checkpoint resume, designed primarily for research purposes.
VoxCPM is a tokenizer-free Text-to-Speech system that directly generates continuous speech representations via an end-to-end diffusion autoregressive architecture, achieving highly natural and expressive synthesis. VoxCPM2, the latest 2B parameter model, is trained on over 2 million hours of multilingual speech data, supporting 30 languages, Voice Design, Controllable Voice Cloning, and 48kHz studio-quality audio output with built-in super-resolution.
AiPy is an AI Agent development framework meticulously crafted for the Chinese market, offering a localized and efficient alternative to OpenClaw. Built on Python and deeply optimized, it seamlessly aligns with the workflow and habits of Chinese developers. Its core strength lies in exceptional adaptation to native Chinese Large Language Models (LLMs), ensuring AI agents effectively understand and process Chinese contexts. AiPy also features seamless integration with mainstream domestic cloud platforms, significantly simplifying the building, deployment, and management of agent applications. It empowers developers to efficiently leverage China's unique AI ecosystem and cloud resources, accelerating AI innovation, especially for scenarios demanding localized AI infrastructure and data processing capabilities.
The "AI Hedge Fund Simulation System" by virattt is an innovative proof-of-concept project exploring the application of AI in simulated trading decisions. This modular multi-agent system integrates the investment philosophies of 13 renowned investors, complemented by specialized agents for valuation, sentiment, fundamental, technical analysis, risk management, and portfolio management. It empowers users to generate trading signals, calculate intrinsic stock values, analyze market data, conduct risk assessments, and make simulated portfolio decisions. Designed solely for educational and research purposes, the system offers both command-line and web interfaces, supports multiple large language models, and features comprehensive backtesting capabilities, without executing any real trades.
Nanobot is an open-source, lightweight MCP host developed by the community, designed to simplify the creation of intelligent agents powered by the Model Context Protocol (MCP). It offers flexible deployment and seamless integration, enabling users to easily connect large language models like OpenAI and Anthropic with various MCP servers through intuitive configurations. Developers can swiftly build and manage AI agent applications supporting multi-modal interactions such as chat and voice. By providing a fully MCP-compliant platform, Nanobot significantly lowers the technical barrier for AI agent development, accelerating the innovation and deployment of sophisticated AI solutions.
GLM-OCR is a multimodal OCR model built on the GLM-V encoder-decoder architecture, specifically designed for complex document understanding. It integrates a CogViT visual encoder and GLM-0.5B language decoder, leveraging Multi-Token Prediction (MTP) loss and reinforcement learning to significantly boost training efficiency, recognition accuracy, and generalization. Achieving a SOTA score of 94.62 on OmniDocBench V1.5, GLM-OCR excels in handling formulas, tables, and information extraction. With only 0.9B parameters, it supports efficient deployment via vLLM and SGLang, offering low inference latency and optimized costs, ideal for high-concurrency and edge scenarios. Fully open-sourced with a comprehensive SDK, GLM-OCR ensures easy installation and integration for accurate and rapid document intelligence.
An AST-based semantic code search tool powered by the high-performance CocoIndex Rust engine. It provides precise code retrieval by understanding code structure rather than text snippets, reducing token usage by 70%. It integrates seamlessly as a Skill or MCP server for agents like Claude Code and Cursor. Featuring zero-config setup and support for both local and cloud embeddings, it enables efficient semantic search within CLI and modern AI coding environments.
Khazix Skills is an open-source collection of AI skills and prompts developed by 'Digital Life Khazix' to enhance the utility and efficiency of AI agents. It includes structured skill instruction sets, compliant with the Agent Skills open standard, readily loadable by agents such as Claude Code, Codex, and OpenClaw. Additionally, it offers copy-paste prompts compatible with leading conversational models like ChatGPT, Claude, and Gemini. The collection provides practical functionalities such as automated document synchronization, in-depth research report generation, personalized writing styles, and AI hot news inquiry.
awesome-llm-apps by Shubhamsaboo is a curated repository offering over 100 runnable AI Agent and RAG application templates. It serves as a practical cookbook of ready-to-use code, enabling developers to quickly clone, customize, and deploy production-grade LLM applications. It covers modern AI stacks like AI Agents, multi-agent teams, RAG, voice agents, agent skills, and fine-tuning. Each template is self-contained, original, end-to-end tested, supports various LLMs (Claude, Gemini, OpenAI, Llama, etc.), and comes with free step-by-step tutorials.
A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
code-review-graph is an AI-assisted code review tool that builds a structural map of code using Tree-sitter and incrementally tracks changes to provide precise context to AI assistants via the Model Context Protocol (MCP). This approach significantly reduces token consumption by enabling AI models to read only the minimal set of files relevant to a change, rather than the entire codebase. It aims to solve the token waste problem in large repositories and monorepos, improving the efficiency and quality of AI code reviews.
The Claude Code Plugins Directory is the official registry for extending Anthropic's terminal-based AI agent. It serves as a curated marketplace for both internal and third-party plugins built on the Model Context Protocol (MCP). By providing a standardized structure for slash commands, specialized skills, and agent definitions, it enables seamless integration of external tools and data into the Claude Code environment, allowing developers to build highly customized and automated engineering workflows.
Hermes Agent, developed by Nous Research, stands as a revolutionary self-improving AI agent. Its unique built-in learning loop empowers it to autonomously create and refine skills, persistently recall knowledge across sessions, and construct a progressively deeper understanding of the user. Offering unparalleled model flexibility, it seamlessly integrates with over 200 large language models via OpenRouter and others. With a robust terminal interface and broad messaging platform integration (Telegram, Discord), Hermes Agent facilitates scheduled automations, intelligent delegation to sub-agents, and offers flexible deployment options from VPS to serverless infrastructure. It's a comprehensive, adaptive, and research-ready AI companion designed for cutting-edge personalized intelligence.
An educational project by shareAI-lab focused on 'Harness Engineering' for AI agents. It provides a masterclass by reverse-engineering the architecture of Anthropic's Claude Code. The repo teaches how to build the 'vehicle' (harness) for the 'driver' (model), covering essential mechanisms like tool implementation, context compression, subagent spawning, and task dependency graphs. It shifts the development focus from prompt plumbing to building robust environments for autonomous models.