Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Python tool for converting files and office documents to Markdown.
Open-Sora: Democratizing Efficient Video Production for All
The fast, Pythonic way to build Model Context Protocol servers 🚀
WhatsApp MCP server
Run AI Agent in your browser.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which lets AI drive data-driven AI.
Build effective agents using Model Context Protocol and simple workflow patterns
A live stream development of RL tunning for LLM agents
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Unified framework for building enterprise RAG pipelines with small, specialized models
Convert PDF to markdown + JSON quickly with high accuracy
The Memory layer for AI Agents
Archon is an AI agent that is able to create other AI agents using an advanced agentic coding workflow and framework knowledge base to unlock a new frontier of automated agents.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
verl: Volcano Engine Reinforcement Learning for LLMs
aider is AI pair programming in your terminal
✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 平台支持 QQ、QQ频道、Telegram、微信、企微、飞书 | MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify 等。附带 WebUI。
Robust Speech Recognition via Large-Scale Weak Supervision
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Local Deep Research is an AI-powered assistant that transforms complex questions into comprehensive, cited reports by conducting iterative analysis using any LLM across diverse knowledge sources including academic databases, scientific repositories, web content, and private document collections.
auto sign cursor
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Stable Diffusion web UI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"
Fully open reproduction of DeepSeek-R1
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
麦麦bot,一款专注于 群组聊天 的赛博网友(非常专注)QQ BOT
Ultralytics YOLO11 🚀
SGLang is a fast serving framework for large language models and vision language models.
🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
你还在为自己存放的VV表情包不够多,使用时觉得不够贴切而感到烦恼吗?快来试试这个项目吧!
Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
Python API for JMComic | 提供Python API访问禁漫天堂,同时支持网页端和移动端 | 禁漫天堂GitHub Actions下载器🚀
Build resilient language agents as graphs.