Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides MCP tool-use capabilities.
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
Prompt-To-Agent : Create custom engineering agents for your codebase
📄 PageIndex: Document Index System for Reasoning-Based RAG
[CVPR 2025 Oral] VGGT: Visual Geometry Grounded Transformer
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
aider is AI pair programming in your terminal
Robust Speech Recognition via Large-Scale Weak Supervision
Wan: Open and Advanced Large-Scale Video Generative Models
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web
Yet Another Document Translator
A simple CLI tool to help you remember terminal commands
A collection of sample agents built with Agent Development (ADK)
Implementation for Describe Anything: Detailed Localized Image and Video Captioning
AI 视频笔记生成工具 让 AI 为你的视频做笔记
"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
A Python Script to fetch Garmin health data and populate that in a InfluxDB Database, for visualization long term health trends with Grafana
:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
🤗 smolagents: a barebones library for agents that think in python code.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
All Algorithms implemented in Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
A list of developer portfolios for your inspiration
Stable Diffusion web UI
Ultralytics YOLO11 🚀
Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.
🙌 OpenHands: Code Less, Make More
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
verl: Volcano Engine Reinforcement Learning for LLMs
A collaborative note taking, wiki and documentation platform that scales. Built with Django and React. Opensource alternative to Notion or Outline.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Agent S: an open agentic framework that uses computers like a human
Towards Human-Sounding Speech
One Model to Rig Them All: Diverse Skeleton Rigging with UniRig
Build effective agents using Model Context Protocol and simple workflow patterns
(🚧 WIP) a course of LLM serving with MLX for systems engineers.
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
Control GenAI interactions with power, precision, and consistency using Conversation Modeling paradigms