Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
欢迎star⭐。🚀从聊天记录创造数字分身的一站式解决方案💡 使用微信聊天记录微调大语言模型,让大模型有“那味儿”。使用微信语音消息➕0.5B大模型实现高质量声音克隆,并绑定到聊天机器人,实现自己的数字分身。 数字克隆/数字分身/数字永生/声音克隆/LLM/大语言模型/微信聊天机器人/LoRA
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Build effective agents using Model Context Protocol and simple workflow patterns
🙌 OpenHands: Code Less, Make More
aider is AI pair programming in your terminal
A collection of sample agents built with Agent Development (ADK)
🤗 smolagents: a barebones library for agents that think in python code.
Stable Diffusion web UI
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Robust Speech Recognition via Large-Scale Weak Supervision
Finetune Llama 4, TTS, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
Towards Human-Sounding Speech
A collaborative note taking, wiki and documentation platform that scales. Built with Django and React. Opensource alternative to Notion or Outline.
Investment Research for Everyone, Everywhere.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
抖音批量下载工具,去水印,支持视频、图集、合集、音乐(原声)。免费!免费!免费!
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Toolkit for linearizing PDFs for LLM datasets/training
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Archon is an AI agent that is able to create other AI agents using an advanced agentic coding workflow and framework knowledge base to unlock a new frontier of automated agents.
Wan: Open and Advanced Large-Scale Video Generative Models
OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to intricate anime characters.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
MCP Server for IDA Pro
verl: Volcano Engine Reinforcement Learning for LLMs
✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 平台支持 QQ、QQ频道、Telegram、微信、企微、飞书 | MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify 等。附带 WebUI。
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
A simple, secure MCP-to-OpenAPI proxy server
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.
Ultralytics YOLO11 🚀
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
SkyReels-V2: Infinite-length Film Generative model
Prompt-To-Agent : Create custom engineering agents for your codebase
auto sign cursor
Build resilient language agents as graphs.
Agent S: an open agentic framework that uses computers like a human
Agent Framework / shim to use Pydantic with LLMs
FastAPI framework, high performance, easy to learn, fast to code, ready for production