Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
A generative speech model for daily dialogue.
Hunt down social media accounts by username across social networks
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具
Faster Whisper transcription with CTranslate2
SOTA Open Source TTS
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
📺IPTV电视直播源更新项目『✨秒播级体验🚀』:支持RTMP推流;支持IPv4/IPv6;支持自定义频道;支持本地源、组播源、酒店源、订阅源、关键字搜索;每天自动更新两次,结果可用于TVBox等播放软件;支持工作流、Docker(amd64/arm64/arm v7)、命令行、GUI运行方式 | IPTV live TV source update project
Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Releases from OpenAI Preparedness
Open source multi-modal RAG for building AI apps over private knowledge.
Turns Codebase into Easy Tutorial with AI
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT4.1/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Framework that enables fine-tuning of vision-language grounding models on custom datasets
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
Official implementations for paper: VACE: All-in-One Video Creation and Editing
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Fast and memory-efficient exact attention
Amazon Nova Act is a research preview of a new AI model for developers to build agents that take actions in web browsers
A community-maintained Python framework for creating mathematical animations.
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
bilibili 硬核会员 AI 自动答题脚本,直接调用 B 站 API,非 OCR 实现
TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.
The official ElevenLabs MCP server
SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web
Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 100 clases, 44 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...
A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search, crawling, and Python code execution, while giving back to the community that made this possible.
Magnificent app which corrects your previous console command.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Deep research agent to help you find the best GitHub repositories 🕵️!
A list of free LLM inference resources accessible via API.
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
获取b站历史记录,批量下载视频,一键下载用户投稿视频,收藏夹所有视频,生成详细的年度总结,自动化任务,下面链接是对应前端
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
The Python programming language
Focus on prompting and generating
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
In-depth study of the graphrag
Easily download and manage game cheats for your convenience
Free and Open Source Enterprise Resource Planning (ERP)
Access large language models from the command-line