Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
Media Discovery and Download Hub
Universal LLM Deployment Engine with ML Compilation
Composable building blocks to build Llama Apps
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
:mag_right: :chart_with_upwards_trend: :snake: :moneybag: Backtest trading strategies in Python.
s1: Simple test-time scaling
【蓝桥杯Python冲刺课】视频合集 https://space.bilibili.com/398421867/lists?sid=4898042&spm_id_from=333.788.0.0
[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
自动直播录制、投稿工具,支持twitch、ytb频道搬运。
Official PyTorch implementation for "Large Language Diffusion Models"
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.
Learn how to use CUA (our Computer Using Agent) via the API on multiple computer environments.
Falcon: A Remote Sensing Vision-Language Foundation Model
A list of Free and Open Source Software (FOSS) for Android – saving Freedom and Privacy.
一些关于目标检测的脚本的改进思路代码,详细请看readme.md
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Deepfakes Software For All
DeepFaceLab is the leading software for creating deepfakes.
An open source engine for your digital products. Sell SaaS and digital products in minutes.
Run Orpheus 3B Locally With LM Studio
An implementation for InfiniteYou
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
汇总多站点数据的AV元数据刮削器
A curated list of awesome DevOps platforms, tools, practices and resources
Redis for LLMs
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断
A very simple GRPO implement for reproducing r1-like LLM thinking.
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
[CVPR 2025] WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,分布式爬虫,爬虫项目实战实例等
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
An Open Source implementation of Notebook LM with more flexibility and features
:star:Github Ranking:star: Github stars and forks ranking list. Github Top100 stars list of different languages. Automatically update daily. | Github仓库排名,每日自动更新
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
TripoSR: Fast 3D Object Reconstruction from a Single Image
a text-based terminal client for Ollama
Various custom nodes for ComfyUI
This is a project that unifies the management of LLM APIs. It can call multiple backend services through a unified API interface, convert them to the OpenAI format uniformly, and support load balancing. Currently supported backend services include: OpenAI, Anthropic, DeepBricks, OpenRouter, Gemini, Vertex, etc.
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning
Question and Answer based on Anything.
Dead simple FLUX LoRA training UI with LOW VRAM support
2025年全网最全即插即用模块,免费分享!CVPR2025,AAAI2025,ICLR2025,TNNLS2025,arXiv2025......包含人工智能全领域(机器学习、深度学习等),适用于图像分类、目标检测、实例分割、语义分割、全景分割、姿态识别、医学图像分割、视频目标分割、图像抠图、图像编辑、单目标跟踪、多目标跟踪、行人重识别、RGBT、图像去噪、去雨、去雾、去阴影、去模糊、超分辨率、去反光、去摩尔纹、图像恢复、图像修复、高光谱图像恢复、图像融合、图像上色、高动态范围成像、视频与图像压缩、3D点云、3D目标检测、3D语义分割、3D姿态识别等各类计算机视觉和图像处理任务,以及自然语言处理、大语言模型、多模态等其他各类人工智能相关任务。持续更新中......
This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning capability.
Official code of ORION
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
A hyperparameter optimization framework