Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning capability.
Official code of ORION
A hyperparameter optimization framework
AWS zero to hero repo for devops engineers to learn AWS in 30 Days. This repo includes projects, presentations, interview questions and real time examples.
M3U Playlist for free TV channels
Big & Small LLMs working together
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
Interlocking Layers Post-Processing Script for PrusaSlicer, OrcaSlicer, and BambuStudio
You like pytorch? You like micrograd? You love tinygrad! ❤️
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Ruta de estudio basada en ejercicios de código de la comunidad MoureDev para aprender y practicar lógica usando cualquier lenguaje de programación.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
Self-hosted Error Tracking
The ultimate training toolkit for finetuning diffusion models
Build autonomous, resilient and observable AI agents with built-in workflow orchestration, security, statefulness and telemetry.
Generative Models by Stability AI
Rapidly build AI apps in Python
Efficient Triton Kernels for LLM Training
Future of Agentic Development in Emacs
🕵️♂️ Collect a dossier on a person by username from thousands of sites
Robyn is a Super Fast Async Python Web Framework with a Rust runtime.
A self-hosted habit tracking app without "Goals"
一款专注于Ai翻译的工具,一键自动翻译RPG SLG游戏,Epub TXT小说,Srt Vtt Lrc字幕,Word MD文档等等复杂长文本。
DockFlare - CloudFlare Tunnel Controller
A toolkit for developing and comparing reinforcement learning algorithms.
Interact with your documents using the power of GPT, 100% privately, no data leaks
Run macOS on QEMU/KVM. With OpenCore + Monterey + Ventura + Sonoma support now! Only commercial (paid) support is available now to avoid spammy issues. No Mac system is required.
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (DeepSeek-V3/R1 满血版 671B 全参数微调的开源解决方案,包含从训练到推理的完整代码和脚本,以及实践中积累一些经验和结论。)
An example starter repo for Python projects
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
:zap: A Fast, Extensible Progress Bar for Python and CLI
Mastering Diverse Domains through World Models
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
🪐 ✨ Model Context Protocol (MCP) Server for Jupyter.
Write scalable load tests in plain Python 🚗💨
Graph Neural Network Library for PyTorch
Zulip server and web application. Open-source team chat that helps teams stay productive and focused.
A GPT-empowered penetration testing tool
SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
bphltaoli
A specialized utility that automates discovering missing and upgrading your TV collection.
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.