Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
A Telegram RSS bot that cares about your reading experience
Master Federated Learning in 2 Hours—Run It on Your PC!
Turn any computer or edge device into a command center for your computer vision projects.
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
BackTrader中文教程笔记(by:量化投资与机器学习),系统性介绍Bactrader的特性、策略构建、数据结构、回测交易等,彻底掌握量化神器的使用方法。章节:介绍篇、数据篇、指标篇、交易篇、策略篇、可视化篇……(持续更新中)
Minimalistic large language model 3D-parallelism training
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
Official repository of the xLSTM.
A Web UI for easy subtitle using whisper model.
自动收集的IPv4酒店电视直播源,自动测试播放速度,每日自动更新。 有CCTV央视卫视频道,及部分地方频道,播放流畅。也可在openwrt或群辉的docker运行。
我的导航算法学习笔记,内容涵盖导航定位开源程序的源码解读、开源项目梳理、书籍讲义、博客翻译、教程讲座推荐;所有内容都可以随意转载,原始文件都放在这里了,大家可以在我的基础上整理出自己的一些文档。(Tips:①主要是写给初学者,已经有基础的同学应该多看论文和代码,看我的笔记学不到啥;②仓库持续更新中,不建议 fork)
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes.
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.
Conventional SGBM depth ranging + yolov5 object detection with deployment on Jeston nano
depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Recipes to train reward model for RLHF.
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
High-speed downloader for multiple platforms
An OSINT CLI tool desgined to fast track IP Reputation and Geo-locaton look up for Security Analysts.
An MIT License of YOLOv9, YOLOv7, YOLO-RD
NanoGPT (124M) in 3 minutes
提升部署在cloudflare、vercel或netlify的网页在中国的访问速度和稳定性 Improve the access speed and stability in China of web pages hosted on cloudflare, vercel or netlify by merely changing your CNAME record. cf优选域名 | cf优选ip | cloudflare | vercel | netlify | 加速 | 国内 | 中国 | 境内 | 大陆
Bjorn is a powerful network scanning and offensive security tool for the Raspberry Pi with a 2.13-inch e-Paper HAT. It discovers network targets, identifies open ports, exposed services, and potential vulnerabilities. Bjorn can perform brute force attacks, file stealing, host zombification, and supports custom attack scripts.
Tailor是一款视频智能裁剪、视频生成和视频优化的视频剪辑工具。目前的目标是通过人工智能技术减少视频剪辑的繁琐操作,让普通人也能简单实现专业剪辑人的水准!长远目标是让视频剪辑实现真正的AIGC!
Agent Zero AI framework
Composable building blocks to build Llama Apps
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Auto-configure and then control your Midea M-Smart devices (Air conditioner, Fan, Water heater, Washer, etc) via local area network.
Video generation from text&image, 1st-gen
Onekey Steam Depot Manifest Downloader
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
A modern cookiecutter template for Python projects that use uv for dependency management
自用青龙面板辅助工具,用于自动登录JD获取许可更新青龙面板
16 colors fork of pywal
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Memory-Guided Diffusion for Expressive Talking Video Generation
Muon optimizer: +>30% sample efficiency with <3% wallclock overhead
Hints lets you navigate GUI applications in Linux without your mouse by displaying "hints" you can type on your keyboard to interact with GUI elements.