Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
In-depth study of the graphrag
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Dream 7B, a large diffusion language model
One-stop Proxies Crawling and Aggregation Platform
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
A highly configurable Windows status bar written in Python.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Solve Visual Understanding with Reinforced VLMs
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Official inference repo for FLUX.1 models
Rich is a Python library for rich text and beautiful formatting in the terminal.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Command-line program to download videos from YouTube.com and other video sites
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Start building LLM-empowered multi-agent applications in an easier way.
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Basic Memory is a knowledge management system that allows you to build a persistent semantic graph from conversations with AI assistants. All knowledge is stored in standard Markdown files on your computer, giving you full control and ownership of your data. Integrates directly with Obsidan.md
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
python版本的小智ai,主要帮助那些没有硬件却想体验小智功能的人
Ray tracing and hybrid rasterization of Gaussian particles
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
崩坏:星穹铁道全自动 三月七小助手
Lumina-mGPT 2.0: Stand-alone Autoregressive Image Modeling
Run Claude Code on OpenAI models
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
✨ 一款小白也能轻松使用的漫画翻译工具,旨在帮助漫画爱好者轻松跨越语言障碍,畅享原汁原味的日文漫画。 利用先进的 AI 技术,智能检测漫画中的对话气泡,精准识别日文文本,并快速翻译成流畅自然的中文。 ✨ 无论是图片还是 PDF 格式的漫画,Saber-Translator 都能轻松应对,让你无压力阅读心爱的漫画作品。
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
AI 视频笔记生成工具 让 AI 为你的视频做笔记
The official Python library for the OpenAI API
获取微信信息;读取数据库,本地查看聊天记录并导出为csv、html等格式用于AI训练,自动回复等。支持多账户信息获取,支持所有微信版本。
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Official repo for CFG-Zero*
The Web framework for perfectionists with deadlines.
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.
Damn Vulnerable MCP Server
Data validation using Python type hints
Industry leading face manipulation platform
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Unlock the fullest potential of your device