Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
A natural language interface for computers
Deezer source separation library including pretrained models.
Python ProxyPool for web spider
:cake: Desktop utility to download images/videos/music/text from various websites, and more.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
A Django content management system focused on flexibility and user experience
Open standard for machine learning interoperability
Generative Models by Stability AI
Best DDoS Attack Script Python3, (Cyber / DDos) Attack With 56 Methods
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources. Featuring the Fiery Meter of AWSome.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
State-of-the-Art Text Embeddings
The best free and open-source automated time tracker. Cross-platform, extensible, privacy-focused.
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
An educational resource to help anyone learn deep reinforcement learning.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Hackable and optimized Transformers building blocks, supporting a composable construction.
Quickly rewrite git repository history (filter-branch replacement)
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
A command-line tool to download photos from iCloud
⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
A curated list of awesome libraries, packages, strategies, books, blogs, tutorials for systematic trading.
A Python framework for high performance GPU simulation and graphics
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!
TerminalTextEffects (TTE) is a terminal visual effects engine, application, and Python library.
东北方言编程语言
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
On-device AI across mobile, embedded and edge for PyTorch
Mastering Diverse Domains through World Models
The All in One Framework to Build Undefeatable Scrapers
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including OpenAI Agents SDK, CrewAI, Langchain, Autogen, AG2, and CamelAI
Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.
夸克网盘签到、自动转存、命名整理、发推送提醒和刷新媒体库一条龙
Inference Microsoft Florence2 VLM
Bring portraits to life!
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
Efficient Triton Kernels for LLM Training
Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs
Unlock the fullest potential of your device
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which lets AI drive data-driven AI.
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
✨ AsrTools: Smart Voice-to-Text Tool | Efficient Batch Processing | User-Friendly Interface | No GPU Required | Supports SRT/TXT Output | Turn your audio into accurate text in an instant!
Janus-Series: Unified Multimodal Understanding and Generation Models
The best OSS video generation models