Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
Generative Models by Stability AI
Code for the paper "Language Models are Unsupervised Multitask Learners"
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Asynchronous HTTP client/server framework for asyncio and Python
Network Analysis in Python
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Official implementation of AnimateDiff.
The official GitHub page for the survey paper "A Survey of Large Language Models".
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
Book_4_《矩阵力量》 | 鸢尾花书:从加减乘除到机器学习;上架!
Implementation of Nougat Neural Optical Understanding for Academic Documents
A collaboration friendly studio for NeRFs
Pure Python 3 MTProto API Telegram client library, for bots too!
Quickly rewrite git repository history (filter-branch replacement)
Go ahead and axolotl questions
用文本编辑器剪视频
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
The recursive internet scanner for hackers. 🧡
🛰️✨ Free V2ray Configs , Updating Every 10 minutes.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Open source crypto trading bot
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
🤖 Build voice-based LLM agents. Modular + open source.
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Adaptive Lighting custom component for Home Assistant
A beautiful, powerful, self-hosted rom manager
Converts text to speech in realtime
Copy playlists and liked music from Spotify to YTMusic
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Benchmarking Generalized Out-of-Distribution Detection
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
Recipes to train reward model for RLHF.
AdalFlow: The library to build & auto-optimize LLM applications.
LLM Agent Framework in ComfyUI includes Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, moonshot,doubao. Adapted to local llms, vlm, gguf such as llama-3.2, Linkage graphRAG / RAG
每个人都能用的数字人
ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots.
Follow along with my AI Agents Masterclass videos! All of the code I create and use in this series on YouTube will be here for you to use and even build on top of!
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which lets AI drive data-driven AI.
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
A tool designed to simplify the creation of OpenCore EFI
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Python API client for AI providers that intends to replace LangChain and LangGraph for most common use cases.