Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
Utilities intended for use with Llama models.
OCR, layout analysis, reading order, table recognition in 90+ languages
Memory-Guided Diffusion for Expressive Talking Video Generation
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
The official Python library for the OpenAI API
Easily download and manage game cheats for your convenience
A chatbot/GraphRAG framework that creates multi-llm-agents from social platform user comments and let them debate on specific topics.
Structured Text Generation
A curated list of resources for using LLMs to develop more competitive grant applications.
A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.
Open-Sora: Democratizing Efficient Video Production for All
Modern YouTube downloader with a clean PyQt6 interface. Download videos in any quality, extract audio, fetch subtitles (including auto-generated), and view video metadata. Built with yt-dlp for reliable performance.
Streamlit — A faster way to build and share data apps.
Infinite Photorealistic Worlds using Procedural Generation
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
The recursive internet scanner for hackers. 🧡
Data validation using Python type hints
Faster Whisper transcription with CTranslate2
适用于AdGuard的去广告合并规则,每8个小时更新一次。
Unlock the fullest potential of your device
Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 100 clases, 44 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...
A generalist Python node editor
The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.
A Gradio web UI for Large Language Models with support for multiple inference backends.
NanoGPT (124M) in 5 minutes
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
SGLang is a fast serving framework for large language models and vision language models.
The best OSS video generation models
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
The Web framework for perfectionists with deadlines.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
AI Search & RAG Without Moving Your Data. Get instant answers from your company's knowledge across 100+ apps while keeping data secure. Deploy in minutes, not months.
[768 Resolution] [Any "SDXL" Model] [Various Conditions] [Arbitrary Views] Official impl. of "MV-Adapter: Multi-view Consistent Image Generation Made Easy"
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
Dockerized Computer Use Agents with Production Ready API’s - MCP Client for Langchain - GCA
Robyn is a Super Fast Async Python Web Framework with a Rust runtime.
基于Python的开源量化交易平台开发框架
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Access large language models from the command-line
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.