Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
Python Backtesting library for trading strategies
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Turns Data and AI algorithms into production-ready web applications in no time.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Train transformer language models with reinforcement learning.
The authentication glue you need.
类似按键精灵的鼠标键盘录制和自动化操作 模拟点击和键入 | automate mouse clicks and keyboard input
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Supercharge Your LLM Application Evaluations 🚀
NAS媒体库自动化管理工具
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
A developer toolkit to implement Serverless best practices and increase developer velocity.
🔄 CLI to convert Webpages to PDFs 🚀
QQ机器人 RSS订阅 插件,订阅源建议选择 RSSHub
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
A curated list of resources for using LLMs to develop more competitive grant applications.
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
LTX-Video Support for ComfyUI
A chatbot/GraphRAG framework that creates multi-llm-agents from social platform user comments and let them debate on specific topics.
SoftVC VITS Singing Voice Conversion
Real-time face swap for PC streaming or video calls
An open-source PAM tool alternative to CyberArk. 广受欢迎的开源堡垒机。
Write scalable load tests in plain Python 🚗💨
The official source code repository for the calibre ebook manager
State-of-the-art 2D and 3D Face Analysis Project
Open source platform for the machine learning lifecycle
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Official implementation of AnimateDiff.
Daemon to ban hosts that cause multiple authentication errors
structured outputs for llms
Run Windows Subsystem For Android on your Windows 10 and Windows 11 PC using prebuilt binaries with Google Play Store (MindTheGapps) and/or Magisk or KernelSU (root solutions) built in.
A framework for few-shot evaluation of language models.
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
A Library for Advanced Deep Time Series Models.
:beginner: Home Assistant Operating System
小红书链接提取/作品采集工具:提取账号发布、收藏、点赞、专辑作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书无水印作品文件
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Automatic integrate all Xiaomi devices to HomeAssistant via miot-spec, support Wi-Fi, BLE, ZigBee devices. 小米米家智能家居设备接入Hass集成
大数据分析项目
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
AI agent microservice
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
WeChatOpenDevTool 微信小程序强制开启开发者工具
The official Python library for the Google Gemini API
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects