Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a RESTful API.
The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Building AI agents, atomically
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Fast and memory-efficient exact attention
Re-movery
Inference code for Llama models
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
Cross-platform, fast, feature-rich, GPU based terminal
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, data analysis, visualization, and report writing. Perfect for researchers and data scientists seeking to enhance their workflow and productivity.
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.
Google Gen AI Python SDK provides an interface for developers to integrate Google's generative models into their Python applications. This is an early release. API is subject to change. Please do not use this SDK in production environments at this stage
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
Efficient Track Anything
Tools for controlling webcam LED on ThinkPad X230
A Library for Advanced Deep Time Series Models.
Rich is a Python library for rich text and beautiful formatting in the terminal.
A command-line tool to download photos from iCloud
The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
structured outputs for llms
Unified GUI Censorship Resistant Solution Powered by Xray
The official Meta Llama 3 GitHub site
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
A blender addon for generating meshes with AI
Bjorn is a powerful network scanning and offensive security tool for the Raspberry Pi with a 2.13-inch e-Paper HAT. It discovers network targets, identifies open ports, exposed services, and potential vulnerabilities. Bjorn can perform brute force attacks, file stealing, host zombification, and supports custom attack scripts.
Open source repo for the WhyHow Knowledge Graph Studio
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Free and Open Source Enterprise Resource Planning (ERP)
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from various relevant websites and do research for you all on its own! And more, not limited to but including saving the findings for you!
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
Video Depth without Video Models
Python Backtesting library for trading strategies
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale