Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
End-to-End Speech Processing Toolkit
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
🥂 Gracefully face hCaptcha challenge with multimodal large language model.
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.
AlphaFold 3 inference pipeline.
Make Mac apps accessible for AI agents
Discover, run, and compose AI agents from any framework
Code for the paper "Language Models are Unsupervised Multitask Learners"
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
The Database Toolkit for Python
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Applications self-hosting and DevOps platform for running open source, web-based linux Panel of lite PaaS
Access 100+ robot descriptions from the main Python robotics frameworks
每个人都能用的数字人
This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audio input.
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
This is for Ethical Use only.
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework
MCP to connect your LLM with Spotify.
Awesome list of open-source startup alternatives to well-known SaaS products 🚀
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources. Featuring the Fiery Meter of AWSome.
The tiniest PaaS you've ever seen. Piku allows you to do git push deployments to your own servers.
An OSINT tool to search for accounts by username and email in social networks.
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
Python Interface for FANUC robots
Learn how DevOps Engineers can use Gen AI to enhance their productivity in day to day tasks.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Linux device manager for Logitech devices
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
轻量、灵活、易上手的Python剪映草稿生成及导出工具,构建全自动化视频剪辑/混剪流水线
Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Automatic headphone equalization from frequency responses
Official implementation of AnimateDiff.
A curated list of awesome things related to Django
Resume Matcher is an open source, free tool to improve your resume. It works by using AI, Reader LLMs, to compare and rank resumes with job descriptions.
Your self hosted YouTube media server
Movie metadata scraper
一个超轻量级、可以在移动端实时运行的数字人模型
RAG that intelligently adapts to your use case, data, and queries