Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Python APIs for web automation, testing, and bypassing bot-detection.
PyTorch code and models for V-JEPA self-supervised learning from video.
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!
A collection of examples that show how to use CrewAI framework to automate workflows.
一个基于✨HOOK机制的微信机器人,支持🌱安全新闻定时推送【FreeBuf,先知,安全客,奇安信攻防社区】,👯Kfc文案,⚡漏洞查询,⚡手机号归属地查询,⚡知识库查询,🎉星座查询,⚡天气查询,🌱摸鱼日历,⚡微步威胁情报查询, 🐛视频,⚡图片,👯帮助菜单。📫 支持积分功能,⚡支持自动拉人,,🌱自动群发,👯Ai回复(国内主流AI模型,扣子,FastGpt,Dify全面支持!),⚡视频号解析,😄自定义程度丰富,小白也可轻松上手!
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
Onmyoji Auto Script | 阴阳师脚本
爬虫入门、爬虫进阶、高级爬虫
Brings Apple's vibrant emojis to your Linux experience
Successor of Undetected-Chromedriver. Providing a blazing fast framework for web automation, webscraping, bots and any other creative ideas which are normally hindered by annoying anti bot systems like Captcha / CloudFlare / Imperva / hCaptcha
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Swift / Ultralytics / veRL / MMEngine / Keras etc.
Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.
Apk builds of piko patches
Follow along with my AI Agents Masterclass videos! All of the code I create and use in this series on YouTube will be here for you to use and even build on top of!
Janus-Series: Unified Multimodal Understanding and Generation Models
Windows Cleaner——专治C盘爆红及各种不服!
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides
Curso de Python desde cero y para todos los públicos con ejercicios
Synthetic Data SDK ✨
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Comprehensive Python Cheatsheet
Image-to-Image Translation in PyTorch
🐧 A list of awesome Linux softwares
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
:rainbow:Python3网络爬虫实战:淘宝、京东、网易云、B站、12306、抖音、笔趣阁、漫画小说下载、音乐电影下载等
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Adding guardrails to large language models.
A Doctor for your data
汇总多站点数据的AV元数据刮削器
Converts text to speech in realtime
WeChatOpenDevTool 微信小程序强制开启开发者工具
Mastering Diverse Domains through World Models
Track emissions from Compute and recommend ways to reduce their impact on the environment.
150+ quantitative finance Python programs to help you gather, manipulate, and analyze stock market data
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
轻量、灵活、易上手的Python剪映草稿生成及导出工具,构建全自动化视频剪辑/混剪流水线
2025年全网最全即插即用模块,免费分享!CVPR2025,AAAI2025,ICLR2025,TNNLS2025,arXiv2025......包含人工智能全领域(机器学习、深度学习等),适用于图像分类、目标检测、实例分割、语义分割、全景分割、姿态识别、医学图像分割、视频目标分割、图像抠图、图像编辑、单目标跟踪、多目标跟踪、行人重识别、RGBT、图像去噪、去雨、去雾、去阴影、去模糊、超分辨率、去反光、去摩尔纹、图像恢复、图像修复、高光谱图像恢复、图像融合、图像上色、高动态范围成像、视频与图像压缩、3D点云、3D目标检测、3D语义分割、3D姿态识别等各类计算机视觉和图像处理任务,以及自然语言处理、大语言模型、多模态等其他各类人工智能相关任务。持续更新中......
A pipeline parallel training script for diffusion models.
Run AI models end-to-end encrypted.
⚡️鸿蒙Next Hap安装包合集,如果您觉得有帮助,还请点亮一下 Star 🌟 哦~ 万分感谢!
A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, BEN2, BiRefNet models, SAM, and GroundingDINO.
[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open