Stay updated with the latest and most popular GitHub repositories. We fetch data from the official GitHub API, analyze it in-house, and update our listings every hour to bring you the freshest trends.
A series of math-specific large language models of our Qwen2 series.
Helpful tools and examples for working with flex-attention
将知乎专栏文章转换为 Markdown 文件保存到本地
Video generation from text&image, 1st-gen
[CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
Awesome list of 300+ agentic AI resources
You can using StoryDiffusion in ComfyUI
A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.
2024智慧树刷课脚本 基于Python Playwright的自动化程序 [有免安装版]
[ICLR'25 Oral] No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Compose multimodal datasets 🎹
Medical Graph RAG: Graph RAG for the Medical Data
[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
Effortlessly request recommended movies, TV shows and anime to Jellyseer/Overseer based on your recently watched content on Jellyfin, Plex or Emby—let SuggestArr handle it all automatically, keeping your library fresh with new and exciting content!
A trainable PyTorch reproduction of AlphaFold 3.
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
LLM-powered multiagent persona simulation for imagination enhancement and business insights.
Efficient and easy multi-instance LLM serving
Unifying 3D Mesh Generation with Language Models
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from various relevant websites and do research for you all on its own! And more, not limited to but including saving the findings for you!
微信云备份,备份到服务器、Docker、NAS,Web访问。
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling
ShodanX is a tool to gather information of targets using shodan dorks⚡.
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
Build fast and accurate GenAI apps with GraphRAG SDK at scale.
Medical o1, Towards medical complex reasoning with LLMs
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
ja-netfilter修改版,jetbrains全家桶激活(IDEA/PyCharm/GoLand/PhpStorm et al.)
[CVPR 2025] Official repository for "Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders"
🚀 Cross attention map tools for huggingface/diffusers
the scikit-learn sidekick
一个masa mods的汉化资源包
Free AI-Tips is a FREE Newsletter provided by Business Science. It comes with bite-sized Python AI for Business tutorials every week. Sign up here:
Open source conversation framework and visual editor for structured Pipecat dialogues
这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。
Expert Parallelism Load Balancer
[CVPR2025] HVI: A New Color Space for Low-light Image Enhancement && "You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement"
[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models
一个完全开源的 Solana 链上交易机器人,支持跟单交易和自动交易功能。 A fully open-source Solana on-chain trading bot that supports copy trading and automated trading features.