Skip to content
View aniki-ly's full-sized avatar

Block or report aniki-ly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Enjoy the magic of Diffusion models!

Python 8,094 725 Updated Mar 25, 2025

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 573 21 Updated Mar 17, 2025
Python 29 Updated Mar 22, 2025

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,363 51 Updated Jan 12, 2025

[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Python 202 11 Updated Dec 27, 2024

Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)

Python 219 12 Updated Jan 22, 2025

Official repository of In-Context LoRA for Diffusion Transformers

1,722 86 Updated Dec 20, 2024

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 259 10 Updated Feb 28, 2025

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Python 1,102 88 Updated May 15, 2024

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 1,268 76 Updated Mar 25, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,783 228 Updated Mar 25, 2025

Inference-time scaling of diffusion-based image and video generation models.

Python 118 9 Updated Mar 5, 2025

[CVPR2025] A benchmark for evaluating video generative models in generating short stories

Python 11 Updated Mar 8, 2025

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

263 14 Updated Feb 28, 2025

Explore the Multimodal “Aha Moment” on 2B Model

Python 531 18 Updated Mar 18, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,178 75 Updated Mar 17, 2025

s1: Simple test-time scaling

Python 6,061 708 Updated Mar 6, 2025

A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.

Jupyter Notebook 112 7 Updated Feb 17, 2025

Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening

Python 52 4 Updated Feb 21, 2025

Inference-Time Alignment in Protein Diffusion Models

Jupyter Notebook 23 1 Updated Jan 20, 2025

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 850 60 Updated Feb 16, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,101 979 Updated Mar 24, 2025

R1-onevision, a visual language model capable of deep CoT reasoning.

467 15 Updated Mar 18, 2025

[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation

Python 278 14 Updated Apr 22, 2024

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,888 176 Updated Mar 10, 2025
Python 481 49 Updated Mar 24, 2025

FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Python 415 24 Updated Mar 5, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,336 266 Updated Mar 24, 2025
Next
Showing results