Stars
A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.
A curated list of awesome Multimodal studies.
Official PyTorch implementation for "Large Language Diffusion Models"
Paper List of Inference/Test Time Scaling/Computing
Fully open data curation for reasoning models
PromptBERT: Improving BERT Sentence Embeddings with Prompts
E5-V: Universal Embeddings with Multimodal Large Language Models
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! ๐ฆฅ
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the siโฆ
An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.
Janus-Series: Unified Multimodal Understanding and Generation Models
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Fully open reproduction of DeepSeek-R1
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Code for ALBEF: a new vision-language pre-training method
KURE: ๊ณ ๋ ค๋ํ๊ต์์ ๊ฐ๋ฐํ, ํ๊ตญ์ด ๊ฒ์์ ํนํ๋ ์๋ฒ ๋ฉ ๋ชจ๋ธ
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
State-of-the-art CLIP/SigLIP embedding models finetuned for the fashion domain. +57% increase in evaluation metrics vs FashionCLIP 2.0.
Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"
Retrieval and Retrieval-augmented LLMs
CUDA integration for Python, plus shiny features
Everything about the SmolLM2 and SmolVLM family of models
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)