Skip to content
View sylee96's full-sized avatar

Block or report sylee96

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

Rust 4,891 381 Updated Feb 4, 2025
Python 1,347 52 Updated Nov 21, 2024

A curated list of awesome Multimodal studies.

HTML 164 16 Updated Mar 19, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 1,304 94 Updated Mar 13, 2025

Paper List of Inference/Test Time Scaling/Computing

Python 123 3 Updated Mar 19, 2025

Fully open data curation for reasoning models

Python 1,573 135 Updated Mar 16, 2025

s1: Simple test-time scaling

Python 6,037 705 Updated Mar 6, 2025

PromptBERT: Improving BERT Sentence Embeddings with Prompts

Python 333 35 Updated Nov 22, 2023

E5-V: Universal Embeddings with Multimodal Large Language Models

Python 236 8 Updated Dec 23, 2024

Unified Reinforcement Learning Framework

Python 708 65 Updated Sep 6, 2024

Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! ๐Ÿฆฅ

Python 35,482 2,728 Updated Mar 22, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the siโ€ฆ

TypeScript 14,839 1,520 Updated Mar 14, 2025

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 493 55 Updated Mar 21, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,821 2,204 Updated Feb 1, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,215 236 Updated Mar 17, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 42,709 5,853 Updated Mar 21, 2025

Fully open reproduction of DeepSeek-R1

Python 23,154 2,107 Updated Mar 22, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,233 226 Updated Mar 22, 2025

Code for ALBEF: a new vision-language pre-training method

Python 1,622 206 Updated Sep 20, 2022

KURE: ๊ณ ๋ ค๋Œ€ํ•™๊ต์—์„œ ๊ฐœ๋ฐœํ•œ, ํ•œ๊ตญ์–ด ๊ฒ€์ƒ‰์— ํŠนํ™”๋œ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ

Python 153 7 Updated Feb 28, 2025

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

1,924 152 Updated Feb 24, 2025

State-of-the-art CLIP/SigLIP embedding models finetuned for the fashion domain. +57% increase in evaluation metrics vs FashionCLIP 2.0.

Python 82 8 Updated Sep 20, 2024

Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"

Python 128 15 Updated Oct 19, 2023

Retrieval and Retrieval-augmented LLMs

Python 9,061 651 Updated Mar 20, 2025

CUDA integration for Python, plus shiny features

Python 1,911 290 Updated Feb 7, 2025

Everything about the SmolLM2 and SmolVLM family of models

Python 2,041 113 Updated Mar 21, 2025

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,893 319 Updated Jun 12, 2024
Next
Showing results