laetokang

Follow

laetokang

Follow

15 followers · 64 following

Korea University
Seoul/Korea
https://laetokang.tistory.com/

Achievements

Achievements

Stars

davidmartinrius / speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Python 240 23 Updated Jun 10, 2024

yangwenmai / github-trending-backup

Github trending backup by everyday.

Go 377 52 Updated Mar 26, 2025

underlines / awesome-ml

Curated list of useful LLM / Analytics / Datascience resources

2,235 189 Updated Feb 21, 2025

Text-to-Audio / AudioLCM

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 962 142 Updated Dec 19, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,383 5,544 Updated Mar 26, 2025

anthropics / courses

Anthropic's educational courses

Jupyter Notebook 9,712 864 Updated Nov 26, 2024

harlanhong / awesome-talking-head-generation

1,651 122 Updated Feb 8, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 25,825 2,487 Updated Mar 27, 2025

maszhongming / Multi-LoRA-Composition

Repository for the Paper "Multi-LoRA Composition for Image Generation"

Python 466 48 Updated Mar 31, 2024

tin2tin / Pallaidium

PALLAIDIUM - a generative AI movie studio integrated in the Blender Video Editor.

Python 1,100 88 Updated Mar 11, 2025

kongzhecn / OMG

[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models

Python 682 45 Updated Jul 2, 2024

pedrolisboa / poseidon

A toolkit for sonar signal processing

Python 3 2 Updated Oct 10, 2022

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 43,090 5,923 Updated Mar 26, 2025

nateraw / stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Python 4,550 438 Updated Sep 21, 2024

pixeli99 / SVD_Xtend

Stable Video Diffusion Training Code and Extensions.

Python 676 66 Updated Jul 25, 2024

sagiodev / stable-video-diffusion-img2vid

Jupyter Notebook 167 18 Updated Mar 3, 2024

tykim0507 / Motion-LoRA

Learning Motion from Low-Rank Adaptation

Python 44 2 Updated Jun 15, 2024

teddylee777 / machine-learning

머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 repository입니다. (This repository is intented for helping whom are interested in machine learning study)

Jupyter Notebook 2,708 872 Updated Apr 5, 2024

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,031 105 Updated Mar 14, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 8,121 728 Updated Mar 26, 2025

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,479 234 Updated Jun 14, 2024

breizhn / DTLN-aec

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Python 295 72 Updated Apr 26, 2022

quic / ai-hub-models

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python 648 102 Updated Mar 26, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 40,680 4,491 Updated Mar 26, 2025

yzhuoning / Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

1,184 57 Updated Jun 28, 2024

asheeshcric / awesome-contrastive-self-supervised-learning

A comprehensive list of awesome contrastive self-supervised learning papers.

1,263 128 Updated Sep 10, 2024

wenet-e2e / speech-recognition-papers

Towards hot directions in industrial end to end speech recognition

326 40 Updated Nov 30, 2021

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 5,376 610 Updated Mar 26, 2025

k2-fsa / sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, L…

C++ 1,239 171 Updated Jan 6, 2025

k2-fsa / sherpa

Speech-to-text server framework with next-gen Kaldi

C++ 641 113 Updated Mar 24, 2025