Skip to content
View laetokang's full-sized avatar

Block or report laetokang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Python 240 23 Updated Jun 10, 2024

Github trending backup by everyday.

Go 377 52 Updated Mar 26, 2025

Curated list of useful LLM / Analytics / Datascience resources

2,235 189 Updated Feb 21, 2025

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 962 142 Updated Dec 19, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,383 5,544 Updated Mar 26, 2025

Anthropic's educational courses

Jupyter Notebook 9,712 864 Updated Nov 26, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 25,825 2,487 Updated Mar 27, 2025

Repository for the Paper "Multi-LoRA Composition for Image Generation"

Python 466 48 Updated Mar 31, 2024

PALLAIDIUM - a generative AI movie studio integrated in the Blender Video Editor.

Python 1,100 88 Updated Mar 11, 2025

[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models

Python 682 45 Updated Jul 2, 2024

A toolkit for sonar signal processing

Python 3 2 Updated Oct 10, 2022

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 43,090 5,923 Updated Mar 26, 2025

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Python 4,550 438 Updated Sep 21, 2024

Stable Video Diffusion Training Code and Extensions.

Python 676 66 Updated Jul 25, 2024
Jupyter Notebook 167 18 Updated Mar 3, 2024

Learning Motion from Low-Rank Adaptation

Python 44 2 Updated Jun 15, 2024

머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 repository입니다. (This repository is intented for helping whom are interested in machine learning study)

Jupyter Notebook 2,708 872 Updated Apr 5, 2024

[CSUR] A Survey on Video Diffusion Models

2,031 105 Updated Mar 14, 2025

Enjoy the magic of Diffusion models!

Python 8,121 728 Updated Mar 26, 2025

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,479 234 Updated Jun 14, 2024

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Python 295 72 Updated Apr 26, 2022

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python 648 102 Updated Mar 26, 2025

Making large AI models cheaper, faster and more accessible

Python 40,680 4,491 Updated Mar 26, 2025

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

1,184 57 Updated Jun 28, 2024

A comprehensive list of awesome contrastive self-supervised learning papers.

1,263 128 Updated Sep 10, 2024

Towards hot directions in industrial end to end speech recognition

326 40 Updated Nov 30, 2021

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 5,376 610 Updated Mar 26, 2025

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, L…

C++ 1,239 171 Updated Jan 6, 2025

Speech-to-text server framework with next-gen Kaldi

C++ 641 113 Updated Mar 24, 2025
Next
Showing results