- Cambridge, MA
- http://people.csail.mit.edu/clai24/
- @jefflai108
-
audiolm-pytorch Public
Forked from lucidrains/audiolm-pytorchImplementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Python MIT License UpdatedFeb 4, 2025 -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedAug 21, 2024 -
-
-
-
-
-
-
-
-
unit_info_align Public
Forked from jacobandreas/info_alignHTML Apache License 2.0 UpdatedMar 11, 2023 -
VGNSL Public
Forked from ExplorerFreda/VGNSL[ACL 2019] Visually Grounded Neural Syntax Acquisition
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedFeb 11, 2023 -
ASSERT Public
JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT).
-
-
-
-
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
-
Self-Supervised-Speech-Pretraining-and-Representation-Learning Public
Forked from s3prl/s3prlThe S3PRL speech toolkit: self-supervised pre-training and representation learning of Mockingjay, TERA, A-ALBERT, APC, and more to come. With easy-to-use standard downstream evaluation scripts incl…
Python MIT License UpdatedOct 31, 2020 -
PPLM Public
Forked from uber-research/PPLMPlug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Python Apache License 2.0 UpdatedApr 3, 2020 -
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
-
Contrastive Predictive Coding for Automatic Speaker Verification
-
-
tf-kaldi-speaker Public
Forked from entn-at/tf-kaldi-speakerNeural speaker recognition/verification system based on Kaldi and Tensorflow
Python Apache License 2.0 UpdatedSep 26, 2019 -
Attentive-Filtering-Network Public
University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.
-
DIM Public
Forked from rdevon/DIMDeep InfoMax (DIM), or "Learning Deep Representations by Mutual Information Estimation and Maximization"
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 25, 2019 -
tacotron2 Public
Forked from nii-yamagishilab/tacotron2An implementation of Tacotron and Tacotron2
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 15, 2019 -
self-attention-tacotron Public
Forked from nii-yamagishilab/self-attention-tacotronAn implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 12, 2019 -
pytorch_GAN_zoo Public
Forked from facebookresearch/pytorch_GAN_zooA mix of GAN implementations including progressive growing
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 11, 2019 -
PytorchWaveNetVocoder Public
Forked from kan-bayashi/PytorchWaveNetVocoderWaveNet-Vocoder implementation with pytorch
Shell Apache License 2.0 UpdatedApr 9, 2019