Skip to content

drawar/rat-duorat-sql

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About

I experimented with combinations of simpler embedding models - DistilBERT and TinyBERT- with the original RAT-SQL architecture and its ablated variant - DuoRAT. The experiments conclude that our simpler architectures required significantly lesser training iterations over the dataset to converge close to the original paper’s result. The experiment results also point to the efficacy of DistilBERT and TinyBERT in nearly matching the performance of BERT on external tasks despite significant reduction in complexity.

The RAT and DuoRAT were forked and modified for experiments from their original open-source implementations (see Resources section). For BERT, DistilBERT and TinyBERT, the open-source implementations from https://huggingface.co/ were used.

Resources

About

Distillation techniques on RAT and DuoRAT

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages