Distilling Giants: Making BERT Models Lightweight and Interpretable with DIET COKE
This past semester, I embarked on an exciting NLP project: building DIET COKE — short for Decision trees Interpreting Efficient Transformers - Compression Of Knowledge Extraction. It's a project that sits at the crossroads of deep learning and classi...
blog.alves.world4 min read