UltraFastBERT Exponentially Faster Language Modelling with 0.3% of Neurons using FFN - Burny website

Skip to content

Burny website

UltraFastBERT Exponentially Faster Language Modelling with 0.3% of Neurons using FFN

Tags¶

Topics: Artificial Intelligence Large language model Natural language processing Transformer Speed optimization Open source
Additional: SotA

Significance¶

Extremely reducing the size of LLMs and losing as little accuracy as possible

Intuitive summary¶

Technical summary¶

a [[BERT]] variant using 0.3% of neurons needed for same performance using [[fast feedforward networks]]

Main resources¶

[2311.10770] Exponentially Faster Language Modelling

Deep dive¶

Brain storming¶

Additional resources¶

AI¶

Metadata¶

paper #tool #processing #long #focus¶