Skip to content

Tags

Significance

  • Extremely reducing the size of LLMs and losing as little accuracy as possible

Intuitive summary

Technical summary

  • a [[BERT]] variant using 0.3% of neurons needed for same performance using [[fast feedforward networks]]

Main resources

Deep dive

Brain storming

Additional resources

AI

Metadata

  • paper #tool #processing #long #focus