Insights from NVIDIA: Creating Compact Language Models through Pruning and Knowledge Distillation

Jan 12, 2025 · 7m 21s
Insights from NVIDIA: Creating Compact Language Models through Pruning and Knowledge Distillation
Description

This episode analyzes the research paper "**Compact Language Models via Pruning and Knowledge Distillation**" authored by Saurav Muralidharan, Sharath Turuvekere Sreenivas, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan...

show more
This episode analyzes the research paper "**Compact Language Models via Pruning and Knowledge Distillation**" authored by Saurav Muralidharan, Sharath Turuvekere Sreenivas, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, and Pavlo Molchanov from **NVIDIA**, published on November 4, 2024. It explores NVIDIA's strategies for reducing the size of large language models by implementing structured pruning and knowledge distillation techniques. The discussion covers how these methods enable the derivation of smaller, efficient models from a single pre-trained model, significantly lowering computational costs and data requirements. Additionally, the episode highlights the development of the **MINITRON** family of models and their performance improvements, such as a **16% increase** in MMLU scores compared to similarly sized models trained from scratch, demonstrating the effectiveness of these approaches in creating scalable and resource-efficient language technologies.

This podcast is created with the assistance of AI, the producers and editors take every effort to ensure each episode is of the highest quality and accuracy.

For more information on content and research relating to this episode please see: https://arxiv.org/pdf/2407.14679
show less
Information
Author James Bentley
Organization James Bentley
Website -
Tags

Looks like you don't have any active episode

Browse Spreaker Catalogue to discover great new content

Current

Podcast Cover

Looks like you don't have any episodes in your queue

Browse Spreaker Catalogue to discover great new content

Next Up

Episode Cover Episode Cover

It's so quiet here...

Time to discover new episodes!

Discover
Your Library
Search