Insights from NVIDIA: Creating Compact Language Models through Pruning and Knowledge Distillation

Download and listen anywhere
Download your favorite episodes and enjoy them, wherever you are! Sign up or log in now to access offline listening.
Description
This episode analyzes the research paper "**Compact Language Models via Pruning and Knowledge Distillation**" authored by Saurav Muralidharan, Sharath Turuvekere Sreenivas, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan...
show moreThis podcast is created with the assistance of AI, the producers and editors take every effort to ensure each episode is of the highest quality and accuracy.
For more information on content and research relating to this episode please see: https://arxiv.org/pdf/2407.14679
Information
Author | James Bentley |
Organization | James Bentley |
Website | - |
Tags |
Copyright 2025 - Spreaker Inc. an iHeartMedia Company
Comments