ZeRO Memory Optimizations: Toward Training Trillion Parameter Models

No se pudo agregar al carrito

Solo puedes tener X títulos en el carrito para realizar el pago.

Add to Cart failed.

Por favor prueba de nuevo más tarde

Error al Agregar a Lista de Deseos.

Por favor prueba de nuevo más tarde

Error al eliminar de la lista de deseos.

Por favor prueba de nuevo más tarde

Error al añadir a tu biblioteca

Por favor intenta de nuevo

Error al seguir el podcast

Intenta nuevamente

Error al dejar de seguir el podcast

Intenta nuevamente

ZeRO Memory Optimizations: Toward Training Trillion Parameter Models

Escúchala gratis

Ver detalles del espectáculo

The paper introduces ZeRO, a novel approach to optimize memory usage when training massive language models. ZeRO-DP and ZeRO-R components effectively reduce memory redundancy and allow for training models with up to 170 billion parameters efficiently. The technique shows superlinear scalability, user-friendly implementation, and has the potential to democratize large model training in AI research. Read full paper: https://arxiv.org/abs/1910.02054 Tags: Systems and Performance, Deep Learning, Natural Language Processing

Todavía no hay opiniones