AI Safety & Benchmarking: Building Trustworthy Evaluation Ecosystems

No se pudo agregar al carrito

Solo puedes tener X títulos en el carrito para realizar el pago.

Add to Cart failed.

Por favor prueba de nuevo más tarde

Error al Agregar a Lista de Deseos.

Por favor prueba de nuevo más tarde

Error al eliminar de la lista de deseos.

Por favor prueba de nuevo más tarde

Error al añadir a tu biblioteca

Por favor intenta de nuevo

Error al seguir el podcast

Intenta nuevamente

Error al dejar de seguir el podcast

Intenta nuevamente

AI Safety & Benchmarking: Building Trustworthy Evaluation Ecosystems

Escúchala gratis

Ver detalles del espectáculo

Effective AI supervision requires reliable benchmarking ecosystems. Nicholas Miailhe discusses why benchmarks matter, how they should be constructed, and what regulators need to know about safety evaluations. The conversation highlights emerging international efforts to standardise safety testing and ensure comparability across models.

Speaker: Nicholas Miailhe (PRISM Eval)

Interviewer: Doaa Abu Elyounes, Programme Specialist, Ethics of AI Unit, UNESCO

Hosted on Ausha. See ausha.co/privacy-policy for more information.

Todavía no hay opiniones