• EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

  • Jul 17 2024
  • Length: 10 mins
  • Podcast

EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

  • Summary

  • In this episode of "You Are A Helpful (Research) Assistant," delve into the AI-generated, human-curated exploration of refusal training vulnerabilities in language models. Uncover the past tense attack's impact on model behavior in this insightful discussion.

    Show more Show less

What listeners say about EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.