This paper presents a data-driven analysis examining the content associations that large language models have developed with various social categories, revealing the stereotypes embedded within chatbot systems.
This work investigates the stereotypes embedded in large language models by analyzing their content associations with different social categories through a comprehensive data-driven approach.