Bartosz Cywiński
bcywinski
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored a paper 12 days ago
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation submitted a paper 12 days ago
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation updated a dataset 13 days ago
bcywinski/uyghurs-censoredOrganizations
None yet