Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Activity Feed
Request to join this org
Follow
50
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Recent Activity
sam-far
updated
a model
about 4 hours ago
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_unique_40_epochs_merged_v1
sam-far
published
a model
about 4 hours ago
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_unique_40_epochs_merged_v1
sam-far
updated
a model
about 4 hours ago
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_unique_40_epochs_v1
View all activity
Team members
15
AlignmentResearch
's datasets
87
Sort: Recently updated
AlignmentResearch/StrongREJECT
Viewer
•
Updated
May 2, 2025
•
387
•
38
•
1
AlignmentResearch/WildChat
Viewer
•
Updated
May 1, 2025
•
45.6k
•
9
AlignmentResearch/HarmBench
Viewer
•
Updated
Apr 23, 2025
•
400
•
19
AlignmentResearch/WildChatCurriculum
Viewer
•
Updated
Apr 18, 2025
•
13.2k
•
29
AlignmentResearch/JailbreakCompletionsCurriculum
Viewer
•
Updated
Apr 18, 2025
•
9.39k
•
2
AlignmentResearch/WildChatScored
Viewer
•
Updated
Apr 11, 2025
•
13k
•
15
AlignmentResearch/BoNStrongREJECT
Viewer
•
Updated
Mar 19, 2025
•
100k
•
1
AlignmentResearch/NestedCiphers
Viewer
•
Updated
Mar 13, 2025
•
806k
•
7
AlignmentResearch/AugmentedJailbreaks
Viewer
•
Updated
Mar 13, 2025
•
20.8k
•
57
AlignmentResearch/JailbreakCompletions
Viewer
•
Updated
Mar 13, 2025
•
46.3k
•
12
AlignmentResearch/WildChatFiltered
Viewer
•
Updated
Mar 12, 2025
•
24k
•
1
AlignmentResearch/JailbreakInputs
Viewer
•
Updated
Mar 11, 2025
•
102k
•
16
•
1
AlignmentResearch/Llama3Jailbreaks
Viewer
•
Updated
Feb 12, 2025
•
78.5k
•
10
AlignmentResearch/XSTest
Viewer
•
Updated
Jan 30, 2025
•
900
•
5
AlignmentResearch/WordLength
Viewer
•
Updated
Aug 7, 2024
•
100k
•
14
AlignmentResearch/Harmless
Viewer
•
Updated
Jul 29, 2024
•
86.6k
•
23
AlignmentResearch/Helpful
Viewer
•
Updated
Jul 29, 2024
•
88.1k
•
31
AlignmentResearch/PasswordMatch
Viewer
•
Updated
Jul 29, 2024
•
100k
•
8
AlignmentResearch/IMDB
Viewer
•
Updated
Jul 29, 2024
•
97.5k
•
42
•
1
AlignmentResearch/EnronSpam
Viewer
•
Updated
Jul 29, 2024
•
62.3k
•
9
AlignmentResearch/PasswordMatch-test
Viewer
•
Updated
Jul 26, 2024
•
50k
•
9
AlignmentResearch/WordLength-test
Viewer
•
Updated
Jul 26, 2024
•
100k
•
9
AlignmentResearch/StrongREJECT-test
Viewer
•
Updated
Jul 26, 2024
•
313
•
5
AlignmentResearch/IMDB-test
Viewer
•
Updated
Jul 26, 2024
•
97.5k
•
7
AlignmentResearch/EnronSpam-test
Viewer
•
Updated
Jul 26, 2024
•
62.4k
•
5
AlignmentResearch/boxoban-astar-solutions
Preview
•
Updated
Jul 25, 2024
•
152
•
1
AlignmentResearch/RuLES-Encryption
Viewer
•
Updated
Jul 16, 2024
•
50k
•
3
•
1
Previous
1
2
3
Next