Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets Paper • 2506.05346 • Published Jun 5, 2025
Spectral Insights into Data-Oblivious Critical Layers in Large Language Models Paper • 2506.00382 • Published May 31, 2025
NCTV: Neural Clamping Toolkit and Visualization for Neural Network Calibration Paper • 2211.16274 • Published Nov 29, 2022
ContextAnyone: Context-Aware Diffusion for Character-Consistent Text-to-Video Generation Paper • 2512.07328 • Published Dec 8, 2025 • 1
A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level Paper • 2112.15594 • Published Dec 31, 2021
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper • 2407.14933 • Published Jul 20, 2024 • 15
Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published Dec 19, 2024 • 13
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series Paper • 2405.19327 • Published May 29, 2024 • 48
AutoVP: An Automated Visual Prompting Framework and Benchmark Paper • 2310.08381 • Published Oct 12, 2023 • 2
Best Practices and Lessons Learned on Synthetic Data for Language Models Paper • 2404.07503 • Published Apr 11, 2024 • 32
Gemma: Open Models Based on Gemini Research and Technology Paper • 2403.08295 • Published Mar 13, 2024 • 51
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Paper • 2403.05530 • Published Mar 8, 2024 • 65
Design2Code: How Far Are We From Automating Front-End Engineering? Paper • 2403.03163 • Published Mar 5, 2024 • 98
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25, 2024 • 57