Read next
updated
Large Language Models as Optimizers
Paper
• 2309.03409
• Published
• 79
Challenges and Applications of Large Language Models
Paper
• 2307.10169
• Published
• 51
Efficiently Modeling Long Sequences with Structured State Spaces
Paper
• 2111.00396
• Published
• 3
DreamCoder: Growing generalizable, interpretable knowledge with
wake-sleep Bayesian program learning
Paper
• 2006.08381
• Published
Universal and Transferable Adversarial Attacks on Aligned Language
Models
Paper
• 2307.15043
• Published
• 2
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Paper
• 2005.11401
• Published
• 14
The Rise and Potential of Large Language Model Based Agents: A Survey
Paper
• 2309.07864
• Published
• 8
FreeU: Free Lunch in Diffusion U-Net
Paper
• 2309.11497
• Published
• 66
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper
• 2309.12307
• Published
• 90
RMT: Retentive Networks Meet Vision Transformers
Paper
• 2309.11523
• Published
• 34
Quiet-STaR: Language Models Can Teach Themselves to Think Before
Speaking
Paper
• 2403.09629
• Published
• 79