Avishai Elmakies's picture

Avishai Elmakies

avishai-elmakies

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion

liked a Space 2 months ago

Eliahu/Model-Atlas

authored a paper 2 months ago

Advancing Speech Understanding in Speech-Aware Language Models with GRPO

View all activity

Organizations

upvoted a paper about 1 month ago

DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion

Paper • 2510.20766 • Published Oct 23 • 34

upvoted a paper 2 months ago

Advancing Speech Understanding in Speech-Aware Language Models with GRPO

Paper • 2509.16990 • Published Sep 21 • 18

upvoted a paper 3 months ago

Lost in Embeddings: Information Loss in Vision-Language Models

Paper • 2509.11986 • Published Sep 15 • 27

upvoted 5 papers 4 months ago

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Paper • 2508.09983 • Published Aug 13 • 68

Speech-to-LaTeX: New Models and Datasets for Converting Spoken Equations and Sentences

Paper • 2508.03542 • Published Aug 5 • 4

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 180

Efficient Differentially Private Fine-Tuning of LLMs via Reinforcement Learning

Paper • 2507.22565 • Published Jul 30 • 9

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 313

upvoted 2 papers 5 months ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Paper • 2507.16814 • Published Jul 22 • 21

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22 • 73

upvoted 5 papers 6 months ago

Discrete Audio Tokens: More Than a Survey!

Paper • 2506.10274 • Published Jun 12 • 32

Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games

Paper • 2506.05309 • Published Jun 5 • 16

Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation

Paper • 2506.08570 • Published Jun 10 • 33

StressTest: Can YOUR Speech LM Handle the Stress?

Paper • 2505.22765 • Published May 28 • 17

WHISTRESS: Enriching Transcriptions with Sentence Stress Detection

Paper • 2505.19103 • Published May 25 • 13

upvoted a paper 7 months ago

Fast Text-to-Audio Generation with Adversarial Post-Training

Paper • 2505.08175 • Published May 13 • 25

upvoted 4 papers 8 months ago

I-Con: A Unifying Framework for Representation Learning

Paper • 2504.16929 • Published Apr 23 • 29

Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models

Paper • 2504.01137 • Published Apr 1 • 21

Scaling Analysis of Interleaved Speech-Text Language Models

Paper • 2504.02398 • Published Apr 3 • 31

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published Apr 1 • 37