Raushan Turganbay's picture

Raushan Turganbay

RaushanTurganbay

·

zucchini-nlp

AI & ML interests

Generation and Multimodality

Recent Activity

new activity about 5 hours ago

llava-hf/llava-1.5-7b-hf:[Question] Why does LLaVA evaluation assert batch_size == 1 for benchmark inference?

updated a bucket 5 days ago

RaushanTurganbay/testing

published a bucket 5 days ago

RaushanTurganbay/testing

View all activity

Organizations

New activity in llava-hf/llava-1.5-7b-hf about 5 hours ago

[Question] Why does LLaVA evaluation assert batch_size == 1 for benchmark inference?

#62 opened 1 day ago by

updated a bucket 5 days ago

RaushanTurganbay/testing

published a bucket 5 days ago

RaushanTurganbay/testing

updated a model 8 days ago

RaushanTurganbay/kimi-two-layers

21B • Updated 8 days ago • 762

published a model 15 days ago

RaushanTurganbay/kimi-two-layers

21B • Updated 8 days ago • 762

upvoted an article 19 days ago

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

19 days ago

• 38

upvoted a paper about 1 month ago

EXAONE 4.5 Technical Report

Paper • 2604.08644 • Published Apr 9 • 72

upvoted an article about 1 month ago

Article

Building a Fast Multilingual OCR Model with Synthetic Data

nvidia

•

Apr 17

• 33

updated a model about 1 month ago

RaushanTurganbay/audio-flamingo-3-hf-lora-finetuned

Text Generation • Updated Apr 17 • 2

upvoted an article about 1 month ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 71

upvoted a collection about 1 month ago

EXAONE 4.5

LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated Apr 22 • 43

upvoted a paper about 2 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

upvoted a paper 2 months ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 22

upvoted an article 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 157

updated 2 models 2 months ago

deepseek-community/Janus-Pro-7B

Any-to-Any • 7B • Updated Mar 18 • 434 • 3

deepseek-community/Janus-Pro-1B

Any-to-Any • 2B • Updated Mar 18 • 109k • 14

New activity in OpenGVLab/InternVL2-8B 2 months ago

Compatibility with v5

#23 opened 2 months ago by

RaushanTurganbay

New activity in OpenGVLab/InternVL2-1B 2 months ago

Compatibility with v5

#10 opened 2 months ago by

RaushanTurganbay

New activity in OpenGVLab/InternVL2-2B 2 months ago

Compatibility with v5

#7 opened 2 months ago by

RaushanTurganbay

New activity in OpenGVLab/InternViT-300M-448px-V2_5 2 months ago

Compatibility with v5

#5 opened 2 months ago by

RaushanTurganbay