Manuel Romero's picture

In a Training Loop 🔄

Manuel Romero PRO

mrm8488

·

https://mrm8488.github.io

AI & ML interests

#AI Research and Democratization. NLP/NLG 🤗

Recent Activity

upvoted a paper 2 days ago

Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

upvoted a paper 2 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

upvoted an article 4 days ago

**LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric**

View all activity

Organizations

upvoted 2 papers 2 days ago

Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

Paper • 2603.15653 • Published 15 days ago • 9

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 3 days ago • 45

upvoted an article 4 days ago

Article

LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric

5 days ago

•

11

liked a dataset 6 days ago

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9, 2025 • 1.2M • 10.3k • 214

upvoted a paper 10 days ago

Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards

Paper • 2603.09117 • Published 12 days ago • 9

liked a model 10 days ago

principled-intelligence/Qwen3.5-2B-text-only

Text Generation • 2B • Updated 10 days ago • 260 • 5

upvoted a collection 10 days ago

Qwen3.5-text-only

Qwen3.5-text-only • 4 items • Updated 10 days ago • 11

upvoted an article 11 days ago

Article

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

about 1 month ago

•

19

liked a model 12 days ago

UKPLab/GritHopper-7B

Sentence Similarity • Updated Feb 2 • 24 • 7

liked 4 datasets 15 days ago

google-research-datasets/paws-x

Viewer • Updated Jan 4, 2024 • 374k • 6.6k • 50

dennlinger/eur-lex-sum

Updated Sep 11, 2024 • 1.09k • 47

HuggingFaceFW/finepdfs-edu

Viewer • Updated Nov 11, 2025 • 49.5M • 6.85k • 84

PleIAs/common_corpus

Viewer • Updated about 1 month ago • 69.9k • 185k • 387

liked 2 datasets 23 days ago

nvidia/Nemotron-Terminal-Synthetic-Tasks

Updated 27 days ago • 477 • 15

nvidia/Nemotron-Terminal-Corpus

Viewer • Updated 23 days ago • 366k • 3.04k • 101

upvoted a paper 23 days ago

Diffusion-Pretrained Dense and Contextual Embeddings

Paper • 2602.11151 • Published Feb 11 • 22

liked a dataset 27 days ago

Metacreation/GigaMIDI

Viewer • Updated Feb 6 • 3.44M • 761 • 35

upvoted a collection 29 days ago

GPT 5 Codex

Distilled models and datasets for GPT 5 Codex • 7 items • Updated Dec 20, 2025 • 5

liked a dataset 29 days ago

TeichAI/claude-4.5-opus-high-reasoning-250x

Viewer • Updated Nov 28, 2025 • 250 • 3.33k • 340

upvoted an article about 1 month ago

Article

Forge: Scalable Agent RL Framework and Algorithm

Feb 13

•

139