AI & ML interests
Enterprise AI and ML, Foundation Models, Responsible AI
Recent Activity
Papers
IndustryAssetEQA: A Neurosymbolic Operational Intelligence System for Embodied Question Answering in Industrial Asset Maintenance
Efficient Agent Evaluation via Diversity-Guided User Simulation
Articles
-
ibm-research/ttm-research-r2
Time Series Forecasting • 855k • Updated • 22k • 6 -
ibm-research/ttm-r3
Time Series Forecasting • 1.41M • Updated • 59.9k • 4 -
ibm-research/flowstate
Time Series Forecasting • 9.07M • Updated • 33.2k • 10 -
ibm-research/patchtst-fm-r1
Time Series Forecasting • 0.3B • Updated • 17.9k • 9
-
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance
Paper • 2506.03828 • Published • 20 -
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes
Paper • 2506.03278 • Published • 7 -
ibm-research/AssetOpsBench
Viewer • Updated • 467 • 765 • 25 -
AssetOpsBench
📉4Evaluating Autonomous AI Agents for Industry 4.0 Tasks
-
AssetOpsBench
🚀19Generate and benchmark machine learning models with ease
-
CUGA Agent
🤖100Configurable Generalist Agent, leader in AppWorld Benchmark
-
ITBench-Lite-Space
🚀7Develop and run interactive code notebooks with JupyterLab
-
VAKRA Leaderboard
🏆18Evaluate AI agents on multi‑hop, multi‑source enterprise tasks
-
ibm-research/granite-3.2-2b-instruct-GGUF
Text Generation • 3B • Updated • 402 • 12 -
ibm-research/granite-3.2-8b-instruct-GGUF
Text Generation • 8B • Updated • 270 • 9 -
ibm-research/granite-vision-3.2-2b-GGUF
3B • Updated • 398 • 12 -
ibm-research/granite-guardian-3.2-3b-a800m-GGUF
Text Generation • 3B • Updated • 308 • 3
-
ibm-research/ttm-research-r2
Time Series Forecasting • 855k • Updated • 22k • 6 -
ibm-research/ttm-r3
Time Series Forecasting • 1.41M • Updated • 59.9k • 4 -
ibm-research/flowstate
Time Series Forecasting • 9.07M • Updated • 33.2k • 10 -
ibm-research/patchtst-fm-r1
Time Series Forecasting • 0.3B • Updated • 17.9k • 9
-
AssetOpsBench
🚀19Generate and benchmark machine learning models with ease
-
CUGA Agent
🤖100Configurable Generalist Agent, leader in AppWorld Benchmark
-
ITBench-Lite-Space
🚀7Develop and run interactive code notebooks with JupyterLab
-
VAKRA Leaderboard
🏆18Evaluate AI agents on multi‑hop, multi‑source enterprise tasks
-
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance
Paper • 2506.03828 • Published • 20 -
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes
Paper • 2506.03278 • Published • 7 -
ibm-research/AssetOpsBench
Viewer • Updated • 467 • 765 • 25 -
AssetOpsBench
📉4Evaluating Autonomous AI Agents for Industry 4.0 Tasks
-
ibm-research/granite-3.2-2b-instruct-GGUF
Text Generation • 3B • Updated • 402 • 12 -
ibm-research/granite-3.2-8b-instruct-GGUF
Text Generation • 8B • Updated • 270 • 9 -
ibm-research/granite-vision-3.2-2b-GGUF
3B • Updated • 398 • 12 -
ibm-research/granite-guardian-3.2-3b-a800m-GGUF
Text Generation • 3B • Updated • 308 • 3