An open-source benchmark for enterprise use cases.
Forecast evaluation benchmark
Convert document images to HTML with Docling
Generate and benchmark machine learning models with ease
Develop and run interactive code notebooks with JupyterLab
Configurable Generalist Agent, leader in AppWorld Benchmark
In-browser tool calling with IBM Granite-4.0
Evaluating Autonomous AI Agents for Industry 4.0 Tasks