Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

OpenHands Community

community
https://github.com/OpenHands/OpenHands
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

xingyaoww  authored a paper 1 day ago
EvoClaw: Evaluating AI Agents on Continuous Software Evolution
JustinLin610  authored a paper 26 days ago
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents
JustinLin610  authored a paper about 1 month ago
SWE-Universe: Scale Real-World Verifiable Environments to Millions
View all activity

Binyuan Hui's profile pictureXingyao Wang's profile pictureGraham Neubig's profile pictureJiaxin Wen's profile pictureBowen Li's profile pictureChao Peng's profile pictureYu Su's profile pictureJiaxin Pei's profile pictureXiang Yue's profile pictureJunyang Lin's profile pictureMartial Hue's profile pictureVaibhav Tulsyan's profile pictureGuneet Singh Kohli's profile pictureBoxuan Li's profile pictureLeo's profile pictureFrank Xu's profile picture

spaces 1

Running
38

OpenHands Evaluation Benchmark

🙌

Visualize evaluation model outputs for datasets

Nov 22, 2024

models 1

OpenHandsCommunity/CodeQwen1.5-7B-OpenDevin

Text Generation • Updated May 25, 2024 • 11 • 17

datasets 7

OpenHandsCommunity/eval-output-webarena

Updated Jul 20, 2024 • 14

OpenHandsCommunity/eval-browsing-instructions

Viewer • Updated Jul 15, 2024 • 933 • 42

OpenHandsCommunity/eval-output-miniwob

Updated Jun 10, 2024 • 27

OpenHandsCommunity/SWE-bench-devin-passed

Viewer • Updated Apr 9, 2024 • 79 • 20

OpenHandsCommunity/SWE-bench-devin-full-filtered

Viewer • Updated Apr 9, 2024 • 450 • 15 • 1

OpenHandsCommunity/SWE-bench-devin-full

Viewer • Updated Apr 9, 2024 • 570 • 17

OpenHandsCommunity/Devin-SWE-bench-output

Viewer • Updated Mar 21, 2024 • 1.14k • 29
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs