Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tencent
/
HunyuanOCR

Image-Text-to-Text
Transformers
Safetensors
multilingual
hunyuan_vl
text-generation
ocr
hunyuan
vision-language
image-to-text
1B
end-to-end
conversational
Model card Files Files and versions
xet
Community
21
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

[Bug] Hardcoded torch.bfloat16 in modeling_hunyuan_vl.py causes RuntimeError on GPUs without BF16 support (Turing architecture)

🤗 1
2
#21 opened 5 days ago by
Wilt2351

VLLM when have more then 3 concurrent connection to do OCR it will fail

#20 opened 6 days ago by
CHONGYOEYAT

vllm structured output

#19 opened 6 days ago by
nyust-eb210

Colab demo and testing video

🤗 1
1
#17 opened 7 days ago by
ritheshSree

Questions related to inputs and outputs

3
#16 opened 9 days ago by
vince62s

run with api is too slow

#13 opened 9 days ago by
medisean

Text format preserving instructions

#12 opened 10 days ago by
sirovub

Monkeypatch for error only one element tensors can be converted to Python scalars

👍 🤗 2
#10 opened 10 days ago by
lastmass

clean_repeated_substrings is a dirty hack

👍 3
1
#9 opened 10 days ago by
PartyParrot

Local Installation Video and Testing - Step by Step

🚀 🔥 3
#4 opened 11 days ago by
fahdmirzac
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs