| --- |
| license: mit |
| base_model: |
| - google/efficientnet-b0 |
| --- |
| |
|
|
| # EfficientNet-B0 Document Image Classifier |
|
|
| This is an image classification model based on **Google EfficientNet-B0**, fine-tuned to classify input images into one of the following 16 categories: |
|
|
| 1. **bar_chart** |
| 2. **bar_code** |
| 3. **chemistry_markush_structure** |
| 4. **chemistry_molecular_structure** |
| 5. **flow_chart** |
| 6. **icon** |
| 7. **line_chart** |
| 8. **logo** |
| 9. **map** |
| 10. **other** |
| 11. **pie_chart** |
| 12. **qr_code** |
| 13. **remote_sensing** |
| 14. **screenshot** |
| 15. **signature** |
| 16. **stamp** |
| |
| ## Citation |
| If you use this model in your work, please cite the following papers: |
| |
| ``` |
| @article{Tan2019EfficientNetRM, |
| title={EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks}, |
| author={Mingxing Tan and Quoc V. Le}, |
| journal={ArXiv}, |
| year={2019}, |
| volume={abs/1905.11946} |
| } |
| |
| @techreport{Docling, |
| author = {Deep Search Team}, |
| month = {8}, |
| title = {{Docling Technical Report}}, |
| url={https://arxiv.org/abs/2408.09869}, |
| eprint={2408.09869}, |
| doi = "10.48550/arXiv.2408.09869", |
| version = {1.0.0}, |
| year = {2024} |
| } |
| ``` |