Spaces:

AI-Solutions-KK
/

Digitizer

Sleeping

App Files Files Community

Digitizer / README.md

AI-Solutions-KK

Update README.md

749088b unverified 5 months ago

preview code

raw

history blame contribute delete

6.76 kB

A newer version of the Streamlit SDK is available: 1.52.2

Upgrade

metadata

title: Diagram Digitizer
emoji: 📊
colorFrom: indigo
colorTo: blue
sdk: streamlit
sdk_version: 1.32.0
app_file: app.py
pinned: false

Diagram Digitizer 📊

An AI-powered web application that converts handwritten or image-based flowcharts into editable digital formats. Built with Streamlit and computer vision models.

🚀 Features

Smart Shape Detection: Automatically identifies rectangles, circles, diamonds, and other flowchart elements
OCR Text Extraction: Reads handwritten and printed text within shapes using EasyOCR
Visual Analysis: Real-time processing with visual feedback and bounding boxes
Multiple Export Formats: Download results as JSON or CSV for further editing
Interactive Interface: User-friendly web app with adjustable detection parameters
No Training Required: Uses pre-trained models for immediate results

🎯 Use Cases

Convert hand-drawn process flows to digital format
Digitize whiteboard diagrams from meetings
Extract flowchart data for documentation
Create editable versions of scanned flowcharts
Educational tools for diagram analysis

🛠️ Technology Stack

Frontend: Streamlit
Computer Vision: OpenCV
OCR: EasyOCR
Image Processing: PIL, NumPy
Data Handling: Pandas

🚀 Quick Start

Online Demo

Try the app instantly on Hugging Face Spaces: Diagram Digitizer Demo

Local Installation

Clone the repository

git clone https://github.com/your-username/diagram-digitizer.git
cd diagram-digitizer
# Diagram Digitizer 📊

[![Streamlit App](https://static.streamlit.io/badges/streamlit_badge_black_white.svg)](https://huggingface.co/spaces/your-username/diagram-digitizer)
[![Python](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)
[![License](https://img.shields.io/badge/license-MIT-green.svg)](LICENSE)

An AI-powered web application that converts handwritten or image-based flowcharts into editable digital formats. Built with Streamlit and computer vision models.

## 🚀 Features

- **Smart Shape Detection**: Automatically identifies rectangles, circles, diamonds, and other flowchart elements
- **OCR Text Extraction**: Reads handwritten and printed text within shapes using EasyOCR
- **Visual Analysis**: Real-time processing with visual feedback and bounding boxes
- **Multiple Export Formats**: Download results as JSON or CSV for further editing
- **Interactive Interface**: User-friendly web app with adjustable detection parameters
- **No Training Required**: Uses pre-trained models for immediate results

## 🎯 Use Cases

- Convert hand-drawn process flows to digital format
- Digitize whiteboard diagrams from meetings
- Extract flowchart data for documentation
- Create editable versions of scanned flowcharts
- Educational tools for diagram analysis

## 🛠️ Technology Stack

- **Frontend**: Streamlit
- **Computer Vision**: OpenCV
- **OCR**: EasyOCR
- **Image Processing**: PIL, NumPy
- **Data Handling**: Pandas

## 🚀 Quick Start

### Online Demo
Try the app instantly on Hugging Face Spaces: [Diagram Digitizer Demo](https://huggingface.co/spaces/your-username/diagram-digitizer)

### Local Installation

1. **Clone the repository**
```bash
git clone https://github.com/your-username/diagram-digitizer.git
cd diagram-digitizer

Install dependencies

pip install -r requirements.txt

Run the application

streamlit run app.py

Open your browser and navigate to http://localhost:8501

📖 How to Use

Upload Image: Drag and drop your flowchart image (JPG, PNG, BMP)
Adjust Settings: Fine-tune detection parameters in the sidebar
View Results: See detected shapes and extracted text with visual feedback
Export Data: Download results as JSON or CSV for further processing

📊 Supported Formats

Input Formats

JPG, JPEG
PNG
BMP

Output Formats

JSON (structured data with coordinates and text)
CSV (tabular format for spreadsheet analysis)

🎨 Example Results

The app detects various flowchart elements:

Rectangles: Process steps
Diamonds: Decision points
Circles/Ovals: Start/End points
Text: Labels and descriptions within shapes

🔧 Configuration

Adjustable parameters in the sidebar:

Minimum Shape Area: Filter out small noise (100-2000 pixels)
Text Confidence Threshold: OCR accuracy threshold (0.1-1.0)

🤝 Contributing

Contributions are welcome! Here's how you can help:

Fork the repository
Create a feature branch: git checkout -b feature-name
Commit changes: git commit -m "Add feature"
Push to branch: git push origin feature-name
Open a Pull Request

Development Setup

# Clone your fork
git clone https://github.com/your-username/diagram-digitizer.git
cd diagram-digitizer

# Install in development mode
pip install -r requirements.txt

# Run with hot reload
streamlit run app.py

📝 Roadmap

Support for more shape types (hexagons, parallelograms)
Arrow and connection line detection
Export to Word/PowerPoint formats
Batch processing for multiple images
Custom shape training interface
Integration with draw.io format

🐛 Known Issues

Complex overlapping shapes may not be detected accurately
Handwritten text with very low contrast might be missed
Very small shapes (< 100 pixels) are filtered out by default

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

EasyOCR - Optical Character Recognition
OpenCV - Computer Vision Library
Streamlit - Web App Framework

📧 Contact

GitHub Issues: Report bugs or request features
Hugging Face: View on HF Spaces

⭐ Star this repository if you find it helpful!

🔄 Recent Updates

v1.0.0: Initial release with basic shape and text detection
Added support for multiple image formats
Implemented JSON/CSV export functionality
Enhanced UI with statistics dashboard