A newer version of the Gradio SDK is available:
6.2.0
title: SpeakEdge - AI Sales Communication Coach
emoji: ๐ฏ
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
license: mit
hardware: zero-gpu
๐ฏ SpeakEdge - Professional Communication Practice Platform
Master your sales conversations with AI-powered roleplay and comprehensive feedback
SpeakEdge is an advanced platform designed to help sales professionals practice and perfect their communication skills through realistic AI-powered roleplay scenarios. Get detailed feedback on your structure, delivery, and listening skills.
โจ Features
๐ญ Realistic Roleplay Scenarios
Practice with 9 different sales situations:
- Cold call to a big shot CEO
- Cold call to the secretary of a big shot CEO
- Inbound customer discovery call
- Demo run through with a customer
- Negotiating pricing with a customer who wants unreasonable discounts
- Renewal or churn reduction with an angry customer
- Upselling to an existing customer
- Sharing update about a good week with your Head of Sales
- Sharing update about a poor week with your Head of Sales
๐ค AI-Powered Conversation
- Speech-to-Text (STT): Whisper large-v3 for accurate transcription
- Text-to-Speech (TTS): Parler-TTS with accent customization
- LLM: Mistral-7B-Instruct for dynamic, context-aware responses
๐ Comprehensive Feedback Analysis
Structure and Clarity
- Logical Flow: Ideas follow coherent sequence
- Simplicity of Language: Avoids jargon and complexity
- Conciseness: Eliminates unnecessary verbosity
Delivery and Presence
- Pace & Pauses: Optimal speaking speed (WPM)
- Confidence & Projection: Strong, definitive language
- Energy Level: Appropriate intensity for context
Listening and Interaction
- Active Listening: Questions and curiosity
- Acknowledgment: Validates others' points
- Reflection: Paraphrases and mirrors
๐ Advanced Metrics
- Words Per Minute (WPM)
- Filler words count and percentage
- Weak words detection
- Repetition analysis
- Unique AI insights (e.g., "23% of sentences started with 'And'")
๐จ Beautiful Dashboard
Professional, visual feedback dashboard with:
- Score breakdowns
- Color-coded performance indicators
- Actionable recommendations
- Unique pattern recognition
๐ Quick Start
Prerequisites
- Python 3.9+
- CUDA-capable GPU (recommended) or CPU
- Hugging Face account with token
Installation
- Clone the repository
git clone https://github.com/yourusername/SpeakEdge.git
cd SpeakEdge
- Install dependencies
pip install -r requirements.txt
- Set up Hugging Face token
export HF_TOKEN=your_hugging_face_token_here
Or create a .env file:
HF_TOKEN=your_hugging_face_token_here
- Run the application
python app.py
The app will launch on http://localhost:7860
๐ Hugging Face Spaces Deployment
This application is optimized for Hugging Face Spaces with Zero GPU (Dynamic H200 GPU Allocation).
Deploy to HF Spaces
Create a new Space
- Go to Hugging Face Spaces
- Click "Create new Space"
- Choose Gradio as the SDK
- Enable GPU (Zero GPU)
Upload files
- Upload all Python files (
app.py,models.py,scenarios.py,feedback_analyzer.py,dashboard.py,config.py) - Upload
requirements.txt - Copy content from
README_HF_SPACE.mdto the Space's README.md
- Upload all Python files (
Set secrets (optional)
- Go to Space settings
- Add
HF_TOKENas a secret (optional but recommended for better rate limits)
Configure Space
- The app will automatically start
- Zero GPU will dynamically allocate H200 when needed
๐ฎ How to Use
- Select Scenario: Choose the sales situation you want to practice
- Customize Bot:
- Select accent (American, British, Australian, Indian, Neutral)
- Choose personality (Professional, Friendly, Skeptical, etc.)
- Give your bot a name
- Start Roleplay: Click "Start Roleplay" to begin
- Engage: Speak naturally using your microphone
- Interact: The AI bot will respond with voice and text
- End Session: Click "End Roleplay" when finished
- Review Feedback: Analyze your comprehensive performance dashboard
๐ Project Structure
SpeakEdge/
โโโ app.py # Main Gradio application
โโโ models.py # STT, TTS, LLM model management
โโโ scenarios.py # Scenario definitions and prompts
โโโ feedback_analyzer.py # Feedback analysis logic
โโโ dashboard.py # HTML dashboard generation
โโโ config.py # Configuration settings
โโโ requirements.txt # Python dependencies
โโโ README.md # This file
โโโ README_HF_SPACE.md # README for Hugging Face Spaces
๐ ๏ธ Technical Details
Models Used
Speech-to-Text
- Model:
openai/whisper-large-v3 - Accuracy: High
- Latency: ~2-3 seconds per audio clip
Text-to-Speech
- Model:
parler-tts/parler-tts-mini-v1 - Features: Multi-accent support
- Quality: Natural-sounding voices
Language Model
- Model:
mistralai/Mistral-7B-Instruct-v0.3 - Purpose: Dynamic conversation & feedback generation
- Context: Last 6 messages for coherence
- Advantage: No approval needed, excellent performance
Performance Optimization
- Zero GPU Integration: Uses
@spaces.GPUdecorator for efficient GPU allocation - Lazy Loading: Models load only when needed
- Mixed Precision: FP16 for faster inference on GPU
- Model Caching: Reuses loaded models across requests
๐ฏ Feedback Categories
Structure and Clarity (3 sub-aspects)
Analyzes logical flow, language simplicity, and conciseness of communication.
Delivery and Presence (3 sub-aspects)
Evaluates pace, confidence, and energy in speech delivery.
Listening and Interaction (3 sub-aspects)
Assesses active listening, acknowledgment, and adaptive responses.
๐ก Unique Features
- Pattern Recognition: Identifies sentence starter patterns
- Repetition Detection: Highlights overused words
- Buzzword Tracking: Warns about jargon overuse
- Contextual Insights: AI-generated observations specific to your conversation
- Scenario-Specific Feedback: Tailored to the sales situation
๐ง Configuration
Edit config.py to customize:
- Model selections
- Generation parameters
- Audio settings
- Feedback thresholds
๐ค Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
๐ License
This project is licensed under the MIT License.
๐ Acknowledgments
- OpenAI Whisper for STT
- Parler-TTS for multi-accent TTS
- Mistral AI for Mistral-7B-Instruct model
- Hugging Face for hosting and Zero GPU infrastructure
๐ Support
For issues, questions, or suggestions, please open an issue on GitHub.
Made with โค๏ธ for sales professionals looking to elevate their communication game