Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
25
7
4
Scott Thornton
PRO
scthornton
Follow
3zzazakl's profile picture
cmadh's profile picture
jungletsubasa's profile picture
19 followers
·
10 following
scthornton
scthornton
AI & ML interests
AI/ML Security
Recent Activity
upvoted
an
article
4 days ago
IntentGuard: Building a Production-Grade Vertical Intent Classifier for LLM Safety
reacted
to
perfecXion
's
post
with 👍
9 days ago
# IntentGuard: Open-Source Vertical Intent Classifiers for LLM Guardrails Three models published to the Hub: - [perfecXion/intentguard-finance](https://huggingface.co/perfecXion/intentguard-finance) - [perfecXion/intentguard-healthcare](https://huggingface.co/perfecXion/intentguard-healthcare) - [perfecXion/intentguard-legal](https://huggingface.co/perfecXion/intentguard-legal) DeBERTa-v3-xsmall fine-tuned for three-way classification: **allow**, **deny**, or **abstain**. ONNX + INT8 quantized, under 80MB, p99 <30ms on CPU. Margin-based thresholds (not argmax) — uncertain queries route to clarification instead of forcing a guess. **Eval results (adversarial test sets, ~470-480 examples per vertical):** | Vertical | Accuracy | Legit-Block Rate | Off-Topic-Pass Rate | |----------|----------|------------------|---------------------| | Finance | 99.6% | 0.00% | 0.00% | | Healthcare | 98.9% | 0.00% | 0.98% | | Legal | 97.9% | 0.00% | 0.50% | ```bash docker run -p 8080:8080 ghcr.io/perfecxion/intentguard:finance-latest curl -X POST http://localhost:8080/v1/classify \ -H "Content-Type: application/json" \ -d '{"messages": [{"role": "user", "content": "What are current mortgage rates?"}]}' ``` Apache 2.0. Full pipeline + Docker configs on [GitHub](https://github.com/perfecxion-ai/intentguard). Feedback welcome on domain coverage, adversarial robustness, and multilingual demand.
reacted
to
their
post
with 👀
16 days ago
# SecureCode Dataset Family Update: 2,185 Security Examples, Framework-Specific Patterns, Clean Parquet Loading Hey y'all, Quick update on the SecureCode dataset family. We've restructured things and fixed several issues: **What changed:** - The datasets are now properly split into three repos: [unified](https://huggingface.co/datasets/scthornton/securecode) (2,185), [web](https://huggingface.co/datasets/scthornton/securecode-web) (1,378), [AI/ML](https://huggingface.co/datasets/scthornton/securecode-aiml) (750) - All repos now use Parquet format -- `load_dataset()` just works, no deprecated loading scripts - SecureCode Web now includes 219 framework-specific examples (Express, Django, Spring Boot, Flask, Rails, Laravel, ASP.NET Core, FastAPI, NestJS) - Data cards have been corrected and split sizes fixed **Why it matters:** With AI-generated code accounting for 60%+ of some codebases (Checkmarx 2025), security training data is more important than ever. Every example in SecureCode is grounded in a real CVE with 4-turn conversations that mirror actual developer-AI workflows. If you're working on code generation models, I'd love to hear how you're approaching the security angle. Are there vulnerability categories or frameworks you'd like to see covered? Paper: [arxiv.org/abs/2512.18542](https://arxiv.org/abs/2512.18542)
View all activity
Organizations
scthornton
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
3 datasets
about 1 month ago
scthornton/atlas
Viewer
•
Updated
Nov 24, 2025
•
155
•
20
•
2
scthornton/securecode-aiml
Updated
Feb 10
•
44
•
3
scthornton/securecode
Preview
•
Updated
Feb 10
•
130
•
3
liked
a dataset
2 months ago
scthornton/securecode-web
Viewer
•
Updated
19 days ago
•
1.38k
•
3.77k
•
9