moon aaa

moon005

AI & ML interests

None yet

Recent Activity

reacted to SeaWolf-AI's post with ❤️ about 23 hours ago

🚀 Introducing MARL — Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning Now available on PyPI · GitHub · ClawHub · HuggingFace AI models sense they could be wrong, but they can't actually fix what's broken. 🤗 Live A/B test: https://huggingface.co/spaces/VIDraft/MARL We evaluated 9 SOTA models (GPT-5.2, Claude Opus 4.6, Gemini 3 Pro, etc.) across 1,800 assessments in FINAL Bench and found a 39.2%p gap between "recognizing potential errors (MA=0.694)" and "actually finding and fixing them (ER=0.302)." MARL (Model-Agnostic Runtime Middleware for LLMs) was built to close this metacognitive gap. It decomposes a single LLM call into a 5-stage expert pipeline (Hypothesis → Solver → Auditor → Adversarial Verifier → Synthesizer), transforming "answer in one shot" into "think, doubt, correct, and rewrite." No weight modification — works instantly with GPT-5.4, Claude, Gemini, Llama, or any OpenAI API-compatible LLM by changing one line: base_url. Ships with 9 domain-specific emergence engines (invention, pharma, genomics, chemistry, ecology, law, and more — 5,538 expert data items) activated by a simple tag like model="gpt-5.4::pharma". pip install marl-middleware MARL is also officially registered on ClawHub, the skill marketplace of OpenClaw — an AI agent platform with 260K+ developers and 3,200+ skills. It's the first middleware in the Reasoning Enhancement category. One command — clawhub install marl-middleware — gives your AI agent a metacognition upgrade. 📝 Technical deep dive: https://huggingface.co/blog/FINAL-Bench/marl-middleware 📦 PyPI: https://pypi.org/project/marl-middleware/ 🐙 GitHub: https://github.com/Vidraft/MARL 🦀 ClawHub: https://clawhub.ai/Cutechicken99/marl-middleware #MARL #LLM #Hallucination #Metacognition #MultiAgent #AIMiddleware #FINALBench #OpenClaw #ClawHub #PyPI #AGI #HuggingFace #ReasoningAI #SelfCorrection #GlassBoxAI

liked a model 6 days ago

unsloth/Qwen3.5-2B-GGUF

liked a model 27 days ago

unsloth/gemma-3-4b-it-GGUF

View all activity

Organizations

None yet

reacted to SeaWolf-AI's post with ❤️ about 23 hours ago

Post

5363

🚀 Introducing MARL — Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

Now available on PyPI · GitHub · ClawHub · HuggingFace
AI models sense they could be wrong, but they can't actually fix what's broken.

🤗 Live A/B test: VIDraft/MARL

We evaluated 9 SOTA models (GPT-5.2, Claude Opus 4.6, Gemini 3 Pro, etc.) across 1,800 assessments in FINAL Bench and found a 39.2%p gap between "recognizing potential errors (MA=0.694)" and "actually finding and fixing them (ER=0.302)."

MARL (Model-Agnostic Runtime Middleware for LLMs) was built to close this metacognitive gap. It decomposes a single LLM call into a 5-stage expert pipeline (Hypothesis → Solver → Auditor → Adversarial Verifier → Synthesizer), transforming "answer in one shot" into "think, doubt, correct, and rewrite."

No weight modification — works instantly with GPT-5.4, Claude, Gemini, Llama, or any OpenAI API-compatible LLM by changing one line: base_url. Ships with 9 domain-specific emergence engines (invention, pharma, genomics, chemistry, ecology, law, and more — 5,538 expert data items) activated by a simple tag like model="gpt-5.4::pharma".

pip install marl-middleware

MARL is also officially registered on ClawHub, the skill marketplace of OpenClaw — an AI agent platform with 260K+ developers and 3,200+ skills. It's the first middleware in the Reasoning Enhancement category. One command — clawhub install marl-middleware — gives your AI agent a metacognition upgrade.

📝 Technical deep dive: https://huggingface.co/blog/FINAL-Bench/marl-middleware
📦 PyPI: https://pypi.org/project/marl-middleware/
🐙 GitHub: https://github.com/Vidraft/MARL
🦀 ClawHub: https://clawhub.ai/Cutechicken99/marl-middleware

#MARL #LLM #Hallucination #Metacognition #MultiAgent #AIMiddleware #FINALBench #OpenClaw #ClawHub #PyPI #AGI #HuggingFace #ReasoningAI #SelfCorrection #GlassBoxAI

liked a model 6 days ago

unsloth/Qwen3.5-2B-GGUF

Image-Text-to-Text • 2B • Updated 8 days ago • 138k • 66

liked a model 27 days ago

unsloth/gemma-3-4b-it-GGUF

Image-Text-to-Text • 4B • Updated Aug 14, 2025 • 90.1k • 180

upvoted a paper about 2 months ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 112

liked a model 2 months ago

lightx2v/Qwen-Image-2512-Lightning

Text-to-Image • Updated Jan 15 • 82.9k • 191

reacted to AdinaY's post with ❤️ 2 months ago

Post

1866

Chinese open source AI in December 2025 was about the stack coming together: open, end to end, and ready to ship 🔥

https://huggingface.co/collections/zh-ai-community/december-2025-china-open-source-highlights

✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size
- DeepSeek-V3.2
- Z.ai GLM-4.7
- MiniMax-M2.1
- Xiaomi: MiMo-V2-Flash

✨ Multimodal reasoning is now default
- Z.ai GLM-4.6V
- Z.ai AutoGLM-Phone 9B
- Bytedance: Dolphin-v2

✨ Image & video: editable assets and real workflows
- Qwen-Image-Layered / Image-2512
- Meituan: LongCat-Image & Image Edit
- AIDC: Ovis-Image-7B
- Live-Avatar / LongCat-Video-Avatar
- HY-WorldPlay / RealVideo

✨ Audio goes edge ready
- GLM-ASR-Nano / Fun-ASR-Nano
- GLM-TTS / VoxCPM1.5
- CosyVoice 0.5B

✨ The quiet backbone: data & infrastructure
- Finch (FinWorkBench)
- Tencent ARC: TimeLens-100K
- BIGAI: TongSIM-Asset
- MiniMax: VTP-Base

✨ Also congrats on Minimax and Z.ai announced their IPOs and Moonshot announced a new $500M funding round 🔥

Like everyone else, I was OOO at the end of December, so feel free to share (in comments or PR) any I missed in this list!

liked a model 2 months ago

Wuli-art/Qwen-Image-2512-Turbo-LoRA

Text-to-Image • Updated Jan 8 • 9.13k • 207

New activity in unsloth/Qwen-Image-2512-GGUF 2 months ago

colab notebook link to test the model

#6 opened 2 months ago by

moon005

liked a model 2 months ago

unsloth/Qwen-Image-2512-GGUF

Text-to-Image • 20B • Updated Jan 6 • 43.4k • 314

New activity in unsloth/Qwen-Image-2512-GGUF 2 months ago

colab / kaggle

#5 opened 2 months ago by

moon005

if your getting very blurry outputs

#2 opened 2 months ago by

realrebelai

reacted to mahimairaja's post with 🚀 2 months ago

Post

4785

Happy New Years 2026!

For next 365 days I will be commit to work on:

- Document AI and OCR Automations
- Voice Agents
- Long Running Tasks - Durable Agents

1 reply

reacted to projectlosangeles's post with ❤️ 2 months ago

Post

2137

🔥Project Los Angeles is proud to announce the release of midisim 🔥

midisim is a SOTA Python package for calculating, searching and analyzing MIDI-to-MIDI similarity at speed and scale!

projectlosangeles/midisim
projectlosangeles/midisim-embeddings
projectlosangeles/midisim-samples

If you like midisim, please ❤️

Project Los Angeles
Tegridy Code 2025

New activity in tencent/WeDLM-8B-Instruct 2 months ago

llama cpp

🔥 4

#8 opened 2 months ago by

moon005

liked a model 2 months ago

tencent/WeDLM-8B-Instruct

Text Generation • 8B • Updated Jan 1 • 2.08k • 311

liked 3 models 3 months ago

reacted to sequelbox's post with 🔥 3 months ago

Post

3042

Two new releases today!

Firstly, our new Raiden-Mini dataset, powered by DeepSeek's newest deepseek-ai/DeepSeek-V3.2-Speciale model!
- A V3.2-Speciale reasoning showcase: the Raiden prompts test the model's creative, analytic, and general reasoning skills!
- HEAD TO HEAD: a comparison subset pits V3.2-Speciale against V3.2 with the same prompts, providing a direct look at each model's advantages!

Get the new Raiden-Mini dataset: sequelbox/Raiden-Mini-DeepSeek-V3.2-Speciale

On the model side, we've also brought Shining Valiant 3 to Ministral 3!
- Science-reasoning: sequelbox/Celestia3-DeepSeek-R1-0528 for physics, biology, chemistry, compsci, astronomy, Earth science, and information theory.
- AI to build AI: the sequelbox/Mitakihara-DeepSeek-R1-0528 dataset for high-quality reasoning performance on AI, MLOps, math and CUDA, complex adaptive and agentic systems, cognition, logic, linguistics, simulation, knowledge management, and more!
- Creative reasoning and general chat performance supplemented with sequelbox/Raiden-DeepSeek-R1

Get the newest SV3: ValiantLabs/Ministral-3-14B-Reasoning-2512-ShiningValiant3

Esper 3.1 is available for Ministral 3 as well: ValiantLabs/Ministral-3-14B-Reasoning-2512-Esper3.1

We're working hard on our next Big New Release, coming out in the next few weeks :)

Help support our releases, donations used for models and datasets: sequelbox/SupportOpenSource

Open source matters. Fight for it with us.

with love and friendship,
allegra

1 reply

reacted to branikita's post with 🚀 4 months ago

Post

3279

Proud to share the results of our engineering team’s recent work at

Robonine :

• Together, we applied advanced topology optimization to redesign critical brackets of the manipulator, achieving a 57–76% reduction in structural deflection.

• Our updated model also demonstrated a major stress decrease — from 93 MPa down to 25 MPa — all while staying within the allowed weight increase.

• Although we didn’t fully reach the target tip deviation of 0.3 mm (best achieved: 0.41 mm), the project gave us valuable insights and a solid foundation for the next design iteration.

moon aaa

AI & ML interests

Recent Activity

Organizations

moon005's activity

colab notebook link to test the model

colab / kaggle

if your getting very blurry outputs

llama cpp