GtZeng's picture

GtZeng PRO

chaoscodes

·

AI & ML interests

None yet

Recent Activity

updated a model about 3 hours ago

AgentCPT/qwen-8b-agent-sft

updated a model about 3 hours ago

AgentCPT/qwen-4b-agent-sft

published a model about 3 hours ago

AgentCPT/qwen-8b-agent-sft

View all activity

Organizations

commented a paper 7 months ago

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Paper • 2505.23604 • Published May 29, 2025 • 23 •

New activity in Satori-reasoning/Satori-7B-Round2 11 months ago

Add Github link, Transformers library, pipeline tag

#1 opened 11 months ago by

commented a paper 11 months ago

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Paper • 2502.02508 • Published Feb 4, 2025 • 22 •

New activity in TinyLlama/TinyLlama_v1.1 over 1 year ago

Chat Template

#2 opened over 1 year ago by

Discord Server

#1 opened over 1 year ago by

New activity in HuggingFaceFW/fineweb over 1 year ago

Sample dataset?

#23 opened over 1 year ago by

New activity in chaoscodes/refined_eval_tinyllama over 1 year ago

Upload folder using huggingface_hub

#1 opened over 1 year ago by

New activity in TinyLlama/TinyLlama-1.1B-intermediate-step-480k-1T almost 2 years ago

Adding `safetensors` variant of this model

#2 opened about 2 years ago by

New activity in TinyLlama/TinyLlama-1.1B-intermediate-step-715k-1.5T almost 2 years ago

Adding `safetensors` variant of this model

#4 opened about 2 years ago by

New activity in TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T about 2 years ago

Adding `safetensors` variant of this model

#2 opened about 2 years ago by

New activity in TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T about 2 years ago

Adding `safetensors` variant of this model

#1 opened about 2 years ago by

why the config point to llama-7b ?

#2 opened about 2 years ago by