Motif-Technologies
/

optimizer

Model card Files Files and versions

Commit History

Replace cpu_offload constructor param with turn_on/turn_off API (#26)

05a75f1
unverified

wyldecat Claude Opus 4.6 (1M context) github-actions[bot] commited on about 10 hours ago

Merge pull request #25 from MotifTechnologies/fix/invalidate_cache_adamw

b61425a
unverified

TaehyunKim commited on 3 days ago

Invalidate AdamW tensor caches on load_state_dict [skip-build]

89b6099

ca1207 Claude Opus 4.6 (1M context) commited on 3 days ago

draft commit for cpu_offload (#23)

10848ab
unverified

TaehyunKim github-actions[bot]

wyldecat Claude Opus 4.6 (1M context) commited on 3 days ago

Replace toy PP tests with real-model-based pipeline tests [skip-build]

67f7e11

wyldecat Claude Opus 4.6 commited on 8 days ago

Add correctness verification to PP tests using fully_shard [skip-build]

a4d1f34

wyldecat Claude Opus 4.6 commited on 8 days ago

Remove correctness check from PP tests, focus on deadlock detection [skip-build]

c0bbf2e

wyldecat Claude Opus 4.6 commited on 8 days ago

Add PP + dp_replicate deadlock regression tests [skip-build]

cd587a6

wyldecat Claude Opus 4.6 commited on 8 days ago

Update fast path comment to reflect current behavior [skip-build]

7e33533

wyldecat Claude Opus 4.6 commited on 8 days ago

Update comment to reflect use_local_synchronization behavior [skip-build]

3f5cf49

wyldecat Claude Opus 4.6 commited on 8 days ago

Fix deadlock in construct_shard_mesh with PP + dp_replicate > 1

da7e5da

wyldecat Claude Opus 4.6 commited on 8 days ago

Apply pre-commit formatting (isort) [skip-build]

96b287c

wyldecat Claude Opus 4.6 commited on 18 days ago

Add MoE uneven shard test with mixed expert and non-expert params [skip-build]

bdada12

wyldecat Claude Opus 4.6 commited on 18 days ago

Add uneven shard correctness test [skip-build]

1a97671

wyldecat Claude Opus 4.6 commited on 18 days ago

Add optimization docs and update implementation guide [skip-build]

14040eb

wyldecat Claude Opus 4.6 commited on 18 days ago

Update tests for MoE and parallel optimizations [skip-build]

81f49fe

wyldecat Claude Opus 4.6 commited on 18 days ago

Muon optimizer: expert batching, parallel caching, A2A overlap [skip-build]

0f37d63

wyldecat Claude Opus 4.6 commited on 18 days ago

Optimize pipeline: batched update, zero-copy scatter, prelaunch gather [skip-build]

2816b64

wyldecat Claude Opus 4.6 commited on 18 days ago

Cache AdamW placement grouping and tensor lists [skip-build]

8ca2492

wyldecat Claude Opus 4.6 commited on 18 days ago

Add torch.compile, CUDA graph, and compiled momentum [skip-build]

e74d98f

wyldecat Claude Opus 4.6 commited on 18 days ago

Apply suggestions from code review

cdaaf4f

TaehyunKim Copilot commited on 20 days ago

Add mhc_attn, mhc_ffn, lambda_proj to skip_keys

ba293d0

wyldecat Claude Opus 4.6 commited on 20 days ago

Remove verbose param_groups summary logging

24f0957

wyldecat Claude Opus 4.6 commited on 20 days ago

Support multi-component expert_keys (e.g. "experts.w1")

5a99e12

wyldecat Claude Opus 4.6 commited on 20 days ago

Extract is_expert_param() helper to consolidate expert key matching

e615b1c

wyldecat Claude Opus 4.6 commited on 20 days ago

Include original (pre-normalize) FQN in is_muon logging

135fc66

wyldecat Claude Opus 4.6 commited on 20 days ago

Add info-level logging for param group classification (Muon vs AdamW)

1118752

wyldecat Claude Opus 4.6 commited on 20 days ago

Use component-level matching for expert_keys to avoid shared_experts collision

f008017

wyldecat Claude Opus 4.6 commited on 20 days ago

Normalize parameter FQNs to handle torch.compile / checkpoint wrappers

95a620f

wyldecat Claude Opus 4.6 commited on 20 days ago

Merge pull request #17 from MotifTechnologies/optimal-ns-coefficients

b220459
unverified

dongseokmotif commited on 22 days ago

Apply pre-commit formatting (yapf) [skip-build]

bf30b9b

dongseokmotif Claude Sonnet 4.6 commited on 22 days ago

Add max_iter cap and non-finite checks to _optimal_quintic [skip-build]

206b280

dongseokmotif commited on 22 days ago

Apply pre-commit formatting (yapf, isort) [skip-build]

aff01db

dongseokmotif commited on 23 days ago

Add comment explaining _coeffs_list and Polar Express vs former NS [skip-build]

abaa449

dongseokmotif Claude Sonnet 4.6 commited on 23 days ago

Replace hardcoded NS coefficients with analytically optimal ones [skip-build]

573242f

dongseokmotif Claude Sonnet 4.6 commited on 23 days ago

Refactor pipeline to async generator pattern (#16)

33929c0
unverified

wyldecat github-actions[bot] commited on 23 days ago

Support mHC (#15)

ae32572
unverified

wyldecat github-actions[bot] commited on Jan 16

Update arxiv URL

fa059da

wyldecat commited on Nov 12, 2025

Support param group with various placements (#13)

e2b41e5
unverified

wyldecat github-actions[bot] commited on Nov 7, 2025

Update tag (#8)

e907c7d
verified

danieldk HF Staff commited on Oct 27, 2025

Merge pull request #14 from MotifTechnologies/fix_bug_in_fsdp

5458c82
unverified

TaehyunKim commited on Oct 23, 2025

Add built binary [skip-build]

6ec5093

github-actions[bot] commited on Oct 23, 2025

fix bug in fsdp

811726c

ca1207 commited on Oct 23, 2025

feat(workflow): add Slack notifications for build start, success, and failure [skip-build] (#12)

0b8d958
unverified

wyldecat commited on Oct 14, 2025

Merge pull request #11 from MotifTechnologies/ca1207-patch-1

53deea3
unverified

TaehyunKim commited on Oct 2, 2025

Add built binary [skip-build]

de5bead

github-actions[bot] commited on Oct 2, 2025

Update torch-ext/optimizer/muon.py

b0230e7
unverified

TaehyunKim commited on Oct 2, 2025

Update torch-ext/optimizer/muon.py

ff2fcfb
unverified

TaehyunKim commited on Oct 2, 2025

Update muon.py

c16b438
unverified

TaehyunKim commited on Oct 2, 2025

Merge pull request #10 from MotifTechnologies/fix_a2a_gs_assert

4f71bc9
unverified

TaehyunKim commited on Sep 30, 2025