ivnle commited on
Commit
41df100
·
verified ·
1 Parent(s): b39bd7a

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -36,6 +36,7 @@ Naming convention: `{regime}_{config}_h{N}_{objective}[_recon-init]`
36
  |------------|--------|-----|------|-----|
37
  | `vision_base_h0_lm` | Vision base | 3.60 | Direct | 5.08 |
38
  | `vision_base_h0_lm_recon-init` | Vision base | 3.60 | From recon | 5.06 |
 
39
  | `meanpool_w4s4_h0_lm_recon-init` | Meanpool w4s4 | 3.97 | From recon | 5.02 |
40
  | `conv1d_t250_h0_lm_recon-init` | Conv1D t250 | 3.97 | From recon | 4.96 |
41
 
@@ -43,6 +44,7 @@ Naming convention: `{regime}_{config}_h{N}_{objective}[_recon-init]`
43
 
44
  - **Architecture**: DeepSeek-OCR with vision encoder
45
  - **Vision checkpoints**: Trained encoder, 768x768 (base)
 
46
  - **Meanpool checkpoints**: Frozen encoder, window=4, stride=4
47
  - **Conv1D checkpoints**: Trained hierarchical encoder, target=250 tokens
48
  - **Dataset**: 510k samples from FineWiki
 
36
  |------------|--------|-----|------|-----|
37
  | `vision_base_h0_lm` | Vision base | 3.60 | Direct | 5.08 |
38
  | `vision_base_h0_lm_recon-init` | Vision base | 3.60 | From recon | 5.06 |
39
+ | `text_ctx277_h0_lm` | Text ctx277 (Truncation) | 3.60 | Direct | 5.02 |
40
  | `meanpool_w4s4_h0_lm_recon-init` | Meanpool w4s4 | 3.97 | From recon | 5.02 |
41
  | `conv1d_t250_h0_lm_recon-init` | Conv1D t250 | 3.97 | From recon | 4.96 |
42
 
 
44
 
45
  - **Architecture**: DeepSeek-OCR with vision encoder
46
  - **Vision checkpoints**: Trained encoder, 768x768 (base)
47
+ - **Text checkpoints**: Truncation baseline (no vision encoder), context=277 tokens
48
  - **Meanpool checkpoints**: Frozen encoder, window=4, stride=4
49
  - **Conv1D checkpoints**: Trained hierarchical encoder, target=250 tokens
50
  - **Dataset**: 510k samples from FineWiki