ivnle commited on
Commit
b39bd7a
·
verified ·
1 Parent(s): f76f705

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -28,6 +28,7 @@ Naming convention: `{regime}_{config}_h{N}_{objective}[_recon-init]`
28
  |------------|--------|-----|-----|
29
  | `vision_base_h0_recon` | Vision base | 3.60 | 1.03 |
30
  | `meanpool_w4s4_h0_recon` | Meanpool w4s4 | 3.97 | 1.04 |
 
31
 
32
  ### Language Modeling
33
 
@@ -36,12 +37,14 @@ Naming convention: `{regime}_{config}_h{N}_{objective}[_recon-init]`
36
  | `vision_base_h0_lm` | Vision base | 3.60 | Direct | 5.08 |
37
  | `vision_base_h0_lm_recon-init` | Vision base | 3.60 | From recon | 5.06 |
38
  | `meanpool_w4s4_h0_lm_recon-init` | Meanpool w4s4 | 3.97 | From recon | 5.02 |
 
39
 
40
  ## Model Details
41
 
42
  - **Architecture**: DeepSeek-OCR with vision encoder
43
  - **Vision checkpoints**: Trained encoder, 768x768 (base)
44
  - **Meanpool checkpoints**: Frozen encoder, window=4, stride=4
 
45
  - **Dataset**: 510k samples from FineWiki
46
 
47
  ## Usage
 
28
  |------------|--------|-----|-----|
29
  | `vision_base_h0_recon` | Vision base | 3.60 | 1.03 |
30
  | `meanpool_w4s4_h0_recon` | Meanpool w4s4 | 3.97 | 1.04 |
31
+ | `conv1d_t250_h0_recon` | Conv1D t250 | 3.97 | 1.00 |
32
 
33
  ### Language Modeling
34
 
 
37
  | `vision_base_h0_lm` | Vision base | 3.60 | Direct | 5.08 |
38
  | `vision_base_h0_lm_recon-init` | Vision base | 3.60 | From recon | 5.06 |
39
  | `meanpool_w4s4_h0_lm_recon-init` | Meanpool w4s4 | 3.97 | From recon | 5.02 |
40
+ | `conv1d_t250_h0_lm_recon-init` | Conv1D t250 | 3.97 | From recon | 4.96 |
41
 
42
  ## Model Details
43
 
44
  - **Architecture**: DeepSeek-OCR with vision encoder
45
  - **Vision checkpoints**: Trained encoder, 768x768 (base)
46
  - **Meanpool checkpoints**: Frozen encoder, window=4, stride=4
47
+ - **Conv1D checkpoints**: Trained hierarchical encoder, target=250 tokens
48
  - **Dataset**: 510k samples from FineWiki
49
 
50
  ## Usage