introvoyz041 commited on
Commit
015a757
·
verified ·
1 Parent(s): 41b44d3

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +54 -0
README.md ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - mlx
6
+ - apple-silicon
7
+ - liquidai
8
+ - lfm2
9
+ - moe
10
+ - transformer
11
+ - long-context
12
+ - instruct
13
+ - quantized
14
+ - 8bit
15
+ - Mixture of Experts
16
+ - coding
17
+ - mlx
18
+ - mlx-my-repo
19
+ pipeline_tag: text-generation
20
+ library_name: mlx
21
+ license: other
22
+ license_name: lfm1.0
23
+ license_link: LICENSE
24
+ base_model: mlx-community/LFM2-8B-A1B-8bit-MLX
25
+ model-index:
26
+ - name: LFM2-8B-A1B — MLX (Apple Silicon), **8-bit** (with guidance on MoE + RAM planning)
27
+ results: []
28
+ ---
29
+
30
+ # introvoyz041/LFM2-8B-A1B-8bit-MLX-mlx-8Bit
31
+
32
+ The Model [introvoyz041/LFM2-8B-A1B-8bit-MLX-mlx-8Bit](https://huggingface.co/introvoyz041/LFM2-8B-A1B-8bit-MLX-mlx-8Bit) was converted to MLX format from [mlx-community/LFM2-8B-A1B-8bit-MLX](https://huggingface.co/mlx-community/LFM2-8B-A1B-8bit-MLX) using mlx-lm version **0.28.3**.
33
+
34
+ ## Use with mlx
35
+
36
+ ```bash
37
+ pip install mlx-lm
38
+ ```
39
+
40
+ ```python
41
+ from mlx_lm import load, generate
42
+
43
+ model, tokenizer = load("introvoyz041/LFM2-8B-A1B-8bit-MLX-mlx-8Bit")
44
+
45
+ prompt="hello"
46
+
47
+ if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
48
+ messages = [{"role": "user", "content": prompt}]
49
+ prompt = tokenizer.apply_chat_template(
50
+ messages, tokenize=False, add_generation_prompt=True
51
+ )
52
+
53
+ response = generate(model, tokenizer, prompt=prompt, verbose=True)
54
+ ```