| model_info: | |
| name: anemll-Meta-Llama-3.2-1B-ctx2048 | |
| version: 0.1.2 | |
| description: | | |
| Demonstarates running Meta-Llama-3.2-1B on Apple Neural Engine | |
| Context length: 2048 | |
| Batch size: 128 | |
| Chunks: 2 | |
| license: MIT | |
| author: Anemll | |
| framework: Core ML | |
| language: Python | |
| parameters: | |
| context_length: 2048 | |
| batch_size: 128 | |
| lut_embeddings: none | |
| lut_ffn: 4 | |
| lut_lmhead: 4 | |
| num_chunks: 2 | |
| model_prefix: llama | |