File size: 565 Bytes
9e56f88 62682fa 9e56f88 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 | model_info:
name: anemll-Meta-Llama-3.2-1B-LUT8-ctx512
version: 0.1.1
description: |
Demonstarates running Meta-Llama-3.2-1B on Apple Neural Engine
Context length: 512
Batch size: 64
Chunks: 2
license: MIT
author: Anemll
framework: Core ML
language: Python
parameters:
context_length: 512
batch_size: 64
lut_embeddings: 8
lut_ffn: 8
lut_lmhead: 8
num_chunks: 2
model_prefix: llama
embeddings: llama_embeddings_lut8.mlmodelc
lm_head: llama_lm_head_lut8.mlmodelc
ffn: llama_FFN_PF_lut8.mlmodelc
|