Upload Phi-3 Industrial Anomaly Detection model

ab23b8d verified 4 months ago

6.89 kB

	---
	language:
	- en
	license: mit
	base_model: microsoft/Phi-3-mini-4k-instruct
	tags:
	- phi3
	- qlora
	- industrial
	- anomaly-detection
	- iot
	- edge-ai
	- fine-tuned
	datasets:
	- ssam17/Edge-Industrial-Anomaly-Phi3
	model-index:
	- name: Phi-3-Industrial-Anomaly
	results:
	- task:
	type: text-generation
	metrics:
	- name: Eval Loss
	type: loss
	value: 2.3992
	- name: Token Accuracy
	type: accuracy
	value: 0.5451
	---

	# 🏭 Phi-3 Mini Fine-tuned for Industrial Anomaly Detection

	<div align="center">

	[![Model](https://img.shields.io/badge/Model-Phi--3--mini-blue)](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct)
	[![Method](https://img.shields.io/badge/Method-QLoRA-green)](https://arxiv.org/abs/2305.14314)
	[![License](https://img.shields.io/badge/License-MIT-yellow)](LICENSE)

	</div>

	Fine-tuned version of Microsoft's Phi-3-mini-4k-instruct using QLoRA (Quantized Low-Rank Adaptation) for industrial IoT anomaly detection and interpretable diagnostics.

	## 📋 Model Description

	This model specializes in analyzing industrial sensor data and network telemetry to detect anomalies, identify potential security threats, and provide actionable insights for industrial automation systems.

	Key Features:
	- 🎯 Industrial anomaly classification
	- 🔒 Security threat detection
	- 📊 Sensor data interpretation
	- 🚨 Real-time diagnostic recommendations
	- 💡 Explainable AI responses

	## 🔧 Training Details

	### Base Model
	- Architecture: Phi-3-mini-4k-instruct (3.8B parameters)
	- Context Length: 4096 tokens
	- Quantization: 4-bit NF4 with double quantization

	### Fine-tuning Configuration
	- Method: QLoRA (Quantized Low-Rank Adaptation)
	- LoRA Rank: 32
	- LoRA Alpha: 64
	- Target Modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
	- Dropout: 0.05

	### Training Parameters
	- Epochs: 5
	- Batch Size: 4 per device
	- Gradient Accumulation: 4 steps (effective batch size: 16)
	- Learning Rate: 2e-5
	- Optimizer: paged_adamw_8bit
	- Scheduler: Cosine with warmup (100 steps)
	- Mixed Precision: BF16

	### Dataset
	- Name: [Edge-Industrial-Anomaly-Phi3](https://huggingface.co/datasets/ssam17/Edge-Industrial-Anomaly-Phi3)
	- Training Samples: 10,749
	- Evaluation Samples: 1,195
	- Format: Conversational (user/assistant format)

	## 📊 Evaluation Results

	\| Metric \| Value \|
	\|--------\|-------\|
	\| Eval Loss \| 2.3992 \|
	\| Token Accuracy \| 54.51% \|
	\| Eval Runtime \| 81.12s \|
	\| Samples/Second \| 14.73 \|

	## 🚀 Usage

	### Using Transformers (Recommended)

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	import torch

	# Load model and tokenizer
	model = AutoModelForCausalLM.from_pretrained(
	"YOUR_USERNAME/phi3-industrial-anomaly",
	torch_dtype=torch.bfloat16,
	device_map="auto",
	trust_remote_code=True
	)
	tokenizer = AutoTokenizer.from_pretrained(
	"YOUR_USERNAME/phi3-industrial-anomaly",
	trust_remote_code=True
	)

	# Prepare input
	prompt = """<\|user\|>
	Sensor Readings: Temperature: 95°C, Vibration: 5.8 m/s, Pressure: 120 kPa, Flow Rate: 6.2 L/min
	<\|end\|>
	<\|assistant\|>"""

	inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

	# Generate response
	outputs = model.generate(
	**inputs,
	max_new_tokens=150,
	temperature=0.7,
	do_sample=True,
	pad_token_id=tokenizer.eos_token_id
	)

	response = tokenizer.decode(outputs[0], skip_special_tokens=True)
	print(response)
	```

	### Using PEFT (Load Adapters Only)

	```python
	from peft import AutoPeftModelForCausalLM
	from transformers import AutoTokenizer
	import torch

	# Load model with LoRA adapters
	model = AutoPeftModelForCausalLM.from_pretrained(
	"YOUR_USERNAME/phi3-industrial-anomaly",
	torch_dtype=torch.bfloat16,
	device_map="auto",
	trust_remote_code=True
	)
	tokenizer = AutoTokenizer.from_pretrained(
	"YOUR_USERNAME/phi3-industrial-anomaly",
	trust_remote_code=True
	)
	```

	### Example Prompts

	Network Security Analysis:
	```
	<\|user\|>
	Network Telemetry: Arp.Opcode: 0.0, Icmp.Checksum: 0.0, Suspicious packet patterns detected
	<\|end\|>
	<\|assistant\|>
	```

	Sensor Diagnostics:
	```
	<\|user\|>
	Sensor Readings: Temperature: 110°C, Vibration: 7.2 m/s, Pressure: 85 kPa, Flow Rate: 3.1 L/min
	<\|end\|>
	<\|assistant\|>
	```

	## 🎯 Use Cases

	- Industrial IoT Monitoring: Real-time anomaly detection in manufacturing plants
	- Predictive Maintenance: Early warning systems for equipment failure
	- Security Operations: Network intrusion detection in OT/IT environments
	- Edge Deployment: Lightweight inference on industrial gateways and edge devices
	- Smart Manufacturing: Quality control and process optimization

	## 🛠️ Edge Deployment

	### Model Formats Available
	- PyTorch (this repo): Full model for transformers
	- GGUF: For llama.cpp and edge devices (see releases)
	- ONNX: For optimized inference (convert with Optimum)

	### Hardware Requirements
	- GPU Inference: 8GB+ VRAM (with quantization)
	- CPU Inference: 16GB+ RAM
	- Edge Devices: Compatible with Jetson Nano, Raspberry Pi 5, Industrial PCs

	## 📈 Performance Considerations

	- Quantization: Model uses 4-bit quantization for efficient memory usage
	- Inference Speed: ~14.7 samples/second on NVIDIA RTX GPUs
	- Context Window: 4096 tokens (sufficient for detailed sensor logs)
	- Generation: Typical response time 2-5 seconds on GPU

	## ⚠️ Limitations

	- Model may require domain-specific fine-tuning for your specific industrial environment
	- Best performance with sensor data in the format seen during training
	- Evaluation accuracy (54.51%) suggests room for improvement with more training epochs
	- Not suitable for safety-critical decisions without human oversight

	## 🔄 Version History

	- v1.0 (2026-01-06): Initial release
	- 5 epochs of QLoRA fine-tuning
	- LoRA rank 32, alpha 64
	- Trained on Edge-Industrial-Anomaly-Phi3 dataset

	## 📄 Citation

	If you use this model, please cite:

	```bibtex
	@misc{phi3-industrial-anomaly-2026,
	author = {Your Name},
	title = {Phi-3 Mini Fine-tuned for Industrial Anomaly Detection},
	year = {2026},
	publisher = {HuggingFace},
	howpublished = {\url{https://huggingface.co/YOUR_USERNAME/phi3-industrial-anomaly}}
	}
	```

	## 📜 License

	This model is released under the MIT License. The base Phi-3 model is subject to Microsoft's [Phi-3 license](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct).

	## 🙏 Acknowledgments

	- Microsoft Research: For the Phi-3-mini-4k-instruct base model
	- Hugging Face: For the transformers and PEFT libraries
	- Dataset: ssam17/Edge-Industrial-Anomaly-Phi3

	## 📞 Contact

	For questions, issues, or collaboration opportunities, please open an issue in the repository or contact the model author.

	---

	<div align="center">
	Built with ❤️ for Industrial AI
	</div>