Transformers
Safetensors
Korean
English
nuclear
domain-adaptation
continual-pretraining
instruction-tuning
AtomicGPT
Instructions to use KAERI-MLP/AtomicGPT-gemma2-9B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use KAERI-MLP/AtomicGPT-gemma2-9B with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("KAERI-MLP/AtomicGPT-gemma2-9B", dtype="auto") - Notebooks
- Google Colab
- Kaggle
File size: 2,233 Bytes
b1bff7d dbe8ad5 b1bff7d dbe8ad5 b1bff7d dbe8ad5 b1bff7d dbe8ad5 b1bff7d dbe8ad5 b1bff7d dbe8ad5 b1bff7d dbe8ad5 b1bff7d dbe8ad5 b1bff7d dbe8ad5 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 | ---
license: gemma
language:
- ko
- en
base_model:
- google/gemma-2-9b
library_name: transformers
tags:
- nuclear
- domain-adaptation
- continual-pretraining
- instruction-tuning
- AtomicGPT
---
# AtomicGPT-Gemma2-9B
**AtomicGPT** is a large language model (LLM) specialized in the nuclear engineering domain, developed at the Korea Atomic Energy Research Institute (KAERI). Based on [Gemma2-9B](https://huggingface.co/google/gemma-2-9b), AtomicGPT deeply understands various nuclear technologies, theories, and terminology, including reactor design, radiation shielding, the nuclear fuel cycle, and nuclear safety and regulations. With this expertise, AtomicGPT delivers precise answers to technical and specialized questions in the nuclear domain.
## Model Details
| | |
|---|---|
| **Base Model** | google/gemma-2-9b |
| **Training** | Continual Pre-training (CPT) + Instruction Tuning (IT) |
| **Domain** | Nuclear Engineering |
| **Languages** | Korean, English |
| **License** | Gemma |
## Performance
| Model | Multiple-Choice (EM) | Short-Answer (F1, %) | Descriptive (1–10) |
|---|---|---|---|
| Gemma2-9B (base) | 23 | 12.16 | 3.65 |
| **AtomicGPT-Gemma2-9B (ours)** | **40** | **19.72** | **4.67** |
| GPT-4 | 48 | 31.29 | 7.70 |
All evaluations were conducted under a zero-shot setting.
## How to Use
```python
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
model_id = "KAERI-MLP/AtomicGPT-Gemma2-9B"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id,
device_map="auto",
)
model.eval()
input_text = "Query about Nuclear (Atomic Energy)"
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")
outputs = model.generate(**input_ids, max_new_tokens=1024)
print(tokenizer.decode(outputs[0]))
```
## Citation
If you use this model in your research, please cite:
```bibtex
@article{atomicgpt2026,
title={AtomicGPT: A Domain-Adapted Large Language Model for Secure On-Premise Applications in Nuclear Engineering},
journal={Nuclear Engineering and Technology},
year={2026}
}
```
## About
Developed by the MLP (Multimodal Language Processing) team at the Korea Atomic Energy Research Institute (KAERI).
|