File size: 2,233 Bytes
b1bff7d
 
 
 
 
 
 
 
dbe8ad5
 
 
 
 
 
 
b1bff7d
dbe8ad5
b1bff7d
dbe8ad5
b1bff7d
dbe8ad5
b1bff7d
dbe8ad5
 
 
 
 
 
 
b1bff7d
dbe8ad5
 
 
 
 
 
 
 
 
 
 
 
 
b1bff7d
 
 
dbe8ad5
b1bff7d
 
 
 
 
 
 
 
dbe8ad5
b1bff7d
 
 
dbe8ad5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
---
license: gemma
language:
- ko
- en
base_model:
- google/gemma-2-9b
library_name: transformers

tags:
- nuclear
- domain-adaptation
- continual-pretraining
- instruction-tuning
- AtomicGPT
---
# AtomicGPT-Gemma2-9B

**AtomicGPT** is a large language model (LLM) specialized in the nuclear engineering domain, developed at the Korea Atomic Energy Research Institute (KAERI). Based on [Gemma2-9B](https://huggingface.co/google/gemma-2-9b), AtomicGPT deeply understands various nuclear technologies, theories, and terminology, including reactor design, radiation shielding, the nuclear fuel cycle, and nuclear safety and regulations. With this expertise, AtomicGPT delivers precise answers to technical and specialized questions in the nuclear domain.

## Model Details

| | |
|---|---|
| **Base Model** | google/gemma-2-9b |
| **Training** | Continual Pre-training (CPT) + Instruction Tuning (IT) |
| **Domain** | Nuclear Engineering |
| **Languages** | Korean, English |
| **License** | Gemma |

## Performance

| Model | Multiple-Choice (EM) | Short-Answer (F1, %) | Descriptive (1–10) |
|---|---|---|---|
| Gemma2-9B (base) | 23 | 12.16 | 3.65 |
| **AtomicGPT-Gemma2-9B (ours)** | **40** | **19.72** | **4.67** |
| GPT-4 | 48 | 31.29 | 7.70 |

All evaluations were conducted under a zero-shot setting.

## How to Use

```python
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "KAERI-MLP/AtomicGPT-Gemma2-9B"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    device_map="auto",
)
model.eval()

input_text = "Query about Nuclear (Atomic Energy)"
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")
outputs = model.generate(**input_ids, max_new_tokens=1024)
print(tokenizer.decode(outputs[0]))
```

## Citation

If you use this model in your research, please cite:

```bibtex
@article{atomicgpt2026,
  title={AtomicGPT: A Domain-Adapted Large Language Model for Secure On-Premise Applications in Nuclear Engineering},
  journal={Nuclear Engineering and Technology},
  year={2026}
}
```

## About

Developed by the MLP (Multimodal Language Processing) team at the Korea Atomic Energy Research Institute (KAERI).