KAERI-MLP
/

AtomicGPT-gemma2-9B

domain-adaptation

continual-pretraining

instruction-tuning

Model card Files Files and versions

AtomicGPT-gemma2-9B / README.md

ysd990's picture

Update README.md

dbe8ad5 verified 7 days ago

|

history blame contribute delete

2.23 kB

	---
	license: gemma
	language:
	- ko
	- en
	base_model:
	- google/gemma-2-9b
	library_name: transformers

	tags:
	- nuclear
	- domain-adaptation
	- continual-pretraining
	- instruction-tuning
	- AtomicGPT
	---
	# AtomicGPT-Gemma2-9B

	AtomicGPT is a large language model (LLM) specialized in the nuclear engineering domain, developed at the Korea Atomic Energy Research Institute (KAERI). Based on [Gemma2-9B](https://huggingface.co/google/gemma-2-9b), AtomicGPT deeply understands various nuclear technologies, theories, and terminology, including reactor design, radiation shielding, the nuclear fuel cycle, and nuclear safety and regulations. With this expertise, AtomicGPT delivers precise answers to technical and specialized questions in the nuclear domain.

	## Model Details

	\| \| \|
	\|---\|---\|
	\| Base Model \| google/gemma-2-9b \|
	\| Training \| Continual Pre-training (CPT) + Instruction Tuning (IT) \|
	\| Domain \| Nuclear Engineering \|
	\| Languages \| Korean, English \|
	\| License \| Gemma \|

	## Performance

	\| Model \| Multiple-Choice (EM) \| Short-Answer (F1, %) \| Descriptive (1–10) \|
	\|---\|---\|---\|---\|
	\| Gemma2-9B (base) \| 23 \| 12.16 \| 3.65 \|
	\| AtomicGPT-Gemma2-9B (ours) \| 40 \| 19.72 \| 4.67 \|
	\| GPT-4 \| 48 \| 31.29 \| 7.70 \|

	All evaluations were conducted under a zero-shot setting.

	## How to Use

	```python
	import torch
	from transformers import AutoTokenizer, AutoModelForCausalLM

	model_id = "KAERI-MLP/AtomicGPT-Gemma2-9B"

	tokenizer = AutoTokenizer.from_pretrained(model_id)
	model = AutoModelForCausalLM.from_pretrained(
	model_id,
	device_map="auto",
	)
	model.eval()

	input_text = "Query about Nuclear (Atomic Energy)"
	input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")
	outputs = model.generate(**input_ids, max_new_tokens=1024)
	print(tokenizer.decode(outputs[0]))
	```

	## Citation

	If you use this model in your research, please cite:

	```bibtex
	@article{atomicgpt2026,
	title={AtomicGPT: A Domain-Adapted Large Language Model for Secure On-Premise Applications in Nuclear Engineering},
	journal={Nuclear Engineering and Technology},
	year={2026}
	}
	```

	## About

	Developed by the MLP (Multimodal Language Processing) team at the Korea Atomic Energy Research Institute (KAERI).