ElenaSenger
/

DiSTER-Llama-3-8B-Instruct

term-extraction

named-entity-recognition

domain-adaptation

Model card Files Files and versions

ElenaSenger commited on Oct 8, 2025

Commit

89aaca2

·

verified ·

1 Parent(s): 753188f

Updated example usage

Files changed (1) hide show

README.md +35 -10

README.md CHANGED Viewed

@@ -26,20 +26,45 @@ This is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://hu
 > The model was trained with *conversation-style* prompts, so the same format should be used for inference.
 ```python
-from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
-model_id = "your-username/DiSTER-Llama-3-8B-Instruct"
 tokenizer = AutoTokenizer.from_pretrained(model_id)
-model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto")
-pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
 prompt = (
-    "Text: We used dropout regularization and AdamW optimizer to train a CNN on MRI images.\n\n"
-    "I've read this text.\n"
-    "What describes (technical or scientific) terms in the text, that are relevant to the domain medical-imaging?"
 )
-out = pipe(prompt, max_new_tokens=128, do_sample=False)
-print(out[0]['generated_text'])
-# → ["dropout regularization", "AdamW optimizer", "CNN", "MRI images"]

 > The model was trained with *conversation-style* prompts, so the same format should be used for inference.
 ```python
+import torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+model_id = "ElenaSenger/DiSTER-Llama-3-8B-Instruct"
 tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = AutoModelForCausalLM.from_pretrained(
+    model_id,
+    device_map="auto",
+    torch_dtype="auto"
+)
+# Use the chat format as in SynTerm-fine-tuning
 prompt = (
+    '{"id": "test_0", "conversations": [\n'
+    '    {"from": "human", "value": "Text: We used dropout regularization and AdamW optimizer to train a CNN on MRI images."},\n'
+    '    {"from": "gpt", "value": "I\'ve read this text."},\n'
+    '    {"from": "human", "value": "What describes (technical or scientific) terms in the text, that are relevant to the domain medical-imaging?"},\n'
+    '    {"from": "gpt", "value": ""}\n'
+    ']}\n'
 )
+inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+  with torch.no_grad():
+      outputs = model.generate(
+            **inputs,
+            max_new_tokens=128,
+            do_sample=True,  # Enable sampling for temperature to take effect
+            temperature=0.1,
+            top_p=1.0
+        )
+  decoded = tokenizer.decode(outputs[0], skip_special_tokens=True)
+  print("=== Decoded output ===")
+  print(decoded)
+  # Print only the model's reply
+  reply = decoded[len(prompt):].strip()
+  print("\n=== Model reply only ===")
+  print(reply)