Improve language tag

#1
by lbourdois - opened
Files changed (1) hide show
  1. README.md +111 -99
README.md CHANGED
@@ -1,100 +1,112 @@
1
- ---
2
- datasets:
3
- - iimran/Medical-Intelligence-Questions
4
- base_model:
5
- - Qwen/Qwen2.5-3B
6
- language:
7
- - en
8
- tags:
9
- - medical
10
- - text-generation-inference
11
- - transformers
12
- - unsloth
13
- ---
14
-
15
- # Qwen2.5-3B-R1-MedicalReasoner LoRA Adapter
16
-
17
- This repository contains the LoRA adapter weights and configuration for **Qwen2.5-3B-R1-MedicalReasoner**, a state-of-the-art clinical reasoning language model fine-tuned using GRPO. The adapter is designed to further optimize and customize model behavior for clinical reasoning tasks.
18
-
19
- ## Overview
20
-
21
- - **Adapter Name:** Qwen2.5-3B-R1-MedicalReasoner LoRA Adapter
22
- - **Purpose:** To modify and enhance the base model (Qwen2.5-3B-R1-MedicalReasoner) using Low-Rank Adaptation (LoRA) techniques without modifying the full model weights.
23
- - **Use Case:** Ideal for users wishing to fine-tune, experiment, or deploy the clinical reasoning model with customized parameter-efficient adaptations.
24
-
25
- ## Key Features
26
-
27
- - **Parameter-Efficient Adaptation:** LoRA allows for training a small number of additional parameters, making further fine-tuning efficient in time and resources.
28
- - **Seamless Integration:** Easily integrated with the base model using the provided tools and functions in Unsloth and vLLM.
29
- - **Optimized for Clinical Reasoning:** The adapter reinforces chain-of-thought generation and improves the clarity of diagnostic reasoning outputs.
30
-
31
- ## How to Use
32
-
33
- ### Integration with Base Model
34
-
35
- To download and load the LoRA adapter into Qwen2.5-3B-R1-MedicalReasoner:
36
-
37
- ```python
38
- from huggingface_hub import snapshot_download
39
- from unsloth import FastLanguageModel
40
-
41
- # Download the adapter weights:
42
- lora_path = snapshot_download("iimran/Qwen2.5-3B-R1-MedicalReasoner-lora-adapter")
43
- print("LoRA adapter downloaded to:", lora_path)
44
-
45
- # Load base model:
46
- model, tokenizer = FastLanguageModel.from_pretrained(
47
- model_name="iimran/Qwen2.5-3B-R1-MedicalReasoner",
48
- load_in_4bit=False,
49
- fast_inference=True
50
- )
51
-
52
- # Load the LoRA adapter:
53
- model.load_lora(lora_path)
54
- ```
55
-
56
- ## Fine-Tuning and Experimentation
57
-
58
- This adapter was originally developed and fine-tuned using GRPO with customized reward functions to enhance chain-of-thought reasoning. Researchers who wish to further optimize the behavior of the clinical reasoning model with targeted adaptations can start from these adapter weights.
59
-
60
- ## Installation Requirements
61
-
62
- * **Python Version:** 3.8 or higher
63
- * **Dependencies:**
64
- * unsloth
65
- * vLLM
66
- * huggingface-hub
67
- * Other dependencies required by the base model and LoRA integration
68
-
69
- Install the required packages using pip:
70
-
71
- ```bash
72
- pip install unsloth vllm huggingface-hub
73
- ```
74
-
75
- ## Citation
76
-
77
- If you use the LoRA adapter in your work, please cite:
78
-
79
- ```bibtex
80
- @misc{Qwen2.5-3B-R1-MedicalReasoner-lora-adapter,
81
- authors = {Imran Sarwar, Muhammad Rouf Mustafa},
82
- title = {Qwen2.5-3B-R1-MedicalReasoner LoRA Adapter},
83
- year = {2025},
84
- publisher = {Hugging Face},
85
- url = {https://huggingface.co/iimran/Qwen2.5-3B-R1-MedicalReasoner-lora-adapter}
86
- }
87
- ```
88
-
89
- ## Contributing
90
-
91
- Contributions to the LoRA adapter are welcome. If you have improvements for:
92
- * Adapter performance or efficiency
93
- * Documentation updates
94
- * Additional experiments or fine-tuning strategies
95
-
96
- Please open an issue or submit a pull request.
97
-
98
- ## Disclaimer
99
-
 
 
 
 
 
 
 
 
 
 
 
 
100
  This LoRA adapter is provided for research and educational purposes. It is intended to be used in combination with the **Qwen2.5-3B-R1-MedicalReasoner** base model. As with the base model, clinical outputs should be validated by qualified healthcare professionals before being used in any medical decision-making.
 
1
+ ---
2
+ datasets:
3
+ - iimran/Medical-Intelligence-Questions
4
+ base_model:
5
+ - Qwen/Qwen2.5-3B
6
+ language:
7
+ - zho
8
+ - eng
9
+ - fra
10
+ - spa
11
+ - por
12
+ - deu
13
+ - ita
14
+ - rus
15
+ - jpn
16
+ - kor
17
+ - vie
18
+ - tha
19
+ - ara
20
+ tags:
21
+ - medical
22
+ - text-generation-inference
23
+ - transformers
24
+ - unsloth
25
+ ---
26
+
27
+ # Qwen2.5-3B-R1-MedicalReasoner LoRA Adapter
28
+
29
+ This repository contains the LoRA adapter weights and configuration for **Qwen2.5-3B-R1-MedicalReasoner**, a state-of-the-art clinical reasoning language model fine-tuned using GRPO. The adapter is designed to further optimize and customize model behavior for clinical reasoning tasks.
30
+
31
+ ## Overview
32
+
33
+ - **Adapter Name:** Qwen2.5-3B-R1-MedicalReasoner LoRA Adapter
34
+ - **Purpose:** To modify and enhance the base model (Qwen2.5-3B-R1-MedicalReasoner) using Low-Rank Adaptation (LoRA) techniques without modifying the full model weights.
35
+ - **Use Case:** Ideal for users wishing to fine-tune, experiment, or deploy the clinical reasoning model with customized parameter-efficient adaptations.
36
+
37
+ ## Key Features
38
+
39
+ - **Parameter-Efficient Adaptation:** LoRA allows for training a small number of additional parameters, making further fine-tuning efficient in time and resources.
40
+ - **Seamless Integration:** Easily integrated with the base model using the provided tools and functions in Unsloth and vLLM.
41
+ - **Optimized for Clinical Reasoning:** The adapter reinforces chain-of-thought generation and improves the clarity of diagnostic reasoning outputs.
42
+
43
+ ## How to Use
44
+
45
+ ### Integration with Base Model
46
+
47
+ To download and load the LoRA adapter into Qwen2.5-3B-R1-MedicalReasoner:
48
+
49
+ ```python
50
+ from huggingface_hub import snapshot_download
51
+ from unsloth import FastLanguageModel
52
+
53
+ # Download the adapter weights:
54
+ lora_path = snapshot_download("iimran/Qwen2.5-3B-R1-MedicalReasoner-lora-adapter")
55
+ print("LoRA adapter downloaded to:", lora_path)
56
+
57
+ # Load base model:
58
+ model, tokenizer = FastLanguageModel.from_pretrained(
59
+ model_name="iimran/Qwen2.5-3B-R1-MedicalReasoner",
60
+ load_in_4bit=False,
61
+ fast_inference=True
62
+ )
63
+
64
+ # Load the LoRA adapter:
65
+ model.load_lora(lora_path)
66
+ ```
67
+
68
+ ## Fine-Tuning and Experimentation
69
+
70
+ This adapter was originally developed and fine-tuned using GRPO with customized reward functions to enhance chain-of-thought reasoning. Researchers who wish to further optimize the behavior of the clinical reasoning model with targeted adaptations can start from these adapter weights.
71
+
72
+ ## Installation Requirements
73
+
74
+ * **Python Version:** 3.8 or higher
75
+ * **Dependencies:**
76
+ * unsloth
77
+ * vLLM
78
+ * huggingface-hub
79
+ * Other dependencies required by the base model and LoRA integration
80
+
81
+ Install the required packages using pip:
82
+
83
+ ```bash
84
+ pip install unsloth vllm huggingface-hub
85
+ ```
86
+
87
+ ## Citation
88
+
89
+ If you use the LoRA adapter in your work, please cite:
90
+
91
+ ```bibtex
92
+ @misc{Qwen2.5-3B-R1-MedicalReasoner-lora-adapter,
93
+ authors = {Imran Sarwar, Muhammad Rouf Mustafa},
94
+ title = {Qwen2.5-3B-R1-MedicalReasoner LoRA Adapter},
95
+ year = {2025},
96
+ publisher = {Hugging Face},
97
+ url = {https://huggingface.co/iimran/Qwen2.5-3B-R1-MedicalReasoner-lora-adapter}
98
+ }
99
+ ```
100
+
101
+ ## Contributing
102
+
103
+ Contributions to the LoRA adapter are welcome. If you have improvements for:
104
+ * Adapter performance or efficiency
105
+ * Documentation updates
106
+ * Additional experiments or fine-tuning strategies
107
+
108
+ Please open an issue or submit a pull request.
109
+
110
+ ## Disclaimer
111
+
112
  This LoRA adapter is provided for research and educational purposes. It is intended to be used in combination with the **Qwen2.5-3B-R1-MedicalReasoner** base model. As with the base model, clinical outputs should be validated by qualified healthcare professionals before being used in any medical decision-making.