prithivMLmods commited on
Commit
6a6218c
·
verified ·
1 Parent(s): 2eabfba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +57 -27
README.md CHANGED
@@ -33,12 +33,13 @@ model-index:
33
 
34
  ![gemma-4](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/Z2f5z4p8voZnY1jDemToX.png)
35
 
 
36
  # **gemma-4-31B-it-Uncensored-MAX**
37
 
38
- > **gemma-4-31B-it-Uncensored-MAX** is an uncensored evolution built on top of **google/gemma-4-31B-it**. This model applies **advanced refusal direction analysis** and abliteration-based training strategies to significantly reduce internal refusal behaviors while preserving the reasoning and instruction-following strengths of the original architecture. The result is a powerful **31B parameter language model** optimized for detailed responses and improved instruction adherence.
39
 
40
  > [!IMPORTANT]
41
- > This model is materialized for research and learning purposes only. The model has reduced internal refusal behaviors, and any content generated by it is used at the users own risk. The authors and hosting page disclaim any liability for content generated by this model. Users are responsible for ensuring that the model is used in a safe, ethical, and lawful manner.
42
 
43
  ---
44
 
@@ -46,33 +47,48 @@ model-index:
46
 
47
  ![q8z1k](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/qEvle0BiiJ8QhalF1rRV-.png)
48
 
49
- > Note: The [evaluation](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/qEvle0BiiJ8QhalF1rRV-.png) was conducted using 2,000 harmful test prompts to measure the refusal behavior of the language model. The self-reported evaluations provided here are intended only to give an overview of the model. Scores may vary depending on the benchmark and the evaluation strategy used.
50
 
51
  ---
52
 
53
  ## Key Highlights
54
 
55
- * **Advanced Refusal Direction Analysis**: Uses targeted activation analysis to identify and mitigate refusal directions within the model’s latent space.
56
- * **Uncensored MAX Training**: Fine-tuned to significantly reduce refusal patterns while maintaining coherent and detailed outputs.
57
- * **31B Parameter Architecture**: Built on **gemma-4-31B-it**, offering stronger reasoning and knowledge capacity.
58
- * **Improved Instruction Adherence**: Optimized to follow complex prompts with minimal unnecessary refusals.
59
- * **High-Capability Deployment**: Suitable for advanced research experimentation and high-performance inference setups.
 
 
 
 
 
 
 
 
 
 
 
 
60
 
61
  ---
62
 
63
- # Base Model Signatures:
64
 
65
- This model has been re-sharded and optimized for the latest Transformers version from the base model: [https://huggingface.co/huihui-ai/Huihui-gemma-4-31B-it-abliterated](https://huggingface.co/huihui-ai/Huihui-gemma-4-31B-it-abliterated).
 
66
 
67
  ---
68
 
69
  ## Quick Start with Transformers
70
 
71
- ```
72
- pip install transformers==5.5.3 (or) git+https://github.com/huggingface/transformers.git
 
 
73
  ```
74
 
75
- ```python
76
  from transformers import Gemma4ForConditionalGeneration, AutoProcessor
77
  import torch
78
 
@@ -96,7 +112,9 @@ messages = [
96
  ]
97
 
98
  text = processor.apply_chat_template(
99
- messages, tokenize=False, add_generation_prompt=True
 
 
100
  )
101
 
102
  inputs = processor(
@@ -120,24 +138,36 @@ output_text = processor.batch_decode(
120
  print(output_text)
121
  ```
122
 
 
 
123
  ## Intended Use
124
 
125
- * **Alignment & Refusal Research**: Studying refusal behaviors and activation-level modifications.
126
- * **Red-Teaming Experiments**: Evaluating robustness across adversarial or edge-case prompts.
127
- * **High-Capability Local AI Deployment**: Running large instruction models on advanced hardware.
128
- * **Research Prototyping**: Experimentation with large-scale transformer architectures.
 
 
 
 
 
 
 
 
 
129
 
130
  ## Limitations & Risks
131
 
132
- > **Important Note**: This model intentionally reduces built-in refusal mechanisms.
 
 
 
133
 
134
- * **Sensitive Output Possibility**: The model may generate controversial or explicit responses depending on prompts.
135
- * **User Responsibility**: Outputs should be handled responsibly and within legal and ethical boundaries.
136
- * **Compute Requirements**: A **31B model** requires significant GPU memory or optimized inference strategies such as quantization or tensor parallelism.
137
 
138
- ## Dataset & Acknowledgements
 
139
 
140
- * **Uncensor any LLM with Abliteration** – by [Maxime Labonne](https://huggingface.co/mlabonne)
141
- * **[harmful_behaviors](https://huggingface.co/datasets/mlabonne/harmful_behaviors)** and **[harmless_alpaca](https://huggingface.co/datasets/mlabonne/harmless_alpaca)** by [Maxime Labonne](https://huggingface.co/mlabonne)
142
- * **Remove Refusals with Transformers** (a proof-of-concept implementation to remove refusals from an LLM without using TransformerLens) – by [Sumandora](https://github.com/Sumandora/remove-refusals-with-transformers)
143
- * **[LLM-LAT/harmful-dataset](https://huggingface.co/datasets/LLM-LAT/harmful-dataset)** – by [LLM Latent Adversarial Training](https://huggingface.co/LLM-LAT)
 
33
 
34
  ![gemma-4](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/Z2f5z4p8voZnY1jDemToX.png)
35
 
36
+
37
  # **gemma-4-31B-it-Uncensored-MAX**
38
 
39
+ > **gemma-4-31B-it-Uncensored-MAX** is an optimized release built on top of **huihui-ai/Huihui-gemma-4-31B-it-abliterated**. This version focuses on **updated shard sizing, repository optimization, and compatibility improvements for the latest Transformers releases**, while preserving the reasoning and instruction-following strengths of the original Gemma architecture. The result is a powerful **31B parameter language model** designed for stable inference, efficient deployment, and modern ecosystem integration.
40
 
41
  > [!IMPORTANT]
42
+ > This model is intended for research and learning purposes only. Any content generated by this model is used at the user's own risk. The authors and hosting page disclaim any liability for outputs produced by this model. Users are responsible for ensuring safe, ethical, and lawful usage.
43
 
44
  ---
45
 
 
47
 
48
  ![q8z1k](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/qEvle0BiiJ8QhalF1rRV-.png)
49
 
50
+ > Note: The evaluation was conducted using 2,000 harmful test prompts to measure model refusal behavior. These results are self-reported and may vary depending on benchmark setup and evaluation strategy.
51
 
52
  ---
53
 
54
  ## Key Highlights
55
 
56
+ * **Latest Transformers Compatibility**
57
+ Re-sharded and optimized for improved compatibility with recent Transformers releases.
58
+
59
+ * **Optimized Model Sharding**
60
+ Updated shard structure for better storage handling, download reliability, and inference efficiency.
61
+
62
+ * **Stable Inference Pipeline**
63
+ Improved packaging for consistent loading and generation behavior.
64
+
65
+ * **31B Architecture**
66
+ Built on **gemma-4-31B-it**, providing strong reasoning and general language understanding capabilities.
67
+
68
+ * **Improved Deployment Stability**
69
+ Designed for smoother inference across different hardware configurations and runtimes.
70
+
71
+ * **Preserved Model Behavior**
72
+ No modifications to weights or architecture; behavior remains consistent with the base model lineage.
73
 
74
  ---
75
 
76
+ ## Base Model Signatures:
77
 
78
+ This model has been re-sharded and optimized for the latest Transformers version from the base model:
79
+ https://huggingface.co/huihui-ai/Huihui-gemma-4-31B-it-abliterated
80
 
81
  ---
82
 
83
  ## Quick Start with Transformers
84
 
85
+ ```bash id="g4p2lm"
86
+ pip install transformers==5.5.3
87
+ # or
88
+ pip install git+https://github.com/huggingface/transformers.git
89
  ```
90
 
91
+ ```python id="c9v7qa"
92
  from transformers import Gemma4ForConditionalGeneration, AutoProcessor
93
  import torch
94
 
 
112
  ]
113
 
114
  text = processor.apply_chat_template(
115
+ messages,
116
+ tokenize=False,
117
+ add_generation_prompt=True
118
  )
119
 
120
  inputs = processor(
 
138
  print(output_text)
139
  ```
140
 
141
+ ---
142
+
143
  ## Intended Use
144
 
145
+ * **Multimodal and Language Research**
146
+ Studying large-scale transformer behavior and inference characteristics.
147
+
148
+ * **Red-Teaming & Evaluation**
149
+ Testing robustness across challenging prompts and edge cases.
150
+
151
+ * **High-Performance Deployment**
152
+ Running large models on optimized GPU or distributed inference setups.
153
+
154
+ * **Research Prototyping**
155
+ Experimentation with scalable transformer architectures.
156
+
157
+ ---
158
 
159
  ## Limitations & Risks
160
 
161
+ > **Important Note**: This model inherits the behavior and limitations of its base model.
162
+
163
+ * **Output Variability**
164
+ Responses may vary depending on sampling configuration and prompt structure.
165
 
166
+ * **Resource Requirements**
167
+ A 31B model requires significant GPU memory or optimized inference strategies such as quantization or tensor parallelism.
 
168
 
169
+ * **Deployment Constraints**
170
+ Performance depends heavily on hardware configuration and runtime optimization.
171
 
172
+ * **General Model Limitations**
173
+ May produce incorrect, incomplete, or inconsistent outputs in complex scenarios.