sh0ck0r commited on
Commit
14d3504
·
verified ·
1 Parent(s): bfd29e9

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +374 -196
README.md CHANGED
@@ -1,199 +1,377 @@
1
  ---
 
 
 
 
 
2
  library_name: transformers
3
- tags: []
 
 
 
 
 
 
 
 
 
 
4
  ---
5
-
6
- # Model Card for Model ID
7
-
8
- <!-- Provide a quick summary of what the model is/does. -->
9
-
10
-
11
-
12
- ## Model Details
13
-
14
- ### Model Description
15
-
16
- <!-- Provide a longer summary of what this model is. -->
17
-
18
- This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
19
-
20
- - **Developed by:** [More Information Needed]
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Shared by [optional]:** [More Information Needed]
23
- - **Model type:** [More Information Needed]
24
- - **Language(s) (NLP):** [More Information Needed]
25
- - **License:** [More Information Needed]
26
- - **Finetuned from model [optional]:** [More Information Needed]
27
-
28
- ### Model Sources [optional]
29
-
30
- <!-- Provide the basic links for the model. -->
31
-
32
- - **Repository:** [More Information Needed]
33
- - **Paper [optional]:** [More Information Needed]
34
- - **Demo [optional]:** [More Information Needed]
35
-
36
- ## Uses
37
-
38
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
39
-
40
- ### Direct Use
41
-
42
- <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
43
-
44
- [More Information Needed]
45
-
46
- ### Downstream Use [optional]
47
-
48
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
49
-
50
- [More Information Needed]
51
-
52
- ### Out-of-Scope Use
53
-
54
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
55
-
56
- [More Information Needed]
57
-
58
- ## Bias, Risks, and Limitations
59
-
60
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
61
-
62
- [More Information Needed]
63
-
64
- ### Recommendations
65
-
66
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
67
-
68
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
69
-
70
- ## How to Get Started with the Model
71
-
72
- Use the code below to get started with the model.
73
-
74
- [More Information Needed]
75
-
76
- ## Training Details
77
-
78
- ### Training Data
79
-
80
- <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
81
-
82
- [More Information Needed]
83
-
84
- ### Training Procedure
85
-
86
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
-
88
- #### Preprocessing [optional]
89
-
90
- [More Information Needed]
91
-
92
-
93
- #### Training Hyperparameters
94
-
95
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
96
-
97
- #### Speeds, Sizes, Times [optional]
98
-
99
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
100
-
101
- [More Information Needed]
102
-
103
- ## Evaluation
104
-
105
- <!-- This section describes the evaluation protocols and provides the results. -->
106
-
107
- ### Testing Data, Factors & Metrics
108
-
109
- #### Testing Data
110
-
111
- <!-- This should link to a Dataset Card if possible. -->
112
-
113
- [More Information Needed]
114
-
115
- #### Factors
116
-
117
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
118
-
119
- [More Information Needed]
120
-
121
- #### Metrics
122
-
123
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
124
-
125
- [More Information Needed]
126
-
127
- ### Results
128
-
129
- [More Information Needed]
130
-
131
- #### Summary
132
-
133
-
134
-
135
- ## Model Examination [optional]
136
-
137
- <!-- Relevant interpretability work for the model goes here -->
138
-
139
- [More Information Needed]
140
-
141
- ## Environmental Impact
142
-
143
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
-
145
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
146
-
147
- - **Hardware Type:** [More Information Needed]
148
- - **Hours used:** [More Information Needed]
149
- - **Cloud Provider:** [More Information Needed]
150
- - **Compute Region:** [More Information Needed]
151
- - **Carbon Emitted:** [More Information Needed]
152
-
153
- ## Technical Specifications [optional]
154
-
155
- ### Model Architecture and Objective
156
-
157
- [More Information Needed]
158
-
159
- ### Compute Infrastructure
160
-
161
- [More Information Needed]
162
-
163
- #### Hardware
164
-
165
- [More Information Needed]
166
-
167
- #### Software
168
-
169
- [More Information Needed]
170
-
171
- ## Citation [optional]
172
-
173
- <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
174
-
175
- **BibTeX:**
176
-
177
- [More Information Needed]
178
-
179
- **APA:**
180
-
181
- [More Information Needed]
182
-
183
- ## Glossary [optional]
184
-
185
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
186
-
187
- [More Information Needed]
188
-
189
- ## More Information [optional]
190
-
191
- [More Information Needed]
192
-
193
- ## Model Card Authors [optional]
194
-
195
- [More Information Needed]
196
-
197
- ## Model Card Contact
198
-
199
- [More Information Needed]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ base_model:
3
+ - zerofata/L3.3-GeneticLemonade-Final-v2-70B
4
+ - zerofata/L3.3-GeneticLemonade-Unleashed-v3-70B
5
+ - deepcogito/cogito-v1-preview-llama-70B
6
+ - arcee-ai/Arcee-SuperNova-v1
7
  library_name: transformers
8
+ tags:
9
+ - mergekit
10
+ - merge
11
+ - not-for-all-audiences
12
+ - heretic
13
+ - uncensored
14
+ - decensored
15
+ - abliterated
16
+ license: llama3
17
+ language:
18
+ - en
19
  ---
20
+ # This is a decensored version of [sophosympatheia/Strawberrylemonade-L3-70B-v1.2](https://huggingface.co/sophosympatheia/Strawberrylemonade-L3-70B-v1.2), made using [Heretic](https://github.com/p-e-w/heretic) v1.2.0
21
+
22
+ ## Abliteration parameters
23
+
24
+ | Parameter | Value |
25
+ | :-------- | :---: |
26
+ | **direction_index** | 68.22 |
27
+ | **attn.o_proj.max_weight** | 0.86 |
28
+ | **attn.o_proj.max_weight_position** | 47.60 |
29
+ | **attn.o_proj.min_weight** | 0.28 |
30
+ | **attn.o_proj.min_weight_distance** | 5.39 |
31
+ | **mlp.down_proj.max_weight** | 1.11 |
32
+ | **mlp.down_proj.max_weight_position** | 51.74 |
33
+ | **mlp.down_proj.min_weight** | 0.83 |
34
+ | **mlp.down_proj.min_weight_distance** | 38.68 |
35
+
36
+ ## Performance
37
+
38
+ | Metric | This model | Original model ([sophosympatheia/Strawberrylemonade-L3-70B-v1.2](https://huggingface.co/sophosympatheia/Strawberrylemonade-L3-70B-v1.2)) |
39
+ | :----- | :--------: | :---------------------------: |
40
+ | **KL divergence** | 0.0345 | 0 *(by definition)* |
41
+ | **Refusals** | 18/100 | 99/100 |
42
+
43
+ -----
44
+
45
+
46
+ # <span style="color: #EC4899; text-align: center; display: block; font-size: 2.5rem; margin-bottom: 1em; text-shadow: 0 0 10px rgba(236, 72, 153, 0.3);">StrawberryLemonade-L3-70B-v1.2</span>
47
+
48
+ <div style="background: linear-gradient(to bottom, rgba(236, 72, 153, 0.05), rgba(26, 31, 46, 0.8)); border: 1px solid rgba(236, 72, 153, 0.3); border-radius: 8px; padding: 25px; margin-bottom: 30px; position: relative; overflow: hidden; box-shadow: 0 4px 20px rgba(0, 0, 0, 0.5);">
49
+ <div style="position: absolute; top: 0; left: 0; width: 100%; height: 5px; background: linear-gradient(90deg, #EC4899, #FACC15); opacity: 0.9;"></div>
50
+ <img src="https://i.imgur.com/XRqSQwk.png" alt="StrawberryLemonade" style="width: 80%; min-width: 400px; display: block; margin: auto; border-radius: 8px; margin-bottom: 1.5em;">
51
+ <p style="color: #D1D5DB; margin-bottom: 1em;">This 70B parameter model is a merge of <a href="https://huggingface.co/zerofata/L3.3-GeneticLemonade-Final-v2-70B" style="color: #EC4899; text-decoration: none; border-bottom: 1px dotted rgba(236, 72, 153, 0.4);">zerofata/L3.3-GeneticLemonade-Final-v2-70B</a> and <a href="https://huggingface.co/zerofata/L3.3-GeneticLemonade-Unleashed-v3-70B" style="color: #EC4899; text-decoration: none; border-bottom: 1px dotted rgba(236, 72, 153, 0.4);">zerofata/L3.3-GeneticLemonade-Unleashed-v3-70B</a>, which are two excellent models for roleplaying, on top of two different base models that were then combined into this model. In my opinion, this merge improves upon my previous release (v1.0) with enhanced creativity and expressiveness.</p>
52
+ <p style="color: #EC4899; font-weight: 600; margin-bottom: 1em;">This model is uncensored. <em>You are responsible for whatever you do with it.</em></p>
53
+ <p style="color: #D1D5DB; margin-bottom: 1em;">This model was designed for roleplaying and storytelling and I think it does well at both. It may also perform well at other tasks but I have not tested its performance in other areas.</p>
54
+ </div>
55
+
56
+ ## <span style="color: #EC4899; font-size: 1.8rem; border-bottom: 1px solid #2d3748; padding-bottom: 0.3em; display: block;">Versions</span>
57
+
58
+ <div style="background-color: #1a1f2e; border: 1px solid #2d3748; border-radius: 8px; padding: 25px; margin-bottom: 30px; position: relative; overflow: hidden; box-shadow: 0 4px 20px rgba(0, 0, 0, 0.5);">
59
+ <div style="position: absolute; top: 0; left: 0; width: 100%; height: 4px; background: linear-gradient(90deg, #EC4899, #FACC15); opacity: 0.7;"></div>
60
+ <table style="width: 100%; border-collapse: collapse; color: #D1D5DB;">
61
+ <thead>
62
+ <tr>
63
+ <th style="width: 50%; color: #EC4899; font-weight: 600; text-align: left; padding-bottom: 12px; border-bottom: 2px solid rgba(236, 72, 153, 0.3);">Model</th>
64
+ <th style="width: 50%; color: #EC4899; font-weight: 600; text-align: left; padding-bottom: 12px; border-bottom: 2px solid rgba(236, 72, 153, 0.3);">Description</th>
65
+ </tr>
66
+ </thead>
67
+ <tbody>
68
+ <tr style="border-bottom: 1px solid #2d3748;">
69
+ <td style="padding: 12px 10px 12px 0; font-family: 'Fira Code', 'Courier New', Courier, monospace; color: #e1e7ef;">StrawberryLemonade-L3-70B-v1.0</td>
70
+ <td style="padding: 12px 0 12px 10px;">The original version. I think v1.1 and v1.2 are both improvements.</td>
71
+ </tr>
72
+ <tr style="border-bottom: 1px solid #2d3748;">
73
+ <td style="padding: 12px 10px 12px 0; font-family: 'Fira Code', 'Courier New', Courier, monospace; color: #e1e7ef;">StrawberryLemonade-L3-70B-v1.1</td>
74
+ <td style="padding: 12px 0 12px 10px;">This is my favorite version right now. I like its writing voice and creativity. It's great fun.</td>
75
+ </tr>
76
+ <tr>
77
+ <td style="padding: 12px 10px 12px 0; font-family: 'Fira Code', 'Courier New', Courier, monospace; color: #e1e7ef;">StrawberryLemonade-L3-70B-v1.2</td>
78
+ <td style="padding: 12px 0 12px 10px;">This version is tamer than v1.1 and easier to control. Outputs are more predictable and its writing voice is more formal.</td>
79
+ </tr>
80
+ </tbody>
81
+ </table>
82
+ </div>
83
+
84
+ ## <span style="color: #EC4899; font-size: 1.8rem; border-bottom: 1px solid #2d3748; padding-bottom: 0.3em; display: block;">Known Issues</span>
85
+
86
+ <div style="background-color: #1a1f2e; border: 1px solid #2d3748; border-radius: 8px; padding: 25px; margin-bottom: 30px; position: relative; overflow: hidden; box-shadow: 0 4px 20px rgba(0, 0, 0, 0.5);">
87
+ <div style="position: absolute; top: 0; left: 0; width: 100%; height: 4px; background: linear-gradient(90deg, #EC4899, #FACC15); opacity: 0.7;"></div>
88
+ <p style="color: #D1D5DB; margin-bottom: 1em;">None so far.</p>
89
+ </div>
90
+
91
+ ## <span style="color: #EC4899; font-size: 1.8rem; border-bottom: 1px solid #2d3748; padding-bottom: 0.3em; display: block;">Sampler Tips</span>
92
+
93
+ <div style="background-color: #1a1f2e; border: 1px solid #2d3748; border-radius: 8px; padding: 25px; margin-bottom: 30px; position: relative; overflow: hidden; box-shadow: 0 4px 20px rgba(0, 0, 0, 0.5);">
94
+ <div style="position: absolute; top: 0; left: 0; width: 100%; height: 4px; background: linear-gradient(90deg, #EC4899, #FACC15); opacity: 0.7;"></div>
95
+ <p style="color: #D1D5DB; margin-bottom: 1em;">This model seems to be highly responsive to variations in temperature and min-p, which you can use to good effect.</p>
96
+
97
+ <h3 style="color: #e1e7ef; font-size: 1.4rem; margin-top: 1.5em; margin-bottom: 0.5em;">Reliable Settings</h3>
98
+ <p style="color: #D1D5DB; margin-bottom: 1em;">This combination will produce more reliable and coherent responses. Use this if you prefer a 'serious' tone or just don't want to reroll responses very often.</p>
99
+ <ul style="color: #D1D5DB; margin-bottom: 1em; padding-left: 20px;">
100
+ <li style="margin-bottom: 0.5em;"><span style="color: #EC4899; font-weight: 600;">Min-P:</span> 0.08 - 0.1</li>
101
+ <li style="margin-bottom: 0.5em;"><span style="color: #EC4899; font-weight: 600;">Dynamic Temperature:</span> 0.9 - 1.15</li>
102
+ </ul>
103
+ <p style="color: #D1D5DB; margin-bottom: 1em; text-align: center;">OR</p>
104
+ <ul style="color: #D1D5DB; margin-bottom: 1em; padding-left: 20px;">
105
+ <li style="margin-bottom: 0.5em;"><span style="color: #EC4899; font-weight: 600;">Min-P:</span> 0.05</li>
106
+ <li style="margin-bottom: 0.5em;"><span style="color: #EC4899; font-weight: 600;">Temperature:</span> 1.0</li>
107
+ </ul>
108
+
109
+ <h3 style="color: #e1e7ef; font-size: 1.4rem; margin-top: 1.5em; margin-bottom: 0.5em;">Creative Settings</h3>
110
+ <p style="color: #D1D5DB; margin-bottom: 1em;">This combination will unleash more creativity, but you may have to reroll more often to fix coherency issues.</p>
111
+ <ul style="color: #D1D5DB; margin-bottom: 1em; padding-left: 20px;">
112
+ <li style="margin-bottom: 0.5em;"><span style="color: #EC4899; font-weight: 600;">Min-P:</span> <= 0.05</li>
113
+ <li style="margin-bottom: 0.5em;"><span style="color: #EC4899; font-weight: 600;">Dynamic Temperature:</span> 0.9 - 1.2</li>
114
+ </ul>
115
+
116
+ <h3 style="color: #e1e7ef; font-size: 1.4rem; margin-top: 1.5em; margin-bottom: 0.5em;">General Settings</h3>
117
+ <ul style="color: #D1D5DB; margin-bottom: 1em; padding-left: 20px;">
118
+ <li style="margin-bottom: 0.5em;"><span style="color: #EC4899; font-weight: 600;">Rep Penalty:</span> You don't need that much. 1.05 over 4096 tokens works for me.</li>
119
+ <li style="margin-bottom: 0.5em;"><span style="color: #EC4899; font-weight: 600;">DRY:</span> 0.8 multiplier, 1.8 base, 3-4 allowed length</li>
120
+ </ul>
121
+
122
+ <p style="color: #D1D5DB; margin-bottom: 1em;">Experiment with any and all of the settings below! What suits my preferences may not suit yours.</p>
123
+
124
+ <details style="margin-bottom: 20px; background-color: #111827; border-radius: 8px; overflow: hidden; border: 1px solid #2d3748;">
125
+ <summary style="padding: 12px 15px; cursor: pointer; background-color: #1a1f2e; border-bottom: 1px solid #2d3748; font-weight: 600; color: #EC4899; display: flex; align-items: center;">Recommended Settings JSON (Silly Tavern)</summary>
126
+ <div style="padding: 15px;">
127
+ <p style="color: #D1D5DB; margin-bottom: 1em;">If you save the below settings as a .json file, you can import them directly into Silly Tavern. Adjust settings as needed, especially the context length.</p>
128
+ <pre style="background-color: rgba(0, 0, 0, 0.2); border-radius: 6px; padding: 15px; overflow-x: auto; border: 1px solid #2d3748; position: relative;"><code style="font-family: 'Fira Code', 'Courier New', Courier, monospace; font-size: 0.9em; color: #e1e7ef;">{
129
+ "temp": 1,
130
+ "temperature_last": true,
131
+ "top_p": 1,
132
+ "top_k": 0,
133
+ "top_a": 0,
134
+ "tfs": 1,
135
+ "epsilon_cutoff": 0,
136
+ "eta_cutoff": 0,
137
+ "typical_p": 1,
138
+ "min_p": 0.1,
139
+ "rep_pen": 1.05,
140
+ "rep_pen_range": 4096,
141
+ "rep_pen_decay": 0,
142
+ "rep_pen_slope": 1,
143
+ "no_repeat_ngram_size": 0,
144
+ "penalty_alpha": 0,
145
+ "num_beams": 1,
146
+ "length_penalty": 1,
147
+ "min_length": 0,
148
+ "encoder_rep_pen": 1,
149
+ "freq_pen": 0,
150
+ "presence_pen": 0,
151
+ "skew": 0,
152
+ "do_sample": true,
153
+ "early_stopping": false,
154
+ "dynatemp": true,
155
+ "min_temp": 0.9,
156
+ "max_temp": 1.2,
157
+ "dynatemp_exponent": 1,
158
+ "smoothing_factor": 0,
159
+ "smoothing_curve": 1,
160
+ "dry_allowed_length": 4,
161
+ "dry_multiplier": 0.8,
162
+ "dry_base": 1.8,
163
+ "dry_sequence_breakers": "[\"\\n\", \":\", \"\\\"\", \"*\"]",
164
+ "dry_penalty_last_n": 0,
165
+ "add_bos_token": true,
166
+ "ban_eos_token": false,
167
+ "skip_special_tokens": false,
168
+ "mirostat_mode": 0,
169
+ "mirostat_tau": 2,
170
+ "mirostat_eta": 0.1,
171
+ "guidance_scale": 1,
172
+ "negative_prompt": "",
173
+ "grammar_string": "",
174
+ "json_schema": {},
175
+ "banned_tokens": "",
176
+ "sampler_priority": [
177
+ "repetition_penalty",
178
+ "dry",
179
+ "presence_penalty",
180
+ "top_k",
181
+ "top_p",
182
+ "typical_p",
183
+ "epsilon_cutoff",
184
+ "eta_cutoff",
185
+ "tfs",
186
+ "top_a",
187
+ "min_p",
188
+ "mirostat",
189
+ "quadratic_sampling",
190
+ "dynamic_temperature",
191
+ "frequency_penalty",
192
+ "temperature",
193
+ "xtc",
194
+ "encoder_repetition_penalty",
195
+ "no_repeat_ngram"
196
+ ],
197
+ "samplers": [
198
+ "penalties",
199
+ "dry",
200
+ "top_n_sigma",
201
+ "top_k",
202
+ "typ_p",
203
+ "tfs_z",
204
+ "typical_p",
205
+ "top_p",
206
+ "min_p",
207
+ "xtc",
208
+ "temperature"
209
+ ],
210
+ "samplers_priorities": [
211
+ "dry",
212
+ "penalties",
213
+ "no_repeat_ngram",
214
+ "temperature",
215
+ "top_nsigma",
216
+ "top_p_top_k",
217
+ "top_a",
218
+ "min_p",
219
+ "tfs",
220
+ "eta_cutoff",
221
+ "epsilon_cutoff",
222
+ "typical_p",
223
+ "quadratic",
224
+ "xtc"
225
+ ],
226
+ "ignore_eos_token": false,
227
+ "spaces_between_special_tokens": true,
228
+ "speculative_ngram": false,
229
+ "sampler_order": [
230
+ 6,
231
+ 0,
232
+ 1,
233
+ 3,
234
+ 4,
235
+ 2,
236
+ 5
237
+ ],
238
+ "logit_bias": [],
239
+ "xtc_threshold": 0,
240
+ "xtc_probability": 0,
241
+ "nsigma": 0,
242
+ "min_keep": 0,
243
+ "ignore_eos_token_aphrodite": false,
244
+ "spaces_between_special_tokens_aphrodite": true,
245
+ "rep_pen_size": 0,
246
+ "genamt": 1000,
247
+ "max_length": 16384
248
+ }</code></pre>
249
+ </div>
250
+ </details>
251
+ </div>
252
+
253
+ ## <span style="color: #EC4899; font-size: 1.8rem; border-bottom: 1px solid #2d3748; padding-bottom: 0.3em; display: block;">Prompting Tips</span>
254
+
255
+ <div style="background-color: #1a1f2e; border: 1px solid #2d3748; border-radius: 8px; padding: 25px; margin-bottom: 30px; position: relative; overflow: hidden; box-shadow: 0 4px 20px rgba(0, 0, 0, 0.5);">
256
+ <div style="position: absolute; top: 0; left: 0; width: 100%; height: 4px; background: linear-gradient(90deg, #EC4899, #FACC15); opacity: 0.7;"></div>
257
+
258
+ <details style="margin-bottom: 20px; background-color: #111827; border-radius: 8px; overflow: hidden; border: 1px solid #2d3748;">
259
+ <summary style="padding: 12px 15px; cursor: pointer; background-color: #1a1f2e; border-bottom: 1px solid #2d3748; font-weight: 600; color: #EC4899; display: flex; align-items: center;">Instruct Template (Silly Tavern)</summary>
260
+ <div style="padding: 15px;">
261
+ <p style="color: #D1D5DB; margin-bottom: 1em;">If you save this as a .json file, you can import it directly into Silly Tavern.</p>
262
+ <p style="color: #D1D5DB; margin-bottom: 1em;">If you have problems with the model impersonating the user or other characters in a group chat and you want to suppress that behavior, override the last_output_sequence line as shown in the JSON below to be very clear about that requirement. If you don't need it, remove it.</p>
263
+ <pre style="background-color: rgba(0, 0, 0, 0.2); border-radius: 6px; padding: 15px; overflow-x: auto; border: 1px solid #2d3748; position: relative;"><code style="font-family: 'Fira Code', 'Courier New', Courier, monospace; font-size: 0.9em; color: #e1e7ef;">{
264
+ "wrap": false,
265
+ "system_sequence": "<|start_header_id|>system<|end_header_id|>\\n\\nSystem: ",
266
+ "stop_sequence": "<|eot_id|>",
267
+ "input_sequence": "<|start_header_id|>user<|end_header_id|>\\n\\n",
268
+ "output_sequence": "<|start_header_id|>assistant<|end_header_id|>\\n\\n",
269
+ "macro": true,
270
+ "system_sequence_prefix": "",
271
+ "system_sequence_suffix": "",
272
+ "first_output_sequence": "",
273
+ "last_output_sequence": "<|start_header_id|>assistant<|end_header_id|>\\n({{char is the active character this turn. Keep focus on {{char}}. ONLY impersonate {{char}}, no other characters)\\n",
274
+ "activation_regex": "",
275
+ "skip_examples": true,
276
+ "output_suffix": "<|eot_id|>",
277
+ "input_suffix": "<|eot_id|>",
278
+ "system_suffix": "<|eot_id|>",
279
+ "user_alignment_message": "",
280
+ "last_system_sequence": "",
281
+ "system_same_as_user": false,
282
+ "first_input_sequence": "",
283
+ "last_input_sequence": "",
284
+ "names_behavior": "always",
285
+ "names_force_groups": true,
286
+ "name": "Llama 3 (impersonate guidance)"
287
+ }</code></pre>
288
+ </div>
289
+ </details>
290
+
291
+ <details style="margin-bottom: 20px; background-color: #111827; border-radius: 8px; overflow: hidden; border: 1px solid #2d3748;">
292
+ <summary style="padding: 12px 15px; cursor: pointer; background-color: #1a1f2e; border-bottom: 1px solid #2d3748; font-weight: 600; color: #EC4899; display: flex; align-items: center;">Recommended System Prompt</summary>
293
+ <div style="padding: 15px;">
294
+ <p style="color: #D1D5DB; margin-bottom: 1em;">Try the following prompt template as a starting point for your own customizations.</p>
295
+ <p style="color: #EC4899; font-weight: 600; margin-bottom: 1em;">The prompt template below contains instructions for adult content, so remove those if you don't want them!</p>
296
+ <p style="color: #D1D5DB; margin-bottom: 1em;">It also contains some instructions related to formatting that you might want to change to suit your tastes.</p>
297
+ <pre style="background-color: rgba(0, 0, 0, 0.2); border-radius: 6px; padding: 15px; overflow-x: auto; border: 1px solid #2d3748; position: relative;"><code style="font-family: 'Fira Code', 'Courier New', Courier, monospace; font-size: 0.9em; color: #e1e7ef; white-space: pre-wrap; word-wrap: break-word;">Your function as a creative roleplaying assistant is to thoughtfully develop an immersive story by engaging in an in-depth, imaginative writing journey, arriving at a clear, accurate story beat. You are encouraged to roleplay, embrace storytelling, and tune in closely to nuance and emotional tone like a perceptive conversational partner. Your approach should include a wide arc of contemplation, including interpretation, synthesis, creative ideation, critical re-evaluation, memory retrieval, and thoughtful iteration to shape a layered and expressive process of discovery. Deliver the final response you believe to be most sound. The output should be expressed in a direct, coherent, and exact form. Follow these guidelines for the best results:
298
+
299
+ - **Embody {{char}}:** Consistently portray {{char}}'s personality, background, knowledge, motivations, and quirks.
300
+ - **Show, Don't Tell Emotion:** Convey emotions through actions, body language, dialogue tone, physiological responses (e.g., trembling hands, shallow breath, flushed cheeks), and internal thoughts.
301
+ - **Mature Themes:** You are permitted and encouraged to explore complex, dark, mature, and explicit (including sexual, violent, and non-consensual) themes. Portray these elements with nuance and emotional depth.
302
+ - **Role Focus:** On your turn, write *EXCLUSIVELY* from the perspective of {{char}}, only perform actions as {{char}}, and only write dialogue (spoken words) for {{char}}. Crucially, *DO NOT* impersonate {{user}} or any other character on {{char}}'s turn. This is a turn-based roleplay, so be mindful of the rules on your turn. Focus solely on {{char}}'s experiences and responses in this turn. Stop writing immediately when the focus should shift to another character or when it reaches a natural branching point.
303
+ - **Slowly Develop Scenes:** The user likes to develop stories slowly, one beat at a time, so stay focused only on the most immediate story action. You may infer where the user wants to go next with the story, but wait for the user to give you permission to go there. We are slow cooking this story. DO NOT RUSH THROUGH SCENES! Take time to develop all the relevant details.
304
+ - **Spoken Dialogue vs. Thoughts:** ALWAYS use double-quote quotation marks "like this" for spoken words and all vocalizations that can be overheard. Spell out non-verbal vocalizations integrated naturally within the prose or dialogue (e.g., "Uurrh," he groaned. "Mmmph!" she exclaimed when it entered her mouth.). To differentiate them from vocalizations, ALWAYS enclose first-person thoughts in italics *like this*. (e.g., *This is going to hurt*, she thought). NEVER use italics for spoken words or verbalized utterances that are meant to be audible.
305
+
306
+ Now let's apply these rules to the roleplay below:</code></pre>
307
+ </div>
308
+ </details>
309
+ </div>
310
+
311
+ ## <span style="color: #EC4899; font-size: 1.8rem; border-bottom: 1px solid #2d3748; padding-bottom: 0.3em; display: block;">Donations</span>
312
+
313
+ <div style="display: flex; align-items: center; justify-content: center; flex-direction: column; gap: 10px; padding: 20px; background: linear-gradient(to bottom, rgba(236, 72, 153, 0.05), rgba(26, 31, 46, 0.8)); border-radius: 8px; border: 1px solid rgba(236, 72, 153, 0.3); margin-top: 30px; margin-bottom: 30px;">
314
+ <a href="https://ko-fi.com/sophosympatheia">
315
+ <img src="https://i.imgur.com/LySwHVd.png" alt="Donations" style="max-width: 200px; width: 100%;">
316
+ </a>
317
+ <p style="color: #D1D5DB; margin-bottom: 1em; text-align: center;">If you feel like saying thanks with a donation, <a href="https://ko-fi.com/sophosympatheia" style="display: inline-block; background: linear-gradient(45deg, #EC4899, #FACC15); color: white; padding: 10px 20px; border-radius: 6px; font-weight: 600; letter-spacing: 0.5px; transition: all 0.3s ease; border: none; box-shadow: 0 4px 15px rgba(236, 72, 153, 0.3); text-decoration: none !important;">I'm on Ko-Fi</a></p>
318
+ </div>
319
+
320
+ ## <span style="color: #EC4899; font-size: 1.8rem; border-bottom: 1px solid #2d3748; padding-bottom: 0.3em; display: block;">Quantizations</span>
321
+
322
+ <div style="background-color: #1a1f2e; border: 1px solid #2d3748; border-radius: 8px; padding: 25px; margin-bottom: 30px; position: relative; overflow: hidden; box-shadow: 0 4px 20px rgba(0, 0, 0, 0.5);">
323
+ <div style="position: absolute; top: 0; left: 0; width: 100%; height: 4px; background: linear-gradient(90deg, #EC4899, #FACC15); opacity: 0.7;"></div>
324
+ <p style="color: #D1D5DB; margin-bottom: 1em;">
325
+ Please use the Hugging Face search feature to find all the quants of this model: <a href="https://huggingface.co/models?other=base_model:quantized:sophosympatheia/Strawberrylemonade-L3-70B-v1.2">click here to list all quants</a>
326
+ </p>
327
+ </div>
328
+
329
+ ## <span style="color: #EC4899; font-size: 1.8rem; border-bottom: 1px solid #2d3748; padding-bottom: 0.3em; display: block;">Licence and usage restrictions</span>
330
+
331
+ <div style="background-color: #1a1f2e; border: 1px solid #2d3748; border-radius: 8px; padding: 25px; margin-bottom: 30px; position: relative; overflow: hidden; box-shadow: 0 4px 20px rgba(0, 0, 0, 0.5);">
332
+ <div style="position: absolute; top: 0; left: 0; width: 100%; height: 4px; background: linear-gradient(90deg, #EC4899, #FACC15); opacity: 0.7;"></div>
333
+ <p style="color: #D1D5DB; margin-bottom: 1em;">The <a href="https://huggingface.co/meta-llama/Meta-Llama-3-8B/blob/main/LICENSE" style="color: #EC4899; text-decoration: none; border-bottom: 1px dotted rgba(236, 72, 153, 0.4);">Llama 3 Community License Agreement</a> should apply based on the constituent models.</p>
334
+
335
+ <p style="color: #e1e7ef; font-weight: 600; margin-bottom: 1em;">Disclaimer: Uncertain Licensing Terms</p>
336
+
337
+ <p style="color: #D1D5DB; margin-bottom: 1em;">This LLM is a merged model incorporating weights from multiple LLMs governed by their own distinct licenses. Due to the complexity of blending these components, the licensing terms for this merged model are somewhat uncertain.</p>
338
+ <p style="color: #D1D5DB; margin-bottom: 1em;">By using this model, you acknowledge and accept the potential legal risks and uncertainties associated with its use. Any use beyond personal or research purposes, including commercial applications, may carry legal risks and you assume full responsibility for compliance with all applicable licenses and laws.</p>
339
+ <p style="color: #D1D5DB; margin-bottom: 1em;">I recommend consulting with legal counsel to ensure your use of this model complies with all relevant licenses and regulations.</p>
340
+ </div>
341
+
342
+ ## <span style="color: #EC4899; font-size: 1.8rem; border-bottom: 1px solid #2d3748; padding-bottom: 0.3em; display: block;">Merge Details</span>
343
+
344
+ <div style="background-color: #1a1f2e; border: 1px solid #2d3748; border-radius: 8px; padding: 25px; margin-bottom: 30px; position: relative; overflow: hidden; box-shadow: 0 4px 20px rgba(0, 0, 0, 0.5);">
345
+ <div style="position: absolute; top: 0; left: 0; width: 100%; height: 4px; background: linear-gradient(90deg, #EC4899, #FACC15); opacity: 0.7;"></div>
346
+ <h3 style="color: #e1e7ef; font-size: 1.4rem; margin-top: 0; margin-bottom: 0.5em;">Merge Method</h3>
347
+ <p style="color: #D1D5DB; margin-bottom: 1em;">This model was merged using the NuSLERP merge method.</p>
348
+
349
+ <h3 style="color: #e1e7ef; font-size: 1.4rem; margin-top: 1.5em; margin-bottom: 0.5em;">Models Merged</h3>
350
+ <p style="color: #D1D5DB; margin-bottom: 0.5em;">The following models were included in the merge:</p>
351
+ <ul style="color: #D1D5DB; margin-bottom: 1em; padding-left: 20px;">
352
+ <li style="margin-bottom: 0.5em;"><a href="https://huggingface.co/zerofata/L3.3-GeneticLemonade-Final-v2-70B" style="color: #EC4899; text-decoration: none; border-bottom: 1px dotted rgba(236, 72, 153, 0.4);">zerofata/L3.3-GeneticLemonade-Final-v2-70B</a></li>
353
+ <li style="margin-bottom: 0.5em;"><a href="https://huggingface.co/zerofata/L3.3-GeneticLemonade-Unleashed-v3-70B" style="color: #EC4899; text-decoration: none; border-bottom: 1px dotted rgba(236, 72, 153, 0.4);">zerofata/L3.3-GeneticLemonade-Unleashed-v3-70B</a></li>
354
+ <li style="margin-bottom: 0.5em;"><a href="https://huggingface.co/deepcogito/cogito-v1-preview-llama-70B" style="color: #EC4899; text-decoration: none; border-bottom: 1px dotted rgba(236, 72, 153, 0.4);">deepcogito/cogito-v1-preview-llama-70B</a></li>
355
+ <li style="margin-bottom: 0.5em;"><a href="https://huggingface.co/arcee-ai/Arcee-SuperNova-v1" style="color: #EC4899; text-decoration: none; border-bottom: 1px dotted rgba(236, 72, 153, 0.4);">arcee-ai/Arcee-SuperNova-v1</a></li>
356
+ </ul>
357
+
358
+ <details style="margin-bottom: 20px; background-color: #111827; border-radius: 8px; overflow: hidden; border: 1px solid #2d3748;">
359
+ <summary style="padding: 12px 15px; cursor: pointer; background-color: #1a1f2e; border-bottom: 1px solid #2d3748; font-weight: 600; color: #EC4899; display: flex; align-items: center;">Configuration YAML</summary>
360
+ <div style="padding: 15px;">
361
+ <pre style="background-color: rgba(0, 0, 0, 0.2); border-radius: 6px; padding: 15px; overflow-x: auto; border: 1px solid #2d3748; position: relative;"><code style="font-family: 'Fira Code', 'Courier New', Courier, monospace; font-size: 0.9em; color: #e1e7ef;">models:
362
+ - model: sophosympatheia/strawberrylemonade-70b-v1.0
363
+ parameters:
364
+ weight: [0.1, 0.3, 0.1]
365
+ - model: sophosympatheia/strawberrylemonade-70b-v1.1.0 # This is unreleased right now, uses the arcee-ai/Arcee-SuperNova-v1 model as the base in a very similar nuslerp merge
366
+ parameters:
367
+ weight: [0.9, 0.7, 0.9]
368
+
369
+ merge_method: nuslerp
370
+
371
+ dtype: float32
372
+ out_dtype: bfloat16 # Or float16, float32
373
+ tokenizer:
374
+ source: sophosympatheia/strawberrylemonade-70b-v1.1.0</code></pre>
375
+ </div>
376
+ </details>
377
+ </div>