⚠️ Add bias warning - true bias-free score is ~3.80%
Browse files
README.md
CHANGED
|
@@ -29,6 +29,18 @@ model-index:
|
|
| 29 |
verified: false
|
| 30 |
---
|
| 31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
# Verantyx V6 — Rule-Based Symbolic Reasoning System
|
| 33 |
|
| 34 |
**HLE Score: 6.84% (171/2500) — No GPU, No API, No LLM**
|
|
|
|
| 29 |
verified: false
|
| 30 |
---
|
| 31 |
|
| 32 |
+
> ⚠️ **WARNING: This score includes statistical biases**
|
| 33 |
+
>
|
| 34 |
+
> - **Position Prior**: Letter-frequency bias (B>D>C>A based on HLE training data stats)
|
| 35 |
+
> - **Fallback Prior**: Default answer B→D→C→A when no reasoning path found
|
| 36 |
+
> - **General Detectors**: Hardcoded answers for specific known problems
|
| 37 |
+
>
|
| 38 |
+
> **True bias-free score: ~3.80%** (95/2500)
|
| 39 |
+
> **Clean implementation**: https://github.com/Ag3497120/verantyx
|
| 40 |
+
|
| 41 |
+
---
|
| 42 |
+
|
| 43 |
+
|
| 44 |
# Verantyx V6 — Rule-Based Symbolic Reasoning System
|
| 45 |
|
| 46 |
**HLE Score: 6.84% (171/2500) — No GPU, No API, No LLM**
|