TitleOS PRO

TitleOS

https://titleos.dev

AI & ML interests

I break the Xbox One/Series. Featured on OSGWiki. Former Xbox MVP. Previously InfoSec at Apple, then SRE at DreamBox Learning, now looking for a new opportunity. Artificial Intelligence LLM enthusiast, wannabe expert. They/Them. 🏳️‍🌈

Recent Activity

liked a model about 5 hours ago

HauhauCS/Qwen3.6-27B-Uncensored-HauhauCS-Aggressive

liked a model about 5 hours ago

Qwen/Qwen3.6-27B

published a dataset about 10 hours ago

TitleOS/Vircava_Alpaca_Latvian_NLLB

View all activity

Organizations

posted an update 8 days ago

Post

112

I taught an old dog, or in this case model, new tricks. Meet Galactic Reasoning 1.3B: https://huggingface.co/collections/TitleOS/galactic-reasoning-galactica-with-chain-of-thought. By finetuning Meta's (at the time Facebook) Galactica model against

glaiveai glaiveai/reasoning-v1-20m. After training for 1000 steps on my poor overworked Tesla P40 for 48 hours, I was able to produce a merged FP16, LoRA and quantization Q8 weights. Check out the readme.md for an example CoT.

reacted to WizardLM's post with 👍 almost 2 years ago

Post

20783

🔥 🔥🔥
Excited to announce WizardLM new Paper: Auto Evol-Instruct!

🐦 Twitter: https://x.com/WizardLM_AI/status/1812857977122202087

📃 Paper: https://arxiv.org/pdf/2406.00770

🤖 1. Fully AI-Powered Pipeline

Auto Evol-Instruct automatically involves an iterative process of optimizing an Evol-Instruct V1 into an optimal one. The pipeline consists of two critical stages: Evol Trajectory Analysis, where the optimizer LLM analyzes the issues and failures exposed in instruction evolution performed by the evol LLM, and Evolving Method Optimization, where the optimizer LLM addresses these issues to progressively develop an effective evolving method. The optimal evolving method is then used to convert the entire instruction dataset into more diverse and complex forms, facilitating improved instruction tuning.

📈2. Scaling Evol-Instruct with Arena Learning

With Auto Evol-Instruct, the evolutionary synthesis data of WizardLM-2 has scaled up from WizardLM-1 to dozens of domains, covering tasks in all aspects of large language models. This allows Arena Learning to train and learn from an almost infinite pool of high-difficulty instruction data, fully unlocking all the potential of Arena Learning.

2 replies

TitleOS PRO

AI & ML interests

Recent Activity

Organizations

TitleOS's activity