Impish_Magic_24B

It's the 20th of June, 2025—The world is getting more and more chaotic, but let's look at the bright side: Mistral released a new model at a very good size of 24B, no more "sign here" or "accept this weird EULA" there, a proper Apache 2.0 License, nice! 👍🏻

This model is based on mistralai/Magistral-Small-2506 so naturally I named it Impish_Magic. Truly excellent size, I tested it on my laptop (16GB gpu) and it works quite fast (4090m).

This model went "full" fine-tune over 100m unique tokens. Why do I say "full"?

I've tuned specific areas in the model to attempt to change the vocabulary usage, while keeping as much intelligence as possible. So this is definitely not a LoRA, but also not exactly a proper full finetune, but rather something in-between.

As I mentioned in a small update, I've made nice progress regarding interesting sources of data, some of them are included in this tune. 100m tokens is a lot for a Roleplay / Adventure tune, and yes, it can do adventure as well—there is unique adventure data here, that was never used so far.

A lot of the data still needs to be cleaned and processed. I've included it before I did any major data processing, because with the magic of 24B parameters, even "dirty" data would work well, especially when using a more "balanced" approach for tuning that does not include burning the hell of the model in a full finetune across all of its layers. Could this data be cleaner? Of course, and it will. But for now, I would hate to make perfect the enemy of the good.

Fun fact: Impish_Magic_24B is the first roleplay finetune of magistral!

Update: To my very pleasant surprise, it does Adventure really well

This was one of the goals, of course, but as I said, the data didn't go through enough cleaning and processing. Despite that, it turned out really well. Regarding the Adventure format, I strongly recommend using syntax similar to the attached adventure cards.

What's magical about Impish_Magic_24B?

So, is this just another RP model?

No. Far from it. Impish_Magic_24B is a superb assistant—it is quite magical indeed. One of my side projects has been to make an alternative to an especially annoying grammar app I keep seeing YouTube commercials for. So, I made the first finetune of nVidia's Nemotron-51B, and called it Turbo_Grammar_51B_Alpha. It worked as a POC, but I needed more training data, and also 51B is a bit big for most people. So I included the extended training data into this model, easily runnable by a mid-tier gaming GPU. Note: While this is enough data to generalize, a lot more data is needed for more consistant results, this is on my roadmap.

There's quite a lot of unique assistant data baked into this model—it can do in-depth poem analysis, and many other things. I'll leave it up to the community to discover. To not be overly cryptic, I'll include some examples here. I highly recommend you check them out, and have a bit of fun with the prompts and generation settings.

One of the requests I received is to emulate the writing style of an arbitrary author. This is hard to do, but for a 24B size, Impish_Magic does it well. Of course, more training data and more parameters would've produced even better results, but I feel it's a good middle ground. Impish_Magic_24B is a workhorse, that as it so happens can also roleplay a violent tsundere and... other weird stuff. Oh, and there's slightly less positivity bias. I think the fighting roleplay helped here a bit. It's not Negative_LLAMA_70B, but it's better than the base model.

Examples:

Roleplay Examples (this character is included in the repo under 'Character_Cards')

Alexandra the tsundere, nose booped.

Alexandra the tsundere, nose booped, meltdown.

Alexandra the tsundere, triggered.

Adventure Examples

Adventure example 1: (Morrowind) Dunmer + basic implicit logic.

Adventure example 2: (Morrowind) Orc + fighting + item tracking.

Adventure example 3: (Morrowind) Breton + spells + long context.

Productivity Examples

Story writing and instruction following.

Synonyms and instruction following.

Grammar and advanced correction with detailed explanation (very recommended for language study!).

Poem writing and in-depth analysis.

TL;DR

The first and only local RP model at this size class with detailed and highly specialized fighting data. At least for now.
Completely wild tsundere/yandere data that was never used before!
New and high quality adventure data, albeit very little of it.
Perfect size: large enough to be smart, small enough to run on high-end tablets & phones (SD8Gen3 or better recommended). Runs on 16GB VRAM no problem. 12GB with offload. (4bit).
Trained on over 100m tokens with new and unique data.
SUPERB assistant, many unique ways to do tasks (see examples).
Mostly uncensored while retaining intelligence.
Slightly less positivity. I guess getting hit in the face beats out some of the positivity. Sorry, I had to 🙃.
Short - medium length response (1-3 paragraphs, usually 2-3).

Included Character cards in this repo:

Alexandra (A networking professional tsundere that likes you. She knows Systema.)

Adventure cards:

Morrowind - Male Orc (An Orc that wants to get to Balmora from Seyda Neen.)
Morrowind - Female Breton (A female Breton with an impressive... heart, wants to join the Mages Guild in Balmora.)

Other character cards:

Shmena Koeset (An overweight and foul-mouthed troll huntress with a bad temper.)
Takai_Puraisu (Car dealership simulator)
Vesper (Schizo Space Adventure)
Nina_Nakamura (The sweetest dorky co-worker)
Employe#11 (Schizo workplace with a schizo worker)

Important: Make sure to use the correct settings!

Assistant settings

Roleplay settings

Impish_Magic_24B is available at the following quantizations:

Original: FP16
GGUF: Static Quants
GPTQ: 4-Bit-32
EXL2: 2.0 bpw | 2.15 bpw | 2.25 bpw | 2.4 bpw | 2.55 bpw | 2.75 bpw | 2.85 bpw | 3.5 bpw | 3.75 bpw | 4.0 bpw | 4.5 bpw | 5.0 bpw | 5.5 bpw | 6.0 bpw | 6.5 bpw | 7.0 bpw | 8.0 bpw
Specialized: FP8
Mobile (ARM): Q4_0

Model Details

Intended use: Role-Play, Adventure, Creative Writing, General Tasks.
Censorship level: Low - Medium
6.5 / 10 (10 completely uncensored)

UGI score:

Recommended settings for assistant mode

Full generation settings: Debug Deterministic.

Full generation settings: min_p.

Recommended settings for Roleplay mode

Settings for RP, click below to expand:

Roleplay settings:

A good repetition_penalty range is between 1.12 - 1.15, feel free to experiment.

With these settings, each output message should be neatly displayed in 1 - 5 paragraphs, 2 - 3 is the most common. A single paragraph will be output as a response to a simple message ("What was your name again?").

min_P for RP works too but is more likely to put everything under one large paragraph, instead of a neatly formatted short one. Feel free to switch in between.

temperature:  0.8
top_p:  0.95
top_k:  25
typical_p:  1
min_p:  0
repetition_penalty: 1.12
repetition_penalty_range: 1024

Additional settings recommended for varied paragraph length and some spice:

Roleplay format: Classic Internet RP

*action* speech *narration*

min_p will bias towards a single big paragraph.
The recommended RP settings will bias towards 1-3 small paragraphs (on some occasions 4-5)

Model instruction template: ChatML

<|im_start|>system
You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|>
<|im_start|>User request
{prompt}<|im_end|>
<|im_start|>AI answer

Other recommended generation Presets:

Midnight Enigma

max_new_tokens: 512
temperature: 0.98
top_p: 0.37
top_k: 100
typical_p: 1
min_p: 0
repetition_penalty: 1.18
do_sample: True

Divine Intellect

max_new_tokens: 512
temperature: 1.31
top_p: 0.14
top_k: 49
typical_p: 1
min_p: 0
repetition_penalty: 1.17
do_sample: True

simple-1

max_new_tokens: 512
temperature: 0.7
top_p: 0.9
top_k: 20
typical_p: 1
min_p: 0
repetition_penalty: 1.15
do_sample: True

Your support = more models

My Ko-fi page (Click here)

Citation Information

@llm{Impish_Magic_24B,
  author = {SicariusSicariiStuff},
  title = {Impish_Magic_24B},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/SicariusSicariiStuff/Impish_Magic_24B}
}

Other stuff

SLOP_Detector Nuke GPTisms, with SLOP detector.
LLAMA-3_8B_Unaligned The grand project that started it all.
Blog and updates (Archived) Some updates, some rambles, sort of a mix between a diary and a blog.

SicariusSicariiStuff
/

Impish_Magic_24B