mirau-agent-base-oai

Introduction

mirau-agent-base is a large language model specifically optimized for Agent scenarios, fine-tuned based on Qwen2.5-14B-Instruct. This model focuses on enhancing multi-turn tool calling capabilities, enabling autonomous planning, task execution, and exception handling in complex interactive environments.

Although named "base", this does not refer to a foundation model that has only undergone pre-training, but rather a "cold-start" version that has been through supervised fine-tuning (SFT) and direct preference optimization (DPO), providing a high-quality policy initialization foundation for subsequent reinforcement learning training. We also hope the community can further enhance it through RL training based on this foundation.

The oai version is compatible with OpenAI's function definitions, with no other differences from the previous mirau-agent-14b-base.

Online Demo

ModelScope Studio - mirau-agent-base-demo

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for eliuakk/mirau-agent-base-oai

Base model

Qwen/Qwen2.5-14B
Finetuned
(193)
this model