Tian-Fantasea
/

test123

Model card Files Files and versions

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

模型相关论文可查阅:

7B

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration
TiFRe: Text-guided Video Frame Reduction for Efficient Video Multi-modal Large Language Models
Searching Meta Reasoning Skeleton to Guide LLM Reasoning
Post-Training Quantization of OpenPangu Models for Efficient Deployment on Atlas A2
OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs
Structured Episodic Event Memory
Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient Verification
CSSBench: Evaluating the Safety of Lightweight LLMs against Chinese-Specific Adversarial Patterns
Beyond Static Question Banks: Dynamic Knowledge Expansion via LLM-Automated Graph Construction and Adaptive Generation
Your Models Have Thought Enough: Training Large Reasoning Models to Stop Overthinking
EvoOpt-LLM: Evolving industrial optimization models with large language models
Accelerating OpenPangu Inference on NPU via Speculative Decoding
V-CAGE: Context-Aware Generation and Verification for Scalable Long-Horizon Embodied Tasks
Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions
Differentially Private and Communication Efficient Large Language Model Split Inference via Stochastic Quantization and Soft Prompt
A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs
Multi-source Heterogeneous Public Opinion Analysis via Collaborative Reasoning and Adaptive Fusion: A Systematically Integrated Approach
Characterize LSM-tree Compaction Performance via On-Device LLM Inference
Traceable Cross-Source RAG for Chinese Tibetan Medicine Question Answering
A Comprehensive Evaluation of LLM Reasoning: From Single-Model to Multi-Agent Paradigms
Structured Self-Consistency: A Multi-Task Evaluation of LLMs on VirtualHome
ArkEval: Benchmarking and Evaluating Automated CodeRepair for ArkTS
FMBench: Adaptive Large Language Model Output Formatting
DScheLLM: Enabling Dynamic Scheduling through a Fine-Tuned Dual-System Large language Model
Holmes: An Evidence-Grounded LLM Agent for Auditable DDoS Investigation in Cloud Networks
Cochain: Balancing Insufficient and Excessive Collaboration in LLM Agent Workflows
How Few-shot Demonstrations Affect Prompt-based Defenses Against LLM Jailbreak Attacks

1B

Post-Training Quantization of OpenPangu Models for Efficient Deployment on Atlas A2
VisionPangu: A Compact and Fine-Grained Multimodal Assistant with 1.7B Parameters
Neuromem: A Granular Decomposition of the Streaming Lifecycle in External Memory for LLMs
CSSBench: Evaluating the Safety of Lightweight LLMs against Chinese-Specific Adversarial Patterns
A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs
AGZO: Activation-Guided Zeroth-Order Optimization for LLM Fine-Tuning
Combining Adam and its Inverse Counterpart to Enhance Generalization of Deep Learning Optimizers
Characterize LSM-tree Compaction Performance via On-Device LLM Inference
PanguMotion: Continuous Driving Motion Forecasting with Pangu Transformers
FMBench: Adaptive Large Language Model Output Formatting
Cochain: Balancing Insufficient and Excessive Collaboration in LLM Agent Workflows
How Few-shot Demonstrations Affect Prompt-based Defenses Against LLM Jailbreak Attacks
Adaptive Confidence Gating in Multi-Agent Collaboration for Efficient and Optimized Code Generation

7B-V1.1

SpatialText: A Pure-Text Cognitive Benchmark for Spatial Understanding in Large Language Models
Efficient Reasoning with Balanced Thinking
OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs
EvoOpt-LLM: Evolving industrial optimization models with large language models
Accelerating OpenPangu Inference on NPU via Speculative Decoding
CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis
TypePro: Boosting LLM-Based Type Inference via Inter-Procedural Slicing
UniPruning: Unifying Local Metric and Global Feedback for Scalable Sparse LLMs
Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
DScheLLM: Enabling Dynamic Scheduling through a Fine-Tuned Dual-System Large language Model
VAR-MATH: Probing True Mathematical Reasoning in LLMS via Symbolic Multi-Instance Benchmarks

1B-V1.1

Silent Inconsistency in Data-Parallel Full Fine-Tuning: Diagnosing Worker-Level Optimization Misalignment
AdapShot: Adaptive Many-Shot In-Context Learning with Semantic-Aware KV Cache Reuse
Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference

详细清单

Searching Meta Reasoning Skeleton to Guide LLM Reasoning

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Papers for Tian-Fantasea/test123

Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

Paper • 2602.08324 • Published 18 days ago

AdapShot: Adaptive Many-Shot In-Context Learning with Semantic-Aware KV Cache Reuse

Paper • 2605.03644 • Published 28 days ago

Searching Meta Reasoning Skeleton to Guide LLM Reasoning

Paper • 2510.04116 • Published Apr 16

A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs

Paper • 2604.09752 • Published Apr 15

TypePro: Boosting LLM-Based Type Inference via Inter-Procedural Slicing

Paper • 2604.02702 • Published Apr 3