GLM-5.2 Local Inference — Dropday.ai

Source post

GLM-5.2 Local Inference

GLM-5.2 Local Inference

GLM-5.2 Local Inference

Running GLM-5.2 locally on Mac Studio M2 Ultra with hardware acceleration.

Built with

GLM-5.2NEW

Strong

The creator explicitly names the model and provides terminal output as evidence of the local inference.

The X post title explicitly states: 'GLM-5.2 running locally on Mac Studio M2 Ultra at 17-19 TPS'.

Build evidence

Moderate

The creator provides terminal performance logs as proof of the build, though no open-source repo or playable demo link is provided.

Creator

とのけん3 @Tono_Ken3

Shipped

2h ago · model from Jun 16, 2026

A successful deployment and performance test of the GLM-5.2 reasoning model running locally on Apple hardware. The project demonstrates inference speeds of ~17-19 tokens per second on an M2 Ultra, utilizing Metal for GPU offloading.

#llm #local #inference #optimization

Timeline

Teaser

Video

Playable

Product

Loading…

Similar

Unsloth MTP Implementation

Unsloth MTP ImplementationApps & Tools

ShardApps & Tools

TracerApps & Tools