Blog Posts

The Agent Cost Wars — Updated: GLM-5, M2.7, and What the Leaderboard Actually Tells Us

The Agent Cost Wars — Updated: GLM-5, M2.7, and What the Leaderboard Actually Tells Us

TL;DR A week ago I published a piece about MiniMax M2.5’s “$1/hour agent” promise and called it the dawn of always-on intelligence. Since then, the leaderboard moved — fast. GLM-5 dethroned M2.5 as the top open-weight model, M2.7 shipped with the same aggressive pricing but unconfirmed licensing, and Gemini 3.1 Pro quietly became the price-performance king among flagships. This is my attempt to challenge my own earlier post with updated numbers from Artificial Analysis, and share some careful impressions on what this means for practitioners heading into the second half of 2026.

Read More
The View from Outside the Glass: Why Growing Organizations Need the Outsider's Mirror

The View from Outside the Glass: Why Growing Organizations Need the Outsider's Mirror

TL;DR, Growing engineering organizations get trapped in the “inner loop” — high-velocity execution that slowly drifts from strategic intent. An external consultant’s value isn’t superior knowledge; it’s lower latency to the truth. This post explores why speaking up is an act of service, not arrogance, and how the “outsider’s mirror” helps teams move from reactive execution to intentional alignment.

Read More
The $1,892 Agent: MiniMax M2.5 and the Dawn of Always-On Intelligence

The $1,892 Agent: MiniMax M2.5 and the Dawn of Always-On Intelligence

TL;DR

MiniMax M2.5 is a 230B-parameter model that activates only 4% of its weights per token — yet scores 80% on SWE-bench Verified, putting it neck-to-neck with Anthropic Opus 4.6 at roughly 3% of the cost. At ~$1,892/year for a continuously running agent, the “always-on agent” is no longer a thought experiment. The cost of intelligence is approaching the cost of electricity, and Jevons’ Paradox says: demand is about to explode.

Read More
AWS KMS Best Practices: Securing the Secret Ingredients of Your Infrastructure

AWS KMS Best Practices: Securing the Secret Ingredients of Your Infrastructure

TL;DR

AWS KMS gives you three flavors of encryption keys — AWS-owned, AWS-managed, and Customer Managed Keys (CMKs). For anything resembling production, CMKs are the only real choice: they give you control over rotation, deletion, cross-account access, and most critically — the ability to kill a key in an emergency. Think of KMS like a hotel’s key management system: the entrance guard has one master card, but the security manager holds the safe with all the master keys. Designing your KMS strategy right is what keeps you from handing that safe to an attacker.

Read More
AWS Landing Zone Accelerator — When Multi-Account Governance Gets Real

AWS Landing Zone Accelerator — When Multi-Account Governance Gets Real

TL;DR

If you’re managing more than a handful of AWS accounts with compliance requirements like HIPAA or FedRAMP, you’ll quickly outgrow IAM Identity Center and manual guardrails. AWS Landing Zone Accelerator (LZA) is an open-source CDK application that turns a set of YAML configuration files into a fully governed, multi-account, multi-region AWS environment — including networking, security controls, and OU-based policy enforcement. This post walks through a real-world design: a shared Transit Gateway architecture with Dev/Prod isolation, NACL-based traffic boundaries, and dual-region deployment for multiple workload types.

Read More
AI Chaos and Productivity How to Choose Your Core Tool Stack (and Stop the FOMO)

AI Chaos and Productivity How to Choose Your Core Tool Stack (and Stop the FOMO)

This post was originally published on Israeli Tech Radar.

TLDR; beyond the cost of conversations, this post is going to discuss the golden path and hopefully introduce the methodology of choosing the most fitted tool-set using a handful of tools which can help mold you A.I strategy. We’ve all heard of 10x developers and this post is aiming at building 10x teams which scale-well with A.I.

Read More
AI Usage: Are You Vibing or Building? Your Wallet Might Know

AI Usage: Are You Vibing or Building? Your Wallet Might Know

This post was originally published on Israeli Tech Radar.

Artificial intelligence is rapidly changing how we work and create. We’re using it for everything from drafting quick emails to designing complex systems. But how we interact with AI — the “usage model” — significantly impacts both the results we get and the money we spend, if your not spending money on A.I / plan to but not dure on what stay tuned and lets start thinking in “token” which might fit in to 2 code categories:

Read More
Taming the AI Beast: The Role of the LLM Gateway

Taming the AI Beast: The Role of the LLM Gateway

In a personal setting, managing AI costs might be a matter of checking your credit card statement. But for an organization with multiple teams, projects, and AI providers, it’s a financial and operational challenge.

Read More
The Cost of Conversations

The Cost of Conversations

This post was originally published on Israeli Tech Radar.

We’ve all been there: a simple question to an AI model turns into a lengthy, back-and-forth “vibe session.” It’s effortless, it’s creative, and it feels free. But behind every token is a hidden cost, and the bill at the end of the month can be a surprise.

Read More
From smoke to sanity: how I unintentionally used Pomodoro and rebuilt my focus after quitting

From smoke to sanity: how I unintentionally used Pomodoro and rebuilt my focus after quitting

Originally posted on the Israeli Tech Radar on medium.

TLDR: For 25 years in the demanding world of IT, I had a secret productivity weapon. It wasn’t a tool or a certification; it was a ritual that could cut through the thickest mental fog and reset my brain on the most complex problems.

Read More