Blog

Technical posts from the FutureSearch team.

LiteLLM made us accidentally make our product free for a week

June 10, 2026·

In LiteLLM v1.83.10, the allow_client_tags security feature started silently stripping caller-supplied request_tags unless an admin opts the key or team in. The request still returns 200, with no error and no caller-side signal, so the tags our billing pipeline depends on vanished from LiteLLM_SpendLogs. For six and a half days every paid task and conversation deducted $0. Here is how the tag stripping happened, why our spend alerts missed it, and what we changed.

Blog

LiteLLM made us accidentally make our product free for a week

DeepSeek V4 Pro vs GPT-5.5: What the Benchmarks and Forecasts Say

GitHub Copilot Per-Token Pricing: The Median Seat Still Costs $19

Intel Crescent Island: Why No Major Cloud Will Deploy It by 2027

Microsoft's GPQA Diamond Score in 2026: Forecasting a Top Four Lab

Some rare examples of AIs being underconfident

History doesn't repeat itself as often as LLMs think

Agents sometimes catastrophize

Run agents twice for fun and profit

AGI Timeline Predictions: How Top Forecasters Updated, 2023 to 2026

AI 2027 Update: A One Year Timeline Check

Forecasting Polymarket Questions with AI

Catching the LiteLLM PyPI Attack: The Full Claude Code Transcript

LiteLLM Hack: Were You One of the 47,000?

litellm 1.82.8 Supply Chain Attack on PyPI (March 2026)

How a Poisoned litellm Package Compromised an MCP Server in Cursor

Why new Date() Parses Almost Any String: V8 and the Implementation Defined Trap

The Self-Optimizing SEO Pipeline: Claude Code Agents on Google Search Console Data

How We Built a Marketing Pipeline with Claude Code

How to Stop MCP Servers From Leaving Orphaned Docker Containers

How to Debug AI Agents by Analyzing Their Own Traces with LLMs

Caution: Read the Docs for Claude 4.6's Effort Parameter

How to Run Claude Code as a Kubernetes CronJob

What Is a Claude Code Workflow? Running Pipelines as Markdown

Unleashing AI forecasters on Kalshi prediction markets

Can AI Beat Kalshi? Simulating a Prediction Market Portfolio

MCP structuredContent: How to Return Large Results Without Flooding the Context Window

OpenAI Responses vs Chat Completions API: Why Structured Outputs Differ

How to Upload Large Files to an MCP Server Without Filling the Context Window

LLM API Differences That Break Your Code: Anthropic vs OpenAI vs Google

Ask LLM Agents to Classify Problems Before Starting

How Much Does Deep Research Cost? A Model-by-Model Breakdown

Using LLMs for Data Cleaning At Scale

How AI Finds Fuzzy Duplicates in Large Datasets

How LLM Agents Solve the Table Merging Problem

Do Founder-Led Companies Outperform? The S&P 500 Returned 118% vs 59%

How to Rank S&P 500 Companies by Risk of Management Turnover

Top Frontier AI Labs and Models in 2026: Who Is Leading the AI Race

AI 2027 Six Months Later: Karpathy, Kokotajlo, and Shifting AGI Timelines

A Guide for LLM Assisted Web Research

Superhuman Coders in AI 2027 - Not So Fast

How Tariffs Will Increase Prices on American-Made Products: Cost Impact Analysis

Apple's Plan to Power Siri with ChatGPT was a Predictable Failure

How Deep Research Agents Fail: Lessons from OpenAI, Gemini, and Perplexity

OpenAI Deep Research: Honest Analysis and Real Limitations in 2025

The Death and Life of Prediction Markets at Google

How to Integrate AI Into Forecasting

The Human v Bots Forecasting Tournament