<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>PromptZone - Leading AI Community for Prompt Engineering and AI Enthusiasts: HAESEONG JEON</title>
    <description>The latest articles on PromptZone - Leading AI Community for Prompt Engineering and AI Enthusiasts by HAESEONG JEON (@haeseong_jeon_bfa6d48ed8d).</description>
    <link>https://www.promptzone.com/haeseong_jeon_bfa6d48ed8d</link>
    <image>
      <url>https://promptzone-community.s3.amazonaws.com/uploads/user/profile_image/25840/e4def429-8ca5-4456-9b01-21741490cf6b.png</url>
      <title>PromptZone - Leading AI Community for Prompt Engineering and AI Enthusiasts: HAESEONG JEON</title>
      <link>https://www.promptzone.com/haeseong_jeon_bfa6d48ed8d</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://www.promptzone.com/feed/haeseong_jeon_bfa6d48ed8d"/>
    <language>en</language>
    <item>
      <title>Spanlens</title>
      <dc:creator>HAESEONG JEON</dc:creator>
      <pubDate>Fri, 03 Jul 2026 10:26:36 +0000</pubDate>
      <link>https://www.promptzone.com/haeseong_jeon_bfa6d48ed8d/spanlens-2k6f</link>
      <guid>https://www.promptzone.com/haeseong_jeon_bfa6d48ed8d/spanlens-2k6f</guid>
      <description>&lt;p&gt;Spanlens is an open-source (MIT) LLM observability platform that lets developers monitor every call their application makes to OpenAI, Anthropic, Gemini, Mistral, OpenRouter, Azure OpenAI, or a local Ollama model. Integration takes one line: swap your client's baseURL to the Spanlens proxy, or run "npx @spanlens/cli init" and the wizard rewrites your code automatically. From that moment, every request is recorded with its model, token counts, latency, cost, and full prompt and response body, with streaming responses reconstructed automatically.&lt;/p&gt;

&lt;p&gt;The dashboard turns that raw log into operational insight. Cost tracking breaks spend down per request, per model, and per end user, and parses prompt-cache tokens separately so you see real cache savings rather than sticker price. Agent tracing visualizes multi-step workflows as Gantt waterfalls and node-and-edge graphs, highlighting the critical path so you can find the slowest dependency chain in a fan-out. Anomaly detection flags 3-sigma deviations in latency, cost, or error rate against a rolling 7-day baseline with root-cause hints. Alerts on budget, error rate, and p95 latency are delivered to Email, Slack, or Discord.&lt;/p&gt;

&lt;p&gt;Spanlens goes beyond passive logging. A regex-based PII and prompt-injection scanner inspects request and response bodies and can block injections at the proxy. The savings engine spots calls that match a cheaper model's profile (for example, a gpt-4o call that looks like a classification task) and estimates the monthly saving from switching. Prompt versioning with A/B experiments compares versions on latency, cost, and error rate using Welch's t-test for statistical significance, and an LLM-as-judge evaluation framework (judge with OpenAI, Anthropic, or Gemini) scores outputs against rubric anchors, with human agreement measured by Pearson r or Cohen's kappa. Reusable datasets power offline evals and regression checks.&lt;/p&gt;

</description>
      <category>llm</category>
      <category>claude</category>
      <category>openai</category>
      <category>gemini</category>
    </item>
  </channel>
</rss>
