OpenAI APIs
Comparisons of the OpenAI service offering with that of Anthropic. Includes context window, rate limits and model optimization.

Comparisons of the OpenAI service offering with that of Anthropic. Includes context window, rate limits and model optimization.

My notes on the design of Anthropic's APIs and some general design considerations for provider based APIs and SDKs. Covers rate limiting, service tiers, SSE flow and some of the REST API endpoints.

Notes on the evolution of Java Unsafe and off heap memory access - touches on Unsafe, FFM API and the Agrona DirectBuffer.

Some notes on false sharing and cache line padding.

Some notes on the basics of CPU caches - covers locality, write policies, hierarchy and inclusion policies.

Notes on "CAPO: Cost Aware Prompt Optimization" (June 2025) from the Munich Center for Machine Learning.

Notes on HTTP2 and web server threading models. Covers servlets, web server/container architecture (Jetty), HTTP2, HTTP streams (prioritization, flow control) and SSE.

Notes on memory barriers and fencing.

Notes on AI/LLM guardrails and safety patterns from a book on "Agentic Design Patterns" by one of Google's Distinguished Engineers, Antonio Gulli.

Some introductory notes I took on Aeron whilst reading their documentation. Covers flow and congestion control, status messages and the transport protocol.
