Ollama Basics
· 5 min read
Some basics on Ollama. Includes some details on quantization, vector DBs, model storage, model format and modelfiles.
Some basics on Ollama. Includes some details on quantization, vector DBs, model storage, model format and modelfiles.
Comparisons of the OpenAI service offering with that of Anthropic. Includes context window, rate limits and model optimization.
My notes on the design of Anthropic's APIs and some general design considerations for provider based APIs and SDKs. Covers rate limiting, service tiers, SSE flow and some of the REST API endpoints.
Some notes on the high level data API in Tensorflow.