This article fixture describes simple page fetching, HTML parsing, and predictable content extraction for baseline crawls.