What is HTTP 401 Unauthorized?
HTTP 401 Unauthorized is the standard HTTP response code indicating that the client request lacks valid authentication credentials for the target resource. In scraping pipelines, a 401 isn't just a bad password — it's usually the symptom of an expired session token, a missing Authorization header, or a cookie jar that failed to persist across redirects. When a pipeline hits a 401, the extraction job halts until the auth state is renegotiated.