What is JSON Lines Format?
JSON Lines format (often called NDJSON) is a text-based data delivery standard where each line is a valid, standalone JSON object. For high-volume scraping pipelines, it replaces monolithic JSON arrays by allowing records to be streamed, appended, and parsed incrementally without loading the entire dataset into memory. If your pipeline crashes mid-write, a JSON array is corrupted; a JSON Lines file simply ends at the last valid record.