Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
WarcParser: Improve compatibility with ARC variants
This makes it so we can read the warcio example.arc file and the example in the ARC file format reference. * Ignore up to 3 spurious linefeeds at the start of ARC records. * Accept ARC records with the trailing linefeed missing. * Accept (but currently ignore) the extra URL-record-v2 fields. * Accept "0" in the ARC IP address field. Fixes #82
- Loading branch information