You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
WARCRecordFormat uses the value of the metadata key _response.headers_ to include the HTTP headers in the WARC representation. The WARCTypeValue would then be 'response' instead of resource.
Storing the request headers is not easily doable with httpclient as the user agent info is not accessible from the httpget object.
jnioche
changed the title
HTTP protocol : store request and response headers verbatim in metadata
HTTP protocol : store response headers verbatim in metadata
Jul 21, 2016
In case it's useful in the future, I think this is how Heritrix does it. i.e. it wraps the input and output streams at the socket level and records what happens so it can be picked apart afterwards.
WARCRecordFormat uses the value of the metadata key
_response.headers_
to include the HTTP headers in the WARC representation. The WARCTypeValue would then be 'response' instead ofresource
.Similarly we'll need to store the request for [https://github.com/DigitalPebble/sc-warc/issues/1]
The text was updated successfully, but these errors were encountered: