oops - the last comment only addresses _item_ name collisions in 24hrs, which is certainly required, but the warc _filename_ will also need a timestamp to protect against restart collisions (which resets the serial number). so we'll need something more like:
oops - the last comment only addresses _item_ name collisions in 24hrs, which is certainly required, but the warc _filename_ will also need a timestamp to protect against restart collisions (which resets the serial number). so we'll need something more like:
WIDE-5- 20101213120000/ WIDE-2010121312 0000-00395. warc.gz
or
{item_name} /{warc_ filename}
item_name:
{job} -{jobnode} -{timestamp}
warc_filename:
{job} -{timestamp} -{serialno} .warc.gz
hopefully that's still helpful.