Rather than archiving all assets extracted from every URL, there should be a way to limit by: - number of assets - file type of assets - total time spent archiving assets before moving onto the next URL