Skip to content
This repository has been archived by the owner on Mar 9, 2021. It is now read-only.

Commit

Permalink
Fixes pagination detection in Tumblr likes.
Browse files Browse the repository at this point in the history
Fixes the pagination detection in the Tumblr likes crawler for blogs that contain a dash in the name.
  • Loading branch information
johanneszab committed Oct 11, 2018
1 parent 56066a8 commit e4a6c7d
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -215,7 +215,7 @@ private long ExtractNextPageLink(string document)
// <a id="next_page_link" href="/liked/by/wallpaperfx/page/5/1457139681" class="next button chrome blue">Next</a></div></div>

long unixTime = 0;
var pagination = "(id=\"next_page_link\" href=\"[A-Za-z0-9_/:.]+/([0-9]+)/([A-Za-z0-9]+))\"";
var pagination = "(id=\"next_page_link\" href=\"[A-Za-z0-9_/:.-]+/([0-9]+)/([A-Za-z0-9]+))\"";
long.TryParse(Regex.Match(document, pagination).Groups[3].Value, out unixTime);
return unixTime;
}
Expand Down

0 comments on commit e4a6c7d

Please sign in to comment.