c# - How can I prevent my web crawler from slowing down over time? -
i made web crawler in c#. starts 1 url, finds urls in url , visits other urls, , on...
i add urls string array pre-defined size , dictionary can check if url has been crawled (i use dictionary's containskey() method because it's faster linear array search).
it fast when starts working, on time gets painfully slow. reason dictionary's contains() method takes lot of time when dictionary big (100k+ urls, example), , means web crawler slowing down on time.
what can this? have check if url has been added already, , dictionary lookup fastest way, way gets slow after dictionary gets large enough.
Comments
Post a Comment