-
Notifications
You must be signed in to change notification settings - Fork 130
Unable to Crawl At All #96
Comments
The exact same settings work for me here. I suppose you've ticked a checkbox for the post kind you want to download of your blog in the details window? Like it's shown on the front page (e.g. Download images)? Windows version, any proxy or VPN in use? Otherwise I probably cannot really help you since downloading here works here with the exact same settings and any tips would be just guessing. |
Oh, it might help to mention I'm running off an Intel Core 2 Quad Q9650, 8 GB of DDR2 and a Gigabyte EP43-US3L motherboard. I dunno how relevant any of that is, but I know that sometimes running on older hardware can cause problems, so yeah. |
It's not hardware related and every windows version above Windows XP (with .NET 4.5) should be working fine. As I've already said, I cannot provide you any more useful help without error/debug messages. You might want to install Visual Studio, download the code and debug it, but I agree that this is kinda overkill. There might be some information in the Event Viewer, but I highly doubt it. If I were you, I'd try it with a as straight as possible connection to the internet. Do you have any Windows around without all this crap installed? If thats working, then there is at least a hint. But I'm not sure if the connection is the reason at all. Another guess is that you might not be able to access the tumblr api. You could try this version here: #33 Let other people here know what you did if you could figure it out. |
Hmm... I think I can make a virtual machine and see if the program works inside that. I can also try installing Visual Studio and debugging the code, but yeah, would prefer to not do that if I don't have to. Considering how many artists' blogs I try to keep up with though, (in addition to other solutions that I'm aware of being in disrepair or simply not doing the fullest of their job ever), I'm kinda really needing this program to work, so I'm willing to do whatever within my abilities to get it working. I'll comment again when I've had a look into the VM and/or the debugging-in-Visual-Studio method. |
Ok, I've attempted to do the Visual Studio method in light of my VM not working at this current time. When attempting to build the sollution in order to debug, this is what the Output tab spat out.
Just in case it's useful 'cause I'm not familiar enough with Visual Studio to totally understand what's useful or not, here's the Error List also came up with. To note also, since my last comment I've been trying to run the program on my HP Elitebook 8440p laptop, of which uses an Intel Core i5 580M and is also running Windows 8.1 Pro, but installed straight from 8.1 Pro installation media rather than the 8.1 upgrade. Same results despite completely different hardware. Is any of this helpful to you? If not, anything else I can do on my end to figure out what's wrong? |
Looks like those two microsoft .dlls are part of the Blend for Visual Studio SDK for .NET under individual components in the VS 2017 Community installer. Either you can rerun the setup from the control panel -> Programs and Features -> VS 2017 -> Change or you might try to install the .dlls on using a NuGET package within Visual Studio itself as its described in this stackoverflow question. Using the installer has the advantage that the version number will match your visual studio, where as download a random dll might not work if there is a version mismatch. The missing .dlls are also part of the TumblThree.zip from the release page. Thanks for testing all this out. I'll update the instructions accordingly to make it easier for the next one to test this out. That .dll is used for the preview to switch between the movie and image control depending on the input. |
The important messages are these:
but there is nothing wrong with them. The LoadLibrary:End is a debug message which I've added to control the exit of the LoadLibrary method. Since TumblThree is now properly async code there are a lot of threads which start and do things concurrently. One of them maybe loaded the library. An exit code of 0 usually means success. So, you might have to set debug points next to the code lines numbers at various points. You can start with the Crawl Task in the CrawlerController of the Applications assembly and go further down from there inwards. That's the command used to start the Crawl (i.e. hitting the Crawl button on the user interface). Edit: I've never seen the messages on the screenshot. It's also helpful to always check the exception window not just the Debug window. Clicking exceptions within the exception window usually brings you directly to the right spot. |
Frustratingly, I wasn't able to find any exceptions in any tests I've run besides the TaskCanceledException thingy you indicated as a normal exception. Additionally, I'm doing all this with little understanding of the actual code, as I don't have any experience in C# or any of the other things that TumblThree uses to function, so it's entirely possible that I'm missing something that would be clearer to someone else. I'm attempting again to try and get a virtual machine working with Windows 8.1 and Windows 7 respectively to see if it's a problem with the operating system my computers are using. For now, I'm stuck with the only theory being that Windows 8 and TumblThree simply don't like each other. That said, the other crawler I was using, referred to simply as Tumblr Image Downloader, had stopped working for me too, (which is what prompted me to find this program to begin with,) and I think it works a similar way that TumblThree currently does. Maybe there was an update or a missing dependency on my computers running Windows 8 that knocked out functionality that the two require to function? |
I don't think so. Since Tumblr Image Downloader is written in Java (and not C# as TumblThree) and all other internet related things seems to work, I cannot think of any way that would be possible. The blog in the manager (left side) is actually shown as Online in the Status row of TumblThree? If so, than TumblThree could already connect one the the tumblr api. The scanning is bascially doing the exact same thing, just with different urls. If it's shown as offline, did you actually try the v.1.0.5.16 release? |
I'll close this but if you have any more questions, I'll still try to answer them. |
I would have loved to give a positive review. However, this app would not function at all. |
Just to contribute my two cents; I'm not having much trouble with the application myself, currently. I've been using it non-stop since tumblr's announcement of their policy change on NSFW/Adult content on their site, with only one or two specific blogs making the app crash when attempting to crawl. While it doesn't automatically add links anymore for whatever reason, (maybe I've not read something in the changelogs?), all the buttons in the main window work perfectly fine for me as of version 1.0.8.65 with .NET framework 4.0.30319.42000 64 Bit. All things considered, I think I've basically inadvertently stress-tested it with little to no problems with a roster of somewhere over a thousand blogs. As such, I for one have to give it a positive response. |
I've been attempting in any way I could think of ever since I found this program for the two to three updates to try and get the program to crawl blogs. It seemingly accepts tumblr URLs just fine and where I've set the program to download to seems to be ok too, but whenever I hit the 'Crawl' button the program just... Sits there.
I've even sat the program there for a few hours upwards of 3-5 hours and still it does nothing. Authentication doesn't seem to affect anything, (unless there's more to that than simply logging in to your tumblr account like any user would), and deleting the appdata settings and starting fresh hasn't helped either.
These are my connection settings for the application, if that helps at all.
Between failed efforts and a bunch of Googling, I'm feeling rather frustrated that the one crawler that seems like it does what others fail to do doesn't even crawl, I'm feeling kinda desperate at this point, so any help would be greatly appreciated.
The text was updated successfully, but these errors were encountered: