-
Notifications
You must be signed in to change notification settings - Fork 112
Too much duplicate data #366
Comments
Unscientifically, turning off eagerly sending small blocks may help? I'm not sure. However, it only reduced duplicate data from 76% -> 45%. And it seems like duplicate data becomes more of a problem as the query runs. |
Scientifically, that option shouldn't make a difference because I only made the change on my own node. But the nodes I'm downloading from should mostly be old. |
I've tried removing the "don't have timeout" and I;m not seeing any improvement. However, it really looks like this problem gets worse and worse as the download progresses. NOTE: I may have messed up when disabling the "don't have timeout"s. You should double check these results. |
This was caused by two issues:
|
Ok, I found the bug in pin. We were wrapping the dagService and FetchGraph was doing a type assertion. |
@dirkmc thanks for investigating this. This looks fixed. |
When testing on the live network, I'm getting 76% duplicate data when pinning dist.ipfs.io.
This should have been fixed so something is going very wrong here.
The text was updated successfully, but these errors were encountered: