-
Notifications
You must be signed in to change notification settings - Fork 20.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
geth 1.8.6 sync fails to complete #16603
Comments
yes. I face the same issue too. I have been running for 5 days now. The sync is close but not over.
Geth Sync fail because of dropped peer...
|
I still have the same issue... the sync is not catching up.. always ~100 blocks behind.
|
Same here... |
Mate, my geth is syncing about 35 days till start. Now i have about 158 million pulled states and it is not ended yet. 💩 |
This is expected behaviour, and not an issue. For an explanation of what is happening, along with a small FAQ, see this comment.
There are also many identical issues open on this bug tracker, too. ;) Suggest closing as not a bug. |
Hi, all. If I correctly understood, in the end it's syncing 'states' above all, not 'blocks'. Because 'blocks' almost synced. (1.8.6-stable-12683fec). Geth is syncing about 7 days. (ubuntu 16, 4vCPU and 16GbRam) |
@dpredkel A node that's synced today shows me 135141789 states right before it's done. I think there was an issue opened recently for the second question, but I can't find it. Your When the node seems to stall on initial sync, I'd generally recommend restarting the node - without shredding the database, of course! - so it can set a newer block as a pivot. The machine that I did the sync on has less memory than yours, but very good network connectivity (no-NAT IPv4, 1 GiB/s), and more-or-less dedicated SSD storage. From start to finish, the sync took ~25 hours (with no restarts). Same |
@veox Thanks for answer. Can you say how to correctly restart node without shredding the database? |
@dpredkel What I meant is don't do a |
@veox Oh, ok, I will just restart client. Thanks a lot :) |
@veox hello again, I am again facing the Do you have any thoughts on how to prevent this issue? I have 16GB RAM, 4 cores, and 0.5GiB/s IOPS on Ubuntu 16.04. Does restarting really help? I have a suspicion that some stochastic process causes corruption of the data and prevents the node from behaving properly. I was hoping after 3 weeks, that this would be a stable Geth version.... thanks for your help! |
@GeeeCoin I think you meant to comment in #16539. :) EDIT: In short, though: no, I have no useful thoughts on that. I'm experiencing something similar on a high-load node, but I expect it's being dropped by peers due to its high message round-trip time. |
Same here, constantly around 90 blocks behind the actual chain. I've tried with higher server specs but to no avail. |
@veox ahh I remember that comment well:) Still seems to be unresolved. Have you had success with static-nodes or addmin.addPeer? I've tried both approaches numerous times, but the requested peers don't show up in my peer list. I grabbed fairly recent peers from the eth network site. I think a good contingency would to use Infura as a backup switch. The only problem is how to incorporate my geth wallet with an Infura provider without having to recode my project with the extracted private key (which leads to low level calls required in web3). There's a new library similar to HDWalletProvider, but for private key as opposed to pnemonic https://github.com/rhlsthrm/truffle-hdwallet-provider-privkey. Looks promising, any thoughts? ...Or perhaps Infura has a public enode ID they're willing to share;) |
So did anybody end up resolving this ? Im facing the same issue too. Using geth v1.8.3-stable. Have plenty of system resources available and using an SSD with only 400/3000 iops used. Sync to ropsten works fine but neither fast sync nor full sync to mainnet works. Still at last 100 blocks for the past 2 days. Is there any point to continue running it n it will sync or do we need to look into another method ? Could this be resolved by running an older / different version of geth ? Thanks |
@troowala if you're working on something, you can continue working on Infura until this gets resolved. |
@CD0x23 i wish it was an option. it's not just publishing contracts from remix. the application we use requires communicating with a geth node which has the wallet file residing in the geth node and the application unlocks the file and then executes a multitude of contracts and scripts. So if we want to go the infura route we will have to significantly modify this third party application in an unspported manner to accomodate transferring private key to infura or myetherapi nodes, hence need the geth node to sync with the mainnet. So are you involved with this issue resolution by any chance ? and does that mean no newly provisioned geth nodes are syncing with the mainnet ? thanks |
Okay solution is to ensure that the SSD has atleast 1000 IOPS or more. In case of AWS EC2 instances the cheapest solution is to use GP2 volumes that are 350GB or above as amazon gives you 3 IOPS per GB. Smaller EBS volumes will not sync as they do not have enough IOPS unless you choose IO1 Provisioned IOPS and set it to 1000 IOPS or more which is a lot more expensive way of achieving the same end result. |
@troowala That number might be AWS-specific. |
@veox Darn ! i feel a little ripped off if AWS is hustling like that ... Changing Disk IOPS is the only configuration change i had to make to get the eth mainnet to sync. If you dont mind posting your disk speeds and geth startup parameters that would be a helpful insight. Mine for the blockchain storage volume are Read Speeds /dev/xvdf: Write Speeds $ sync; dd if=/dev/zero of=/store/testspeed bs=1M count=1024; sync Geth Startup parameters ./geth --datadir=/store/mainnet --rpcapi "personal,web3,eth,net,db,debug" --rpc --rpcaddr "0.0.0.0" --rpccorsdomain "10.10.0.0/16,172.21.0.0/24" Thanks |
@troowala Actually, the provider's spec sheet no longer shows "IOPS" anywhere. :/ I've also re-purposed the "350 IOPS" machine to Ropsten earlier this month, and downgraded the plan, so even if I took the measurements, they'd be for something else. The best I can do is show these graphs from the provider's panel: Since the start of December 2017 'till mid-May 2018, it's been running with some variant of:
The peaks on the "write" side represent the occasions of Note that most of the read activity is from serving light peers. It's very unrepresentative regarding a do-nothing A different (KVM-virtualised) machine I'm currently running seems to measure ~ 800 IOPS with |
Closing this because it's an old issue. Many improvements to blockchain sync have been implemented since this issue was opened. If you are still having sync issues, please open a new issue. |
I've tried several clean resyncs, removed the chain database, in attempt to test the new release of geth, 1.8.6. All attempts result in the sync stalling at the very end, about 100-79 blocks from the end:
using command-line: geth --cache=1024 --syncmode "fast"
eth.syncing
{
currentBlock: 5516185,
highestBlock: 5516264,
knownStates: 72045381,
pulledStates: 72029865,
startingBlock: 5516011
}
again 15 min later
eth.syncing
{
currentBlock: 5516254,
highestBlock: 5516326,
knownStates: 74102284,
pulledStates: 74087290,
startingBlock: 5516011
}
I never reach false. This process above has been running for 3 days.
System information
Geth version:
1.8.6-stable-12683fec
OS & Version: OSX High Sierra 10.13.3
Ethereum wallet: 0.10.0
Expected behaviour
sync should complete and final balance should show in wallet
Actual behaviour
sync never completes, wallet never shows balance
Steps to reproduce the behaviour
brew install latest geth, 1.8.6
There are several open tickets, some even list more tickets, for this same issue on other related projects. but for reference:
Ethereum wallet will not sync past the last 65 blocks
ethereum/mist#3738
Ethereum Wallet doesn't sync
ethereum/mist#3836
Issues related with Syncing
ethereum/mist#3097
After syncing, Mist doesn't update info (but stays syncd)
ethereum/mist#3837
Is it possible this change got left out of the latest release?
add 'ps.lock.Unlock()' before return #16360
#16360
Going back to the geth release, 1.8.2, packaged with the wallet 0.10.0, removed chain data, the full sync, not even with the fast mode setting, completes in a few hours. There may be a regression in the 1.8.6 release, or least it may warrant to look into the issue. Let me know if you need any further info, or would like me to try something else out. THanks.
The text was updated successfully, but these errors were encountered: