WIP: Push scaling #3080

flovilmart · 2016-11-20T03:15:52Z

This PR attempts to solve the scaling issues related with sending large amount of push.

What's in there?

Makes the _PushStatus update based on increments instead of full status update.
Introduces PushQueue and PushWorker

The responsibility of the PushQueue is to split the work (configurable by batchSize) and publish it so it's available to the PushWorker.
The responsibility of the PushWorker is to consume messages published by the PushQueue. All the work is represented by a PushWorkItem.

A new Adapter is introduced, ParseMessageQueue. It shares the same API as the ParsePubSub, but is a 1..1 instead of a 1..* (fan out). The messages in ParseMQ should be consumed by the worker only once (otherwise the push would be sent twice).

It is also possible to disable the push worker on parse-server and to defer the execution to the message queue. In mind, is the integration with SQS/Lambda or Google Message Service / Cloud Functions.

The default ParseMessageQueue implementation is based on an EventEmitter, where only the last subscriber is active.

Fixes #2977

facebook-github-bot · 2016-11-20T03:38:34Z

@flovilmart updated the pull request - view changes

facebook-github-bot · 2016-11-20T04:27:59Z

@flovilmart updated the pull request - view changes

facebook-github-bot · 2016-11-20T15:00:08Z

@flovilmart updated the pull request - view changes

adirgan · 2016-11-29T20:59:00Z

What is missing so that this is available in the master?

flovilmart · 2016-11-29T21:00:00Z

@adirgan probably testing and deploying to SNS/Lambdas, make sure no regression exist, rebasing, testing with Google Cloud as well (PubSub + GCF) etc...

facebook-github-bot · 2016-12-02T01:04:43Z

@flovilmart updated the pull request - view changes

facebook-github-bot · 2016-12-02T01:06:19Z

@flovilmart updated the pull request - view changes

facebook-github-bot · 2016-12-02T01:27:28Z

@flovilmart updated the pull request - view changes

facebook-github-bot · 2016-12-02T02:57:51Z

@flovilmart updated the pull request - view changes

facebook-github-bot · 2016-12-03T00:41:23Z

@flovilmart updated the pull request - view changes

facebook-github-bot · 2016-12-03T01:30:05Z

@flovilmart updated the pull request - view changes

adirgan · 2017-01-15T02:23:45Z

@flovilmart I do not think it is a problem that causes some instability, it is only a problem with the state of the same

flovilmart · 2017-01-15T04:46:24Z

I can see that when at least one recipient is marked, the push is properly marked as sent.
Does some of your push, the ones with 0, have really an empty audience? Or it's something else?

adirgan · 2017-01-15T05:16:29Z

@flovilmart Not that it does not have a hearing, what happens is that the notifications are per user and if the user does not want to receive them, they are excluded by placing the codes of the alerts that they do not want to receive in a list, just as the server sends the push so the user does not You want to receive it, because the one that issues the alert is a platform completely separated to parse, as you can see in the image to those who have no hearing stays in sending.

adirgan · 2017-01-15T05:20:20Z

@flovilmart But also happens when you have many users to send and some do not get the push for some reason, such as uninstalling the application, etc.

adirgan · 2017-01-15T05:24:34Z

@flovilmart Before updating, with the previous version of parse without scalability, always when sending the push the state ended in sent, even with those who had no audience.

flovilmart · 2017-01-15T06:11:43Z

I have a pretty good idea why it may happen, i'lol check for a fix

jeacott1 · 2017-01-15T23:25:43Z

its not clear to me browsing this PR, but is the applicationId sent with each push request into the queue adapter? it doesn't look like it does.

flovilmart · 2017-01-15T23:45:36Z

It does:

https://github.com/ParsePlatform/parse-server/blob/master/src/Push/PushQueue.js#L53

Each item enqueued has:

const pushWorkItem = {
  body,
  query,
  pushStatus: { objectId: pushStatus.objectId },
  applicationId: config.applicationId
}

jeacott1 · 2017-01-16T00:56:04Z

doh - yeah just found it myself. excellent, thx.

jeacott1 · 2017-01-17T03:51:56Z

This works pretty well at not locking up the server, but there's no backpressure on push, so if the publisher is held up, or its all running with the default setup and the sending backs up, ram still just grows.
perhaps replacing with RX streams could fix this?

flovilmart · 2017-01-17T04:16:24Z

not sure what you mean. Also, we can't solve all dimensioning problems. node being single threaded, it's quite impossible to 'nice' the push sending process or reduce it's priority.

I'm not sure how streams would solve the issue, if it solves anything.

From what I see in what I wrote, we could limit the number of 'concurrent' messages being processed at once in the PushWorker with an internal logic queue that would just hold the received messages, dequeing and processing them one at a time. This way, it will be guaranteed that only one batch hits the memory at any given time.

What do you think?

jeacott1 · 2017-01-17T08:21:26Z

I think reactive streams could solve the issue because they support back pressure, even the js impl, and almost every other language also has an impl. just holding the recieved messages isn't enough, the rate at which the database is read (well, the messages generated ) needs to be throttled. For even modest installation counts the input side can overwhelm ram as it builds up in the queue faster than it can be processed. Throwing it all directly to sqs via the adapter is likely to solve the issue, but that doesn't help lots of folk who dont run in a cloud and dont want to run their own queue.

dvanwinkle · 2017-02-07T17:44:30Z

@flovilmart Any documentation on this?

flovilmart · 2017-02-07T18:45:12Z

not at the moment (unfortunately), this is just an internal re-implementation, we'll follow up with examples of queues and consumer at a later time.

dvanwinkle · 2017-02-07T22:52:05Z

@flovilmart Can I assume that this should work for nearly 8 million installations? I just updated to version 2.3.2 and I'm still unable to send pushes to any large amount of installations.

acinader · 2017-02-07T23:22:25Z

@dvanwinkle are you on aws?

if so, you can give this a try, i've tested it, and if you want to try it, I can document and answer any question/fix any issues for you.

dvanwinkle · 2017-02-08T00:25:18Z

@acinader Unfortunately, we're on Azure. While it isn't a bad service, most the people on it aren't doing OSS. I may take your linked PR and see if I can repurpose it for one of Azure's queue services.

flovilmart · 2017-02-08T01:37:57Z

That would be awesome! I'm planing to implement it for Gcloud pub/sub that would give the holy trinity of cloud services.

Next 'big' step would be the lightweight sender (that can report the status) and that would not require parse-server but just a client.

jjdp · 2017-02-14T06:54:13Z

this PR broke the https://www.npmjs.com/package/parse-server-onesignal-push-adapter.

im getting a new unsubscribed user which has no device on the one signal dashboard. so it seems the deviceToken is not getting sent somehow

mihai-iorga · 2017-03-08T14:08:22Z

This PR broke my push adapters, I cannot update parse-server with a custom push adapter. I will investigate when I will have time.. to lookup on all changes.

Minor changes should not break existing code.

flovilmart · 2017-03-08T14:15:28Z

@mihai-iorga you're 100% right, the changes should be backwards compatible, and they are designed to be backwards compatible.

Can you open a proper issue filling the issue template so we can have a look?

mihai-iorga · 2017-03-08T14:32:57Z

@flovilmart yes, as soon as I will have some spare time to investigate .. I will do that. Thanks

mortizbey · 2017-04-05T03:45:24Z

Do you have or have in plans some adapter implementation for use with AWS SQS?

dvanwinkle · 2017-04-05T03:48:13Z

@ortimanu This works well for me... #3080 (comment)

Edit: I should probably note, it's also on NPM

mortizbey · 2017-04-05T03:52:31Z

@dvanwinkle awesome! thanks!

kontextbewusst · 2017-04-09T10:18:38Z

Sounds like a big improvement we've been waiting for a long time... is there an estimate when we will see this included in a release version?

dvanwinkle · 2017-04-09T11:31:13Z

@kontextbewusst This is in the current version. You just have to also install the plugin that's relevant to you.

kontextbewusst · 2017-04-09T11:59:30Z

@dvanwinkle thank you, is there any kind of documentation for the right config? Also, which plugin are you referring to?

flovilmart added 5 commits November 19, 2016 11:37

Update status through increment (breaks on PG)

18f8958

adds support for incrementing nested keys

cf37a6b

fix issue when having spaces in keys for ordering

82b278c

Refactors PushController to use worker

8a9e4ed

Adds tests for custom push queue config

447b593

facebook-github-bot added the GH Review: review-needed label Nov 20, 2016

Makes PushController adapter independant

109eea4

Better logging of _PushStatus in VERBOSE

3862aef

flow typing

5b61e70

flovilmart mentioned this pull request Nov 20, 2016

Installation push tracking #2994

Closed

flovilmart added this to the 2.3.0 milestone Dec 1, 2016

flovilmart force-pushed the push-scaling branch from 5b61e70 to c732735 Compare December 2, 2016 01:04

flovilmart force-pushed the push-scaling branch from c732735 to 5b61e70 Compare December 2, 2016 01:06

Merge branch 'master' into push-scaling

5f2596f

Merge remote-tracking branch 'origin/master' into push-scaling

aa0415f

Merge branch 'master' into push-scaling

4815c90

lint nits

8573789

flovilmart mentioned this pull request Dec 3, 2016

Security context: 0x99ecc5e3ac1 Error #2977

Closed

4 tasks

flovilmart removed this from the 2.3.0 milestone Dec 7, 2016

jjdp mentioned this pull request Feb 14, 2017

broken on Parse-Server 2.3.3 parse-server-modules/parse-server-onesignal-push-adapter#33

Closed

adirgan mentioned this pull request Feb 24, 2017

Some Push Notifications stuck in SENDING state #3562

Closed

WIP: Push scaling #3080

WIP: Push scaling #3080

Conversation

flovilmart commented Nov 20, 2016 • edited Loading

facebook-github-bot commented Nov 20, 2016

facebook-github-bot commented Nov 20, 2016

facebook-github-bot commented Nov 20, 2016

adirgan commented Nov 29, 2016

flovilmart commented Nov 29, 2016

facebook-github-bot commented Dec 2, 2016

facebook-github-bot commented Dec 2, 2016

facebook-github-bot commented Dec 2, 2016

facebook-github-bot commented Dec 2, 2016

facebook-github-bot commented Dec 3, 2016

facebook-github-bot commented Dec 3, 2016

adirgan commented Jan 15, 2017

flovilmart commented Jan 15, 2017

adirgan commented Jan 15, 2017

adirgan commented Jan 15, 2017

adirgan commented Jan 15, 2017

flovilmart commented Jan 15, 2017

jeacott1 commented Jan 15, 2017 • edited Loading

flovilmart commented Jan 15, 2017 • edited Loading

jeacott1 commented Jan 16, 2017

jeacott1 commented Jan 17, 2017

flovilmart commented Jan 17, 2017

jeacott1 commented Jan 17, 2017 • edited Loading

dvanwinkle commented Feb 7, 2017

flovilmart commented Feb 7, 2017

dvanwinkle commented Feb 7, 2017

acinader commented Feb 7, 2017 • edited Loading

dvanwinkle commented Feb 8, 2017

flovilmart commented Feb 8, 2017

jjdp commented Feb 14, 2017 • edited Loading

mihai-iorga commented Mar 8, 2017 • edited Loading

flovilmart commented Mar 8, 2017

mihai-iorga commented Mar 8, 2017

mortizbey commented Apr 5, 2017

dvanwinkle commented Apr 5, 2017 • edited Loading

mortizbey commented Apr 5, 2017

kontextbewusst commented Apr 9, 2017

dvanwinkle commented Apr 9, 2017

kontextbewusst commented Apr 9, 2017

flovilmart commented Nov 20, 2016 •

edited

Loading

jeacott1 commented Jan 15, 2017 •

edited

Loading

flovilmart commented Jan 15, 2017 •

edited

Loading

jeacott1 commented Jan 17, 2017 •

edited

Loading

acinader commented Feb 7, 2017 •

edited

Loading

jjdp commented Feb 14, 2017 •

edited

Loading

mihai-iorga commented Mar 8, 2017 •

edited

Loading

dvanwinkle commented Apr 5, 2017 •

edited

Loading