I need to implement microservice which is fairly simple in terms of logic and architecture, but needs to handle around 305k requests per second.
All it's going to do is to ingest JSON data, validate it according to simple rules and record to Google Cloud Storage as JSON files. There are lots of Google Cloud services and APIs available, but it's hard for me to pick proper stack and pipeline because I have not had much experience with them as well as with highload.
There is an example I'm looking at https://cloud.google.com/pubsub/docs/pubsub-dataflow
The flow is the following:
PubSub > Dataflow > Cloud Storage
It does exactly what I need (except date validation) but looks like Dataflow is limited to Java and Python, and I'd rather use PHP.
Another relevant example is https://medium.com/google-cloud/cloud-run-using-pubsub-triggers-2db74fc4ac6d
It uses Cloud Run, with supports PHP, and PubSub to trigger Cloud Run workload. So it goes like:
PubSub > Cloud Run
and working with Cloud Storage in Run looks pretty simple.
Am I on a right way? Can something like mentioned above work for me or do I need something different?