Building an eCommerce platform at scale is a deep endeavour and requires deeper engineering efforts across the spectrum of Performance Engineering, Site Reliability & Incident Management. This section describes our approach to building and operating large scale retail platforms on JC
We start by drawing a simulation models of all possible user journeys, funnels & conversion percentage for an eCommerce Store-front
The traffic model is then augmented with scenarios which simulate the actual user behavior
The example shows how these techniques are applied by our SRE Teams. We use the traffic model to simulate synthetic traffic executed against a replica of the production environmentScenario: Checkout flow Server side Key Performance metrics
Our Monitoring tool-chain helps us to watch and understand system’s state using predefined set of metrics
Our metrics pipeline is central to our approach and uses open source proven components which record real time metrics at scale. This helps teams stay on top of issues and get pro-active feedback.
The Alerting tool-chain notifies teams about critical events or exceeding threshold limits
We’ve also created a custom blueprint helps in faster analysis and identify the exact issue between multiple points in the Infrastructure. Here’s an example -
We define Production Readiness as the process that ensures sure each feature push is production ready for live customers. We follow a checklist which audits each platform service and the cloud infrastructure. A high level PR process is mentioned below -
Make sure systems are scaled enough to handle desired traffic
Explore our Platform or request a demo from our Professional Services Team