
2x concurrent users validated
Performance Testing · Retail · Capacity Planning
An architectural change, an unknown performance impact, and a three-week window. spriteCloud found the platform's breaking point and the CPU capacity hiding behind it.
An international eCommerce platform serving markets in Europe, Africa and South America planned an architectural change. Multiple vendors had built parts of the application and infrastructure, and none of them could say what the change would do to performance under load.
This engagement built on a partnership of over seven years and more than 40 projects together.
The client needed answers before onboarding more customers: how many concurrent users can the platform take, where does it break, and what does that mean for scaling the Azure environment up or down without overspending?
The timeline left no slack. Scripting, workload modelling, multiple test scenarios, execution, monitoring and analysis, all inside three weeks.
One spriteCloud performance engineer ran the project, with a test manager presenting strategy and results to stakeholders. The first step was building a workload model that matched reality: we analysed production traffic in Google Analytics on peak days, per country organisation, and recreated that pattern in OctoPerf. New Relic, already in place at the client, handled monitoring.
Baseline at 100 percent of peak load to verify the new architecture. Then 150 percent, validated against pre-agreed KPIs. The same again with back-office load running in parallel, to measure degradation. Then stress tests, first per country and then combined, pushing concurrent users up until the platform broke. Front-end page load behaviour was analysed throughout.
The stress test found the breaking point: throughput saturated and the application stopped processing additional requests. The cause sat in CPU utilisation, and inside that, in garbage collection. 35 percent of CPU time went to garbage collection and only 65 percent to actual application work.
We recommended a G1 GC configuration change to claw that capacity back, flagged the slowest queries and Java methods as bottlenecks under peak load, and added front-end fixes such as image compression and resizing.
Performance regressions now surface in the development pipeline, not on a peak sales day.
The spriteCloud team helped us regain confidence in our current eCommerce solution and has provided us with the insights needed to continue our customer onboarding plans.
International eCommerce platform
Every team has a regression problem. Let’s fix yours in 8 weeks.
Start the conversationSee more case studies