~30%
Reduction in Aurora IO costs
3 years
Zombie database found and killed
10%
Reseller markup eliminated on day one
Boost your global cloud visibility and control with Pump. Share your email for details!
By submitting your email, you agree to opt in to marketing emails.
Overview
"We had a database running for three years that nobody knew about. The Pump team found it on a live call, and I killed it right there. That's the kind of thing you just can't find on your own."

Bhavani Yanamadala
Head of Engineering & CTO
ForUsAll builds 401(k) plans for startups and small businesses with a mission to give everyday Americans access to the same investment options historically reserved for the wealthy. Their Alt 401(k) was the first retirement plan to offer in-plan cryptocurrency and alternative investment access, and today the company manages over multiple billion in retirement assets for 80,000 participants. A six-person engineering team split between the Bay Area and Medellín, Colombia runs the entire platform on AWS, with an RDS-heavy infrastructure of multiple Aurora clusters across production, staging, and pre-production accounts powering the backend.
Industry
Fintech / 401(k) & Retirement
Integrations

Location
Bay Area, CA & Medellín, Colombia
Pump services
Pump Save
Use Case 1
A Solutions Architecture Investigation That Uncovered What Dashboards Can't
Savings plans cover the predictable spend, but ForUsAll's bill had layers of waste that only a hands-on infrastructure investigation could find. The Pump team assigned a dedicated solutions architect to ForUsAll's account who requested read-only IAM access to their AWS environment and laid out a structured investigation plan across three specific cost categories: AWS Backup, OpenSearch Service, and charged backup usage.
The investigation was methodical. For AWS Backup, the SA audited every backup plan, reviewing the rules for frequency, start windows, lifecycle to cold storage, and retention periods. They checked which resource types were attached to each plan (EBS, RDS, EFS, DynamoDB) and looked for cross-region or cross-account copy rules, which silently double or triple storage costs without appearing as a separate line item. The goal was to determine whether retention was longer than the business actually needed, whether anything cold-eligible was sitting in warm storage, and whether cross-region copies were inflating the bill.
For OpenSearch, the SA requested 30 days of CloudWatch metrics across eight dimensions: CPUUtilization (average and maximum), JVMMemoryPressure (average and maximum), FreeStorageSpace (minimum), SearchRate (average), IndexingRate (average), SearchLatency (average and p99), ClusterStatus (yellow/red maximum), and ThreadpoolWriteQueue/SearchQueue (maximum). These metrics together tell you whether a cluster is over-provisioned on compute, whether memory pressure leaves room for downsizing, how much storage is actually in use versus provisioned, and whether there is any queue pressure that would block a downsize. The SA also requested index-level data from the OpenSearch Dev Tools console to identify large or stale indexes that could move to UltraWarm or be deleted entirely.
For charged backup usage, the SA wrote and shared a CLI script that cross-referenced active EBS volume IDs against existing snapshots to surface orphans: snapshots tied to volumes that no longer existed but were still incurring daily storage charges. ForUsAll's DevOps engineer in Medellín ran the script and cleaned up a batch of orphaned snapshots immediately. The SA also reviewed RDS backup retention periods on each database instance and checked for manual DB snapshots, which persist indefinitely until explicitly deleted and are one of the most common sources of silent cost accumulation.
The zombie database discovery came directly out of this investigation. While drilling into RDS spend at the resource level, the SA noticed an Aurora Serverless v2 operations cluster in the pre-production account showing sustained IO activity and consistent daily spend that didn't correspond to any active workload. The cluster had been provisioned nearly three years earlier as part of a previous build cycle and had been running continuously ever since. At the account level, the spend blended in with legitimate pre-production costs. Only when filtered down to individual cluster ARNs did the orphaned instance stand out. Bhavani stopped it on the call. AWS keeps the data intact on a stopped cluster for seven days before requiring a decision, so there was no risk of data loss. The instance turned out to be completely unused.
Beyond finding waste, the SA provided architectural recommendations. ForUsAll had been restoring their production database into the staging environment by dumping it to S3 and re-uploading through GitHub Actions, burning both compute and CI/CD minutes on every refresh. The Pump SA documented a strategy for using Aurora read replicas and cross-account snapshot sharing to accomplish the same thing natively within AWS, eliminating the S3 intermediary and the GitHub Actions overhead entirely.
Use Case 2
Switching Aurora to IO-Optimized Storage and Cutting IO Costs by 30%
ForUsAll's heaviest and fastest-growing cost line was IO on their Aurora clusters. As the team built a new customer-facing platform on Aurora Serverless v2, IO operations spiked significantly. The increase was expected given the development activity, but the per-operation cost structure was not. By default, Aurora charges for IO on a pay-per-request basis, meaning every read and write operation against the storage layer incurs an individual charge. For workloads with heavy IO this adds up fast and creates unpredictable monthly bills.
This was not something ForUsAll would have found through a savings plan or a dashboard. It required the Pump solutions architect to navigate into the RDS console during a live call with Bhavani, inspect the cluster configuration, and identify a setting that most teams never encounter: Aurora IO-Optimized. This storage configuration replaces the per-request IO pricing with a flat monthly rate baked into the storage cost, effectively functioning like a reserved instance for IO operations. AWS introduced it as an option for Aurora clusters, but it is not the default, not prominently surfaced in the console, and requires a manual modify operation on each cluster individually.
For ForUsAll's workload profile, where IO was already the dominant cost driver, the switch was estimated to reduce that entire cost line by roughly 30% with zero performance impact. It is a billing-layer change only: no failover, no downtime, and no changes to application code or connection strings. The SA identified the same pattern in ForUsAll's pre-production account as well, where a separate cluster was showing a similar IO cost profile. Bhavani applied the change across both environments during the call.
"Some of this stuff is crazy. Like, how would you even know?" - Bhavani Yanamadala, Head of Engineering & CTO
Use Case 3
Layered Commitment Strategy Across Compute, RDS, and OpenSearch
While the solutions architecture investigation uncovered hidden waste, the Pump team also built a commitment strategy to reduce the cost of ForUsAll's baseline workloads. When ForUsAll switched to Pump in March 2026, the first win was immediate: the 10% reseller markup from their previous provider disappeared. Within the first week, the Pump team analyzed ForUsAll's workload patterns across EC2, RDS, and OpenSearch to determine the right mix of compute and database savings plans. Rather than over-committing, the team sized each plan to cover ForUsAll's always-on workloads while leaving headroom for the variable spend tied to their new platform build.
The database savings plan was particularly well-timed. AWS had recently extended database savings plan eligibility to cover OpenSearch Service, which meant a single commitment could now reduce costs across both ForUsAll's Aurora clusters and their OpenSearch domain. The Pump team caught this policy change and folded it into the initial proposal, something ForUsAll's previous reseller had never surfaced. As workloads stabilized over the following weeks, the team layered in additional reserved instance and savings plan commitments calibrated to ForUsAll's actual utilization data, not estimates. Each round was scoped conservatively enough that coverage could grow as the infrastructure matured without locking in spend that might go unused.
"There were several optimizations that the Pump team was able to propose, which we implemented. It was multi-tiered." - Bhavani Yanamadala, Head of Engineering & CTO
Pump’s impact
Within three months of switching to Pump, ForUsAll went from paying a 10% reseller markup with no optimization to running a multi-layered savings strategy backed by dedicated solutions architecture support. The impact came in two distinct layers.
The first layer was commitment optimization: compute savings plans, database savings plans covering both RDS and OpenSearch, and additional reserved instances layered in as workloads stabilized. This addressed the predictable, always-on spend that ForUsAll's previous reseller had never touched.
The second layer, and the one Bhavani described as the real differentiator, was the solutions architecture engagement. The Pump SA didn't just review dashboards. They requested IAM access, pulled 30 days of CloudWatch metrics across multiple dimensions, wrote CLI scripts for the engineering team to run, analyzed index-level OpenSearch data, audited backup retention policies across every plan and vault, and provided documented architectural recommendations for cross-account database restores. This is the kind of work that typically requires a dedicated FinOps engineer or an expensive third-party consultant. For ForUsAll, it came as part of their Pump engagement.
The results speak to why this matters. A three-year-old zombie database was found and stopped. IO-optimized storage mode was identified and applied across two environments, cutting IO costs by an estimated 30%. Orphaned EBS snapshots were surfaced and cleaned up. A production-to-staging database restore workflow was redesigned to eliminate S3 and GitHub Actions overhead. None of these would have been caught by a savings plan engine or a cost dashboard alone. They required someone inside the infrastructure, understanding the architecture, and knowing where AWS buries the configurations that most teams never find.
For a six-person engineering team with no dedicated FinOps function, this level of support effectively extended ForUsAll's infrastructure capabilities without adding headcount. Bhavani described it clearly: "For some of the usage benefits, we did need that consultation with the solutions architect." That consultation is what separated the savings ForUsAll gets with Pump from the zero optimization they received from their previous reseller.
ForUsAll is now building its next-generation platform on Aurora Serverless v2 and expanding its use of AI tools for development. Bhavani has expressed interest in LLM compute reservation strategies and is exploring Pump View for unified visibility across AWS and AI tool spend as that footprint grows.

