Payscale Casestudy

About PayScale

Headquartered in Seattle and launched in January 2002, PayScale has been giving light to a once dark area by providing salary, benefits and compensation information. The service not only helps job candidates be informed of what they are worth but also helps companies make competitive salary offers. PayScale has collected the world’s largest salary information database with over 50 million individual salary profiles. The service works by enabling individuals to submit their job and salary profiles and be compared to other individuals in the market. With this volume of information, the company has been able to statistically induce and cater real-time salary information.

The Challenge

PayScale hosts several large Microsoft SQL Server databases with high I/O workload. It is imperative to have measures in place to protect data and be able to recover from any failure without compromising performance of production databases. The company wanted to update their Business Continuity and Disaster Recovery Plan for the databases in the event of a full data center outage and needed a backup site with the infrastructure that could not only keep up with demand and can easily scale but also secure. This is where Amazon Web Services (AWS) was clearly the right choice.

Why Amazon Web Services

Due to the workload of the databases and other existing technologies in place, it was determined using native SQL Server Database Replication would provide the least intrusive way of copying data to the DR site. Amazon EC2 became the natural choice for the DR server. Although Amazon RDS service is available, having a Microsoft SQL Server on EC2 allowed a deeper level of database administration needed to setup this particular replication.

In terms of security, the Microsoft SQL Server DR instance is placed inside a VPC. Amazon Virtual Private Cloud (VPC) is a logically isolated virtual network in AWS in which instances and other resources can be securely provisioned. The VPC service allowed staff to connect to the Microsoft SQL Server DR instance via their existing VPN while protecting the instance from unauthorized access.

Data is replicating asynchronously from the datacenter to AWS over VPN connection. A staging instance was setup to be the publisher for the DR databases. The figure below shows the resulting database DR architecture.

Payscale database DR architecture

Figure 1: Payscale database DR architecture

Benefits

Aside from providing a DR site for the databases, AWS also became a playground for doing database development and load testing. It became easy to launch a test environment by launching a SQL Server instance with the EBS snapshot of the DR instance. PayScale also has realized upfront cost savings by using AWS Reserved Instance as opposed to the infrastructure on hardware. AWS has helped the company’s agility with cost savings and business continuity.

Cascadeo continues to support and provide MS SQL advice to PayScale to keep the systems running.