Cloud Zone is brought to you in partnership with:

Kathiravelu Pradeeban is an Open Source Evangelist. He is a postgraduate student of the Erasmus Mundus European Master in Distributed Computing, a Master of Science joint degree from Instituto Superior Técnico, Lisbon, Portugal, and KTH Royal Institute of Technology, Stockholm, Sweden. He holds a Bachelor of the Science of Engineering (Hons) degree, majoring Computer Science & Engineering, with a first class from the University of Moratuwa, Sri Lanka [Batch 2010]. He is also an old boy of Royal College, Colombo [A/L 2005]. He is highly interested in FOSS development, and is an active participant of the Google Summer of Code (GSoC) program since 2009 - with AbiWord (2009 as a student and since 2011 as a mentor), a light weight word processor, and with (OGSA-DAI) Open Grid Services Architecture Data Access and Integration (2010 as a student), an innovative solution for distributed data access and management, mentored by OMII-UK. His research interests include Distributed Computing and Data mining. Kathiravelu is a DZone MVB and is not an employee of DZone and has posted 25 posts at DZone. You can read more from them at their website. View Full User Profile

Configuring WSO2 Load Balancer for Auto Scaling

  • submit to reddit
This post assumes that the reader is familiar at configuring the WSO2 Load Balancer without autoscaling, and has configured the system already with the load balancer. Hence this post focuses on setting up the load balancer with autoscaling. If you are a newbie to setting up WSO2 Servers proxied by WSO2 Load Balancer, please read the blog post, How to setup WSO2 Elastic Load Balancer to configure WSO2 Load Balancer without autoscaling.


The autoscaling configurations are defined from CARBON_HOME/repository/deployment/server/synapse-configs/tasks/autoscaler.xml

1) Task Definition
In WSO2 Load Balancer, the autoscaling algorithm to be used is defined as a Task. ServiceRequestsInFlightEC2Autoscaler is the default class that is used for the autoscaler task.
<task xmlns=""  
2) loadbalancer.xml pointed from autoscaler.xml

This property points to the file loadbalancer.xml for further autoscaler configuration.
<property name="configuration" value="$system:loadbalancer.xml"/>
3) Trigger Interval

The autoscaling task is triggered based on the trigger interval that is defined in the autoscaler.xml. This is given in seconds.
<trigger interval="5"/>

Autoscale Mediators

autoscaleIn and autoscaleOut mediators are the mediators involved in autoscaling as we discussed above. As with the other synapse mediators, the autoscaling mediators should be defined in the main sequence of the synapse configuration, if you are going to use autoscaling. Load Balancer-1.0.x comes with these mediators defined at the main sequence, which can be found at $CARBON_HOME/repository/deployment/server/synapse-configs/sequences/main.xml. Hence you will need to modify main.xml, only if you are configuring the load balancer without autoscaling.

autoscaleIn mediator is defined as an in mediator. It gets the configurations from loadbalancer.xml, which is the single file that should be configured for autoscaling, once you have already got a system that is set up for load balancing.
<autoscaleIn configuration="$system:loadbalancer.xml"/>
Similarly autoscaleOut mediator is defined as an out mediator.


loadbalancer.xml contains the service cluster configurations for the respective services to be load balanced and the load balancer itself. Here the service-awareness of the load balancer makes it possible to manage the load across multiple service clusters. The properties given in loadbalancer.xml is used to provide the required configurations and customizations for autoscaling and load balancing. These configurations can also be taken from the system properties as shown below.

1) Properties common for all the instances

1.1) ec2AccessKey

The property 'ec2AccessKey' is used to provide the EC2 Access Key of the instance.
<property name="ec2AccessKey" value="${AWS_ACCESS_KEY}"/>
1.2) ec2PrivateKey
The certificate is defined by the properties 'ec2PrivateKey'.
<property name="ec2PrivateKey" value="${AWS_PRIVATE_KEY}"/> 
1.3) sshKey
The ssh key pair is defined by 'sshKey'.
<property name="sshKey" value="stratos-1.0.0-keypair"/>
1.3) sshKey
The ssh key pair is defined by 'sshKey'.
<property name="sshKey" value="stratos-1.0.0-keypair"/>
1.4) instanceMgtEPR'instanceMgtEPR' is the end point reference of the web service that is called for the management of the instances.
<property name="instanceMgtEPR" value=""/>
1.5) disableApiTerminationThe 'disableApiTermination' property is set to true by default, and is recommended to leave as it is. This prevents terminating the instances via the AWS API calls.
<property name="disableApiTermination" value="true"/>
1.6) enableMonitoringThe 'enableMonitoring' property can be turned on, if it is preferred to monitor the instances.
<property name="enableMonitoring" value="false"/>

2) Configurations for the load balancer service group

These are defined under
<loadBalancer> .. </loadBalancer>
2.1) securityGroups
The service group that the load balancer belongs to is defined by the property 'securityGroups'. The security group will differ for each of the service that is load balanced as well as the load balancers. Autoscaler uses this property to identify the members of the same cluster.
<property name="securityGroups" value="stratos-appserver-lb"/>
2.2) instanceType
'instanceType' defines the EC2 instance type of the instance - whether they are m1.small, m1.large, or m1.xlarge (extra large).
<property name="instanceType" value="m1.large"/>
2.3) instancesThe property, 'instances' defines the number of the load balancer instances. Multiple load balancers are used to prevent the single point of failure - by providing a primary and a secondary load balancer.
<property name="instances" value="1"/>
2.4) elasticIPElastic IP address for the load balancer is defined by the property, 'elasticIP'. We will be able to access the service, by accessing the elastic IP of the load balancer. The load balancer picks the value of the elastic IP from the system property ELASTIC_IP.
<property name="elasticIP" value="${ELASTIC_IP}"/>
In a public cloud, elastic IPs are public (IPV4) internet addresses, which is a scarce resource. Hence it is recommended to use the elastic IPs only to the load balancer instances that to be exposed to the public, and all the services that are communicated private should be associated to private IP addresses for an efficient use of this resource. Amazon EC2 provides 5 IP addresses by default for each customer, which of course can be increased by sending a request to increase elastic IP address limit.
2.5) availabilityZoneThis defines in which availability zone the spawned instances should be.
<property name="availabilityZone" value="us-east-1c"/>
2.6) payload
The file that is defined by 'payload' is uploaded to the spawned instances. This is often a zip archive, that extracts itself into the spawned instances.
<property name="payload" value="/mnt/"/> contains the necessary files such as the public and private keys, certificates, and the launch-params (file with the launch parameters) to download and start a load balancer instance in the spawned instances.
The launch-params includes the details for the newly spawned instances to function as the other instances. More information on this can be found from the EC2 documentations.
Sample Launch Parameters
Given below is a sample launch-params, that is used in StratosLive by the load balancer of the Application Server service.

We will look more into these launch-params now.
CredentialsThe credentials - access key ID and the secret access key are given to access the aws account.
S3 LocationsThe service zip and the common modifications or patches are stored in an S3 bucket. The locations are given by a few properties in the launch-params shown above.
  • PRODUCT_MODIFICATIONS_PATH_S3 - Points to the product specific changes, files, or patches are uploaded to a specific location.
  • COMMON_MODIFICATIONS_PATH_S3 - Points to the patches and changes common to all the servers.
  • PRODUCT_PATH_S3 - Points to the location where the relevant Stratos service zips are available.
  • STARTUP_DELAY - Given in seconds. Provides some time to start the service that is downloaded on the newly spawned instance, such that it will join the service cluster and be available as a new service instance.
Apart from these, PRODUCT_NAME, SERVER_NAME, HTTP_PORT, and HTTPS_PORT for the application are also given.

3) Configurations for the application groups

These are defined under
<services> .. </services>
These too should be configured as we configured the properties for the load balancers above.We define the default values of the properties for all the services under
<defaults> .. </defaults>
Some of these properties - such as the payload, host, and domain - will be specific to a particular service group, and should be defined separately for each of the services, under
<service> .. </service>
Properties applicable to all the instancespayload, availabilityZone, securityGroups, and instanceType are a few properties that are not specific to the application instances. We have already discussed about these properties when setting the load balancer properties above.
Properties specific to the application instances
These properties are specific to the application clusters, and are not applicable to the load balacer instances. We will discuss about these properties now.
3.1) minAppInstancesThe property 'minAppInstances' shows the minimum of the application instances that should always be running in the system. By default, the minimum of all the application instances are set to 1, where we may go for a higher value for the services that are of high demand all the time, such that we will have multiple instances all the time serving the higher load.
<property name="minAppInstances" value="1"/>
3.2) maxAppInstances'maxAppInstances' defines the upper limit of the application instances. The respective service can scale up till it reaches the number of instances defined here.
<property name="maxAppInstances" value="5"/>
3.3) queueLengthPerNodeThe property 'queueLengthPerNode' provides the maximum length of the message queue per node.
<property name="queueLengthPerNode" value="400"/>
3.4) roundsToAverageThe property 'roundsToAverage' indicates the number of attempts to be made before the scaling the system up or down. When it comes to scaling down, the algorithm makes sure that it doesn't terminate an instance that is just spawned. This is because the spawned instances are billed for an hour. Hence, even if we don't have much load, it makes sense to wait for a considerable amount (say 58 minutes) of time before terminating the instances.
<property name="roundsToAverage" value="10"/>
3.5) instancesPerScaleUpThis defines how many instances should be scaled up for each time. By default, this is set to '1', such that a single instance is spawned whenever the system scales. However, this too can be changed such that multiple instances will be spawned each time the system scales up. However it may not be cost-effective to set this to a higher value.
<property name="instancesPerScaleUp" value="1"/>
3.6) messageExpiryTimemessageExpiryTime defines how long the message can stay without getting expired.
<property name="messageExpiryTime" value="60000"/>
Properties specific to a particular service groupProperties such as hosts and domain are unique to a particular service group, among all the service groups that are load balanced by the given load balancer. We should note that we can use a single load balancer set up with multiple service groups, such as Application Server, Enterprise Service Bus, Business Process Server, etc.
Here we also define the properties such as payload and availabilityZone, if they differ from the default values provided under
<defaults> .. </defaults>
Hence these properties should be defined under
<service>.. </service>
for each of the services.
3.7) hosts'hosts' defines the hosts of the service that to be load balanced. These will be used as the access point or url to access the respective service.Multiple hosts can be defined under
<hosts> .. </hosts>
Given below is a sample hosts configurations for the application server service
3.8) domainLike the EC2 autoscaler uses the security groups to identify the service groups, 'domain' is used by the load balancer (ServiceDynamicLoadBalanceEndpoint) to correctly identify the clusters of the load balanced services.
Once you have configured the load balancer as above, with the product/service instances, you will have the system that dynamically scales.  


Published at DZone with permission of Kathiravelu Pradeeban, author and DZone MVB.

(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)