Tech Work: April 2013

As per Wiki, Capacity planning is the process of determining the production/serving capacity needed by an organization to meet changing requirements for its products/solutions. This process strategically assess requirements for new solution, additional network capacity and underlying IT architectures. The information provided by capacity planning helps to :

Characterize the solution workloads more accurately
Analyze the performance of various modules
Model contention for application servers, and ensure scalability
Model and plan for communications infrastructure
Forecast and cope with peak demand
Project the impact of agent technologies and non-PC devices.

Capacity planning is more cost-effective and efficient if done prior to deployment. Performance problems resulting from a lack of capacity are more complex and costly to resolve after deployment. However, in post deployment scenarios, it is possible to identify the impact on:

Code changes required
Existing Java Heap footprint
Native Heap footprint
CPU utilization or other negative side effect

Benefits of this exercise:

Avoid losing customers due to site crashes
Performance modeling and capacity planning for infrastructure
Build and analyze customer behavior models
Plan to avoid frequent upgrades and migrations.
Identify potential bottlenecks in the architecture

There are various methodologies and proven theories available to conduct this exercise. Some of them are:

Discrete-event simulation,
Mean value analysis of product-form net-works,
Analytical identification of bottleneck resources in multiclass environments, and
Workload characterization with fuzzy clustering.

Above methodologies in detail are complex and out of scope to discuss here. Instead let us look at the easier way to understand the whole process.

The first and foremost step here is to understand the IT requirements as explained below.

Data Gathering :

To properly determine resource requirements for an application, it requires architectural information, along with a functional description of anticipated usage. The completeness and accuracy of the sizing depends on the quality of the information received. When portions of information are unknown or missing, the risk factor for incorrect sizing increases. Following type of information are required to drive the capacity planning analysis:

Percentage of new function supplied by solution
Percentage of new data elements created
Business-use scenarios
Data transferred to and fro
Peak load users size
Solution architecture definition:

Business function, scenarios and supporting models
Data architecture, solution architecture — business and deployment architecture diagrams
Technical architecture and schematic (client, server, network, Web)

Once the information is collected in standard templates, it is time to apply right calculations considering each and every criteria. When standard benchmark data is available, the analysis need to be performed and results gets documented.

The components that need to be considered or validated are given below. This might not be the complete list and depending upon the IT system in design it could differ.

Operating systems
Application servers
Network protocols
Data access services:DB systems
Programming languages
UI/Client Frameworks: AJAX, Java scripts etc.
Distribution services: NFS, DFS, Kerberos
Systems management: SNMP, AntiVirus, ADSM, TME
Application interface with legacy data/systems
Peak load: Data throughput.

Afterthe initial assessment of architecture sizing has been completed, it is time for coding and application development. Once the development is over and solution is in deployable stage, test-based sizing process can be started. This process provides the validation sizing analysis that need to be performed and results are documented in this stage.

Hope this helps to start with and I'm thinking to post some case studies and sample reports in my next article.

Tech Work

Sunday, April 28, 2013

Capacity Planning for J2EE applications

Anti Patterns for Data Integration Hub