Contributors

Sergey Gordeychik*
Jeremiah Grossman
Michael Sutton
Mandeep Khera
Peter Ahearn
Brian Martin
Simone Onofri
Matt Latinga
Chris Wysopal

*Project Leader



Download PDF version

Purpose

The Web Application Security Consortium (WASC) is pleased to announce the WASC Web Application Security Statistics Project 2007. This initiative is a collaborative industry wide effort to pool together sanitized website vulnerability data and to gain a better understanding about the web application vulnerability landscape. We ascertain which classes of attacks are the most prevalent regardless of the methodology used to identify them. Industry statistics such as those compiled by Mitre CVE project provide valuable insight into the types of vulnerabilities discovered in open source and commercial applications, this project tries to be the equivalent for custom web applications.

Goals

  1. Identify the prevalence and probability of different vulnerability classes
  2. Compare testing methodologies against what types of vulnerabilities they are likely to identify.

Methodology

The statistics was compiled from web application security assessment projects which were made by the following companies in 2007 (in alphabetic order):

Booz Allen Hamilton
BT
Cenzic with Hailstorm and ClickToSecure
dblogic.it
HP Application Security Center with WebInspect
Positive Technologies with MaxPatrol
Veracode with Veracode Security Review
WhiteHat Security with WhiteHat Sentinel

Identified vulnerabilities for assessment technologies have been aggregated using the Web Security Threat Classification as a baseline.

The statistics includes 2 different data sets: automated testing results and security assessment results made using black and white box methodology.

Automated testing results contain data about the scanning of hosting provider sites without any customizing the settings (with standard profile). While analyzing this data it is recommended to consider that not every site tested uses interactive elements. Additional customized settings made by an expert would improve vulnerability detection effectiveness by automated scanners.

It is also recommended to consider automatic scanning type 1 and 2 errors: the scanner might miss the vulnerability or suggest the vulnerability which does not exist. A manual expert assessment allows to eliminate type 2 errors and to minimize type 1 errors (but not eliminate them).

Black and white box security assessment statistics contain manual and automatic analysis results. The analysis includes scanning with preliminary settings followed by manual analysis, manual search for vulnerabilities which cannot be detected by automated scanner, and source code analysis.

Consequently 3 data sets were obtained:

  1. Overall statistics
  2. Automated scanning statistics
  3. Black and and white box methods security assessment statistics

The overall statistics includes analysis results of 32,717 sites and 69,476 vulnerabilities of different degrees of severity. The detailed information can be found in Statistics chapter.

Data analysis

Data analysis shows that more than 7% of analyzed sites can be compromised automatically. About 7.72% applications had a high severity vulnerability detected during automated scanning (P. 1). Detailed manual and automated assessment using white and black box methods shows that probability to detect high severity vulnerability reaches 96.85%.

So automated scanning represents data for an average Internet site and black and white box methods results refer to interactive corporate web applications.

!!!

P. 1 Probability to detect vulnerabilities of different risk degree

The most prevalent vulnerabilities are Cross-Site Scripting, Information Leakage, SQL Injection and Predictable Resource Location (P. 2, P. 3). As a rule, Cross-Site Scripting and SQL Injection vulnerabilities appears due to system design errors, Information Leakage and Predictable Resource Location are often connected with improper system administration (for example, weak access control).

!!!

P. 2 The most prevalent vulnerabilities

!!!

P. 3 Vulnerability frequency by types

While detailed system analysis with BlackBox and WhiteBox methods appreciable percentage of sites are vulnerable to Content Spoofing, Insufficient Authorization and Insufficient Authentication (P. 4, P. 5). With this approach to security assessment the probability to detect SQL Injection reaches 25%.

!!!

P. 4 The most prevalent vulnerabilities (BlackBox & WhiteBox)

!!!

P. 5 Vulnerability frequency by types (BlackBox & WhiteBox)

In terms of Web Application Consortium Threat Classification version 1 classes (T. 1 and P. 6) the most prevalent classes of vulnerabilities are Client-side Attacks, Information Disclosure and Command Execution. The detailed analysis shows the popularity of Authentication and Authorization classes (P. 7).

T. 1 The probability distribution of vulnerabilities detection according to WASC TCv1 classes

% ALL% Scans% Black & WhiteBox
Authentication1.17%0.02%20.82%
Authorization 1.28%0.07%19.01%
Client-side Attacks33.13%31.17%69.37%
Command Execution8.15%7.32%27.85%
Information Disclosure31.78%30.42%56.54%
Logical Attacks0.90%0.20%13.92%
!!!

P. 6 The probability distribution of vulnerabilities detection according to WASC TCv1 classes

!!!

P. 7 The probability distribution of vulnerabilities detection according to WASC TCv1 classes (BlackBox & WhiteBox)

The Comparison of security assessment methods

While compared automated scanning with detailed Blackbox and Whitebox analysis methods, it is evidently clear that detailed analysis is much more effective to detect Authorization and Authentication class vulnerabilities and logic flaws (T. 2, P. 8).

T. 2 Automated scanning vs Blackbox and Whitebox analisys (% Sites)

Threat Classification Scans vs Black & WhiteBox
Content Spoofing 18.30%
Insufficient Authorization 14.15%
Insufficient Authentication 12.95%
SQL Injection 8.68%
!!!

P. 8 The difference in probability of vulnerabilities detection using different methods

As mentioned above (P 1), the probability to detect high risk degree vulnerability using detailed analysis is 12.5 times higher than using automated scanning. According to the number of vulnerabilities detected for a site (T. 3 and P. 9) the detailed analysis allows to detect on average 9 high risk degree vulnerabilities per site while automated scanning allows to detect only 2.3 vulnerabilities of this rank.

T. 3 Number of vulnerabilities per site

AllScansBlack&WhiteBox
Low3.152.961.11
Med2.352.042.65
High4.222.338.91
All2.121.6113.11
!!!

P. 9 Number of vulnerabilities per site

Additional notes

Web Application Security Consortium Threat Classification version 1 was used in this research. Therefore some types of vulnerabilities are not included into the overall results. We plan to use WASC TC version 2 in future.

The most prevalent vulnerability Cross-Site Request Forgery in this statistics is not on top because it is difficult to detect in automatically and because a lot of experts take its existence for granted.

Vulnerabilities which existence depends on platform are also not included into the statistics (for example, buffer overflow in Apache).


Contributors

WASC would like to thank the following organizations for making this initiative possible. Each organization is responsible for contributing sanitized data from web application security projects which was then combined to produce aggregated statistics.

Statistics

Overall Data

T. 4 General statistics

Threat ClassificationN of VulnsN of Sites% Vulns% Sites
Abuse of Functionality 169990.24%0.30%
Brute Force 2911250.42%0.38%
Buffer Overflow171190.25%0.06%
Content Spoofing 13992132.01%0.65%
Credential/Session Prediction 79460.11%0.14%
Cross-site request forgery9931261.43%0.39%
Cross-site Scripting287691029741.41%31.47%
Denial of Service55440.08%0.13%
Directory Indexing 281870.40%0.27%
Fingerprinting120600.17%0.18%
Format String Attack104120.15%0.04%
HTTP Response Splitting7492651.08%0.81%
Information Leakage 22156761431.89%23.27%
Insufficient Anti-automation 2881150.41%0.35%
Insufficient Authentication3562290.51%0.70%
Insufficient Authorization3432180.49%0.67%
Insufficient Process Validation117380.17%0.12%
Insufficient Session Expiration200910.29%0.28%
LDAP Injection 21110.03%0.03%
OS Commanding 2590.04%0.03%
Path Traversal1781450.26%0.44%
Predictable Resource Location 533133497.67%10.24%
Session Fixation 183650.26%0.20%
SQL Injection642025679.24%7.85%
SSI Injection 185400.27%0.12%
URL Redirectors 2101950.30%0.60%
Weak Password Recovery Validation 164310.24%0.09%
WSDL Exposure60200.09%0.06%
XPath Injection59160.08%0.05%
Total6947632717

T. 5 Vulnerabilities distribution by risk

Threat rankN of VulnsN of Sites% Vulns% Sites
Low24433776035.17%23.72%
Med295751259642.57%38.50%
High13765326319.81%9.97%

T. 6 Vulnerabilities distribution by WASC TCv1 classes

WASC ClassesN of VulnsN of Sites% of Vulns% Sites
Authentication8113841.17%1.17%
Authorization 8054181.16%1.28%
Client-side Attacks321201084046.23%33.13%
Command Execution6985266510.05%8.15%
Information Disclosure281261039840.48%31.78%
Logical Attacks6292950.91%0.90%

Automatic scans

T. 7 General statistics

Threat ClassificationN of VulnsN of Sites% Vulns% Sites
Abuse of Functionality 530.01%0.01%
Brute Force 330.01%0.01%
Buffer Overflow 0.00%0.00%
Content Spoofing 33210.06%0.07%
Credential/Session Prediction 440.01%0.01%
Cross-site request forgery87530.17%0.17%
Cross-site Scripting19171965137.29%30.26%
Denial of Service30220.06%0.07%
Directory Indexing 91120.18%0.04%
Fingerprinting 0.00%0.00%
Format String Attack 0.00%0.00%
HTTP Response Splitting1821610.35%0.50%
Information Leakage 21157711541.16%22.31%
Insufficient Anti-automation 37380.07%0.12%
Insufficient Authentication220.00%0.01%
Insufficient Authorization6110.01%0.03%
Insufficient Process Validation 0.00%0.00%
Insufficient Session Expiration330.01%0.01%
LDAP Injection 0.00%0.00%
OS Commanding 1930.04%0.01%
Path Traversal1181160.23%0.36%
Predictable Resource Location 477232139.28%10.07%
Session Fixation 330.01%0.01%
SQL Injection5301229810.31%7.21%
SSI Injection 180370.35%0.12%
URL Redirectors 1951820.38%0.57%
Weak Password Recovery Validation 110.00%0.00%
WSDL Exposure 0.00%0.00%
XPath Injection410.01%0.00%
Total5140431891

T. 8 Vulnerabilities distribution by risk

Threat rankN of VulnsN of Sites% Vulns% Sites
Low21736735242.28%23.05%
Med244521201247.57%37.67%
High5736246311.16%7.72%

T. 9 Vulnerabilities distribution by WASC TCv1 classes

WASC ClassesN of VulnsN of Sites% of Vulns% Sites
Authentication660.01%0.02%
Authorization 16210.03%0.07%
Client-side Attacks19668994138.26%31.17%
Command Execution5504233610.71%7.32%
Information Disclosure26138970150.85%30.42%
Logical Attacks72630.14%0.20%

Black Box & White Box

T. 10 General statistics

Threat ClassificationN of VulnsN of Sites% Vulns% Sites
Abuse of Functionality 114661.05%7.99%
Brute Force 148661.37%7.99%
Buffer Overflow110.01%0.12%
Content Spoofing 6461525.96%18.40%
Credential/Session Prediction 25100.23%1.21%
Cross-site request forgery74130.68%1.57%
Cross-site Scripting641848059.26%58.11%
Denial of Service25220.23%2.66%
Directory Indexing 60390.55%4.72%
Fingerprinting80510.74%6.17%
Format String Attack420.04%0.24%
HTTP Response Splitting447644.13%7.75%
Information Leakage 6393395.90%41.04%
Insufficient Anti-automation 16140.15%1.69%
Insufficient Authentication2041071.88%12.95%
Insufficient Authorization1871171.73%14.16%
Insufficient Process Validation77180.71%2.18%
Insufficient Session Expiration17160.16%1.94%
LDAP Injection 110.01%0.12%
OS Commanding 660.06%0.73%
Path Traversal60290.55%3.51%
Predictable Resource Location 4991214.61%14.65%
Session Fixation 120221.11%2.66%
SQL Injection8792098.12%25.30%
SSI Injection 530.05%0.36%
URL Redirectors 11110.10%1.33%
Weak Password Recovery Validation 13100.12%1.21%
WSDL Exposure000.00%0.00%
XPath Injection55150.51%1.82%
Total10831826


T. 11 Vulnerabilities distribution by risk

Threat rankN of VulnsN of Sites% Vulns% Sites
Low4524084.17%49.39%
Med154958414.30%70.70%
High712780065.80%96.85%


T. 12 Vulnerabilities distribution by WASC TCv1 classes

WASC ClassesN of VulnsN of Sites% of VulnsWASC Classes
Authentication3651723.37%20.82%
Authorization 3491573.22%19.01%
Client-side Attacks759657370.13%69.37%
Command Execution9512308.78%27.85%
Information Disclosure133846712.35%56.54%
Logical Attacks2321152.14%13.92%

First Round

This initial round of statistics was compiled from data provided by four vendors - Whitehat Security, SPI Dynamics, Positive Technologies and Cenzic in 2006. The detailed information can be found here.

Participation

If you represent an organization that performs vulnerability assessments on websites, particular in those in custom web applications, through a manual or automated process and would like to participate please let us know. Once statistics are compiled, a report will be distributed, and all contributors will receive a logo on the project pages as well as on other deliverables in appreciate of their contribution. Please contact Sergey Gordeychik. Statistics will be collected once per year one month after December 31.






Search this site
Home :: About Us :: Projects :: Mailing Lists :: Library :: News :: Links :: Contact Us
© Copyright 2005, Web Application Security Consortium. All rights reserved.