How Verizon analyzes security-breach data with R
Analyzing 200,000 records may not seem like a big task. But when those records are security incidents with potentially hundreds of attributes each — types of bad actors, assets affected, category of organization and more — it starts getting a little complex for a spreadsheet. So Verizon’s annual security report, which was initially done in Excel, is now generated “soup to nuts” in R.
In fact, the Verizon Data Breach Report is somewhat of “a love letter to R,” Bob Rudis, managing principal and senior data scientist at Verizon Enterprise Solutions, told the EARL (Effective Applications of the R Language) Boston conference earlier today.