Scared Straight? Reviewing Excel Files in the Wake of Wells Fargo

The Wells Fargo inadvertent disclosure episode provides a high-profile reminder that attorneys who are responsible for reviewing and producing client documents must thoroughly understand those documents.  There were several process breakdowns that led to the inadvertent disclosure of thousands of confidential records, and not all of them were very well explained.  (The affirmation of the producing attorney is available on the New York e-filing website, Index No. 652025/2017 )(see docket entry 36).  But however it happened, as reported by the New York Times, the inadvertently produced material “included copious spreadsheets with customers’ names and Social Security numbers, paired with financial details like the size of their investment portfolios and the fees the bank charged them.”  This might have been a pure eDiscovery mistake–i.e., an accidental production of documents due to tagging errors or miscommunication with the vendor.  But I suspect some Excel ineptitude was at play as well: the producing lawyer may have affirmatively decided to produce certain spreadsheets without realizing that they contained this confidential data.  Excel features such as filters and hidden columns often mean that there is data “hidden in plain sight” that escapes the attention of document reviewers.

That possibility should give us all pause–might we make the same kind of mistakes?  As the producing party, the consequences of failing to review documents thoroughly go beyond just bad publicity. At one end of the spectrum you may fail to produce relevant information, and face an embarrassingly meritorious (not to mention expensive) motion to compel. At the other end, you may inadvertently produce a “treasure trove” of sensitive information that should never have been produced.  Aside from potentially landing you on the pages of the New York Times, that kind of mistake may jeopardize your case, and the representation itself.  If the information contains Social Security numbers or other PII, the disclosure may run afoul of state and federal privacy laws as well.  Of course, as the receiving party, failing to comprehend the evidence produced by the other side means that you may be overlooking evidence that could win the case for you.

The key to avoiding such mistakes is to thoroughly understand the relevant features of Excel.  This can be a challenge because the users–our clients’ employees–are very skilled users of Excel!  They use it all day to create fancy spreadsheets.  It’s their job.  These files are not dumbed down for us when the eDiscovery vendor collects and hosts them for us to review.  That means we’ve got to roll up our sleeves and get our hands dirty–whether we “like” using Excel is not relevant.

Is your document review team up to the task?   Here is a list of skills that I think every reviewer must have to competently review spreadsheets in the eDiscovery context:

  • Reviewing the entire workbook, not just the current tab
    “In my Father’s house are many mansions . . . ” (John 14:2).  So, too, are there multiple worksheets, a/k/a tabs, in a typical Excel file. Some complex accounting files can have more than 20 worksheets!  Reviewers have to understand that every worksheet must be reviewed for relevant information, not just the worksheet that first pops up when you open the file.  It’s not always obvious how many worksheets there are.  In the example below, it looks like there are only 5 worksheets.Multitab-1

    But looks can be deceiving, and they are here.  You can launch a list of all the worksheets in an Excel file by right-clicking on the group of arrows in the lower left-hand corner–the area shown in green below. Doing that here reveals that there are three more worksheets, called Additional Data, Update, and Corrected Data–those sound important!

    Multitab-2

    Meanwhile, to search for specific information or keywords throughout a multi-tab workbook, you can tweak Excel’s Find feature to search for content across all the worksheets, rather than only the current worksheet, which is the default. To do this, click on Options in the Find dialog box, and select “Within –> Workbook”; then choose “Find All” instead of “Find Next.”

  • Understanding Filters
    Filters allow a user to selectively display certain rows of a worksheet with certain characteristics.  In the example below, a spreadsheet of wire payments is filtered to display only records where the customer name contains ‘Fujitsu’–there are thousands of other records in the spreadsheet that are not currently displayed, but can be revealed by clearing or turning off the filter (e.g., by the receiving party).FilterRedact-fujitsu

    Business professionals love
    filters, and use them extensively.  As a consequence, many native files open up with filters already applied, making a lot of the information temporarily invisible.  As a document reviewer, you fail to understand filters at your peril.   I discussed this issue in a recent presentation at the Harris County Law Library in Houston; the relevant (8-minute) video clip is viewable here.
  • Identifying hidden rows, columns, and worksheets
    Relativity is not your friend here.  In the default view you probably won’t recognize hidden material, much less be able to unhide it.   You should really be downloading Excel files natively and reviewing within Excel to make sure you’re seeing everything. That said, your eDiscovery vendor may be able to create and populate a metadata field to flag Excel files with hidden content.  Once inside, there are several tricks to find and unhide it. (I discuss one of the tricks in this post.) Remember: there is nothing necessarily nefarious about hiding content in Excel–it is usually a matter of convenience.  And when the last user of the file saves it with hidden content, it will still be hidden when you open the file to review it.
  • Identifying & Deciphering formulas
    Numbers that show up in an Excel worksheet may be just typed in, or they may be generated by a formula, a link to other cells, or both.  The distinction can be extraordinarily important in a litigation.   Sometimes values are hard-wired when they should be generated by a formula, and sometimes formulas and links are fudged to perpetrate a fraud.  Identifying which kind of cell is which is critical. Once you identify the formulas, you’ll want to figure out how they work.   You may need to call in the experts, but a basic knowledge of formula syntax can go a long way.  I discuss some tips for finding and analyzing formulas, including detecting inconsistent formulas, in this post.
  • Understand the Freeze Panes display feature
    Sometimes a worksheet is bizarrely difficult to navigate, and the reason is Freeze Panes–a way of locking the display to keep certain rows and columns visible.  Some users may also have unusual Freeze Panes settings that can obscure large parts of the worksheet.  These settings are easy to turn on and off from the “View” ribbon:unfreeze-panes
  • Finding and Expanding Truncated Text
    Sometimes certain cells contain reams of text that aren’t immediately visible.  Take a look at the spreadsheet screen-capped below–it’s derived from publicly available data on 1,651 Chicago restaurant inspections between 2010 and 2015. (You can download this data from the Chicago Data Portal.)   Column N is a field called Violations; each cell appears to have a number and then a brief description.overflowtext-1When you click on one of those cells, you discover that there is a lot more text in it than what appears in the too-small cell.  The actual contents appear (kind of) in the Formula Bar up above (shown in the green box).

    overflowtext-2

    But wait, there’s more!  In fact, this cell contains several paragraphs of text, and the default Formula Bar only displays the first line.  When you drag the lower edge of the Formula Bar down to make it bigger, you finally see the entire Violation description.

    overflowtext-3

    Would your document reviewers know that they need to stretch out the Formula Bar to see all the text?  Suppose your litigation involves faulty ice machines? Lead paint? Sneeze guards? How do you make sure that you’re identifying all the relevant entries in this workbook, when the relevant text may not appear initially?  There are a number of Excel techniques beyond the scope of this post to plumb the depths of a text-heavy spreadsheet like this.  Suffice it to say you do not want to leave the proper review of this file to chance.

  • Identify the boundaries of a worksheet 
    You may think you are at the bottom or the edge of a set of data, but how can you be sure there isn’t more stuff down there? There are ways.  For example, to jump to the very last used cell on a worksheet, hit the F5 key to launch the Go To dialog; then click Special–>Last cell.GoToLastCell
  • Finding each worksheet’s print formatting
    Reviewing which data a user chose to print, and what kind of titles she added, can provide valuable insight into the case.  For example, an otherwise innocuous-looking list of foreign exchange trades might have a footer that says “Fake Trades to Process”–you’d be surprised how dumb some criminals are.   Fortunately, there are ways to review this information without actually printing each worksheet.  One is to review the print formatting in the “Name Manager” ribbon; another is to use Print Preview.   

Excel Esquire offers an interactive (and potentially CLE-accredited) workshop that illustrates the most important aspects of legal Excel review in depth, with real spreadsheets.  To learn about organizing a workshop for your document review team, please get in touch with Ben Kusmin of Excel Esquire.  Thanks for reading!  Feel free to comment, link, or share.

39-cropped-featured-image-class

Posted in CLE, eDiscovery, formulas, Lit Support, Native File Review, Practice Management | 1 Comment

Excel Filters and the Duty of Technology Competence

The Harris County Law Library in Houston, Texas recently hosted my presentation “Excel Essentials for the Practice of Law.”  Among other topics, I explained how filters work in Excel, and how they present pitfalls for the unwary lawyer.  When lawyers responsible for reviewing documents–either incoming or outgoing–are unaware of how filters work, all kinds of things can go wrong.  Learn more by watching this 8-minute video excerpt:

In the ongoing conversation about what constitutes “technology competence” for lawyers, I think more attention should be paid to Microsoft Excel.  Because our clients’ employees use it, and our adversaries’ employees use it, we ignore it at our peril.

I hope you enjoyed this post and video.  If you would like to learn how to host an Excel workshop at your law library, bar association, law firm, company, or public interest organization, please get in touch with Ben Kusmin of Excel Esquire.

Posted in CLE, eDiscovery, Lit Support, Native File Review, Practice Management | Leave a comment

Upcoming CLE Event in Houston, TX (with livestream)

On July 13th, I will give a presentation called “Excel Essentials for the Practice of Law” at the Harris County Law Library in Houston.  The presentation is designed for lawyers, paralegals, and other legal professionals.  Participants will learn how to manipulate text with Find & Replace, print worksheets with a custom footer, analyze data with Excel’s Filter feature, and create and copy formulas.  If you’re not lucky enough to be in the Lone Star State, you can watch the livestream!

More details below from HCLL’s website:

Date: Thursday, July 13, 2017

Time: 12pm – 1pm

Class Format: Classroom & Livestreaming

Where: Harris County Law Library — Harris County Attorney Office Conference Center, 1019 Congress Street, Houston, Texas 77002

Description: Microsoft Excel is an important tool for the modern legal profession. It can assist in supervising document reviews, calculating damages, and managing client information. Because so many business clients use Excel, spreadsheets often become evidence in their own right, and attorneys need to be able to review and understand Excel documents. In this interactive program, Ben J. Kusmin, a commercial litigator and the founder of Excel Esquire, will demonstrate the most important features of Microsoft Excel using concrete examples and realistic data. Attendees will have the option to download the data used in the presentation to follow along on their own computers.

CLE: 1.0 hour CLE in Texas (pending)

To learn more and register, please follow this link: HCLL

Posted in CLE, eDiscovery, Everybody, formulas, Lit Support, Practice Management | 1 Comment

Spelling Bee Fundraiser for Harlem Hospital CCHP

Hello readers!  Your humble narrator and Excel blogger will compete in a spelling bee fundraiser to benefit the Center for Comprehensive Health Practice, a terrific hospital serving needy residents in Harlem.  They are facing a difficult rent hike and this fundraiser is critically important.  And if you’re in New York, please come to the spelling bee – it is a lot of fun! (The featured photo *might* be me in last year’s spelling bee.)   Please follow the link to learn more about the event or make a donation.

CCHP_pic

With any luck they will ask me to spell CONCATENATE.

Posted in eDiscovery, Everybody | Tagged , , | 1 Comment

Nassau County Bar Association CLE Event – March 8

The Nassau Academy of Law, the educational arm of the NCBA, will host an Excel Esquire CLE lunch program called “Excel Essentials for the Practice of Law” on Wednesday, March 8.  Lunch is served at 12:30, and the one-hour program will start at 1 pm.  NCBA is the largest suburban bar association in the country, and we’re expecting a great turnout. The program will feature lots of great Excel content for both beginners and experienced users.  See below for more details:

ncba_flyer

Click here to go to the registration page for this program.  Hope to see you there!

 

 

 

Posted in CLE, eDiscovery, Everybody, formulas | Tagged , , | Leave a comment

Highlight Every Other Row of a Privilege Log (or other spreadsheet!)

Happy New Year, readers! Today’s tip is an old favorite, and one that can save you a lot of tedious work.   It exploits Excel’s conditional formatting feature to quickly highlight every other row of your privilege log, or any other spreadsheet.  Feel free to download my dummy privilege log if you would like to follow along.

Here is the original, unadorned privilege log:

privlog_unhighlighted

In order to highlight alternating rows, take the following steps:

First, select all the cells containing the privilege log entries, but not the Header Row.  Also, don’t select the entire rows–just the cells to be highlighted. You can do this by clicking on cell A2 and dragging the cursor to the last cell on the bottom right of the log.  (Pro tip: with cell A2 selected, hit Ctrl+Shift+Down Arrow, then Ctrl+Shift+Right Arrow (perhaps a couple times) to highlight all the cells.)

privlog_selectrows

Next, from the Home ribbon, click the drop-down menu under Conditional Formatting and select New Rule:

privlog_choosenewrule

In the dialog box, choose the option “Use a formula to determine which cells to format”; then, in the formula bar, type the following (don’t forget the “=”):

=ISODD(ROW($A2))

. . . then hit the Format… button to set up the highlighting.

privlog_createformula

Choose Fill, pick a nice color, and hit OK:

privlog_choosecolor

Now you can hit OK on the Conditional Formatting dialog box:

privlog_complete

Voilà!  Alternating rows of the privilege log will be highlighted in the color you chose:

 

privlog_highlighted2

The formula analyzes every cell in the selection and applies the formatting to every cell in an odd-numbered row.  That’s why the entries in rows 3, 5, 7, etc. are highlighted.  You can delete this highlighting, if necessary, by selecting all the cells and choosing Clear Rules from the conditional formatting drop-down menu.

Please share, forward, or like this post, and subscribe to the Excel Esquire blog!

To learn about hosting a CLE-approved Excel workshop at your law firm, company, public interest organization, or bar association, click here or email Ben Kusmin from Excel Esquire.

Posted in CLE, eDiscovery, Everybody, formulas, Lit Support, Practice Management, Privilege Logs | Tagged , , , , , , | 1 Comment