The 
University of Arizona

Step 5 - Install Cornell Spider (if applicable)

Install the search tool unless your IT staff has done it for you.

The software application Spider, which was developed at Cornell University, was selected to help you search for personal information. This open-source tool is available to you free of charge.

Spider's purpose is to identify files that may contain personal information. It scans a collection of files, searching for patterns of numbers or letters that resemble Social Security Numbers, payment card numbers or driver’s license numbers. Spider creates a log that lists all the files that may contain personal information. You can then look through the log, examine each of the files listed, and take steps to eliminate or protect any files that prove to contain personal information.

Spider can be used for computers and jump drives.

You must complete this step unless you’ve been informed that your IT staff is doing it for you.

To perform this step, you must have administrator rights, or be assisted by IT staff to download and install the necessary application.


For Windows:

  • Go to the Cornell University Spider web site.
  • Click Download under Spider3 for Windows.
  • Click Open on the File Download Box.
  • Double click on Spider_Release.zip.
  • Double click on Spider3.msi to install on your computer.
  • If you receive an error message, you may need to install .NET 1.1 or later before installing Spider. Click here to install .NET.
  • Click the link below and add the UA-specific registry settings.

Add UA Specific Registry Settings


For Macintosh:

Download Spider to your computer by double clicking here. The following screen will appear.

To start the installation, indicate your acceptance of the terms of the license by clicking the “Agree” button. The following screen will appear.

  • Using your mouse, drag the Spider_OSX folder to the Applications folder.
  • Click on the Applications folder to open it.
  • Double-click on Spider_OSX.


For Unix:

  • Go to the Cornell University Spider web site.
  • Click Download under Spider Engine for UNIX, source.
  • Install Spider on your computer.
  • Install the required development packages using apt-get, yum, or any other tool that will resolve any dependencies for you:
Debian Etch/Ubuntu/etc
- build-essential
- libssl-dev
- libmagic-dev
- libzzip-dev
- libbz2-dev
- libz-dev
- libexpat-dev
- libpcre3-dev
Centos 5/RHEL5/maybe SuSE
- gcc
- make
- expat-devel
- openssl-devel
- bzip2-devel
- pcre-devel
- zziplib-devel from the Dag Wieers Repository, or rpm - Uhv
  • Type make.
  • Type mkdir /etc/spider.
  • Copy SSNlin.xml to /etc/spider/.
  • Download spider.conf.
  • Copy spider.conf to /etc/spider.
  • Adjust the startdir line if you have files outside of /home.

 

Step 6 - Run Cornell Spider