Options for Data Storage

Box.com – Document and Image storage

  • Box.com is a cloud storage service similar to Dropbox, Google Drive or Microsoft OneDrive.
  • Files are stored on a cluster of servers hosted by box.com and either downloaded/uploaded via the box.com website or synchronized with a folder on one or more computers using the box sync client.
  • Fully funded by DoIT with a contract negotiated to make sure the security and rights are appropriate for use on campus.
  • Every NetID automatically gets 50GB, group folders can be added for free and can go up to 250+ GB
  • Great for document and image storage and collaboration.  Easy sharing thanks to NetID integration and automatically saves older versions of documents as updates are made!
  • No VPN needed, files are available via the Internet.
  • Box.com will (probably) always be online and never loose documents.  Highly redundant, distributed and maintained by many professionals.

External Hard Disk Drives – Desktop backups, personal data

  • Files are stored on a piece of hardware which labs/users are responsible for.
  • Bad for primary data, low “mean time to failure” and no built in redundancy.
  • Very inexpensive, and flexible
  • Vary in size from a few GB to currently 5TB
  • Limits Genetics IT’s ability to help, labs are solely responsible for their data.

Departmental File Server – Large files, critical files, downstream analysis, microscopy images

  • Files are stored on a server with enterprise class drives and vendor support.
  • Monitored, maintained and protected by Genetics IT (nightly backups)
  • The server can scale up to hundreds of terabytes
  • Basic service paid for by departmental funds with additional storage available on chargeback.
  • Server is located on premise resulting in very fast transfer rates to experimental storage, Biotech storage and campus desktops.
  • VPN needed to connect form off campus.
  • Monthly maintenance windows to perform updates.

Research Data Service – Large files, sequence data, data to be or that has been analyzed or processed. 

  • Files are stored on a cluster of commodity servers that appears as a single file system.
  • Each file exist on two servers, the complete catastrophic loss of a server will result in no down time or file loss.
  • Files are not backed up, accidental deletions/overwrites are permanent.
  • System can easily scale to thousands of terabytes
  • Storage is only accessible through departmentally managed servers.  Any work done on Deep Thought or HT1 will be saved on this platform, but data must be uploaded/downloaded via SCP or WGET.