Skip to content

Latest commit

 

History

History
63 lines (37 loc) · 6.04 KB

File metadata and controls

63 lines (37 loc) · 6.04 KB

Storage

Directory Structure

To promote long term consistency, the S3DF directory structure provides immutable paths, independent from the underlying file system organization and technology:

  • /sdf: Root mount point.

  • /sdf/home/<u>/<username>: Home directories. Space quotas in effect for all users.

  • /sdf/sw/<package>/<version>: For general purpose software not installed on each node, e.g., EPICS, Matlab, matplotlib, GEANT4, etc. Not meant for software that is used by only one group. Space quotas in effect.

  • /sdf/group/<organization>/<groupname> or /sdf/group/<groupname>: For group/project specific software (e.g., lcls/psdm, ad/hla, etc.). Space quotas in effect.

  • /sdf/data/<facility>/…: For science data (as opposed to code, documents, etc), including raw, calibrated data, and results. Space quotas set according to storage space purchased by the facility. Some examples:

    • LCLS experimental: /sdf/data/lcls/<instrument>/<experiment>
    • LCLS accelerator: /sdf/data/lcls/accel/<bld|bsa|ca|...>
    • FACET experimental: /sdf/data/facet/<instrument>/<experiment>
    • FACET accelerator: /sdf/data/facet/accel/
    • CryoEM: /sdf/data/cryoem/<YYYYMM>/<experiment>
  • /sdf/scratch/<facility>/…: 3 months retention on a best effort basis (actual retention can be shorter or longer depending on actual usage. NOTE: as of September 2025, the auto-purge policy is not in effect.)

?> Access to AFS, GPFS, and Lustre from S3DF is described in this reference section on legacy file systems.

Policies

  • Home directory permissions will be delegated to each user. By default, home directories will be readable by everyone, though you can change that by changing UNIX permissions on one or more of your directories. Everyone will be able to list /sdf/home/<u>.

  • General purpose software will go under sw. Group specific software will go under /sdf/group and will be maintained by each group.

  • Some groups may decide to logically hold all their information under /sdf/group/<groupname>. Such a structure may be implemented by each group via symlinks. The actual mount points and relative backup and archive policies will be based on the structure shown above.

?> TODO Desktop/endpoint access to S3DF file systems will likely be via authenticated NFS v4. This is currently a future implementation topic.

Backup and Archiving

  • Everything under /sdf/{home, sw, group} will be backed up. We currently use snapshots (point-in-time copies) taken at regular intervals (e.g., a few times a day) that users can access with no involvement from system administrators. A subset of the snapshots will be copied to tape at a lower rate (usually once a day). Snapshots for
    /sdf/{home, sw, group}/<dir>
    can be found at
    /sdf/{home, sw, group}/.snapshots/<GMT_time>/<dir> GMT_time indicates the time the snapshot directory was created. Choose a time that corresponds to the file versions you want and simply copy back the files.

  • Files/objects under /sdf/data will be backed up or archived according to a data retention policy defined by the facility. Facilities will be responsible for covering the media costs and overhead required by their policy. Similar to the /sdf/home area, you can also check in /sdf/data/<facility>/.snapshots to see if snapshots are enabled for self-service restores.

  • The scratch spaces under /sdf/scratch and all directories named "nobackup" (located anywhere in any /sdf path) will not be backed up or archived. Please use as many "nobackup" subdirectory locations as required for any files that do not need backup. That can save significant tape and processing resources.

  • A subset of users in some groups will be able to access archiving software (called HPSS) for the purpose of archiving/retrieving data to/from tape. Unlike backups, which will be automatically performed by the storage team within SCS, archiving will be the responsibility of each group (contact SCS for assistance).

?> The current and target backup and archiving policies are summarized in this reference section on data backup.

Change to AFS Tape Backup Retention Policy

March 31, 2025

Executive Summary: due to the upcoming retirement of our legacy tape libraries, the tape backup retention policy for the legacy AFS file system will be reduced from one year to one month (31 days), effective May 14, 2025 (the deadline date). This policy change means any future request to restore accidentally deleted files must be created within one month (rather than one year) of the deletion event. In addition, if a user deleted an AFS file more than one month prior to the deadline date and needs it restored, they have until the deadline date to create a restore request via email to s3df-help@slac.stanford.edu.

Details: the AFS file system used for legacy Unix home directory and group file storage originally started with a one-year tape backup retention policy. This meant that the tape backup system used to store backup copies of files had the ability to go back approximately one year and restore files that were accidentally deleted.

The legacy AFS file system will be retired before the end of 2025. Since a restore request for an AFS file requires the AFS file system itself to be online, the current retention policy would require the AFS file system to remain running for one year beyond its public retirement date just to satisfy restore requests. Since that will not be possible, the tape backup retention policy for legacy AFS files is being reduced from one year to one month (defined as 31 days) effective May 14, 2025 (the deadline date).

This policy change means any request to restore accidentally deleted files must be created within one month (rather than one year) of the deletion event. In addition, if a user deleted an AFS file more than one month prior to the deadline date and needs it restored, they have until the deadline date to create a restore request via email to s3df-help@slac.stanford.edu. After the deadline date, the backup system will be able to restore an AFS file only if it was deleted within the last one month.