UC Berkeley Year 2000 Information Departmental and Personal Computers: Find and Resolve Y2K Problems
Home | Overview | Readiness Checklists | Computer Advisories | Software Tools | Specific Issues | Recharge Services | Peer Help | Search | Site Map | UCB Y2K Home

This page was last updated early during the year 2000 and some or all of its content may thus no longer be current or accurate.
Y2K error digits graphic
Image copyright (c) 1998 by PhotoDisc, Inc. Go
Used under license.

Data Files: Finding & Resolving Y2K Problems



Overview: About Y2K problems in data files Go
Keeping Y2K problems out of your data files Go
Finding Y2K problems in data files Go
Resolving Y2K problems in data files Go
Related documents
Checking your data for Y2K problems Go
(article appearing in the November-December 1999 issue of Berkeley Computing and Communications Go)
Why You Should Enter Dates With Four-Digit Years Whenever Possible Go
Data Sharing Methods: Finding & Resolving Y2K Problems Go
Commercial Off-the-Shelf Applications: Finding & Resolving Y2K Problems Go

Overview: About Y2K problems in data files

Some data files may harbor Y2K-related and other date-related problems. Some of these problems might first surface when the year 2000 arrives or when dates falling in the year 2000 and beyond are first used.

Examples of files which could potentially be at risk include:

Keeping Y2K problems out of your data files

To help keep Y2K-related problems (and other date-related problems) out of your data files, we strongly recommend these four practices:

  1. Configure your operating system to display full four-digit years.

    When you do this, your application programs which use this display format will also start displaying dates with four-digit years. This can help you in three ways:


    For information on how to make this change - and for an important discussion of some problems that can result from making this change - see Changing Your Operating System's Default Date Format To Use Four-Digit Years Go .

  2. Always enter dates with four-digit years.

    If you enter dates with century-ambiguous two-digit years, such as "05/13/29", which could reasonably fall within either the 20th or 21st centuries, most spreadsheet and database programs, as well as some other types of programs, will use a set of rules to guess which century you "probably meant" when you entered this date.

    Unfortunately, sometimes your programs will guess wrong, assigning certain dates to an incorrect century. This could cause incorrect data to be stored in your data files. You can avoid this problem altogether by always entering dates with four-digit years, such as "05/13/1929" or "2029-05-13".

    For more details, see Why You Should Enter Dates With Four-Digit Years Whenever Possible Go .

  3. Exercise caution when using built-in date functions in spreadsheet and database programs.

    Date functions such as Microsoft Excel's Microsoft Excel's =DATE(), Microsoft Access's =DateSerial(), and Lotus 1-2-3's @YEAR() may not always return the results you're expecting when using dates which fall within the year 2000 and beyond. These date functions can be especially problematic when you use them with 21st century dates containing only two-digit years, such as "00" or "01".

    If you use these functions without fully understanding how they handle 21st century dates, you could introduce incorrect data into your spreadsheets or database files.

    For more details, see the Finding Y2K problems in data files Go section of this document, Data Files: Finding & Resolving Y2K Problems Go .

  4. Exercise caution when importing dates into your data files.

    When you import data containing dates into your files, make sure these dates are represented in an unambiguous manner.

    For example, if the data you're importing contains dates stored as text, make sure that these dates either:


    This is especially critical in the case of data imported from text files, pasted in from the Clipboard, or read in via programmatic data streams. If this data contains century-ambiguous dates with two-digit years, such as "05/13/29", your program could potentially interpret such dates as falling into an incorrect century, thus causing incorrect data to be stored in your data files.

    For more details, see Data Sharing Methods: Finding & Resolving Y2K Problems Go .

Finding Y2K problems in data files

There are two primary ways of finding date-related problems in data files: you can manually scan these files or you can use automated software tools to assist you.

Whether you're manually scanning your data files or using a tool to assist you, you might begin your search for potential Y2K problems by looking for any instances of the following:

For additional suggestions about what types of problems to look for in your data files, you might start by reviewing articles such as the following:

Patrick O'Beirne
Year 2000 and Spreadsheets Go

Patrick O'Beirne
Year 2000 issues in PC Database packages Go

Dan Haught, FMS, Inc.
Solving the Year 2000 Problem in Microsoft Desktop Application Programs Go

Luke Chung, FMS, Inc.
Year 2000 Issues in Microsoft Access Go

Resolving Y2K problems in data files

Caution! Before installing any updates or patches which may alter your computer's operating system software or application programs, and before making any changes to your critical documents (data files), be sure that you have a complete, current backup of your computer's data.

For general backup advice, see the Berkeley Computing and Communications articles
Ask Dr. Micro: How can I back up the files on my computer Go
and
Simplify your life with the UCBackup service Go.

There is no "magic bullet" that will automatically resolve all of your date-related problems in your critical data files.

As noted above, automated software tools Go can assist you in this task by scanning your data files and helping you identify potential problems. Most tools will be able to tell you which spreadsheet cells, or which database records and fields, have potential date-related problems, and provide you with information about the nature of these problems. Some more sophisticated tools may even be able to open your spreadsheet or database files for you, and allow you to directly edit the cells or fields in which potential problems were detected.

However, even the best tools still primarily rely on human judgment when fixing these problems. For example, a tool might be able to detect a problematic use of a date-related function in particular cell in a spreadsheet document, but it will still be the responsibility of the the author or maintainer of that spreadsheet to determine how best to fix this problem.

Start early

We've seen reports from a number of sites indicating that large numbers of spreadsheet and database files at those sites have been identified as having potentially moderate to serious date-related problems. (These problems were typically spotted by data file scanning tools and then a sampling of these problems were confirmed by manual inspection.)

This experience might potentially be the case with at least some computers used for campus business at UC Berkeley. For this reason, it is important that you begin the task of scanning key data files for date-related problems relatively early in your departmental or personal Y2K efforts. If you have tens, hundreds, or even thousands of data files which contain or process dates, and many of these files have date-related problems, it could take a long time to resolve these problems.

Some tips for managing the task of checking and fixing data files

  1. Separate active files from those which are no longer being used.

    When starting out, it's often very helpful to perform some cleanup. You might separate out files which:


    You might move these files to a special folder or directory to help you avoid wasting effort scanning these files.

  2. Focus repair work on just the highest priority files.

    Work to resolve problems first on the files which are of the highest importance to your department's or the campus's mission, then tackle other files when you have time to do so.

  3. Use your tool's capabilities (if any) to help you identify the highest priority problems in your files.

    Some data file scanning tools will rank potential date-related problems by assigning a (tool-calculated) priority or severity level to entire files or to individual problems identified within these files. Use this feature to help you focus on the problems which most urgently require fixing.

  4. Focus on date functions and calculations.

    If you're checking spreadsheets for Y2K problems, focus your efforts on formulas which contain built-in date functions or otherwise process dates.

    The same advice is equally appropriate for any types of data files which include custom programming, such as user-defined functions, scripts, or macros in spreadsheets and scripts or programs associated with databases. Concentrate your efforts on just those areas of your data files which use built-in date functions or otherwise explicitly process dates.

  5. Don't waste time "fixing" dates which are visually displayed with two-digit years.

    Dates which are displayed with century-ambiguous two-digit years, such as 05/13/29, are often flagged as problematic by data file scanning tools. However, even if many dates are visually displayed with two-digit years, such dates are often internally stored in spreadsheets or database files in a manner which accurately preserves four-digit year information. (In some types of spreadsheets, for example, dates are typically stored as "serial dates": the number of days past a starting or "epoch" date.) These dates can be safely used in date calculations, regardless of how they are displayed.

    It is true that dates which are stored internally with four-digit year information but are displayed with only two-digit years could potentially cause problems. Most commonly, these problems will occur if these dates they are exported as text or via the Clipboard. This is because some programs use the as-displayed format, rather than the internal storage format, when exporting data via these methods. However, it's not all that common to share spreadsheet or database data using text files or the Clipboard. More typically, spreadsheet and database files are passed around via disks or e-mail, or shared via file servers.

    In addition, 'dates' in spreadsheets or database files which are stored in text or numeric formats and are century-ambiguous could cause a variety of date-related problems. However, you won't encounter these problems unless these 'non-date dates' are actually used in date calculations.



Find something unclear? Missing? Incomplete? Inaccurate? Or even praiseworthy? Send us feedback about this Web site!

This site is provided by the campus Year 2000 Departmental Computers and Administrative Equipment Subcommittee at the University of California, Berkeley.

Copyright 1999 by the Regents of the University of California.
Disclaimer: The University assumes no liability if the information on this page is used for other than University purposes.