Back to Basics: Explore the Edge Cases or Date Math will Get You
 Disclaimer: I don't work for the Zune team and I don't know anyone on the team. I think that Z2K9 was a bummer, but I don't have any inside knowledge. Everything here came from the public interweb.
 Disclaimer: I don't work for the Zune team and I don't know anyone on the team. I think that Z2K9 was a bummer, but I don't have any inside knowledge. Everything here came from the public interweb.
Dates will get you every time. Further more, it's all about Edge Cases. This is one of the things you'll think about when doing Test Driven Development and it's one of the things that everyone learns in Computer Science 101. You really have to hit those edge cases, be they dates, or number overflows, or buffer overflows.
The news reports:
"A bug in the internal clock driver related to the way the device handles a leap year affected Zune users," said the company in a statement. "That being the case, the issue should be resolved over the next 24 hours as the time change moves to January 1, 2009."
Disclaimer*2: I have no idea if the following is 100% true, only that it seems quite plausible. I present it for educational purposes, nothing else. It's very interesting. Again, I'm just a caveman.
A Zune Fan poked around in the source code from the vendor (Freescale Semiconductor) that made the real time clock in the Zune (The vendor's source for rtc.c is here) and with the benefit of hindsight, noted that there's the opportunity to get stuck in an infinite loop.
The Zune 30 shows the date and time, as do many devices, as the number of days and seconds since January 1st, 1980 at midnight.
year = ORIGINYEAR; /* = 1980 */
while (days > 365)
{
if (IsLeapYear(year))
{
if (days > 366)
{
days -= 366;
year += 1;
}
}
else
{
days -= 365;
year += 1;
}
}
The days variable is read out of the memory location managed by the Real Time Clock (RTC). When the value of days == 366, you break out of the inner loop, but you can never get out of the outer loop.
A number of folks have blogged about the bug, their analysis and how they'd fix it. Programming Phases has a good post and folks have twittered suggestions). The basic problem is that since there are 366 days remaining when the calculations are reached for the year 2008, there will never be another subtraction to bring the total below 365, so the loop continues. The value of days is stuck at 366. It IS a leap year, but days is not > 366, so the loop continues.
In working with banking software for years, I can tell you that dates'll get you. When dealing with dates and date math you can't underestimate the value of really good code coverage. Also, even if you have 100% coverage, as I learned in my interview with Quetzal Bradley, 100% coverage just tells you a line of code DID run, it doesn't tell you that it ran correctly.
I noticed also when I visited my Live home page on Dec-31 that it said today was Jan-1, likely a Time Zone glitch. However, when I clicked the date, I was taken to a page with historical facts about Dec-31. ;)

Dates, especially when TimeZones are added in, are notoriously hard.
To this day there are a half-dozen bugs in DasBlog where we have a devil of a time with Time Zones. Omar spent weeks fighting with them before he just gave up. We have to reconcile the local time of the visitor, the time on the server, and GMT time (the time we store everything in). It usually works, but when the Server isn't on GMT we get into all sorts of trouble. We also have problems with clients that call the XML-RPC APIs, some of which use UTC (GMT) time and some use local time. Other than keeping an internal table to wrong-headed clients, there are no good solutions.
In 2007 CNN talked to a Major General about a bug in some F-22's that caused them to malfunction when going across the international dateline. The Unofficial Apple Weblog had an interesting post on Apple Date/Time bugs through the years. If you're interested in working with and maintaining legacy code, I recommend Michael Feathers' Working Effectively with Legacy Code.
As a random slightly-related aside, I was over at Hollywood Video buying a used copy of Mirror's Edge yesterday and some folks were talking about the Zune problem. The manager said, wow, I'm running the music in here on a Zune right now, and produced his brown 30G Zune from behind the counter. He either hadn't turned it on today fresh or the clock was wrong so he was able to weather the whole day without an incident. He seemed pretty pleased about his "survival."
I wonder how many other less widespread devices are running this real time clock and if any of them had trouble as well.
Do you have any personal Date/Time stories, Dear Reader? Please share in the comments!
About Scott
Scott Hanselman is a former professor, former Chief Architect in finance, now speaker, consultant, father, diabetic, and Microsoft employee. He is a failed stand-up comic, a cornrower, and a book author.
 
                         
                        
About Newsletter
The important point about this is that MSFT got more recognition for Zune from this event than all the other marketing activity they've done combined. Now people actually know what a Zune is.
Makes me think I should plan what bugs should be in the code to help market the product.
When the value of days == 366, you break out of the inner loop, but you can never get out of the outer loop.
Nitpicking, but the inner part isn't a loop :)
I guess the classic case of this is the y2k hysteria. Sweden has something like social security numbers that include the birth date but only with dual digit years. So I spent better part of 1st of january 2000 at my old job fixing calculations for peoples' ages.
My personal date and time issue is related to my new laptop. When day light savings time came aroudn, first, Vista didn't seem to have the new date for this all set. I manually adjusted the clock and COULD NOT get it to stick with the new time... I changed the time in Windows, changed the time in the BIOS, synced with a NTP server... it would stay changed until I rebooted, or even just set back seemingly at random. I was grumbling about Microsoft and then i found, I had the exact same problem with Linux on the system! And then I was messing with time zones and I ended up with my times off by an hour here, two hours there... an hour back, two ahead...
Finally I got it set right. Then, I went on a trip to another time zone... and was foolish enough to mess with the time. Now, Windows is an hour behind, Linux is an hour ahead, and I don't even know what the BIOS thinks the time is.
This had caused leap years to be incorrectly computed (1BC and 5BC are leap years, not 4BC).
Luckily these cases have only ever appeared in unit tests.
http://www.hanselman.com/blog/DontUnderestimateThePowerOfToStringIFormatProvider.aspx
Related code uses time stamps that are modulo 24 hours (to fit in 17 bits) which must be converted (based on some other hints and properties of the system) into epoch times. Complicating all this is the combination of system (server) time, user preferred time zone, and the time zone of the end user's computer. This often came to grief around daylight savings switches, but I may have finally nailed it.
The fundamental issue in our implementation was that while the standard C library has localtime() and gmtime() which convert an epoch time to a tm structure (with elements for year, month, day, etc.) there is only mktime() to go the other way around, which converts using the local machine settings. We needed one that didn't do this; effectively mkgmtime(). Microsoft provides mkgmtime(), but the software was originally cross-platform. Hopefully next April we will see this problem gone for good.
Other than that, no problem.
In the days of the WORLD wide web, a date such as 10/12/08 can mean October 12th 2008 in some parts of the world, and 10th December 2008 in other parts of the world. Even if some developers use the local date format others do NOT and therefore you can never be quite sure (unless it is after the 12th of the month).
I suggest that developers use a format such as Oct-10-2008 or 10-Dec-2008. Of course this does not quite satisfy everyone as the three letter month format is in English. In Germany it is different, and in France it is different again. However, if someone now uses the local settings it is obvious and the date is no longer ambigous. Perhaps a little flag could be displayed beside the date to signal the culture being used.
It seems noone [at least not media] noticed, but some Sony Ericsson mobile phones died in a similar fashion on new year's eve. My wife's phone was one of them. It wouldn't boot so she bought a new one thinking the old one was bricked. Lo an behold, comes 2009 and the old phone works again... I wonder if Sony Ericsson used the same clock/driver as MSFT...?
This gets even more confusing when you're dealing with fiscal calculations against a service that is offerered on a calendar year basis.
When the daylightsavingstime is switched you get some nice results.
When 2AM becomes 3AM, and you start the call at 1:59AM and end it at 2:01 (which did not occur), that has in the meantime become 3:01 you get to pay for a call of 1hour and 2 minutes.
When 3AM becomes 2AM, and you start a call at "the first" 2:50 AM and end it at "the second" 2:55 AM, you made a call of 1 hour 5 minutes, but only have to pay for 5 minutes. Even more funny: When you start the call at "the first" 2:50AM, and end it at "the second" 2:05. you made a call of 15 minutes, but actually never have to pay anything. The software "thinks" you made a call with negative duration, so it gets filtered by the rating engine. Actually, if it wouldn't get filtered, you would be paid, so make sure you call a 0900 or so... unless they fixed the bug by now.
http://www.theregister.co.uk/2009/01/07/oracle_leap_second/
Comments are closed.

