

Velocity 2011 - Yahoo! homepage failed
source link: https://www.otsukare.info/2011/06/15/velocity-2011-yahoo-homepage-failed
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Lessons from Yahoo’s Homepage: 5 tips for High availability
Jake Loomis is in charge at Yahoo! of VP. The most visited Web site in USA. over 600 millions global users. 300k+ servers, 100pb+ storage. The Yahoo! home page is the entry point. around 40,000 requests per second. When Yahoo! went down it made a lot of news. Amazon or Playstation outages have business consequences
Tip #1 Redundancy for everything
Down to the code, application features. Understanding your system’s failure points is very important. Everything can and will fail at one point. Redundancy at servers, network devices, colo loses power, user database. ad lookup fails, page fails completely (show a static page instead).
Tip #2 Error proofing change
Make changes in a safe environment. Practice how you play. Test also in environment which is very close from the production environment. QA environement has to be treated the same way that the production environment. Same logs, same type of catching issues. etc. Fork production traffic in the staging environment. We did that for Le Grand Club at RDS. You have to be able to recover quickly.
Tip #3: Global Load balancing
Route traffic to different places in the world. Ability to serve any market from any data center. If your site depends on one coloc, then there is an issue. By being close to the user, it offloads traffic, part of the performance issues.
Tip #4: Monitor everything
There are huge traffics you can prepare for such as a big planned event, like a royal wedding, but there are news coming by chance. These can generate stress on the infrastructure. At Pheromone, we were planning the days where the players of LNH are exchanged. Unplugging some features, etc.
Tip #5: Fallback plans in case of failure
What is happening and how do you handle in case of accidents? Isolate failure for not impacting users. You may want to drop features to add capacity. Learn from failures and followup.
Bonus Tip: Talent
“Having people who are talented is key. ” Well… lapalissade.
Recommend
-
12
Mobile Web and HTML5 Performance Optimizationotsukare Thoughts after a day of workMobile Web & HTML5 Per...
-
11
Know Your Engines: How to Make Your JavaScript Fast (the slides contains a lot of materials) David Mandelin (Mozilla) In 5 years some javas...
-
10
Mobile Web Performance It turns it is basically a webkit talk… sigh Webkit relies on JavascriptCore or V8. Two important parts are Render...
-
14
Performance and CSS3 - beyond the hype The situation has changed a lot with regards to CSS3. Browsers have made huge progress on this area. There is a pl...
-
9
Change = Mass x Velocity, and Other Laws of Infrastructure Mark Burgess is starting with a “Lapalisse” comment: “We are living in an exciting time.” The...
-
10
Opening Remarks - Jesse Demonstrates Caffeine IV John Allspaw is being graduated as a co-chair of the program of the conference. (sponsors thanks).
-
7
Advanced Postmortem Fu and Human Error 101otsukare Thoughts after a day of workAdvanced Postmortem Fu and Hu...
-
13
Velocity 2011 - Big Data, Scale Dirty Mar 14 juin 2011by Karl Dubost (Worked at...
-
6
Velocity 2011 - Performance Tools Mar 14 juin 2011by Karl Dubost (Worked at Opera f...
-
3
Turning into Bing in both look and quality — Google.com tests a news-filled homepage, just like Bing and Yahoo The mobile app has a "Google Discover" newsfeed, so the homep...
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK