what happend (FC down part of Saturday & yesterday)? | Page 2 | FerrariChat

what happend (FC down part of Saturday & yesterday)?

Discussion in 'Ferrari Discussion (not model specific)' started by NYJETSFAN, May 16, 2004.

This site may earn a commission from merchant affiliate links, including eBay, Amazon, Skimlinks, and others.

  1. thecarreaper

    thecarreaper F1 World Champ
    Silver Subscribed

    Sep 30, 2003
    18,130
    Savannah



    BIG GRIN!!!!!!!!!!!!!! DROOL DROOL!
     
  2. wax

    wax Five Time F1 World Champ
    Lifetime Rossa

    Jul 20, 2003
    52,481
    SFPD
    Full Name:
    Dirty Harry
    Good job going to the root of the problem, Rob.
    Folks - this sort of thing has happened to many a type of database/server/administrator. I'm on another site that is run by one of the brightest, experienced, computer, internet, technically savvy people on Earth. Absolutely brilliant. Site still crashes 4 times a year.
     
  3. rob lay

    rob lay Administrator
    Staff Member Admin Miami 2018 Owner Social Subscribed

    Dec 1, 2000
    64,305
    Southlake, TX
    Full Name:
    Rob Lay
    Well, the site going down Saturday night was the first major problem since Nov. 1st of last year and yesterday afternoon was the second major problem. So I guess that's about a 4 times a year average. :) Here's part of the explanation for you nerds...

    Saturday was a hardware/disk failure. When you try to hit the site and it can't be found by the browser at all, then that's usually a hardware, network, or domain resolution problem. I was on vacation and didn't have easy access to a computer, so that's the only reason it took longer than usual to fix. Once Omar called me, I went home and confirmed it was out of my hands, then I called my admins and it took them about 2 hours of work on the actual box in Ohio to fix. Root cause fix: hardware just fails, but with the next server upgrade I'm definitely going with higher quality hardware and maybe even consider fail over designs.

    Yesterday's off and on problems where you got the system message there was a database problem and to contact me. Well, that was all MySQL. The server was up and vBulletin was working, it just couldn't hit the database. MySQL has scripts to auto-restart, but it couldn't because there were some corrupted tables. That's usually an easy fix as you can just run an auto scan/fix tool that MySQL has, however, the table that was corrupted was a HEAP table and the MySQL scan/fix doesn't work on it. Root cause fix: This is the first time this has happened, so next time it at least won't take as long to fix, because I know exactly what to do. The HEAP table in question is the session table and as far as I know, is just used to keep track of active users browsing the site at that time. It's really not needed for us as I have the 'Who's logged in' display turned off. I'm going to inquire with vB if there's a way to shut this table off all together.

    One last note. I lost all my email from about 2pm to 8pm CST yesterday. If you sent me something that needs a response and I haven't responded, please resend or PM.

    Thank you,
    rob
     
  4. richard_wallace

    richard_wallace Formula 3

    Feb 6, 2004
    1,957
    Cincinnati, Ohio
    Full Name:
    Richard Wallace
    Rob,

    You might want to consider running some monitors (or have your hosting company) like BMC or Sitescope - that can check the heart beat of the DB and HW - plus give you the heads up / page/call if there is an issue.

    The other consideration - down the road as you gain more volume - will be to upgrade to Sun or Linux for the DB engine and use Oracle... (not that I am bashing MS or anything). Volume becomes a consideration once you hit a certain point. Also utilizing Oracle you also get a better toolset with DBtools, etc.

    Just a thought - from one of your F-chat Consultants :)
     
  5. milstanselnino

    milstanselnino Formula Junior

    Jan 8, 2004
    573
    MN
    Full Name:
    Jon P.
    Oh, Toto, I have a feeling Rob's not in Kansas anymore.

    BTW Thanks for fixing it quickly
     
  6. rob lay

    rob lay Administrator
    Staff Member Admin Miami 2018 Owner Social Subscribed

    Dec 1, 2000
    64,305
    Southlake, TX
    Full Name:
    Rob Lay
    We have cron jobs and scripts for almost everything, but there's always something new that falls through.

    I agree Oracle would be a more stable environment, but it's a little out of my price range right now. We are on Redhat right now though, I prefer the Unix OS.
     
  7. Artherd

    Artherd F1 Veteran

    Jun 19, 2002
    6,588
    Bay Area, CA
    Full Name:
    Ben Cannon
    What version of mysql are you running? It should be fairly solid, but AFAIK mysql still dosen't run truely joined queries, one of the reasons I do not like it. Postgre does, and it is also freeware.


    Oracle is great if you're making (real) money with the system. their business practices and CEO leave much to be desired however. Prices they charge are highway robbery.

    RAID will *SOLVE* your disk problems. Highly recomended (get a standalone unit and bootable FC <qlogic> card for the machine.)
     
  8. rob lay

    rob lay Administrator
    Staff Member Admin Miami 2018 Owner Social Subscribed

    Dec 1, 2000
    64,305
    Southlake, TX
    Full Name:
    Rob Lay
    I have the latest version of MySQL.

    I'll get a RAID drive for the new server.

    Now what is this bootable FC <qlogic> card all about? I don't know anything about that.

    Thanks,
    rob
     
  9. wax

    wax Five Time F1 World Champ
    Lifetime Rossa

    Jul 20, 2003
    52,481
    SFPD
    Full Name:
    Dirty Harry
    Aw, shoot. I guess you didn't see the email about how I had 5 million tax-free US dollars available for you only between the hours of 4 and 6 pm that very day.
     
  10. normhuff

    normhuff Formula Junior

    Dec 14, 2003
    716
    Peoria, IL
    Full Name:
    J. Norman Huff, Esq.
  11. rob lay

    rob lay Administrator
    Staff Member Admin Miami 2018 Owner Social Subscribed

    Dec 1, 2000
    64,305
    Southlake, TX
    Full Name:
    Rob Lay
  12. Artherd

    Artherd F1 Veteran

    Jun 19, 2002
    6,588
    Bay Area, CA
    Full Name:
    Ben Cannon
    Rob- this is just so you boot the machine off the RAID array as well. No sense having a great redundant data drive, only to be screwed when your boot disk goes out.

    Solution: just boot off the RAID :)
     
  13. rob lay

    rob lay Administrator
    Staff Member Admin Miami 2018 Owner Social Subscribed

    Dec 1, 2000
    64,305
    Southlake, TX
    Full Name:
    Rob Lay
    Ben and Richard,

    I'm going to start working with my host soon to design the new server(s), please give me any other advice.

    Right now we're using about 500 Gb of bandwidth a month (growing 5-10% a month) and taking up 20 Gb of drive (growing 5-10% a month).

    So I already have a RAID drive w/bootable FC <qlogic> card on the list.

    Multiple processors? Which ones?

    Anything else?
     
  14. trasus

    trasus Rookie

    Feb 9, 2004
    22
    Phoenix, Az.
    Finally, something I can contribute :)

    Essentially, there are a few ways to go about this, but knowing how colo facilities charge and the volume you're looking at, there's really only one way to go.

    RAID is essential. You'll want to go RAID 5E (RAID with hot spare disk) for the database, and mirrored boot disks. Since your bandwidth and page-per-second serving requirements are relatively light, you could consider a single RAID5E logical drive for both booting and DB, especially with RedHat and MySQL. This will keep your costs low and colo profile reasonable. Personally, I'd go with an IBM x345 or HP DL380 (avoid Dell at all costs!). This would give you the added benefit of more disks spinning (equals faster seek times in DBs) and greater fault tolerance / redundancy. If you use 15K RPM drives, you'll allow yourself plenty of headroom for growth as well.

    Multiprocessor won't do anything for you on an Intel platform, because no Wintel/Lintel server on the market today can dynamically disable a processor without going down for a restart. You can beat that by implementing a VMWare virtualization effort, but that is undo cost and complication for your scenario.

    Some technologies from HP / IBM do offer fault tolerance at the RAM level beyond the traditional ECC you'd find 3 years ago (essentially these allow you to perform RAID with your RAM). Honestly, you'll have memory failures at least 100X more often than processors, so this is a worthy investment.

    Redundant power is a great thing as well, and in server-dense farms like colos, you'll also want redundant fans to cope with what are often poorly ventilated, over-extended server rooms. In these environments, a simple fan failure can lead to multiple failed components.

    Rob, I could rant and rave about these things all day. This is what I do, sometimes just for fun :) If you have any questions, let me know by PM, I'd be happy to help.
     
  15. shelbee

    shelbee Guest

    Thanks for all the hard work ROB!! Glad to be on Fchat
     
  16. rob lay

    rob lay Administrator
    Staff Member Admin Miami 2018 Owner Social Subscribed

    Dec 1, 2000
    64,305
    Southlake, TX
    Full Name:
    Rob Lay
    Thank you very much for the great advice, much of this is way over my head, but when I get the system designed, I'll share it on here first before I get it.

    Don't I want multiple processors just for the speed, if not redundancy?
     
  17. richard_wallace

    richard_wallace Formula 3

    Feb 6, 2004
    1,957
    Cincinnati, Ohio
    Full Name:
    Richard Wallace
    The above posts are on target... Not sure what Iron you are running on... If it were Blue - I would be going with the IBM 365's rack mounted (Every Maker Compaq/Dell, etc have equilvalant.- you will get your redunant power, etc. Stick with Raid 5 - hot swappable - Disk is cheap - so shouldn't cost you a ton. As mentioned mulitprocessor on Intel will not buy you much - but I would go with a 2 or 4 proc - you will get some basic peroformance gains over a 1 proc. You are still pretty small as far as DB size - 20Gb - SQL will still handle this - Oracle 9i will be the way to go once you double/triple.. I am surprised that your hosting company does not have you on EMC for the DB - like Clarion or Symmetrix (sp) - That way you are utilizing high power disk resource for your DB - and have the Web/App tier run local disk and DB on EMC or Sharkbites, etc. (however this is a higher end solution - may not be an option with the hosting provider).

    Ok that is enough geek talk - PM me or give me a call if you need any more suggestions.

    Rich
     
  18. rob lay

    rob lay Administrator
    Staff Member Admin Miami 2018 Owner Social Subscribed

    Dec 1, 2000
    64,305
    Southlake, TX
    Full Name:
    Rob Lay
    Right now I'm on a leased server... 1 proc, 2 Gb RAM, 80 GB drive.

    Should I still get a leased solution or should I CoLo? Not sure I want to get into buying my own hardware. Every time I get a server I think it will meet my needs for 3 years and I'm already replacing it after 9 months. This is like the 5th upgrade I've done in under 4 years.

    With the host I'm with, a 1U rack rental and the bandwidth are almost similar to what I'm paying now for all that leased. Doesn't make sense to CoLo in my mind.

    However, if leased, they'll always try to get the cheapest hardware I think.
     
  19. trasus

    trasus Rookie

    Feb 9, 2004
    22
    Phoenix, Az.
    Rich is right on with his analysis. Given the opportunity, 4-way Xeon attached to a SAN is the way to go but it is EXPENSIVE. I recently did a "low-end" implementation like this and it ran to about $78K in hardware alone (using M$ software didn't help the costs either :) ). For maximum scalability I'm completely behind Rich on this one, but the price and requirements don't bear out the expense. You'll see huge improvements in performance just by moving to a RAID5 config (much faster DB lookups) and SCSI (sounds like your provider is serving your pages off a desktop motherboard and hard drive stuffed into a 1U rack mount case, which is very common). Xeons have hyperthreading, which basically makes one processor work like 2. You can offload your Apache onto the strong portion of the CPU and relegate the disk intensive database work to the remaining thread. I've found this to be very effective on sites with larger requirements.

    I think the best middle ground for speed and redundancy might be to go with a dual Xeon 1U server (pizza box for all you other geeks out there) attached to a dual ESM external SCSI array, like the IBM EXP400 (I'm a fan of Big Blue). That way, you'd get the benefits of lots of disks (up to 14 platters) for fast database actions and give yourself some nice headroom because the EXP400 would allow you to attach a second server later on and do a cluster. Lower up front costs than SAN, and plenty of expandability in the future. Good solution.

    There is also the ebay route. Scoop up a 4-way xeon back off-lease from one of the big ASPs and run with that. Total cost will be less than the solution I proposed, and with some upgrades and tweaks, you'll have more performance than you ever dreamed you'd need.

    One more thing. Definitely colo or managed dedicated hosting on YOUR server. There is no substitute as far as I'm concerned.
     
  20. TestShoot

    TestShoot F1 World Champ
    Silver Subscribed

    Sep 1, 2003
    12,319
    Beverly Hills
    I would like to take this time to apologize for my denial of service attack on Lambochat, which ironically points to...

    I will say that it was most likely a mysql fart. It is free, but it is a ram hog.

    I might be able to loan a 2way 2ghz xeon box with a gb of ram and raid5 based out of USC (minimal hops to anyplace on earth) and plug in some space on our SQL server. Perl is running on the box, so is compression to nip that transfer size in the butt.

    One could argue Unix vs Intel like Ferrari vs Lambo (in no specific order), point is stick with what you are comfy with.

    My suggestion would be to allow img code so you are not jamming your self up with image transfers as your b/w hog.
     
  21. richard_wallace

    richard_wallace Formula 3

    Feb 6, 2004
    1,957
    Cincinnati, Ohio
    Full Name:
    Richard Wallace
    Rob - I would lease don't buy - that way you can get out and upgrade sooner as needed - and once you run power through the box - it is not worth much... But I would tell them what you want to lease - or lease your own and co-lo.

    Rich
     
  22. rob lay

    rob lay Administrator
    Staff Member Admin Miami 2018 Owner Social Subscribed

    Dec 1, 2000
    64,305
    Southlake, TX
    Full Name:
    Rob Lay
    Let's not forget about budget here. If I work too many years on this "hobby" at 10 cents an hour, then I might be single pretty soon. :) I'm roughly thinking a $1,000 per month budget for the server and bandwidth. That might change your recommendations. $75k is WAY out of the picture right now. Thank you.
     
  23. TestShoot

    TestShoot F1 World Champ
    Silver Subscribed

    Sep 1, 2003
    12,319
    Beverly Hills
    rob i have a 4u server here that can handle dual 550 p3's with onboard raid, i'll donate a few scsi drive to the cause. maybe if nothing else it can help handle some of the load by running just the php and the other box can do the db.

    I have an ibm drvs 9gb drive that is really fast just for the os, and two 18.4 gb cheetahs, a few ide drives, no ram or cpus, but those are cheap. the motherboard is an intel 440glx+
     
  24. vraa

    vraa F1 Rookie
    Rossa Subscribed

    Oct 31, 2003
    4,599
    Texas
    Full Name:
    Mr. A
    Instead of one very fast server why not two fast servers? Maybe something along the lines of two dual AMD's, one for mail, pictures, whatever and the other for the main F-Chat itself?

    I'm trying to get another major forum to do something like this and optimize a lot of their forum stuff because it's just hogged down. vBull isn't the nicest thing once you start hitting a large amount of members. It always needs work.
     
  25. trasus

    trasus Rookie

    Feb 9, 2004
    22
    Phoenix, Az.
    No kidding :). The scenario I mentioned should be less than $700 with bandwidth. Certainly no more than $1000. $1000 per month should carry you to 20,000 users realistically.

    Have a great weekend!
     

Share This Page