Master of the obvious....that's me!!!
One thing I hadn't realized until I started researching HA/DR incidents for recent stats is just how often failures happen. Maybe I am a bit isolated due to the excellent services our IT org does .....but some interesting tidbits I learned:
I would have thought by now that as smart as all of us IT folks are that outages would have been diminishing ....or at least much less often. With hardware reliability at a fairly high compared to a decade (or two) ago and especially with SSD's eliminating moving parts..... But, alas, we still have software to blame things on.
The reason I was doing this research is that in my job as a Product Manager for ASE, I was tasked with focusing on the ASE Always-On option that we are releasing with ASE 16sp02. While it has been under development for sometime, it was fun (and sometimes exasperating) to put my hands on it and take it for a test spin. One of the considerations behind Always-On is to address two of those problems above - the Data Center failures and Upgrades.
With regard to the former, Always-On is based on typical HADR synchronous replication. This allows you to take a hit on the primary and failover to the standby with minimal latency and zero data loss. While the last is key, the former was really surprising at how well we did even at higher volumes. While the solution is based on SAP Replication Server, even under heavy load, engineering has done such a good job of pretuning it that one customer was able to get ~20K rows/sec from primary to standby with <5secs latency. Internally (and with better HW) we were able to get much better.... And to think I remember categorizing 5K rows/sec as high volume a few years ago...
With respect to the upgrades, ASE Always-On will be just that - always on - even during upgrades. Yes, even when you upgrade to ASE 17 or whatever the next major release is. The current installer installs the software and then manages the failover/failbacks in such a way that overall processing is never interrupted.
Not only that, but failover times were quite impressive. I spent a lot of time recently doing kill -9's on my dataservers to simulate a crash. When the project originally started, there were some fairly long times discussed as a possibility (which influenced the JDBC api a bit). However, under testing (including customer tests), the speed is much faster than OS clustering and competing with ASE Cluster Edition in some cases.
Now, doing this did take a few changes to both ASE and SRS. For example, we all are familiar with "near zero" downtime approaches in which you get ready to failover and at the appointed time, you kill all the applications (or have them log out), flip to the new system and restart the applications. With luck and enough staff, it can be done in minutes. But ASE 16sp02 added a new soft quiesce feature that even eliminates that - as a result, planned outages can happen without disrupting the applications.
There were some other modifications made. On next Wednesday, I am hosting a webcast on the ASE 16sp02 Always-On option. If you would like to hear more - join me.
Join us for a webcast September 30th, 10:00 EDT: SAP ASE 16 Always-On – The ultimate in zero downtime/zero data loss. I’ll be discussing the new SAP ASE 16 always-on feature that leverages synchronous log replication and SAP’s leading database replication technologies to create a solution for customers that is a comprehensive solution for high availability (HA) and disaster recovery (DR) requirements. This new feature will allow zero downtime during major upgrades to SAP ASE or key hardware components (such as a complete storage upgrade) while being impervious to data storage failures. Offering zero data loss in HA configuration, it helps to reduce application RTO objectives while providing transparent application failover for both planned and unplanned outages.
Register now: https://event.on24.com/eventRegistration/EventLobbyServlet?target=registration.jsp&eventid=1037729&s...
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
User | Count |
---|---|
11 | |
10 | |
10 | |
10 | |
9 | |
8 | |
8 | |
7 | |
7 | |
6 |