2014-07-14T10:35:00Z

Moving from ESXi Essentials to plus: key install or a reinstall? What are my options for a redundant active passive san?

DG
  • 6
  • 6
PeerSpot user
5

5 Answers

DG
Real User
Leaderboard
2014-10-13T12:05:36Z
Oct 13, 2014

The bad block issue indeed was the SAN. After some chasing of mysterious corruption and appearing as if a bad query was being run, I finally recieved one hardware error from the LSI Raid card in the San. After researching the Hex code that seems almost unreadable I found a chart that defined the code. "Drive not ready slot 9" is what the controller was telling me. And to my surprise the drive was good and ended up to be the slot it's self. My unbreakable SAN with redundant everything had a bad backplane. Randomly causing one drive at slot 9 to fail and write wrong data to the array. Once this occurred the domino affect was massive. Even after elimination of the slot the corruption grew from server to server. We shutdown and cleaned the individual guest os filesystems and validated databases but still the corruption persisted. Finally needing a stable safe enviroment I chose to move all servers to local storage. Each Host had a raid5 with 2.7TB so dividing up the server's by balancing the disk I/O we moved them and repaired any corruption. We saw no performance hit and had stability again. Having to understand what was causing this global corruption and having everything off the San I retraced events and steps taken. I found the root cause. And I found why after so much work the problem continued. I had missed one step in my effort to resolve the issue. I had cleaned vmaware and the guest os of every server install. I had validated every database. But I did not clean the filesystem of the SAN itself. And consistancy checks of the volumes do not look at the filesystem. Only ifnthe blocks are intact. The problem from a bad slot caused my SAN's filesystem to be corrupted just enough to start writing data to the wrong block. Since application servers and front end servers are static for the most part they were last to see corruption. Only the ever changing database servers were affected at 1st. This made troubleshooting even more difficult giving the illusion of a stable SAN when in fact it was not. The variables were huge and changing. We now have a repaired SAN and after analysis and discovery of the issues we are reworking the array and are going to add a raid 10 of ssd drives and reinitialize the raid 6. Place the database servers on the raid 10 for speed and the rest on the cost effective raid 6. No matter how much you may read or sell or install, you never truly own and know a technology until you have a very odd issue, and you successfully wrestle it to the ground.

Dan Gillman

Search for a product comparison
it_user132501 - PeerSpot reviewer
Consultant
2014-07-16T17:53:56Z
Jul 16, 2014

I want to believe you are migrating your Oracle data. Can your confirm? and if you are, what migration tool are you utilizing? The wrong migration tool will results in bad block related events.

it_user86838 - PeerSpot reviewer
Consultant
2014-07-15T06:04:02Z
Jul 15, 2014

1. Just use new License of VMware Enterprise Plus. That's all.

By the way, my suggestion is assign a dedicated management server/workstation/Desktop (physical) for vCenter. Do not install into VM. Otherwise you'll be in trouble while failure occur.

2. Need to understand your current design of virtual environment and SAN as well. Could you please share with me your design? It could be more easier for me to help you in this regards. But anyway, After purchasing VMware Enterprise Plus, you'll get Stoarge vMotion, Storage I/O control, Storage DRS, Storage API for data protection. Additionally I can suggest you to buy Virtual SAN of VMware as well. It'll help to consolidate your storage either it could be your SAN or DAS or both.

3. I'm not sure but from my instinct says you've to check your Open-E storage OS compatibility issue with Oracle.

Thanks.

it_user131937 - PeerSpot reviewer
Vendor
2014-07-14T12:42:32Z
Jul 14, 2014

We are using ESXi 5.1 Enterprise. Whenever the license expires, the vcenter goes to standard license mode with some features disabled until the new key is entered and the enterprise-license additional features are available without a re-install.
For your other problem of bad-block, does the problem arise after you shifted on virtualization? If so, then check for any features such as High Availability that ensures the fault tolerance in case of a virtual disk failure.

it_user113166 - PeerSpot reviewer
Consultant
2014-07-14T12:18:53Z
Jul 14, 2014

You do not have to reinstall to apply a license. License management is in the vCenter console.
I cannot give you any input on the other questions.

Learn what your peers think about VMware vSphere. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
765,386 professionals have used our research since 2012.
VMware vSphere is a powerful and complete server virtualization platform that allows its users to create and manage virtual data centers and machines. VMware vSphere is designed to help IT departments set up and run applications using the most cost-effective computer resources. By using vSphere, organizations save the time and energy necessary for purchasing infrastructure and software and reduce ongoing maintenance and operational burdens on IT teams. Infrastructure administrators and...
Download VMware vSphere ReportRead more