Recovering / reinstalling SRM (Site Recovery Manager) 4.1.1 after suffering a host failure

 

I’ve been meaning to write a post about recovering / reinstalling SRM 4.1 after having to rebuild one when a client suffered a host failure but never got the chance to until this weekend.  The incident happened during a planned datacenter move a few weeks ago where the environment had SRM 4.1 collocated with vCenter 4.1 on a physical server and someone decided to perform firmware upgrades during the move which resulted in the vCenter server continuously bluescreen-ing after the upgrade.  The priority during that when the host failed was obviously not the recovery of SRM because vCenter was more important but I ended up going in to recover SRM a few days later.  What I noticed as I started the recovery was that I wasn’t able to find a public KB from VMware that clearly outlined steps for situations such as these and the closest KB article I was able to find was the:

Migrating an SRM server to run on a different host
http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=1008426

 

image

 

So armed with this KB, I went on reinstall SRM 4.1 onto the production vCenter 4.1 server (the protected site). 

Assumptions

The following are the assumptions for the environment:

  1. There have been no changes made to the SAN replication and it is still in working order.
  2. You are using the same vCenter version prior to the failure.
  3. You have a backup of the SRM database.
  4. SRM is using Microsoft SQL server for the database service.

Downloading the SRA (Storage Replicator Adapters)

Before proceeding to reinstall SRMs, you should download the SRA for the SAN so proceed with opening up a web browser and navigate to:http://www.vmware.com/download/srm:

image 

Click on the Show Details link to expand the list of downloads:

 

image

 

Proceed with scrolling down the list of downloads to the one for your SAN:

 

image

 

image

 

Download and install Microsoft SQL server for SRM 4.1

The next step for the recovery process is to install Microsoft SQL and I can’t help but to vent that I’ve come across way too many environments with the incorrect Microsoft SQL server installed.  While I have yet to see an install cease to function because an unsupported Microsoft SQL server was used, I still prefer to stick with what VMware has listed in the SRM Compatibility Matrix 4.x (srm_compat_matrix_4_x.pdf) so please refer to the following list for the support SQL Server editions:

image

For the purpose of this demonstration, I will be using SQL Server 2005 Express Edition SP2 which can be downloaded here:

http://www.microsoft.com/download/en/details.aspx?displaylang=en&id=22625

image

Proceed with the install by running the executable:

 

image

 

Configuring Microsoft SQL Server for SRM

Once SQL server has been installed, open the SQL Server 2005 Surface Area Configuration for the instance:

 

image

 

Navigate to Instance Name –> Database Engine –> Remote Connections and change Local connections only to Local and remote connections with the option Using TCP/IP only:

image image

Clicking on the Apply button will prompt you with a warning message that the changes will not apply until you restart the database services but you won’t need to restart the services just yet as there are still changes required to be made:

image

Proceed with opening SQL Server Configuration Manager and navigate to SQLServer 2005 Network Configuration (32bit) –> Protocols for SQLEXPRESSand enable TCP/IP:

image

image

You will again be prompted with a warning that you will need to restart the services for the changes to take effect but there are still change required to be made for a restart so proceed with the next steps:

image

If you’re using SQL Server Express as shown in this demonstration, you will need to remove the dynamic ports that the default installation sets so open SQL Server Configuration Manager and navigate to SQL Server 2005 Network Configuration (32bit) –> Protocols for SQLEXPRESS right click on TCP/IPand choose Properties:

image

 

image

 

Navigate to the IP Addresses tab and make sure you change all of the TCP Portto 1433 (default for Microsoft SQL) and TCP Dynamic Ports to the value of 0:

 

image

 

image
image

 

Applying the changes will once again warn you that a service restart will be required for the changes to take affect but we’re not done with the changes so proceed on with the next steps:

image

There is no need for Shared MemoryNamed Pipes or VIA to enabled so disable the protocols if they’re still enabled:

image

With all of these changes made, proceed with restarting the services either in the service console:

image

… or the SQL Server 205 Surface Area Configuration:

image

Restoring SRM Database

Proceed with launching the Microsoft SQL Server Management Studioadministration console:

image

From here, you have the options of:

  1. Restore an .bak file of your SRM database from a previous backup.
  2. Re-attach the .mdf and .ldf files for your SRM database.

In my situation, the client had the .mdf and .ldf files stored on a separate LUN so all I had to do was reattach the LUN to the server and reattach the database.  With that being said, if you don’t intend on restoring the master database to this SQL Express server such as what I’m doing here, the security logins for the server will be missing so prior to reattaching or restoring the database, you will need to configure the security account used for the DSN connection on the SQL server instance first.

——————————————————————————————————————————————————————-

If you’re going to restore the master database, you can ignore the following step:

Navigate to localhost\SQLEXPRESS –> Security then right click on Logins and select New Login.  Within the Login – New window, select the account you used to connect to the SRM database prior to the reinstall:

image

Once you’ve added the login configured, proceed with clicking the OK button and confirm that the login is now listed under the Logins node:

image

——————————————————————————————————————————————————————-

With the service account created under the SQL Express database’s login, proceed with restoring your SRM database.  The following demonstration will use the Attach feature:

image

image

 

image

 

With the database restored, we will now proceed with configuring the other requirements required for the SRM database outlined in the deployment guide:

image

Open up a new SQL query and execute the following command:

CREATE SCHEMA VMW_SRM

Note that VMW_SRM is the database in this demonstration and is not a requirement to be named that way.

image

With the schema with the same name as the service account used for the DSN accessing the database created, open up the SRM database’s properties and configure the Default schema with the schema we created:

image

image

 

image

 

With the databases’ configuration complete, proceed with opening the properties of the service account you’re using for the DSN connection and give itbulkadmin and public roles:

image

Map the account to the SRM database:

image

Configuring the 32-bit SRM DSN

With the configuration for the database and user account completed, proceed with creating the 32-bit DSN for the SRM database.  I won’t go into too much detail but for more information, please refer to one of my vCenter / Update Manager posts (use the 32-bit instructions):

http://terenceluk.blogspot.com/2011/02/creating-vcenter-update-manager-41-sql.html

cd\Windows\syswow64

odbcad32.exe

image

image

Installing SRM 4.1

Now that all of the prerequisites have been installed and configured, proceed with running the installation binaries for VMware Site Recovery Manager:

image

image

Note that you’ll be warned that your production vCenter server already has an extension registered for SRM during the vCenter server registration section and since you’re recovering from a host failure, proceed with selecting Yes:

image

Note that it is important that you fill in the field Local Site Name with the same site name you used for the SRM site you are recovering or you’ll receive the an error when you’ve completed the recovery:

image

image

Make sure you select the Use existing database option:

image

image

image

Reinstalling the SRA (Storage Replicator Adapters)

With SRM reinstalled, proceed with installing the SRA you downloaded earlier:

image

image

image

image

image

Download and install vCenter Site Recovery Manager

With the SRA installed, proceed with launching vCenter and install the vCenter Site Recovery Manager plug-in:

image

image

image

image

image

Launch Site Recovery Manager

With the plug-in for SRM installed and enabled, proceed with opening the Site Recovery plug-in:

image

Run the installcreds utility to register account credentials on the new host with the old DSN

Open up the command prompt as an administrator and change the directory to:

C:\Program Files (x86)\VMware\VMware vCenter Site Recovery Manager\bin>

… within the directory above, execute the following:

installcreds.exe -key db:srm -u domain\vmw_srm

For this demonstration, the database user name is a domain account named VMW_SRM so please change that to the appropriate domain and user account for your environment.

Run the srm-config utility to establish an authenticated connection to the local VirtualCenter server

Open up the command prompt as an administrator and change the directory to:

C:\Program Files (x86)\VMware\VMware vCenter Site Recovery Manager\bin>

… within the directory above, execute the following:

srm-config.exe -cmd updateuser -cfg ..\config\vmware-dr.xml -u VMW_SRM

For this demonstration, the database user name is VMW_SRM so please change that to the appropriate user account for your environment.

 

image

 

Review Protection Groups

Proceed with logging into the Site Recovery plug-in and verify that your protection groups are in good health:

image

… and we’re done!  I ran into more errors after bringing the protected site up but will separate those errors into other blog posts instead.

Measure
Measure
 
 

Read more

How to document Home Lab and Network

運維機房和跨域的網路,會遇到各式需求與問題,用對工具才能分析問題,個人覺得最重要的是使用能處理問題的工具。 推薦目前想學和正在使用的平台與軟體,協助將公司/家用機房文件化 佈告欄任務管理 Focalboard 白板可管理任務指派 網路架構文件編寫 netbox 精細管理網路設備與連接線路 IP 資源管理 phpipam 專注網路IP分配 邏輯塊文件編寫 draw.io 視覺化概念圖 機房設備管理 ITDB 管理設備生命週期與使用者

By Phillips Hsieh

如何在Raspberry Pi4上安裝Proxmox for ARM64

第一步 準備好Raspberry Pi 4 / CM4 4GB RAM,這裡要留意CM4如果是買有內建eMMC storage會限制不能使用SD卡開機而限制本地空間容量,如果沒有NAS外接空間或使用USB開機的話,建議買CM4 Lite插上大容量SD卡 第二步 去Armbian官網下載最小化Debian bookworm image https://www.armbian.com/rpi4b/ Armbian 25.2.2 Bookworm Minimal / IOT 然後寫入SD/USB開機碟,寫入方法參考官方文件 https://github.com/raspberrypi/usbboot/blob/master/Readme.md Note: 官方提供的預先設定系統方法,可以在Armbian初次啟動自動化完成系統設定。連結在此 https://docs.armbian.com/User-Guide_Autoconfig/

By Phillips Hsieh

世界越快心越慢

在晚飯後的休息時間,我特別享受在客廳瀏灠youtube上各樣各式創作者的影音作品。很大不同於傳統媒體,節目多是針對大多數族群喜好挑選的,在youtube上我會依心情看無腦的動畫、一些旅拍記錄、新聞時事談論。 尤其在看了大量的Youtube的分享後,我真的感受到會限制我的是我的無知,特別是那些我想都沒想過的實際應用,在學習後大大幫助到我的生活和工作層面。 休息在家時,我喜歡想一些沒做過的菜,動手去設計生活和工作上的解決方案,自己是真的很難閒著沒事做。 如創作文章,陪養新的習慣都能感覺到成長的喜悅,是不同於吃喝玩樂的快樂的。 創作不去限制固定的形式,文字是創作、影像聲音也是創作,記錄生活也是創作,我想留下的就是創造—》實現—》回憶,這樣子的循環過程,在留下的足跡面看到自己一路上的成長、失敗、絕望、重新再來。 雖然大部份的時候去做這些創作也不明白有什麼特別的意義,但不去做也不會留下什麼,所以呀不如反事都去試試看,也許能有不一樣的水花也許有意想不到的結果,投資自己永遠不會是失敗的決定,不是嗎?先問問自己再開始計畫下一步,未來沒人說得準。 像最近看youtube仍大一群人在為DOS開

By Phillips Hsieh

知識管理的三個步驟:一小時學會把知識運用到生活上

摘錄瓦基「閱讀前哨站」文章作為自己學習知識管理的內容 Part1「篩選資訊」 如何從海量資訊中篩選出啟發性、實用性和相關性的精華,讓你在學習過程中不再迷失方向。 1. 實用性 2. 啟發性 Part2「提高理解」 如何通過譬喻法和應用法,將抽象的知識與日常生活和工作緊密結合,建立更深刻的理解。 1. 應用法 2. 譬喻法 Part3「運用知識」 如何連結既有知識,跟自己感興趣的領域和專案產生關聯,讓你在運用知識的路途上游刃有餘。 1. 跟日常工作專案、人際活動產生連結 # 為什麼要寫日記? * 寫日記是為了忘記,忘卻瑣碎事情,保持專注力 * 寫日記就像在翻譯這個世界,訓練自己的解讀能力 * 不只是透過日記來記錄生活,而是透過日記來發展生活 #如何寫日記? * 不要寫流水帳式的日記,而是寫覆盤式的日記 當我們試著記錄活動和感受之間的關聯,有助於辦認出真正快樂的事 日記的記錄方式要以過程為主,而非結果 * 感恩日記的科學建議,每日感恩的案例

By Phillips Hsieh