How to troubleshoot frozen media on UNIX and Windows

Problem

DOCUMENTATION: How to troubleshoot frozen media on UNIX and Windows

Solution

Modification
When troubleshooting frozen media issues, it is important to understand the following:

 

  • Media must be unfrozen one at a time.
  • A media being frozen does not necessarily mean that the media in question is defective. Freezing media is a safety measure taken by the NetBackup application to help prevent further errors, drive damage, or possible data loss.
  • Investigate if there is any pattern to the media IDs, tape drives or media servers involved when media are frozen

The following logs are useful when troubleshooting frozen media:

 

UNIX:
 

  • The bptm log from the media servers that froze the media: /usr/openv/netbackup/logs/bptm
  • The messages or syslog from the OS
  • The file /usr/openv/netbackup/db/media/errors on the media server

 

Windows
 

  • The bptm log from the media servers that froze the media: <Install_dir>\VERITAS\NetBackup\logs\bptm
  • The Windows Event Viewer System Log
  • The Windows Event Viewer Application Log
  • The log file <Install_dir>\VERITAS\NetBackup\db\media\errors

 

Note:  It is preferable to have bptm enabled at a verbosity of 5 to troubleshoot any media and drive related issues. The bptm process log does not tend to take up excessive drive space or resources, even at an elevated verbosity. When a media is frozen, the bptm logs may contain more detailed information on why the media was frozen that the Activity Monitor or Problems Report does not state. Verbosity on bptm must be increased for every media server individually by changing its logging levels under Host Properties in the Administration console.

The following Status Codes can cause, or be a result of frozen media: 
 

 

Status Code Reason
84 – Media Write Error If the tape unit can not read or write to the tape correctly, this status code can occur when media are frozen
86 – Media Position Error If the tape unit can not read or write to the tape correctly, this status code can occur when media are frozen
96 – Unable to allocate new media If media continue to become frozen, the backup job may end in a Status 96, because no more media available to mount.

 

 

 

The following are six common situations in which media become frozen:

1.   The same media has excessive errors during backup
 

 

FREEZING media id E00109, it has had at least 3 errors in the last 12 hour(s)
 

 

Common causes and resolutions for this include:
 

  • Dirty drives. Clean the drives that are freezing media. One of the first symptoms seen with a dirty drive is often frozen media. Drive cleaning should be done according to the manufacturer's suggestions.
  • There may be an issue with the drive itself. Check the OS system logs mentioned above for any errors regarding tape devices or errors reported by the driver for the tape device. If any are found, follow the hardware manufacturer's recommendations for this type of error.
  • There may be an issue with communication at the SCSI or Host Bus Adapter (HBA) level. Check the OS system logs mentioned above for any errors regarding SCSI or HBA devices or errors reported by their driver. If any are found, follow the hardware manufacturer's recommendations for this type of error.
  • Ensure that the tape drives appear on the hardware compatibility list as supported for NetBackup. See related links below. 
  • Ensure that the media is supported for use with the tape drive by the tape drive vendor

2.   An unexpected media is found in the drive

Incorrect media found in drive index 2, expected 300349, found 200244, FREEZING 300349
 

 

This can occur under the following circumstances:
 

  • If NetBackup requests a media ID to be mounted in a drive and the media ID physically recorded on the tape is different than that NetBackup media ID, media will freeze. This can happen if the robot needs to be inventoried, if barcodes have been physically changed on the media, or if the media was previously written to by another NetBackup installation with different barcode rules.
  • The drives in the robot are not configured in order within NetBackup, or are configured with the wrong tape paths. Configuration of drives using the correct Robot Drive Number is important to the proper mounting and utilization of media. The Robot Drive Number, commonly set based on co-relation of the drive serial number with drive serial number information from the robotic library, should be determined and validated before the device configuration is considered complete.

3.  The media contain a non-NetBackup format
 

 

FREEZING media id 000438, it contains MTF1-format data and cannot be used for backups
FREEZING media id 000414, it contains tar-format data and cannot be used for backups
FREEZING media id 000199, it contains ANSI-format data and cannot be used for backups
 

 

These are usually tapes written outside of NetBackup that have found their way into the library. By default, NetBackup will only write to a blank media or other NetBackup media. Other media types (DBR, TAR, CPIO, ANSI, MTF1 and recycled Backup Exec BE-MTF1 media) will be frozen as a safety measure. This behavior can be changed with the following procedure:
 

 

1.  From the Administration Console, proceed to Host Properties | Media Server
 

2.  Open the properties for the media server in question
 

3.  Select the Media tab
 

 

The Allow Media Overwrite property overrides the NetBackup overwrite protection for specific media types. To disable overwrite protection, select one or more of the listed media formats
 

 

Stop and restart the NetBackup services for the changes to take effect.
 

 

Caution:  Do not select a foreign media type for overwriting unless it is certain that this media type should be overwritten. For more details on what each media type is, see the NetBackup System Administrator's Guide.
 

 

4.  Media was intentionally frozen

 

It is possible to manually freeze media with the bpmedia command for a variety of administrative reasons. If frozen media are encountered and there is no record of a specific job freezing the media, media may have manually been frozen.
 

 

5.  Media is physically write protected

 

If the media has a write protect switch that is set for write protection, this will prevent any writing to the media and NetBackup will freeze the volume.
 

 

 

Unfreezing frozen media:

To unfreeze frozen media, use the bpmedia command with the following syntax:

For UNIX 
/usr/openv/netbackup/bin/admincmd/bpmedia -unfreeze -m <mediaID> -h <name of media server that froze media>

For Windows
<Install_path>\VERITAS\Netbackup\bin\admincmd\bpmedia -unfreeze -m <mediaID> -h <name of media server that froze media>

If it is not known which media server froze the media, run the bpmedialist command and note the "Server Host:" listed in the output:

For UNIX 
/usr/openv/netbackup/bin/admincmd/bpmedialist -m <mediaID>

For Windows
<Install_path>\VERITAS\Netbackup\bin\admincmd\bpmedialist -m <mediaID>
 

 

See the text illustration below for a sample output. In this example, bpmedialist is run for the frozen media div008. It is found in this example that the media server "denton" froze this media. 
 

 

C:\Program Files\VERITAS\NetBackup\bin\admincmd>bpmedialist -m div008
 

 

Server Host = denton
 

 

ID     rl  images   allocated        last updated      density  kbytes restores
 

          vimages   expiration       last read         <——- STATUS ——->
 

———————————————————————————————————-
 

 

DIV008   1      1   04/22/2005 10:12  04/22/2005 10:12   hcart          35     5
 

               1   05/06/2005 10:12  04/22/2005 10:25   FROZEN
 

Read more

How to document Home Lab and Network

運維機房和跨域的網路,會遇到各式需求與問題,用對工具才能分析問題,個人覺得最重要的是使用能處理問題的工具。 推薦目前想學和正在使用的平台與軟體,協助將公司/家用機房文件化 佈告欄任務管理 Focalboard 白板可管理任務指派 網路架構文件編寫 netbox 精細管理網路設備與連接線路 IP 資源管理 phpipam 專注網路IP分配 邏輯塊文件編寫 draw.io 視覺化概念圖 機房設備管理 ITDB 管理設備生命週期與使用者

By Phillips Hsieh

如何在Raspberry Pi4上安裝Proxmox for ARM64

第一步 準備好Raspberry Pi 4 / CM4 4GB RAM,這裡要留意CM4如果是買有內建eMMC storage會限制不能使用SD卡開機而限制本地空間容量,如果沒有NAS外接空間或使用USB開機的話,建議買CM4 Lite插上大容量SD卡 第二步 去Armbian官網下載最小化Debian bookworm image https://www.armbian.com/rpi4b/ Armbian 25.2.2 Bookworm Minimal / IOT 然後寫入SD/USB開機碟,寫入方法參考官方文件 https://github.com/raspberrypi/usbboot/blob/master/Readme.md Note: 官方提供的預先設定系統方法,可以在Armbian初次啟動自動化完成系統設定。連結在此 https://docs.armbian.com/User-Guide_Autoconfig/

By Phillips Hsieh

世界越快心越慢

在晚飯後的休息時間,我特別享受在客廳瀏灠youtube上各樣各式創作者的影音作品。很大不同於傳統媒體,節目多是針對大多數族群喜好挑選的,在youtube上我會依心情看無腦的動畫、一些旅拍記錄、新聞時事談論。 尤其在看了大量的Youtube的分享後,我真的感受到會限制我的是我的無知,特別是那些我想都沒想過的實際應用,在學習後大大幫助到我的生活和工作層面。 休息在家時,我喜歡想一些沒做過的菜,動手去設計生活和工作上的解決方案,自己是真的很難閒著沒事做。 如創作文章,陪養新的習慣都能感覺到成長的喜悅,是不同於吃喝玩樂的快樂的。 創作不去限制固定的形式,文字是創作、影像聲音也是創作,記錄生活也是創作,我想留下的就是創造—》實現—》回憶,這樣子的循環過程,在留下的足跡面看到自己一路上的成長、失敗、絕望、重新再來。 雖然大部份的時候去做這些創作也不明白有什麼特別的意義,但不去做也不會留下什麼,所以呀不如反事都去試試看,也許能有不一樣的水花也許有意想不到的結果,投資自己永遠不會是失敗的決定,不是嗎?先問問自己再開始計畫下一步,未來沒人說得準。 像最近看youtube仍大一群人在為DOS開

By Phillips Hsieh

知識管理的三個步驟:一小時學會把知識運用到生活上

摘錄瓦基「閱讀前哨站」文章作為自己學習知識管理的內容 Part1「篩選資訊」 如何從海量資訊中篩選出啟發性、實用性和相關性的精華,讓你在學習過程中不再迷失方向。 1. 實用性 2. 啟發性 Part2「提高理解」 如何通過譬喻法和應用法,將抽象的知識與日常生活和工作緊密結合,建立更深刻的理解。 1. 應用法 2. 譬喻法 Part3「運用知識」 如何連結既有知識,跟自己感興趣的領域和專案產生關聯,讓你在運用知識的路途上游刃有餘。 1. 跟日常工作專案、人際活動產生連結 # 為什麼要寫日記? * 寫日記是為了忘記,忘卻瑣碎事情,保持專注力 * 寫日記就像在翻譯這個世界,訓練自己的解讀能力 * 不只是透過日記來記錄生活,而是透過日記來發展生活 #如何寫日記? * 不要寫流水帳式的日記,而是寫覆盤式的日記 當我們試著記錄活動和感受之間的關聯,有助於辦認出真正快樂的事 日記的記錄方式要以過程為主,而非結果 * 感恩日記的科學建議,每日感恩的案例

By Phillips Hsieh