IBM System p
IBM中国技术支持中心
2009-4-10 © 2007 IBM Corporation
IBM System p
目录
– HACMP概念回顾
–常见HA架构
–日常管理
– HACMP 新功能介绍
– Q&A
2 2009-4-10 © 2003 IBM Corporation
IBM System p
Although Hardware is Now Very Reliable, Hardware Failures
Account for a Small Minority of System Outages
Several studies place the proportion between 20% and 45%
Human error, software error and planned maintenance cause the
majority of service outages
3 2009-4-10 © 2003 IBM Corporation
IBM System p
HACMP—(High Availability Cluster Multi Processing)
–为什么需要高可用性?
–什么是HACMP?
High Availability:
•系统可用性或运行时间最大化
•系统宕机时间最小化
multi-processing:
•一个cluster里的各个节点上可以运行多个应用
•共享数据或并发访问数据.
– HACMP的目的
•消除单点故障(SPOF),实现高可用
– High Availability is fault resilient
not fault tolerant
4 2009-4-10 © 2003 IBM Corporation
IBM System p
高可用& 容错
Standalone High Availability Clusters Fault puters
Solutions
Journaled File System Redundant Servers Lock Step CPUs
Dynamic CPU Deallocation works Hardened Operating System
Service Processor work Adapters Hot Swap Everything
Redundant Power Heartbeat Monitoring Continuous Restart
Availability benefits
Redundant Cooling Failure Detection
ECC Memory Failure Diagnosis
Hot Swap Adapters Automated Fallover
Dynamic Kernel Automated Reintegration
Depends,
Downtime Couple of days In theory, none
but typically 3 mins
Good as your last full
Data Availability Last transaction No loss of Data
backup
Relative Cost 1 2-3 10+
5 2009-4-10 © 2003 IBM Corporation
IBM System p
Software Layers on a HACMP node
Application
– Uses the services made highly available by
HACMP
HACMP
– Makes services highly available for applications
– Co-ordinates resource availability through the
cluster
RSCT
– Provides munication between nodes
– Co-ordination of subsystems
AIX
– Operating system services
LVM
– Logical storage management
TCP/IP
HACMP 5.x 高可用方案培训 来自淘豆网m.daumloan.com转载请标明出处.