Monitoring and Managing Microsoft Exchange Server 2003 (HP Technologies)

 < Day Day Up > 


The MOF is one of the three frameworks that form the Microsoft Enterprise Services frameworks. Each framework provides detailed information on the people, processes, and technologies required for success in the different phases of the IT life cycle. The three Enterprise Services frameworks are as follows:

Microsoft is well established as a software giant. However, a framework is not software. The three frameworks include a variety of assessment tools, best practices, case studies, courseware, deployment guides, operations guides, planning tools, solution kits, support tools, training roadmaps, and white papers.

MOF includes a set of best practices, principles, and models that promote mainframe-quality reliability, availability, supportability, and manageability for environments built on Microsoft products and technology.

Microsoft designed MOF to help IT departments design IT services to meet business goals and priorities while reducing downtime, risks, and the total cost of ownership for production systems.

There are some obvious benefits for IT organizations, but there are some equally important benefits to Microsoft. Microsoft wants to compete more effectively at the enterprise level for mission-critical production systems. To accomplish this, the company needs to negate the perception that Microsoft platforms do not have the reliability necessary for mission-critical services. During the past few years, Microsoft has focused on improving software quality and has added cluster support and Microsoft Windows Datacenter to improve availability. Hardware improvement during the same period has produced Intel-based servers with RAID controllers; storage area networks; redundant power supplies, fans, and controllers; hot swappable fans, disk drives, power supplies; and other features to improve reliability and availability. However, as shown in Figures 2.1 and 2.2, a major cause of downtime is poor operational procedures. Microsoft cannot improve its reliability image without addressing the people and processes used to manage Microsoft environments.

2.2.1 Microsoft Operations Framework design considerations

The development of MOF was an important investment for Microsoft, therefore the company incorporated several design goals to ensure its success, including the following:

MOF combines the ideas in ITIL with specific guidelines for using Microsoft technologies. MOF also extends ITIL to support distributed IT environments and current industry trends, such as application hosting and web-based systems. MOF is composed of three models: the process mode, the team model, and the risk model. These models provide guidance about people, processes, and risk management for IT service management. Each model focuses on the technologies and best practices for achieving high availability, reliability, supportability, and manageability for the Microsoft environment and provides guidance on interoperability with non-Microsoft environments.

2.2.2 Microsoft Operations Framework Process Model

Because IT operations include so many processes, procedures, and communications-all occurring simultaneously for a large collection of systems, applications, and platforms-it is impossible to create a model that captures all of the intricacies. Instead of trying to create an exact model, MOF simplifies this complexity into a framework that is more easily understood and more easily applied. The MOF Process Model is a functional model of the processes that operations teams perform to manage and maintain IT services. As such, it provides a simplified, generalized way to think about complex IT environments. The Process Model has some concepts that are keys to understanding the model. The keys are as follows:

IT infrastructures are not static; they are constantly changing. One of the primary responsibilities of the operations team is to manage these changes in a way that ensures the continued availability of critical services. A common and effective way to deal with change is to group related changes together into a series of releases, which allows for planning and managing of each group of related changes as a unit. The MOF Process Model recognizes that applications or services follow a life cycle of distinct, integrated phases. Examples of these phases include the following:

These phases, also known as quadrants, form an iterative life cycle that can be applied to any release, and they describe the processes or activities that make up each part of that life cycle. There are also four reviews described by the model:

Microsoft based the Process Model on the best practices documented in the ITIL, with the addition of some Microsoft-specific content. Most of the Microsoft-specific content is in the operating quadrant of the Process Model. Because ITIL is platform independent, it does not cover these items. Where applicable, MOF also references specific Microsoft products and features that either automate or improve the delivery of the service management functions.

Figure 2.4 illustrates the MOF process model, showing the relationship between the life cycle phases and the reviews associated with each phase.

Figure 2.4: Microsoft Operations Framework Process Model

Changing quadrant

The changing quadrant follows a Release Approved Review. This is the final review before a proposed change is released into the production environment. It reviews the readiness of the release itself, the readiness of the staff, and the potential impact of the release on other systems. If the release passes this review, then the following service management functions perform the release:

The change, configuration, and release management functions work closely with each other to ensure that the shared configuration management database is always accurate and up to date. When the release is complete, the Release Readiness Review evaluates the effectiveness of the service management functions.

Operating quadrant

Once you have completed the deployment, the service management functions in the operating quadrant are responsible for effectively and efficiently performing the daily operational tasks.

Periodically, you should perform an Operations Review, which is an inwardly focused review of the operations group's ability to maintain the service.

Supporting quadrant

No system is perfect, and problems will occur after a service is put into daily operations. The objective of the functions in the supporting quadrant is to resolve incidents, problems, and inquiries in a timely manner.

Periodically, you should perform an SLA Review to evaluate the support staff 's ability to meet the requirements defined in the SLAs. The SLA Review often results in changes to the support staff procedures. It also often influences changes to other operational processes, tools, and procedures.

Optimizing quadrant

The service management functions in the optimizing quadrant focus on future needs rather than the day-to-day management of the current environment.

The functions in the optimizing quadrant often identify changes that the IT department should implement to improve delivery of the IT services. The Release Approved Review is the final review for these proposed changes.

2.2.3 Microsoft Operations Framework Team Model

Microsoft created the Team Model on the basis of ITIL's best practice for organizational structure and process ownership, augmented by best practices used by organizations with successful IT operations. By examining the practices of these successful IT organizations, Microsoft found that these organizations shared many common attributes that were the keys to their success. These attributes drive the team and help define the Team Model.

Building successful teams requires shared principles that set guidelines for how the team functions and create a sense of common values. The primary principles and guidelines for the Team Model are:

Microsoft incorporated these shared attributes and principles into the Team Model to provide examples for how other IT operations teams can improve their own operations and service management practices. The Team Model describes the following:

The role clusters of the Team Model define six general categories of activities and processes. The role clusters are groups of activities that share common goals. They do not imply any kind of organizational chart and they are not job descriptions. They also do not imply a specific number of people to perform these roles. The number of people will vary for each organization. A small organization may choose to have a single person perform several of the roles. Larger organizations may need a team of people-or possibly a virtual team-to perform a role.

Figure 2.5 shows the MOF Team Model with these six role clusters. The Team Model shows communication at the center. Clear, effective, accurate, and timely communication is important for all roles.

Figure 2.5: Microsoft Operations Framework Team Model

The tasks and activities needed to keep production systems operational are complex. Performing those activities and processes requires organization and coordination, but the complexity of the work makes this hard to accomplish. The Team Model helps simplify the complexity and provides guidance on team roles and ways to effectively organize the team.

Release cluster

The activities in the release role cluster are responsible for identifying and tracking resources, documenting processes, and maintaining the history of all IT environmental changes. This includes activities such as change management, release engineering, configuration control, asset management, software distribution, software licensing, and quality assurance. To meet this responsibility, the people performing these activities typically use a corporate knowledge base to track changes and lessons learned and a configuration management database to track inventory and changes to the environment.

Infrastructure cluster

The activities in the infrastructure role cluster are responsible for defining the physical environment standards, managing assets, maintaining the IT infrastructure, and overseeing the evolution of the architecture. This includes activities such as capacity management, IT cost management, enterprise architecture, infrastructure engineering, resource planning, and long-range planning.

Support cluster

The support role cluster is responsible for supporting internal and external customers. This includes activities such as product support, production support, problem management, service desk (or help desk), and service level management.

Operations cluster

The activities in the operations role cluster are responsible for reliably performing daily, routine operational tasks. This includes activities such as availability management, archiving and storage management, database operations, file and print server management, messaging operations, system monitoring, and network administration.

Partner cluster

Providing IT services requires cooperation with many groups outside the IT organization and outside the enterprise. The partner role cluster manages these partnerships in mutually beneficial and cost-effective ways. The role cluster also includes the external partners who provide critical services, including environmental support groups, hardware suppliers, managed services groups, maintenance vendors, software suppliers, and trading partners.

Security cluster

The security role cluster is responsible for ensuring data confidentiality, data integrity, and data availability. This includes activities such as audit administration, compliance administration, contingency planning, intellectual property protection, intrusion detection, network security, system security, and virus protection.

2.2.4 Microsoft Operations Framework Risk Model

Even with the best processes and the best IT operations staff, you will still encounter unexpected problems. Many IT operations teams are unprepared for the unexpected problems. They perform their daily tasks with the naÔve assumption that everything will work as planned. These IT groups are usually easy to identify by the fear, panic, and finger pointing that is present when the unexpected disrupts their daily routine.

Successful IT operations teams plan for an uncertain future. They view the unexpected as a normal part of operations and proactively work to identify and control the risk. They view the risk management process as a continuous, visible, and important process. They have metrics for measuring their ability to evaluate risks and take actions that address the causes or problems, rather than just the symptoms. Microsoft based the MOF Risk Model on guiding principles that are common to these successful IT operations teams.

The risk model applies a structured, repeatable, five-step process to the daily problems that IT operations face. Figure 2.6 shows the five steps of the risk management process. Each risk goes through the complete process at least once and often cycles through several times. Because each risk goes through the process on its own schedule, it is common for multiple risks to be in each step simultaneously. The five steps in the process are as follows:

Figure 2.6: Microsoft Operations Framework risk model

  1. Identify the risk. The purpose of this step is to determine the source of the risk (technology, people, process, or external), the mode of failure (performance, cost, agility, or security), the conditions that cause the failure (e.g., server's sole power supply fails), the operational consequences of the failure (i.e., what impact will the failure have on the operations team), and the business consequences (i.e., how will the business as a whole be hurt).

  2. Analyze the risk. This step determines the risk's probability and the impact to the business (on a scale from 1 to 10) if failure occurs. The risk exposure is calculated by multiplying the probability by the impact. You can use the exposure value to prioritize risks.

  3. Plan. The purpose of this step is to define mitigations to reduce the probability and/or impact, identify trigger conditions that indicate the failure is imminent but has not yet occurred, and define contingencies to execute if you detect the trigger condition.

  4. Track. Continually gather information about how elements of the risk are changing over time.

  5. Control. Continually manage the risks. Execute the contingency plan if you detect a trigger condition. Retire the risk if you no longer need it. If risk factors change (e.g., impact or probability), restart the cycle at Step #2 to reevaluate the risk.

The risk process includes the following lists of risks:


 < Day Day Up > 

Категории