• Increase font size
  • Default font size
  • Decrease font size
Installation & Configuration


[this section is DRAFT only and based on the our articles published there, but we are trying to update all of them to take into account SharePoint 2010 changes]



Overview

Planning and installing SharePoint Farm across enterprise network is not a trivial task. SharePoint is rarely installed in an isolated environment, and usually interferes with the organization strategy and existing infrastructure. Many factors may affect farm design, performance, scalability and redundancy - from hardware devices in organization network, to network topology. As a result, leveraging and finding compromises among those factors helps to build consistent, reliable and flexible environment.
There are several whitepapers on the Microsoft TechNet portal http://technet.microsoft.com/en-us/library/cc262733.aspx describing requirements for SharePoint Farm, but most of them are either written without taking into account infrastructure scope or filled with irrelevant information that navigate the reader away from the problem scope.

In this section we describe guidelines and instructions of how to plan and configure SharePoint farm across organizations. Current practices are based on the different of SharePoint projects and can be used as step-by-step instructions helping to configure proper baseline for SharePoint farms

Table of Contents

  1. Infrastructure Planning
  2. Farm Planning
  3. Installation
  4. Configuration
  5. Logical Architecture Planning
  6. Backup and Recovery Strategy
  7. Development Environment
  8. Virtual Environment
  9. Performance

 

Introduction

Organizations adopting SharePoint face a variety of tasks - from planning, strategy, infrastructure architecture design, UI Design, migration, and to development. All these tasks imply flexible infrastructural baseline before actual work starts. However, in reality we face the outdated environment and misconfigured farms that are not ready to implement new requirements. In such cases, baseline architecture becomes foundation stone of all SharePoint projects.

Why would we care about infrastructure and not about something else, for example development? Fixing infrastructure errors is very expensive task and leads into significant changes across SharePoint farm. For example, Index Role assigned to the wrong server and incorrectly configured Search can lead to performance and redundancy issues that might require up to 3 days fix. Development errors are not so expensive and can be fixed relatively quickly, but sometime such errors, eventually become infrastructure errors that lead to changes in infrastructure design.

This article provides "best practices" of how to build and configure SharePoint farm across organizations, and some recommendations to improve existing environments. These practices were extracted from number of successful and failed projects, and can be used as guidelines to configure proper baseline for SharePoint farm. You will find the configuration recommendations regarding different SharePoint areas. All information is represented in the set of recommendations about different actions you need to undertake or pay additional attention when you install and configure your SharePoint environment. We tried to structure all section to follow the natural flow of SharePoint installation from the scratch – from pre-installing analysis requirements to post deployment actions.

We plan several whitepapers in our “Best Practices” series, and we are interested which topics you would like to see in our next SharePoint publications. Please send us your comments and suggestions via this form



Part 2 - Farm Planning

PDF

Become familiar with SharePoint farm communications

Before discussing servers’ redundancy and farm topologies let us review farm servers and how they communicate with each other. The following picture from “Planning an Extranet Environment for Office SharePoint Server” TechNet article illustrates the communication channels within a server farm and which servers handle client's request.

sharepointcommunication


When a user issues a query, the query is sent to a Web server. The Web server communicates with the query server to build a list of results, and then communicates with the computer running Microsoft SQL Server to extend the list of results with summarization text, URLs, and security trimming. In parallel, the Web Server gets page data from SQL Server and renders them on fly. This diagram will help in understanding which roles to use on farm servers.

 

Identify Farm Size

When you start new SharePoint project it's very important to understand the size of a farm you are planning to end up, because it impacts on:

  • SharePoint farm topology
    There are three farms topologies to choose between - Small, Medium and Large. Your farm topology choices are usually dictated by the number of users and content your plan to use inside SharePoint.

There are several recommendations to following the next values for the farm size (I would delete them on 5, actually)

• Small farm: Typically < 50,000 users
• Medium farm: Typically < 100,000 users
• Large farm: Typically < 500,000 users

The number of users varies depending on usage profiles, type of data being saved, and the type of hardware and network the system is deployed on.

  • SQL Server configuration
    Determine your SQL Server deployment size by using the following table:
    1. If your deployment parameters are generally < than the listed values, your deployment can be considered small - 4 GB is the minimum required memory (SQL 2005)
    2. If your deployment parameters are approximately = to the listed values, your deployment can be considered medium - 8 GB is recommended for medium size deployments
    3. If your deployment parameters are generally > than the upper limits of most of the listed values, your deployment can be considered large - 16 GB and greater above is recommended for large deployments

Metric

Value

Content database size

100 GB

Number of content databases

20

Number of concurrent requests to SQL Server 2005

200

Users

1000

Number of items in regularly accessed list

2000

Number of columns in regularly accessed list

20

  • Capacity planning
    The general rule is to plan for 1 RPS (requests per second) per 1000 users (on the minimum recommended hardware)
    • Small Farm (single WFE) can serve about 100 RPS
    • Medium Farm (two WFE) can serve about 200 RPS

 

Plan a baseline topology

Analyse the existing infrastructure and plan a SharePoint topology for redundancy.

(“The term redundancy is often misinterpreted to be synonymous with availability. While these concepts are related, they are not the same. Redundancy refers to the use of multiple servers in a load-balanced environment for any of several purposes, such as to improve farm performance, to scale out to accommodate additional users, and to improve availability” http://technet.microsoft.com/en-us/library/cc288334.aspx)

There are several different topologies - from three to six servers in farm, which can be used as a baseline. Which one to choose depends on the level of redundancy and available hardware. Not all clients can afford topology with six or ten servers in farm due to budget limitation or data centre capabilities. Finding the compromise between numbers of servers, type of hosting and servers’ roles become critical task, because this choice will affect performance and extensibility of the SharePoint farm for several years ahead.

Three servers farm
The minimum availability for the farm with few servers can be achieved with "3-servers farm" topology. In the current topology Web and Application Servers locate together on the one box and the database is on another box. The remaining, third, server gives a choice of which server role make redundant – Web server role or the database server role.

The farm with the two Web Servers provides redundancy of the Web and Application roles, improving the overall performance. A drawback of this design is that your data is not redundant (left farm).
In other case, farm with two Database Servers (cluster) provides data redundancy, increasing availability of critical data, but users might suffer from temporary access loosing (right farm).

threeserverfarm

The "3-servers farm" is one of the most questionable farms in terms of redundancy and performance. This limitation in the number of servers cannot provide redundancy of Query Server and high performance at the same time.

Redundancy can be achieved with Query Roles on both Web App servers. In this case, Database Server is the only place for Index Role, but this will hinder the overall performance. The Index Role is very CPU and HDD consuming role and that is why database servers are not very optimal place for this role. Alternative solution is to assign Index Role to the Web Server with the Query Role, but this will not work effectively, because in this case, index will not be propagated to another Query Server in farm.

If performance is one of the priorities then consider using Query Server and Index Server Roles on different Web Application Servers. This is flexible design in terms of extensibility, because with the new servers in farm changing roles of Index and Query servers is not required.

Interestingly, a TechNet article (http://is.gd/8QbS, page 26) explains that a Query Server cannot be used with Web Applications server for 3-servers farm. The reality is that, Web App and Query Role together are super common, more common than not (one of the reasons is that Query Server doesn’t use Network-Load Balancer - it uses its own algorithm). What they actually mean in the TechNet article is that having the Index Role on database server is not at all a recommended solution.

Four servers farm
Additional, forth server will add redundancy either for Data Server or for Web Server. However, it does not help much with performance. Current topology suffers from the same "3-servers farm" drawbacks – no place for Index Server with Query Role redundancy.

Five+ servers farm

fiveserversfarmThe most common and highly available server farm topology is "5+ servers farm", the farm with middle tier server.

This middle tier server solves all issues of three and four servers topology by providing the dedicated tier for Index and Application roles. Additional servers in farm will extend middle tier, by assigning new roles to those servers - Excel Calculation Services Role, and Microsoft Office Project Server 2007 Role.

 

The following table summarize farm topology:

 

Farm Servers

Performance

Redundancy

3 – 4

Index on WFE with Query on another box

App Roles on WFE

Index on Database, with Query on WFE

App Roles on WFE

5

App Roles on Middle Tier

Dedicated Index Server on Middle Tier

App Roles on WFE

Dedicated Index Server on Middle Tier

6

Dedicated Web Server for Crawling, outside NLB

Dedicated Index Server on Middle Tier

App Roles on Middle Tier in NLB

Dedicated Index Server on Middle Tier

To optimize the overall performance of five and more servers SharePoint Farm, configure a dedicated Web Server for crawling content, especially when crawling a server farm that contains more than 500 gigabytes (GB) of content or crawling content over the WAN. To ensure that user requests are not affected by content crawling, remove the dedicated Web server from the network load balancing rotation. This is especially important in global environments in which the off-peak hours of a regional farm—when crawl jobs are likely to be schedule —coincide with the peak hours of the central farm.

 

Plan extranet topology

Choose the topology based on requirements for external users. This topology will provide a basis of network extensibility for applications servers and communications between them.
The simplest topology is “Edge firewall topology”, which is represented by following diagram, from TechNet article.

edgetopology

This topology applicable for the small farms, when there is no need to separate internal services from corporate network and secure communications between server farms. All remote users are separated from farm by ISA server which plays a role of remote proxy.

For the big farms, when security of communications is a priority, the recommended topology is “Back-to-back perimeter topology”. This is very flexible and adaptable topology for network changes

perimetertopology


The main advantage of this topology is that it isolates the server farm in a separate perimeter network. Layers logically separate all servers and communications are under control – any security damages affect only specific layer, not the entire farm. External user access is isolated to the perimeter network and users can be isolated in different AD for remote and corporate access.

There are some other extranet topology variations, but mostly all of them are based on “Back-to-back perimeter topology” with some modification.
Detailed information about farm topologies can be found in the following documents:

  1. Best practices for My Sites http://technet.microsoft.com/en-us/library/cc262706.aspx
  2. Best practices for team collaboration sites http://technet.microsoft.com/en-us/library/cc850694.aspx
  3. Planning an Extranet Environment for Office SharePoint Server http://technet.microsoft.com/en-us/library/cc262400.aspx

 

Staff needs

One commonly overlooked component of a successful implementation is staffing. Architects and administrators usually do a good job creating a bill of goods for hardware and software, but they often forget to secure funding for personnel to adequately develop and maintain a new system. SharePoint Server 2007 can consume a large amount of human resources if used to its full potential. Understand what types of dedicated personnel are required in the beginning, and start getting stakeholders' support immediately.
(Note: F = One full time person and P = One part time person.)

Staff position

Small farm

Medium farm

Large farm

Multiple farms

System administrator

F

F

F F

F F

Search administrator

P

F

F P

F F F

Site designer

P

F

F F

F F F

Software developer

N/A

P F

F F

F F F

Software tester

N/A

P

F

F F

SQL DBA

P

F

F

F F

 



Digg! Technorati! Diigo! Reddit! Del.icio.us! Google! Live! Facebook! StumbleUpon! Yahoo! Joomla Portal
 

Part 1 - Infrastructure Planning

PDF

Plan your farm and network communications before starting actual installation. The first thing to start is designing SharePoint architecture across corporate network. This includes understanding network structure, examining network devices and choosing the right SharePoint topology to fit the existing infrastructure and new requirements.

Understand your environment

Start from description of the existent network design, location of all applications and system servers. Microsoft Visio 2007 and "Network Diagram" template is a good instrument for this task.
Record the location and information of corporate system servers, like Domain Controllers, File Servers, Mail Servers, Application and others. Do not forget about network services - firewalls, proxies, etc. For example, locations of ISA Servers across corporate network - IP address, list of open ports and the administrative user.
The best way to maintain "Network Diagram" document is to update the single diagram that covers topology of all domains and how they are connected.
The following diagram demonstrates the sample of server’s location across organization.

infrastructurediagram

This diagram will give a holistic view of the existing topology and ensure quick access to information across different domains.

 

Examine network devices

Gaining a detailed understanding of the network infrastructure and implementing thorough documentation are necessary components in a solid system design. It is important to know all connection points, traversal paths, and personnel responsible during your implementation. Several network infrastructure components are used in medium and enterprise-scale SharePoint Server 2007 implementations. Consult with the network administrators to understand any nuances, such as maintenance windows, that you need to be aware of. Here are some examples of network infrastructure components and considerations:

  • Switches Port speed, Duplex, virtual LANS, media type
  • Routers Restrictions, wide area network bandwidth and latency
  • Storage area networks Type, capacity, SAN HBA compatibility
  • Firewalls Restrictions, configuration control, screened subnets

Meticulous documentation of these components up front will save time and energy as you progress through your design.

Refer to the following links for the detailed information about WAN accelerators, NLB and other network devices across SharePoint farms:
1) http://technet.microsoft.com/en-us/library/cc263099.aspx
2) http://blogs.msdn.com/joelo/archive/2008/01/17/global-sharepoint-deployment-partner-solutions.aspx
3) http://blogs.msdn.com/joelo/archive/2007/01/05/nlb-network-load-balancing-and-sharepoint-troubleshooting-and-configuration-tips.aspx

 

Work closely with network admins

The IT administrator is the person who should participate in farm configuration from the very beginning. This person will be responsible for the configuration of all network servers and devices across corporate network.
Most of the SharePoint Farm topologies cross the bounds of domains and from the very beginning specific protocols and ports must be open to cross the bounds. The best way to maintain current situation is to have a separate document, shared with administrator, with the description of protocols and ports to open across network services.
Detailed information about system accounts and list of ports is available in the following articles:

 

Measure network latency

Network response time is one of the important factors that can affect SharePoint farm design. Ideally, you need to measure the latency between SharePoint servers and users in order to reorganize servers according the smallest response time.
Network latency is the key point to determine which of the proposed scenarios to implement in the current SharePoint deployment. (Latency is the time required for a packet to travel from one point on a network to another).
Use the Ping tool (ping.exe) to measure latency for:

  • users - from the client computer to the Web server on the server farm;
  • data centres that host servers of the same farm - from a Web server in the remote data centre to the database server in the primary data centre

Do not forget to divide the round-trip result by two, because all measures are one way only, not round-trip.
Compare results to the data below, and adopt environment to have latency lower those values.

 

Number of users

Concurrent users (10%)

Central Solution

Distributed solution

100-5,000

10-500

Bandwidth:   3+ Mbps (dual T1)

Latency: < 100 ms

Bandwidth:   1.5 Mbps (T1)

Latency:   <100 ms

10,000

1,000

Bandwidth:     3+ Mbps (dual T1)

Latency:   <250 ms

Bandwidth:   1.5 Mbps (T1)

Latency:   <500 ms

100,000

10,000

Bandwidth:   3+ Mbps (dual T1)

Latency:   < 250 ms

Bandwidth:   1.5 Mbps (T1)

Latency:   <500 ms

 

The critical bandwidth for any SharePoint farms is 1.5 Mbps (T1) with 500ms latency. Overstepping these values will increase the page-load times dramatically, in 4 times at least. Refer to the diagrams in the “Plan for bandwidth Requirements” document, for more details about the bandwidth and latency results under different conditions.

Available network bandwidth and latency influences geographic deployments significantly. Data transfers across WAN links that span multiple cities, states, provinces, countries, or continents requires really fast lines to provide adequate response time, so design such topologies thoroughly.
More details for bandwidth requirements available in the following article http://technet.microsoft.com/en-us/library/cc262952.aspx



Digg! Technorati! Diigo! Reddit! Del.icio.us! Google! Live! Facebook! StumbleUpon! Yahoo! Joomla Portal
 



Buy me a coffee and donuts


Disclaimer

Views and suggestions expressed on this site do not necessarily reflect the Microsoft position and we don't hold liablily for tips and recommendations. You should be aware that some information represented on site might be officially unsupported, could potentially damage your SharePoint installation, and could lead to a lack of support from Microsoft.