ASK HERE

project uploader · 11-06-2012, 12:04 PM

Web Server Software Architectures

.pdf

WebServerArch-November2003.pdf (Size: 345.04 KB / Downloads: 1)
Web site scalability depends on several
things — workload characteristics,1
security mechanisms,2 Web cluster
architectures3 — as I’ve discussed in previous
issues. Another important item that can affect a
site’s performance and scalability is the Web server
software architecture.
In this column, I provide a classification of Web
server architectures, offer a quantitative analysis
of some possible software architectural options,
and discuss the importance of software contention
on overall response time.
Software Architecture Schemes
A Web server’s software architecture can affect
performance significantly. Two dimensions generally
characterize the architecture: the processing
model and pool-size behavior. The processing
model describes the type of process or threading
model used to support a Web server operation;
pool-size behavior specifies how the size of the
pool of processes or threads varies over time and
with workload intensity.
Processing Models
The main options for a processing model are
process-based, thread-based, or a hybrid of the
two. Software architectures on process-based
servers consist of multiple single-threaded processes,
each of which handles one request at a time
(see Figure 1a). We can see implementations of this
model, also called prefork in Apache, in the Unix
version of Apache version 1.3, and in Apache version
2.0’s multiprocessing (MPM) prefork module
(see also http://httpd.apachedocs-2.0/mod/
prefork.html). In a thread-based architecture, the
server consists of a single multithreaded server;
each thread handles one request at a time (see Figure
1b). We can see an implementation of this
model in Windows NT’s version of Apache 1.3. The
hybrid model consists of multiple multithreaded
processes, with each thread of any process handling
one request at a time (see Figure 1c). The
Apache 2.0 Worker MPM implements an example
of this type of approach (see http://httpd.
apachedocs-2.0/mod/worker.html).
One advantage of a process-based architecture
is stability. The crash of any process generally does
not affect the others, so the Web server continues
to operate and serve other requests even when one
of its processes must be killed and restarted. The
architecture’s drawbacks relate to performance:
creating and killing processes overloads the Web
server, mainly because of address-space management
operations. Moreover, high-volume Web sites
require many processes, which leads to non-negligible
memory requirements and increased contextswitching
overhead (see http://httpd.apache
docs/misc/perf-tuning.html#preforking).
A thread-based architecture is not as stable as
a process-based one. A single malfunctioning
thread can bring the entire Web server down
because all threads share the same address space.
However, this type of server architecture’s memory
requirements are much smaller than a processbased
one’s. Spawning threads of the same process
is much more efficient than forking processes
because the new threads don’t need additional
address space. One additional advantage is that the
various threads can easily share data structures
such as caches.
The hybrid architecture combines the advantages
of both methods and reduces their disadvantages.
For example, suppose that a Web server
78 NOVEMBER • DECEMBER 2003 Published by the IEEE Computer Society 1089-7801/03/$17.00©2003 IEEE IEEE INTERNET COMPUTING
Daniel A. Menascé • George Mason University • menasce[at]cs.gmu.edu
has p processes with n threads each.
So, up to p × n requests can execute
simultaneously. If a thread crashes, it
can bring down the process in which
it runs, but all other (p – 1) × n threads
continue to process their requests. Less
memory is required in this approach to
handle p × n concurrent requests than
to run the same number of requests in
the process-based architecture.
Pool-Size Behavior
The other dimension of Web-server
software architecture is the process or
thread’s pool-size behavior. Two
approaches are possible: static or
dynamic.
For static pools, the Web server creates
a fixed number of processes or
threads at startup time. Consider a
process-based server and assume that
p processes are created when the server
is started. If a request arrives and
finds p requests being processed, it
waits in a queue until one of the
requests completes execution (see Figure
2’s queue). As the arrival rate of
requests to a Web server increases, the
queue for processes or threads increases,
which increases a request’s
response time. If server load is low,
most of the processes or threads will
be idle.
For dynamic pools, the number of
processes or threads varies with load.
As load increases, so does pool size,
letting more requests be processed
concurrently and reducing queue. In
periods of low loads, the number of
processes or threads reduces to free up
more memory.
Apache provides an example of a
dynamic pool-size implementation.
The Web server’s parent process initially
forks a number p of child
processes, established as a configuration
parameter. The child processes
handle requests directly; the parent
process only monitors the load to
decide if processes should be forked
or killed. Another configuration
parameter determines the minimum
number m of idle processes (those
processes that aren’t handling
requests). As load increases, the number
of idle processes could fall below
its minimum value m; if so, the main
process creates more processes. If the
number of idle processes exceeds a
value set in the configuration file,
Apache’s parent process kills the
excess child processes.
You can also regulate pool size in
Apache’s configuration file by specifying
the maximum number of
requests a child can process before it
dies. Once a process has reached the
parameter-specified value, it exits. You
can also specify an infinite value for
the parameter (by making the parameter
value equal to zero). An infinite or
high value is appropriate for high-volume
Web sites subject to bursty traffic,
whereas a lower value is more
appropriate for low-volume Web sites.
One advantage of limiting a process’s
lifetime is that it reduces the amount
of lost and unusable memory that
accumulates via memory leaks.
Performance Considerations
We can decompose a Web server
request’s total response time into the
following components:4
• Service time, which is the total time
a request spends at the physicalresource
level (such as CPU and
disk). This time does not include
any time spent waiting to use any
of the Web server’s physical
resources.
• Physical contention, which is the
total time a request spends waiting
to use any of the Web server’s
physical resources.
• Software contention, which is the
total time a request spends waiting
in a queue for a process or
thread to become available to
handle the request.
IEEE INTERNET COMPUTING http://computerinternet/ NOVEMBER • DECEMBER 2003 79
Web-Server Software Architectures
(a)
... ...
(b)
...
...
...
©
Process Thread Software queue
Figure 1. Web server processing models. (a) Process-based servers have multiple single-threaded processes, (b) threadbased
servers have a single multithreaded process, and © hybrid servers use multiple multithreaded processes.
As Figure 2 illustrates, when a
request waits for a software resource
(such as a thread or process), it is not
using or waiting for any physical
resource. During processing, a request
typically alternates between periods in
which it is using a physical resource
and waiting to use one.
To explore some of Figure 2’s architecture’s
performance characteristics,
we use the results of a combined
Markov chain model with a queuing
network (QN) model.5 The QN model
computes the Web server’s throughput
X0 (k), k = 1, …, p given the incoming
requests’ service demands on physical
resources. In other words, the QN models
the Web server’s physical resources.
Next, we can use a Markov chain
model to represent the software architecture.
In this model, the state k of the
Markov chain represents the number
of requests in the system (either waiting
for a process or being handled by
one). Transitions from state k to state
(k + 1) occur at the rate at which new
requests arrive at the server. Transitions
from state k to state (k – 1) indicate
request completion. The completion
rate is given by X0 (k) for states k
= 1, …, p and X0 (p) for all states k > p.
Figure 3 shows the average number
of idle processes (those not handling
requests) as a function of the average
arrival rate of requests for three different
values of the number of active
processes p. The figure indicates, as
expected, that for very light loads, the
number of idle processes is almost equal
to the total number of processes. As
load increases, the average number of
idle processes decreases slightly at the
beginning and precipitously as the
arrival rate achieves its maximum possible
value, which is equal to X0 (p). This
maximum value, which is equal to five
requests per second in Figure 3, represents
the point at which the Web server
becomes unstable — that is, the queue
for processes grows without bounds.
Figure 3 also indicates how a Web
server with a dynamic pool size can
vary the size of its pool of processes as
load increases. Consider Table 1’s scenarios
and assume that we want to
have an average of 10 to 11 idle
processes to handle surges in requests
with a good response time. In scenario
1, when the arrival rate is 4.5
requests/sec, the average number of
idle processes is close to 10 when there
are 20 processes. If the arrival rate
increases to 4.8 requests/sec (scenario
2), the average number of idle processes
decreases to 5.3. To keep the num-
80 NOVEMBER • DECEMBER 2003 http://computerinternet/ IEEE INTERNET COMPUTING
Scaling the Web
1
2
p
...
Threads/
processes
Completing
requests
Queue for
threads/processes
Arriving
requests
CPU
Disk
Figure 2. Software and physical contention.While a request is waiting to be
processed, it doesn’t use the CPU or I/O or any other physical resources.
Average number idle processes
Average arrival rate (requests per second)
45
40
35
30
25
20
15
10
5
0
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
20 processes 40 processes 30 processes
Figure 3. Average number of idle processes versus the average arrival rate of
requests. At low loads, almost all processes tend to be idle. At high loads, the
number of idle processes falls to zero very quickly.
ber of idle processes at around 10 to
11, we must increase the number of
processes to 30. Suppose now that the
average arrival rate increases to 4.9
requests/sec (scenario 4). The average
number of idle processes falls to 6.6.
We must then fork 10 additional
processes (scenario 5) to restore the
average number of idle processes to a
value close to 11.
Final Remarks
Software contention and architectures
can significantly affect Web server
performance. Poorly designed and
configured software architectures
might even generate high response
times while the physical resources display
low utilization.
Figure 3 and Table 1’s scenarios
indicate that computer systems, Web
servers included, could use analytic
performance models as a guide to
dynamically adjust configuration
parameters such as the number of
active processes.6 This line of research
has potentially important practical
applications for complex, high-volume
Web and e-commerce sites for which
it is virtually impossible to have
humans adjust configuration parameters
to meet QoS goals.

Possibly Related Threads...
Thread		Author	Replies	Views	Last Post
	Mining Web Graphs for Recommendations	seminar paper	6	3,946	28-07-2016, 04:36 PM Last Post: jaseela123d
	A Paper Presentation on Software Development Automation by Computer Aided Software	seminar paper	4	3,110	23-03-2016, 02:05 PM Last Post: dhanabhagya
	Linux Virtual Server (LVS) for Red Hat Enterprise Linux	seminar addict	1	1,778	06-09-2013, 10:05 AM Last Post: Guest
	Embedded Software Testing	seminar addict	1	2,743	30-01-2013, 12:22 PM Last Post: seminar details
	Web Based Meeting Scheduler	seminar addict	1	1,461	22-11-2012, 12:27 PM Last Post: seminar details
	On the Use of Mobile Phones and Biometrics for Accessing Restricted Web Services	project uploader	1	2,372	05-11-2012, 11:46 AM Last Post: Guest
	Mobile Agents for World Wide Web Distributed Database Access	project uploader	0	1,243	08-06-2012, 03:01 PM Last Post: project uploader
	WEB PHARMACEUTICAL	seminar details	0	1,629	08-06-2012, 11:41 AM Last Post: seminar details
	An Ontology-Supported Web Focused-Crawler for Java Programs	project uploader	0	1,153	08-06-2012, 11:31 AM Last Post: project uploader
	Mobile Agents for World Wide Web Distributed Database Access	project uploader	0	1,030	07-06-2012, 05:44 PM Last Post: project uploader

Important Note..!

ASK HERE