02 - Fundamentals fo Queuing Theory

02 - Fundamentals fo Queuing Theory - OMT

Queues appear when a flow of vehicles, persons or objects wants to receive a service that implies a restriction. Typically, services have a limited capacity or are offered only with some frequency, which may lead to queues, delays and waits.

The subject that studies queues is referred to Queuing Theory (see also [[03. Variabili Aleatorie#Teoria delle code]], [[05. Teoria del Deflusso#Teoria delle code]]).

Most of the complexities of queuing theory come from the stochastic nature of the processes involved.

Components of a queuing system

Server: the restriction
Storage: where the costumer waits
Arrivals: the ones that feed the queue
Departures: the yield after service

Server

server

In a queuing system, the server represents the restriction of the system. The server is able to provide the service at a certain rate, $μ$ .

Server symbol - 02 - Fundamentals fo Queuing Theory - OMT 2024-10-26 12.17.04.excalidraw.png

Server disposition

Servers in parallel

When the servers are localised in parallel, they can work at the same time. These are differentiated based on how the queue works:

One problem that might arise with parallel service, is that the servers are not positioned in space in an efficient way. It may happen that the queue of one server blocks some other server. This could happen, for example, at a petrol station where two pumps are one after the other and there is not enough space for cars to overtake each other.

Decentralised system

Centralised queue

The centralised queue, if well managed, can maximize the capacity and efficiency of the system. In fact, in this case, no server will ever be idle, while there are other servers working.

In these kind of systems, it is important to implement adequate information system that assigns customers to servers at due time. This needs to ensure that there is no idle server at any time.

This can be challenging, especially when it takes a long time for the costumer to get to the server from the end of the queue (think of container ships waiting outside a port).

Servers in series

Tandem queues

In this kind of systems costumers need to undertake several steps to receive a complete service.

For example, in an airport, a passenger has first to check-in, thank go through security, and finally queue to board the plane. Seen as a whole, this is a tandem queue.

In this scenario, the departure from the previous server, is the arrival of the subsequent one.

A better explanation of this kind of systems is provided in the section [[#Tandem queueing system]]

Storage

Arrivals

Departures

Queuing disciplines

Queuing discipline is a non-physical component of a queuing system. It refers to the rules which determine in which order the costumers are served.

Sometimes, there can be priorities assigned to some kind of costumers, leading to a new type of discipline:

First Come First Serves - FCFS

This doesn't mean that the first costumer to enter will also be the first one to exit as, with [[#Servers in parallel]], some server may be faster than another allowing for overtakes in the system.

Last Come First Serves - LCFS

This happens when the [[#Storage]] is in a closed space and the entrance and exit are the same "door".

For example an airport shuttle bus will unload first the passengers that boarded the bus at last and viceversa.

Service in Random Order - SIRO

This system is mostly applied to industrial processes where the costumer are different items in a production line.

Round Robin - RR

Early Due Date First - EDDF

Shortest Service Time First - SSTF

Cumulative plot

cumulative plot

A cumulative plot - $(N, t)$ diagram - plots the cumulative number of customers $N$ , to cross a specific location as a function of time $t$ .

Cumulative plot generic - 02 - Fundamentals fo Queuing Theory - OMT 2024-10-26 15.09.12.excalidraw.png

The flow of vehicles or passengers

q

can be calculated starting from the cumulative plot as:

When a large number of customer is recorded, the jumps of the

N (t)

function become meaningfulness. So, the function is usually replaced by a continuous interpolation.

Input-output diagram

Notice that the horizontal distance between the 2 curves at a given number of cumulative costumers,

n_{1}

, is the time spent by that costumer in the system. This time includes:

Instead, the vertical distance between the 2 curves at a given instant

t_{1}

, is the customer accumulation in the system. This includes:

observation

The $D (t)$ curve can never go above the $A (t)$ curve since, in order to exit, a customer has to have enter first.

It the 2 curves match, it means that the system is empty of customers.

In the diagram above, other than the input-output diagram, the amount of customers in service and in queue at any given time is also shown:

A logistic curve

We will now consider an [[#Input-output diagram]] with a large number of customers, so that the curves can be seen as continuous functions.

In the input-output diagram, the Arrival curve

A (t)

usually takes the shape of a logistic curve.

The arrival rate is always relatively small at the two extremes while it picks in the middle (the arrival rate,

λ

, is given by the slope of the arrival curve). This is because, as humans, we tend to want the same service at about the same time. This pick rate is theoretically unbounded.

The departure curve instead, has a fixed maximum rate,

μ

, equal to the Capacity of the system. So, this slope is always bounded.

From a normal [[#Input-output diagram]] we can easily read the sum of the service time and delay, and the total accumulation (number of customer being served + customers queuing).

It is useful to be able to read directly the delay and the excess accumulation. This can be done through the [[#Virtual Arrivals Curve]]

3D representation of input-output diagram

If we want to visualize both vehicles trajectories and input-output diagram on the same chart, we can do so in a 3D diagram:

The projection of this diagram on the (

x, t

) plane gives the trajectories, while the projection onto the

(N, t)

plane gives the input-output diagram.

We can also look at the two projection one under the other to visualize everything on a single plane:

Virtual Arrivals Curve

virtual arrivals curve (v(t))

The Virtual Arrivals Curve is a curve obtained by moving the arrival curve ( $A (t)$ ) forward in time of an amount equal to the free flow travel time.

It can also be defined as the [[#Departures]] curve that would be measured in the absence of delay.

Virtual Arrivals curve - 02 - Fundamentals fo Queuing Theory - OMT 2024-10-26 17.14.37.excalidraw.png

Excess accumulation

The Excess accumulation can be obtained as the vertical distance in the

N - t

diagram between the [[#Virtual Arrivals Curve]] and the [[#Departures]] curve.

Average excess accumulation

average excess accumulation ($\overline{q}$)

We can calculate Average excess accumulation ( $\overset{―}{Q}$ ) in the period $t_{i}, t_{j}$ :
$\overset{―}{Q} = \frac{W_{1}}{t_{j} - t_{i}}$

The average excess accumulation $\overset{―}{Q}$ is always less or equal to the number of costumers that we see in the physical queue. Watch the example below to better understand this concept.

The diagram shows a stretch of road in 2 scenarios:

No queue: cars move freely at the same speed
- We count 2 veh in the area of interest (the one in the orange dashed box)
Queue: Some cars are queueing in the area of interest
- We count 5 cars

02 - Fundamentals fo Queuing Theory - OMT 2024-11-03 18.27.37.excalidraw.png

Notice that, of the 5 cars in the boxed area, even without queue, there would be 2. Meaning, the excess accumulation is NOT 5, but just 3.

Delay

Delay can be obtained as the horizontal distance in the

N - t

diagram between the [[#Virtual Arrivals Curve]] and the [[#Departures]] curve.

Average delay per costumer

\begin{align}
\text{Knowns: }&
\begin{cases}
V(t): \text{Measured when there are few people} \
\mu : \text{Max service flow (it's a property of the facility)}
\end{cases} \ \

\text{Unknowns: }&
\begin{cases}
\overline{w}\
\overline{Q}
\end{cases} \quad \Longrightarrow\text{We need the }D(t)
\end

\begin{cases}
D(t) < V(t) \quad \forall t \
\dot{D}(t) < \mu \quad \forall t
\end

\begin{align}
\lambda t^{} &= \mu (t^{} -R) \
\lambda t^{} &= \mu t^{} - \mu R \
(\lambda -\mu )t^{} &= -\mu R \
(\mu - \lambda )t^{} &= \mu R \
t^{*} &= \frac{\mu R}{\mu -\lambda }
\end

\overline{w} = \frac{W}{n'} = \frac{\frac{1}{2} \frac{\mu \lambda R^{2}}{\mu -\lambda }}{\lambda (R+G)} = \frac{1}{2} \frac{\mu R^{2}}{(\mu -\lambda)(R+G) }