FREE MOBILE CLOUD
COMPUTING CONCEPTS - TRAINING_MODULES_WITH_TONS_OF_VIDEOS
ip-video-phone-close-up
The basics, for those
of you, like me, that "hate" or, rather "enjoy" (sometimes!) all these new Cloud terms for the Web Cloud
Version 3.0......well, here's Johnny!
IP video is the transfer of video information in IP packet data format. Transmission
of IP video involves digitizing video, coding, addressing, transferring, receiving, decoding and converting (rendering) IP
video data into its original video form.
FREE MOBILE CLOUD COMPUTING CONCEPTS - TRAINING_MODULES_WITH_TONS_OF_VIDEOS
+++++++++
IP
video technology provides flexible, scalable, and cost-effective surveillance solutions suitable for a wide range of industries
and applications. With an IP-based video surveillance setup, users can monitor and record video remotely using an IP network
as the system's backbone.
IP video installations can be deployed in any environment, and offer many benefits previously unavailable with
analog CCTV systems.
Benefits of Network Video
IP-based video surveillance has improved
the effectiveness of video security by leaps and bounds over the analog CCTV equipment we've grown so accustomed to over the
years. Today's IP video surveillance solutions use an IP network, rather than complicated cabling setups, as the backbone
for delivering information. This allows for flexible, cost-effective installation, remote video monitoring, improved storage,
and a host of other benefits.
Remote
Video Monitoring
With
an IP-based surveillance system, users can view live network camera feeds in real-time from any computer with Internet access.
Network cameras can capture and transmit high-quality video images over any IP network or the Internet, where the footage
can be viewed remotely using a computer or, in some cases, cell phones and other handheld devices. Additionally, the recorded
surveillance footage can be stored at remote locations.
Scalability
Expanding
a network surveillance system is as simple as connecting additional IP cameras to the IP network. IP cameras can be placed
anywhere along the network, and there's no need for expensive and complicated cabling. Simply connect the camera like you
would any other network device.
Improved
Storage Capabilities
Since
network cameras capture digital video images, large amounts of footage can be stored on servers and network video recorders,
where archived video can be quickly accessed and searched. Compare this type of setup to analog systems where video was stored
on VHS cassette tapes, and it's easy to see the benefits of a digital surveillance system.
What is an IP Camera?
An IP camera captures and sends video
footage over an IP network, allowing users to view, record, store, and manage their video surveillance images either locally
or remotely over the network infrastructure. The camera can be placed wherever there's an IP network connection. It has its
own IP address and unlike a webcam, doesn't require a connection to a PC in order to operate.
Along with streaming video footage, network cameras can include
a number of additional functionalities, such as pan/tilt/zoom operation, motion detection, audio surveillance, integration
with alarms and other security systems, automated alerts, intelligent video analytics, and much more.
Many IP cameras can also send multiple streams of video,
using different compression technologies for live viewing and archiving.
IP cameras offer flexible installation, ease of use, higher-quality images, stability,
and scalability as new cameras can be added to the network at any time.
What is NVR software?
Video management software is a key component
of any video surveillance solution. It's the software that provides the tools for monitoring and analyzing surveillance footage,
as well as recording. While a standard web browser often allows for remote viewing, dedicated video management software is
required for viewing and managing multiple cameras at once.
The most basic IP video software provides live viewing, recording, and retrieving of video
footage. More advanced NVR software platforms offer simultaneous viewing of multiple cameras, and multiple recording modes
(including continuous, scheduled, and triggered recording).
Other features may include the ability to handle large image files with high frame rates,
fast search capabilities, pan/tilt/zoom control, audio support, and remote access via web browser as well as cell phones and
other handheld devices. Some software programs also support intelligent surveillance using sophisticated video analytics such
as facial recognition and advanced motion detection
This section provides
an overview of why video surveillance deployments are migrating from analog-based systems to IP-based systems. The time between
2007 and 2010 represents a market transition in the industry where sales of IP-based components began out-selling analog-based
systems.
While analog systems have a cost advantage in small deployments (sixteen cameras or less), when larger
number of cameras are deployed, IP-based systems may be more cost-effective initially and have a lower ongoing total cost
of ownership. IP-based video surveillance systems, especially the end-node (the IP camera), have several operational and technological
advantages. Why implement IP video surveillance over analog-based systems? The following subsections provide the answer.
Leveraging VoIP
Adoption
Many of the advantages
of implementing IP video surveillance are similar to those of VoIP adoption. The fundamental reason is the cost savings of
using the IP network for both voice and data. By adding the transport of video surveillance on the existing highly-available
IP network, the cost savings realized from eliminating the separate cable plant for voice extends as well to the elimination
of the separate cable plant for video.
Not only the wiring
for media transport can be eliminated, but also the cabling for electrical power.
As is the case with VoIP in
the enterprise space, where the IP phone uses PoE, so does many fixed installation IP cameras. While power to some camera
deployments continue to be a requirement (Pan-Tilt-Zoom housings, wireless cameras and cameras that require fibre connectivity
due to distance), PoE is a substantial cost savings.
IP video surveillance
cameras, once connected to the network, may be remotely configured and managed from a central command center. The installing
technician must have a laptop to focus the lens and adjust the viewpoint of the camera, but following this initial installation,
the camera configuration may be completed by a technician in a central, rather than local, facility.
Access Video Any
Time, Any Place
With IP-based systems,
video feeds are encoded into Motion JPEG or MPEG-4/H.264 formats and stored as a digital image on a computer disk array. This
provides the ability to access the video, by way of the networked digital video recorder, through the IP network at any time,
from any place.
These digital images do not degrade in quality from duplication like analog recordings on magnetic
tape. They can be replicated and posted on web servers, distributed to law enforcement as E-mail attachments, and sent to
news outlets. When analog-based systems were the norm, loss prevention/investigations staff may have to visit the location
of the incident to view the video or a tape or DVD would need to be shipped by overnight courier. These inefficiencies no
longer exist with IP-based systems and WAN connectivity to the physical location.
Intelligence at
the Camera
With IP cameras,
local processing of the video image may be done during capture and analysis like motion detection and tampering detection
logic may raise alerts by communicating with a central server. The alert may use a variety of IP protocols, SMTP (E-mail),
Syslog, File Transfer (FTP), or a TCP socket connection with a small keyword in the payload. The Cisco 4500 IP Cameras have
an additional DSP capabilities specifically designed to support real-time video analytics on the camera. |
This
option is to allow analytic vendors to develop firmware in the future to run on these resources.
Barriers to Success
While the advantages
of an IP-based system are considerable, there are some barriers to success. They mainly revolve around the human element—job
responsibilities, training, and education. Typically, the physical security manager and the network manager have no overlapping
job responsibilities and therefore have little need to interact with each other. The physical security manager has job responsibilities
targeted at loss prevention, employee and customer/visitor safety, security and crime prevention.
Because of this,
the physical security manager is more confident with a dedicated, reliable, physically separate cable plant.
Many installations
of physical security cameras and the accompanying components are solely or partially implemented by value added resellers
(VARs) who are specialists in their field, but not yet experts in IP networking. The VAR must become more fluent in internetworking
and the network manager must understand the requirements of the physical security processes and applications.
The key elements
of video surveillance is the three Rs: resolution,
retention, and reliability. For an IP video surveillance deployment to be a success on the IP network, the reliability element must have careful attention by the network manager for
the physical security manager to be successful.
Resolution, one
of the three Rs of video surveillance, directly
influences the amount of bandwidth consumed by the video surveillance traffic. Image quality (a function of the resolution)
and frame rate are functions of the amount of bandwidth required. As image quality and frame rate increase, so does bandwidth
requirements.
Analog Video Resolutions
Video surveillance
solutions use a set of standard resolutions. National Television System Committee (NTSC) and Phase Alternating Line (PAL)
are the two prevalent analog video standards. PAL is used mostly in Europe, China, and Australia and specifies 625 lines per-frame
with a 50-Hz refresh rate. NTSC is used mostly in the United States, Canada, and portions of South America and specifies 525
lines per-frame with a 59.94-Hz refresh rate.
These video standards
are displayed in interlaced mode, which means that only half of the lines are refreshed in each cycle. Therefore, the refresh
rate of PAL translates into 25 complete frames per second and NTSC translates into 30 (29.97) frames per second.
Table 4-1 Analog Video Resolutions (in
pixels)
Format
NTSC-Based
PAL-Based
QCIF
176 × 120
176 × 144
CIF
352 × 240
352 × 288
2CIF
704 x 240
704 x 288
4CIF
704 × 480
704 × 576
D1
720 × 480
720 × 576
Note that the linear
dimensions of 4CIF are twice as big as CIF. As a result, the screen area for 4CIF is four times that of CIF with higher bandwidth
and storage requirements. The 4CIF and D1 resolutions are almost identical and sometimes the terms are used interchangeably.
Note
IP camera vendors may use different video resolutions. The Cisco Video Surveillance Manager solution supports the format delivered
by the camera.
Digital Video
Resolutions
User expectations
for resolution of video surveillance feeds are increasing partially due to the introduction and adoption of high-definition
television (HDTV) for broadcast television. A 4CIF resolution, which is commonly deployed in video surveillance, is a 4/10th
megapixel resolution. The HDTV formats are megapixel or higher. Table 4-2
Digital
Video Surveillance Resolutions (in pixels)
While image quality
is influenced by the resolution configured on the camera, the quality of the lens, sharpness of focus, and lighting conditions
also come into play. For example, harshly lighted areas may not offer a well-defined image, even if the resolution is very
high. Bright areas may be washed out and shadows may offer little detail. Cameras that offer wide dynamic range processing,
an algorithm that samples the image several times with differing exposure settings and provides more detail to the very bright
and dark areas, can offer a more detailed image.
As a best practice,
do not assume the camera resolution is everything in regards to image quality. For a camera to operate in a day-night environment,
(the absence of light is zero lux), the night mode must be sensitive to the infrared spectrum. It is highly recommended to
conduct tests or pilot installations before buying large quantities of any model of camera.
Tip
Some cameras rated as megapixel cameras in Motion JPEG only offer 4CIF resolution when configured for MPEG-4.
Video Compression
CODECS
The Cisco Video
Surveillance Media Server supports IP endpoints that use Motion JPEG (MJPEG) or MPEG-4 codec technology. Both types of codecs
have advantages and disadvantages when implemented in a video surveillance system. A system administrator may choose to use
MJPEG on certain cameras and MPEG-4 or H.264 on others, depending on system goals and requirements.
A codec is a device or program that performs encoding and decoding on a digital
video stream. In IP networking, the term frame refers to a single unit of traffic across an Ethernet or other Layer-2 network.
In this guide, frame primarily refers to one image
within a video stream. A video frame can consist of multiple IP packets or Ethernet frames.
A video stream
is fundamentally a sequence of still images. In a video stream with fewer images per second, or a lower frame rate, motion
is normally perceived as choppy or broken. At higher frame rates up to 30 frames per second, the video motion appears smoother;
however, 15 frames per second video may be adequate for viewing and recording purposes.
Some of the most
common digital video formats include the following:
•Motion JPEG (MJPEG) is a format consisting of a sequence
of compressed Joint Photographic Experts Group (JPEG) images. These images only benefit from spatial compression within the
frame; there is no temporal compression leveraging change between frames. For this reason, the level of compression reached
cannot compare to codecs that use a predictive frame approach.
•MPEG-1 and MPEG-2 formats are Discrete Cosine Transform-based
with predictive frames and scalar quantization for additional compression. They are widely implemented, and MPEG-2 is still
in common use on DVD and in most digital video broadcasting systems. Both formats consume a higher level of bandwidth for
a comparable quality level than MPEG-4. These formats are not typically used in IP video surveillance camera deployments.
•MPEG-4 introduced object-based encoding, which handles
motion prediction by defining objects within the field of view. MPEG-4 offers an excellent quality level relative to network
bandwidth and storage requirements. MPEG-4 is commonly deployed in IP video surveillance but will be replaced by H.264 as
it becomes available. MPEG-4 may continue to be used for standard definition cameras.
•H.264 is a technically equivalent standard to MPEG-4
part 10, and is also referred to as Advanced Video Codec (AVC). This emerging new standard offers the potential for greater
compression and higher quality than existing compression technologies. It is estimated that the bandwidth savings when using
H.264 is at least 25 percent over the same configuration with MPEG-4. The bandwidth savings associated with H.264 is important
for high definition and megapixel camera deployments.
MJPEG
An MJPEG codec
transmits video as a sequence of Joint Photographic Experts Group (JPEG) encoded images. Each image stands alone without the
use of any predictive compression between frames. MJPEG is less computation-intensive than predictive codecs such as MPEG-4,
so can be implemented with good performance on less expensive hardware. MJPEG can easily be recorded at a reduced frame rate
by only sampling a subset of a live stream.
For example, storing every third frame of a 30-frame per second video
stream will result in a recorded archive at 10 frames per second.
MJPEG has a relatively
high bandwidth requirement compared to MPEG-4. A 640x480 VGA resolution stream running at 30 frames per second can easily
consume 5 to 10 Mbps. The bandwidth required is a function of the complexity of the image, in conjunction with tuning parameters
that control the level of compression.
Higher levels of compression reduce the bandwidth requirement but also
reduce the quality of the decoded image. Since there is no predictive encoding between frames, the amount of motion or change
in the image over time has no impact on bandwidth consumption.
MPEG-4/H.264
An MPEG-4 codec
uses prediction algorithms to achieve higher levels of compression than MJPEG while preserving image quality. Periodic video
frames called I-frames are transmitted as complete, standalone JPEG images similar to an MJPEG frame and are used as a reference
point for the predictive frames. The remaining video frames (P-frames) contain only information that has changed since the
previous frame.
To achieve compression,
MPEG-4 relies on the following types of video frames:
•I-frames (intraframes, independently decodable)—These
frames are also referred to as key frames and contain
all of the data that is required to display an image in a single frame.
•P-frames (predictive or predicted frames)—This
frame type contains only image data that has changed from the previous frame.
•B-frames (bi-directional predictive frames)—This
frame type can reference data from both preceding frames and future frames. Referencing of future frames requires frame reordering
within the codec.
The use of P-frames
and B-frames within a video stream can drastically reduce the consumption of bandwidth compared to sending full image information
in each frame. However, the resulting variance of the video frames' size contributes to the fluctuation in the bandwidth that
a given stream uses. This is the nature of most codecs because the amount of compression that can be achieved varies greatly
with the nature of the video source.
Pan-Tilt-Zoom
(PTZ)
The Cisco Video
Surveillance Manager solution supports the configuration of PTZ cameras connected to encoders or as IP cameras. In order to
support PTZ connectivity, the encoder should be able to connect to the camera through a serial interface. The Video Surveillance
Manager solution supports the following PTZ protocols:
•Bosch
•Cohu
•J2 Vision
•Pelco D
•Pelco P
The encoder
also connects through a serial cable to the analog camera. When the OM viewer requests PTZ control through the joystick, the
Media Server intercepts the request and communicates the request to the encoder. Once the request is received by the encoder,
a serial communication takes place between the encoder and the analog camera.
Figure 4-1 Pan-Tilt-Zoom
Via Encoders
Aspect Ratio
The aspect ratio
is the relationship between the number of pixels in the horizontal and vertical image dimensions.
A 4:3 (1.33:1) aspect ratio is universal
for standard definition cameras. For HDTV formats, 16:9 (1.78:1) is universal. In video surveillance deployments, the HDTV
aspect ratio is more advantageous because the pixels at the top and bottom of the image are generally of less importance than
having a wide field of view.
In other words, the width of the image is more important than the height of the image. Capturing, encoding, and transporting
bits that are of little value is a waste of bandwidth and disk space. In some instances, a single HDTV format video camera
may be able to replace two standard definition cameras.
ip-video-camera-security
ip-video-surveillance-management
Digital Video
Surveillance Resolutions (in pixels)
While image quality
is influenced by the resolution configured on the camera, the quality of the lens, sharpness of focus, and lighting conditions
also come into play. For example, harshly lighted areas may not offer a well-defined image, even if the resolution is very
high. Bright areas may be washed out and shadows may offer little detail. Cameras that offer wide dynamic range processing,
an algorithm that samples the image several times with differing exposure settings and provides more detail to the very bright
and dark areas, can offer a more detailed image.
As a best practice,
do not assume the camera resolution is everything in regards to image quality. For a camera to operate in a day-night environment,
(the absence of light is zero lux), the night mode must be sensitive to the infrared spectrum. It is highly recommended to
conduct tests or pilot installations before buying large quantities of any model of camera.
Tip
Some cameras rated as megapixel cameras in Motion JPEG only offer 4CIF resolution when configured for MPEG-4.
Video Compression
CODECS
The Cisco Video
Surveillance Media Server supports IP endpoints that use Motion JPEG (MJPEG) or MPEG-4 codec technology. Both types of codecs
have advantages and disadvantages when implemented in a video surveillance system. A system administrator may choose to use
MJPEG on certain cameras and MPEG-4 or H.264 on others, depending on system goals and requirements.
A codec is a device or program that performs encoding and decoding on a digital
video stream. In IP networking, the term frame refers to a single unit of traffic across an Ethernet or other Layer-2 network.
In this guide, frame primarily refers to one image
within a video stream. A video frame can consist of multiple IP packets or Ethernet frames.
A video stream
is fundamentally a sequence of still images. In a video stream with fewer images per second, or a lower frame rate, motion
is normally perceived as choppy or broken. At higher frame rates up to 30 frames per second, the video motion appears smoother;
however, 15 frames per second video may be adequate for viewing and recording purposes.
Some of the most
common digital video formats include the following:
•Motion JPEG (MJPEG) is a format consisting of a sequence
of compressed Joint Photographic Experts Group (JPEG) images. These images only benefit from spatial compression within the
frame; there is no temporal compression leveraging change between frames. For this reason, the level of compression reached
cannot compare to codecs that use a predictive frame approach.
•MPEG-1 and MPEG-2 formats are Discrete Cosine Transform-based
with predictive frames and scalar quantization for additional compression. They are widely implemented, and MPEG-2 is still
in common use on DVD and in most digital video broadcasting systems. Both formats consume a higher level of bandwidth for
a comparable quality level than MPEG-4. These formats are not typically used in IP video surveillance camera deployments.
•MPEG-4 introduced object-based encoding, which handles
motion prediction by defining objects within the field of view. MPEG-4 offers an excellent quality level relative to network
bandwidth and storage requirements. MPEG-4 is commonly deployed in IP video surveillance but will be replaced by H.264 as
it becomes available. MPEG-4 may continue to be used for standard definition cameras.
•H.264 is a technically equivalent standard to MPEG-4
part 10, and is also referred to as Advanced Video Codec (AVC). This emerging new standard offers the potential for greater
compression and higher quality than existing compression technologies. It is estimated that the bandwidth savings when using
H.264 is at least 25 percent over the same configuration with MPEG-4. The bandwidth savings associated with H.264 is important
for high definition and megapixel camera deployments.
MJPEG
An MJPEG codec
transmits video as a sequence of Joint Photographic Experts Group (JPEG) encoded images. Each image stands alone without the
use of any predictive compression between frames. MJPEG is less computation-intensive than predictive codecs such as MPEG-4,
so can be implemented with good performance on less expensive hardware. MJPEG can easily be recorded at a reduced frame rate
by only sampling a subset of a live stream. For example, storing every third frame of a 30-frame per second video stream will
result in a recorded archive at 10 frames per second.
MJPEG has a relatively
high bandwidth requirement compared to MPEG-4. A 640x480 VGA resolution stream running at 30 frames per second can easily
consume 5 to 10 Mbps. The bandwidth required is a function of the complexity of the image, in conjunction with tuning parameters
that control the level of compression. Higher levels of compression reduce the bandwidth requirement but also reduce the quality
of the decoded image.
Since there is no predictive encoding between frames, the amount of motion or change in
the image over time has no impact on bandwidth consumption.
MPEG-4/H.264
An MPEG-4 codec
uses prediction algorithms to achieve higher levels of compression than MJPEG while preserving image quality. Periodic video
frames called I-frames are transmitted as complete, standalone JPEG images similar to an MJPEG frame and are used as a reference
point for the predictive frames. The remaining video frames (P-frames) contain only information that has changed since the
previous frame.
To achieve compression,
MPEG-4 relies on the following types of video frames:
•I-frames (intraframes, independently decodable)—These
frames are also referred to as key frames and contain
all of the data that is required to display an image in a single frame.
•P-frames (predictive or predicted frames)—This
frame type contains only image data that has changed from the previous frame.
•B-frames (bi-directional predictive frames)—This
frame type can reference data from both preceding frames and future frames. Referencing of future frames requires frame reordering
within the codec.
The use of P-frames
and B-frames within a video stream can drastically reduce the consumption of bandwidth compared to sending full image information
in each frame. However, the resulting variance of the video frames' size contributes to the fluctuation in the bandwidth that
a given stream uses. This is the nature of most codecs because the amount of compression that can be achieved varies greatly
with the nature of the video source.
Pan-Tilt-Zoom
(PTZ)
The Cisco Video
Surveillance Manager solution supports the configuration of PTZ cameras connected to encoders or as IP cameras. In order to
support PTZ connectivity, the encoder should be able to connect to the camera through a serial interface. The Video Surveillance
Manager solution supports the following PTZ protocols:
•Bosch
•Cohu
•J2 Vision
•Pelco D
•Pelco P
below shows how
an analog camera can be connected to an IP encoder to convert its video feed to an IP video format. The encoder also connects
through a serial cable to the analog camera. When the OM viewer requests PTZ control through the joystick, the Media Server
intercepts the request and communicates the request to the encoder. Once the request is received by the encoder, a serial
communication takes place between the encoder and the analog camera.
Figure 4-1 Pan-Tilt-Zoom
Via Encoders
Aspect Ratio
The aspect ratio
is the relationship between the number of pixels in the horizontal and vertical image dimensions. A 4:3 (1.33:1) aspect ratio
is universal for standard definition cameras. For HDTV formats, 16:9 (1.78:1) is universal. In video surveillance deployments,
the HDTV aspect ratio is more advantageous because the pixels at the top and bottom of the image are generally of less importance
than having a wide field of view. In other words, the width of the image is more important than the height of the image.
Capturing, encoding, and transporting bits that are of little value is a waste of bandwidth and disk space. In some
instances, a single HDTV format video camera may be able to replace two standard definition cameras.
Camera
Placement
Camera
placement can be characterized by either overview or detail view. The camera placement influences the resolution, frame rate
and codec in use.
Overview
A
camera with an overview scene is monitoring a large area such as a parking lot or a traffic camera that is viewing vehicle
congestion or the number of cars parked in the lot. Because details are not important, standard definition cameras using a
wide-angle lens may be sufficient. The preferred codec may be MPEG-4 with a relatively low frame rate, 1-5 frames per second.
Overview cameras may
be supplemented with a detail view camera focused on a key area of interest or by a PTZ camera to provide real-time analysis
of areas of interest at a higher resolution.
Detail
View
The
detail view placement is targeted at observing a specific area of interest at a higher resolution than the overview. Detail
view is used for Point-of-sale transactions and face or license plate recognition. The detail view may have a PTZ capability,
or the camera may be close to the subject area or have a long focal length lens. Megapixel or HD cameras may be deployed to
provide a sufficient number of pixels per-foot to accurately represent the subject.
The
positioning of a camera for detail view is a function of the number of pixels per-foot required for the application.
Detection,
Recognition, Identification
Detection,
recognition, and identification are visual processes associated with the amount of detail discernable to the human eye. We
detect an object when it enters the field of view. Detection means we are aware that an object (or person) now exists where
previously it was not seen. Usually, this is due to movement of the object into the field of view of the surveillance camera.
Detection simply means we are aware of the object, but have too little details to recognize or identify the object.
As
the object moves closer, we may recognize the object from characteristics previously encountered. For example, aircraft recognition
is taught to military ground troops and airmen. All aircraft have wings, engines, a fuselage, and tail assembly. They differ
in size, shape, number, and position to each other. A particular model of aircraft can be recognized by recalling these characteristics
from pictures, drawings or past detailed observations.
Identification
is the process where sufficient details are available to uniquely discern a person or object that is previously unknown. Identification
requires sufficient detail to accurately describe or recall the characteristics of the subject at a later time. For example,
a mug shot (booking photograph) is taken following the arrest of a subject as a means of photographing (recording) sufficient
details for later identification by a victim or witness. In video surveillance terms, sufficient detail is calibrated in pixels
per foot of the area recorded by the camera.
The
number of pixels per-foot to identify a subject may, at a minimum, range from 40 to over 150. If the goal, therefore, is to
identify a person entering through a standard 7-foot high doorway, the camera would need to be positioned so that the pixel
per-foot requirement covering the door is met. The door would then need to be covered by 1050 pixels, if the goal is to have
150 pixels per foot; 7 feet x 150 pixels per foot.
As
shown, the video surveillance
image is subject to uneven lighting, the subject is standing near a large window of a lab environment. There is little light
from the internal space with the natural light entering from the side and rear in this scene. This image is from an analog
camera that does not include a wide-dynamic range processing that would improve the image quality in this deployment. This
illustrates the point that the number of pixels alone does not guarantee a high quality image.
Number
of Cameras per Location
The
number of cameras at any one building or facility may vary greatly depending on the coverage requirements and the nature of
the business. While there are some small office deployment scenarios where only a single IP camera is needed, in most cases
even a small office will require more cameras that one might initially expect.
There
is a camera behind each teller station, a camera on the main entrance (both inside and outside), and two cameras in the inner
office area focused on the lobby and half doorway leading into the manager office areas. Additionally, the parking lot area,
side, front, and rear of the branch as well as any exterior ATM would need be covered. This small location may easily require
10 to 16 IP cameras. The Cisco Video Management and Storage System (VMSS) Network Module for the ISR router is targeted at
a 16 to 32 camera deployment any may be implemented in this branch location.
Larger
facilities require more cameras per location. It is not uncommon for a large retail store, home center, or warehouse retailer
to need 100 to 200 IP cameras per location. Public school deployments may need 80 to 100 cameras per building.
Tip
One advantage of deploying high definition cameras over standard definition is fewer cameras may be required to cover an area
of interest with a similar number of pixels per foot.
Frame Rates
As image quality
and frame rate increase, so does bandwidth requirements. The frame rate selected must meet the business requirements, but
it does not need to be higher than what is required and should be considered carefully as frame rate influences both bandwidth
and storage requirements.
Motion pictures
are captured at 24 frames per second (fps). The human eye/brain sees images captured at 24 fps as fluid motion. Televisions
use 25 fps (PAL) or 30 fps (NTSC) as does analog video cameras. These full motion rates are not needed for all video surveillance
applications and in most applications less than 12 to 15 fps is sufficient.
The following are
some industry guidelines:
•Nevada Gaming Commission (NGC) standards for casinos—30 fps
•Cash register, teller stations—12 to 15 fps
•School or office hallways —5 fps
•Parking lots, traffic cameras, overview scenes —1 to 3 fps
•Sports Stadiums on non-event days, less than 1 fps
Movement in Relation
to Camera Placement
If the camera is
placed where the subject moves toward the camera or vertically, the number of frames per second can be less than if the subject
moves from side to side or horizontally within the field of view. The velocity of the subject is also a consideration. A cameras
observing persons jogging or riding a bicycle may require higher frame rates than a person walking.
Progressive Scanning
Analog cameras
capture images using an interlaced scanning method, odd and even scan lines are done alternately. There is approximately 17
ms delay between the scanning of the odd and even lines making up the entire image. Because of this slight delay between scan
passes, objects that are moving in the frame may appear blurred while stationary objects are sharp. Most IP cameras use a
progressive scan that is not subject to this problem. Everything being equal, a progressive scan image has less motion blurring
than an interlace scanned image.
Wide Dynamic Range
Imaging
The Cisco 2500
Series Video Surveillance IP Camera offer wide dynamic range imaging. This technology increases the image quality in harsh
lighting conditions, including back lighted scenes or indoor/outdoor areas such as loading docks or stadiums.
IP Transport
IP cameras and
encoders communicate with the Media Server in different ways, depending on the manufacturer. Some edge devices may support
only MJPEG over TCP, while others may also support MPEG-4 over UDP.
TCP
MJPEG is typically
transported through TCP. TCP provides guaranteed delivery of packets by requiring acknowledgement by the receiver. Packets
that are not acknowledged will be retransmitted. The retransmission of TCP can be beneficial for slightly congested network
or networks with some level of inherent packet loss such as a wireless transport. Live video rendering at the receiving end
may appear to stall or be choppy when packets are retransmitted, but with the use of MJPEG each image stands alone so the
images that are displayed are typically of good quality.
UDP/RTP
MPEG-4/H.264 video
is typically transmitted over UDP or Real-time Transport Protocol (RTP). UDP does not guarantee delivery and provides no facility
for retransmission of lost packets. RTP/UDP transport is most suitable for networks with very little packet loss and bandwidth
that is guaranteed through QoS mechanisms. MPEG-4 over RTP/UDP is relatively intolerant to packet loss; if there is loss in
the stream, there will typically be visible artifacts and degradation of quality in the decoded images.
UDP transport does provide the option
of IP multicast delivery, where a single stream may be received by multiple endpoints. In an IP multicast configuration, the
internetworking devices handle replication of packets for multiple recipients. This reduces the processing load on the video
encoder or IP camera and can also reduce bandwidth consumption on the network.
Some IP cameras
and encoders also provide for TCP transport of MPEG-4. TCP encapsulation can be beneficial for networks with inherent packet
loss. TCP may be useful especially for fixed cameras and streams that are only being recorded and not typically viewed live.
TCP transport induces a little more latency in the transport due to the required packet acknowledgements, so may not be a
desirable configuration for use with a PTZ controlled camera.
Required TCP/UDP
Ports
IP Unicast
Applications that
rely on unicast transmissions send a copy of each packet between one source address and one destination host address. Unicast
is simple to implement but hard to scale if the number of sessions is large. Since the same information has to be carried
multiple times, the impact on network bandwidth requirements may be significant.
The communication
between the Media Server and the viewers is always through IP unicast, making the Media Server responsible for sending a single
stream to each viewer.
Note
The Media Server only supports IP unicast between the Media Server and the viewers.
Network Deployment
Models
This chapter provides
a high-level overview of different deployment models and highlights the typical requirements of campus and wide area networks.
Cisco's Enterprise Systems Engineering team offers detailed network designs that have been deployed by enterprise customers
to provide enhanced availability and performance. Campus
Networks
An infrastructure
that supports physical security applications requires several features from a traditional campus design. A hierarchical campus
design approach has been widely tested, deployed, and documented. This section provides a high-level overview and highlights
some of the design requirements that may apply to a video surveillance solution. A traditional campus design should provide the following:
•High availability—Avoid single points of failure
and provide fast and predictable convergence times.
•Scalability—Support the addition of new services
without major infrastructure changes.
•Simplicity—Ease of management with predictable
failover and traffic paths.
A highly available
network is a network that provides connectivity at all times.
As applications have become more critical, the network has become significantly
more important to businesses. A network design should provide a level of redundancy where no points of failure exist in critical
hardware components. This design can be achieved by deploying redundant hardware (processors, line cards, and links) and by
allowing hardware to be swapped without interrupting the operation of devices.
The enterprise
campus network is a typical campus network.
It provides connectivity to several environments such as IDFs, secondary buildings, data centers, and wide area sites. An
Intermediate Distribution Frame (IDF) is the cable infrastructure used for interconnecting end user devices to the Main Distribution
Frame (MDF) or other buildings and is typically located at a building wiring closet.
Quality-of-service
(QoS) is critical in a converged environment where voice, video, and data traverse the same network infrastructure. Video
surveillance traffic is sensitive to packet loss, delay, and delay variation (jitter) in the network. Cisco switches and routers
provide the QoS features that are required to protect critical network applications from these effects.
Hierarchical Design
The goal of a campus
design is to provide highly available and modular connectivity by separating buildings, floors, and servers into smaller groups.
This multilayer approach combines Layer 2 switching (based on MAC addresses) and Layer 3 switching or routing (based on IP
address) capabilities to achieve a robust, highly available campus network. This design helps reduce failure domains by providing
appropriate redundancy and reducing possible loops or broadcast storms.
With its modular
approach, the hierarchical design has proven to be the most effective in a campus environment. The following are the primary
layers of a hierarchical campus design:
•Core layer—Provides high-speed transport between
distribution-layer devices and core resources. The network's backbone.
•Distribution layer—Implements policies and
provides connectivity to wiring closets. This layer provides first-hop redundancy such as Hot Standby Router Protocol (HSRP)
and Gateway Load Balancing Protocol (GLBP).
•Access layer—User and workgroup access to the
network. Security and QoS can be defined at this layer and propagated to the higher layers.
In smaller environments,
it is typical to collapse the distribution and core layers into a single layer.
Wide Area Networks
A wide-area network
(WAN) is used to connect different local-area networks (LANs) and typically covers a broad geographic area. WAN services are
leased from service providers who provide different speeds and connectivity options.
Deploying a video
surveillance solution through a WAN environment presents challenges that are not typically seen in a LAN. In a LAN environment
it is common to see 1 Gbps and 10 Gbps of bandwidth, while in a WAN environment, most connections are less than 10 Mbps; many
remote connections operate on a single T1 (1.544 Mbps) or less.
These inherent
bandwidth constraints require careful evaluation of the placement of cameras and Media Servers and how many viewers can be
supported at remote sites simultaneously. By using child proxies, bandwidth requirements can be reduced to transport video
streams across WAN connections.
The placement of
recording devices also becomes important. The video may be streamed to a central site using lower frame rates or resolution,
but another attractive alternative is to deploy Media Servers at the remote sites and stream the traffic using the LAN connectivity
within the remote site.
Table 4-3 Service Provider Links (1)
Digital Signal Level
Speed
"T"
Channels or DS0s
DS0
64 kbps
-
1
DS1
1.544 Mbps
T1
24
DS3
44.736 Mbps
T3
672
Table 4-4 Service Provider Links (2)
SONET Signal Level
Speed
SDH Equivalent
STS-OC-1
51.84 Mbps
STM-0
STS-OC-3
155.52 Mbps
STM-1
STS-OC-12
622.08Mbps
STM-4
STS-OC-48
2488.32 Mbps
STM-16
STS-OC-192
9.952 Gbps
A point-to-point
or leased line is a link from a primary site to a remote site using a connection through a carrier network. The link is considered
private and is used exclusively by the customer. The circuit usually is priced based on the distance and bandwidth requirements
of the connected sites.
Technologies such
as Multilink PPP allow several links to be bundled to appear as a single link to upper routing protocols. In this configuration,
several links can aggregate their bandwidth and be managed with only one network address. Because video surveillance traffic
requirements tend to be larger than other IP voice and data applications, this feature is attractive for video surveillance
applications.
Hub-and-spoke,
also known as star topology, relies on a central site router that acts as the connection for other remote sites. Frame Relay
uses hub-and-spoke topology predominantly due to its cost benefits, but other technologies, such as MPLS, have mostly displaced
Frame Relay.
Example 1: Network
Bandwidth Usage
Two OM Viewers
are displaying video streams from Camera 1 and Camera 2 while one OM Viewer is displaying three video streams: two streams
from Camera 1 and one stream from Camera 2. The network bandwidth required to display video streams for Camera 2 in Site A
are relatively small for a LAN environment, but the traffic from Camera 1 can be significant for WAN environments since four
different 1Mbps streams have to traverse the WAN locations.