2nd CD ISO/IEC 15948: xxxx


Information technology - Computer graphics and image processing -
Portable Network Graphics (PNG): Functional specification

30 April 1999


1 SCOPE 1

2 NORMATIVE REFERENCES 2

3 DEFINITIONS 4

4 CONCEPTS 10

4.1 Images 10

4.2 Colour spaces 12

4.3 Source to PNG image transformation 12

4.3.1 Introduction 12

4.3.2 Implied alpha 12

4.3.3 Scaling 12

4.3.4 Indexing 13

4.3.5 Alpha compaction 14

4.4 PNG image 14

4.5 Encoding the PNG image 16

4.5.1 Introduction 16

4.5.2 Pass abstraction 16

4.5.3 Scanline abstraction 17

4.5.4 Filtering 17

4.5.5 Compression 18

4.5.6 Chunking 19

4.6 Ancillary information 19

4.7 PNG datastream 20

4.7.1 Chunks 20

4.7.2 Chunk types 20

4.8 Error handling 20

4.9 Registration 22

5 DATASTREAM AND DATA REPRESENTATION 23

5.1 Integers and byte order 23

5.2 PNG signature 23

5.3 Chunk layout 24

5.4 Chunk naming conventions 24

5.5 Cyclic Redundancy Code algorithm 26

5.6 Chunk ordering 26

6 REFERENCE IMAGE TO PNG IMAGE TRANSFORMATION 29

6.1 Introduction 29

6.2 Colour types and values 29

6.3 Scaling 29

6.4 Scanline 30

6.5 Alpha representation 30

6.6 Filtering 31

6.7 Text strings 31

7 INTERLACING AND PASS ABSTRACTION 32

7.1. Introduction 32

7.2. Interlace methods 32

8 FILTERING 33

8.1 Filter methods and filter types 33

8.2 Filter types for filter method 0 33

8.3 Filter type 3: Average 34

8.4 Filter type 4: Paeth 34

9 COMPRESSION 36

9.1 Compression of filtered scanlines 36

9.2 Other uses of compression 37

10 CHUNK SPECIFICATIONS 38

10.1 Introduction 38

10.2 Critical chunks 38

10.2.1 General 38

10.2.2 IHDR Image header 38

10.2.3 PLTE Palette 39

10.2.4 IDAT Image data 40

10.2.5 IEND Image trailer 40

10.3 Ancillary chunks 40

10.3.1 General 40

10.3.2 Additional colour information 40

10.3.2.1 cHRM Primary chromaticities and white point 40

10.3.2.2 gAMA Image gamma 41

10.3.2.3 iCCP Embedded ICC profile 41

10.3.2.4 sBIT Significant bits 42

10.3.2.5 sRGB Standard RGB colour space 43

10.3.3 Additional source image information 44

10.3.3.1 bKGD Background colour 44

10.3.3.2 hIST Image histogram 44

10.3.3.3 tRNS Transparency 45

10.3.3.4 pHYs Physical pixel dimensions 45

10.3.3.5 sPLT Suggested palette 46

10.3.4 Textual information 47

10.3.4.1 iTXt International textual data 47

10.3.4.2 tEXt Textual data 48

10.3.4.3 zTXt Compressed textual data 49

10.3.5 Time stamp information 49

10.3.5.1 tIME Image last-modification time 49

11 PNG ENCODERS 51

11.1 Introduction 51

11.2 Encoder gamma handling 51

11.3 Encoder colour handling 53

11.4 Alpha channel creation 54

11.5 Sample depth scaling 55

11.6 Suggested palettes 56

11.7 Interlacing 57

11.8 Filter selection 57

11.9 Compression 57

11.10 Chunking 58

11.10.1 Text chunk processing 58

11.10.2 Use of private chunks 58

11.10.3 Private type and method codes 58

11.10.4 Ancillary chunks 59

12 PNG DECODERS AND VIEWERS 60

12.1 Introduction 60

12.2 Error checking 60

12.3 Security considerations 61

12.4 Chunking 61

12.4.1 General 61

12.4.2 Pixel dimensions 61

12.4.3 Text chunk processing 61

12.5 Decompression 62

12.6 Filtering 62

12.7 Interlacing and progressive display 62

12.8 Truecolour image handling 64

12.9 Sample depth rescaling 64

12.10 Decoder gamma handling 65

12.11 Decoder colour handling 66

12.12 Background colour 67

12.13 Alpha channel processing 67

12.14 Suggested-palette and histogram usage 70

13 EDITORS AND EXTENSIONS 72

13.1 Additional chunk types 72

13.2 Behaviour of PNG editors 72

13.3 Ordering of chunks 73

13.3.1 Ordering of critical chunks 73

13.3.2 Ordering of ancillary chunks 73

14 CONFORMANCE 75

14.1 Introduction 75

14.1.1 Objectives 75

14.1.2 Scope 75

14.2 Conformance 75

14.2.1 Conformance of PNG datastreams 75

14.2.2 Conformance of PNG encoders 76

14.2.3 Conformance of PNG decoders 76

14.2.4 Conformance of PNG editors 77

ANNEX A FILE CONVENTIONS AND INTERNET MEDIA TYPE 78

A.1 File name extension 78

A.2 Internet media type 78

A.3 Macintosh file layout 78

ANNEX B GUIDELINES FOR NEW CHUNK TYPES 79

ANNEX C GAMMA AND CHROMATICITY 80

ANNEX D SAMPLE CYCLIC REDUNDANCY CODE 82

ANNEX E ONLINE RESOURCES 84

Introduction 84

Archive sites 84

PNG home page 84

Sample implementation and test images 84

Electronic mail 84

ANNEX F RELATIONSHIP TO W3C PNG 85

Editor (Version 1.0) 85

Editor (Version 1.1) 85

Contributing Editor (Version 1.0) 85

Contributing Editor (Version 1.1) 85

Authors (Versions 1.0 and 1.1 combined) 85

List of changes between W3C Recommendation PNG Specification Version 1.0
and this International Standard 86

Editorial changes 86

Technical changes 86

BIBLIOGRAPHY 88


1 Scope

This International Standard specifies a datastream and associated file
format for single computer graphics images transmitted across the
Internet. The datastream and associated file format is of value outside
its main design goal.

The design goals were:

a) Portability: encoding, decoding and transmission should be software
and hardware platform independent.

b) Completeness: the representation of true colour, indexed-colour and
greyscale images, in each case with the option of transparency.

c) Serial encode and decode: it should be possible for datastreams to be
generated serially and read serially allowing the datastream format to
be used for on-the-fly generation and display of images across a serial
communication channel.

d) Progressive presentation: it should be possible to transmit
datastreams so that an approximation of the whole computer graphics
image can be presented initially and progressively enhanced as the
datastream is received.

e) Robustness to transmission errors: it should be possible to detect
datastream transmission errors effectively.

f) Lossless: filtering and compression should be without any loss of
information for any arbitrary image.

g) Performance: any filtering, compression, and progressive
image presentation should be aimed at efficient decoding and
presentation. Fast encoding is a less important goal than fast
decoding. Decoding speed may be achieved at the expense of encoding
speed.

h) Compression: images should be compressed effectively, consistent with
the other design goals.

i) Simplicity: developers should be able to implement the standard
easily.

j) Interchangeability: any standard-conforming PNG decoder shall read
all conforming PNG files.

k) Flexibility: future extension and private additions should be
catered for without compromising the interchangeability of standard PNG
datastreams.

l) Legally unencumbered: no algorithms should be used that are not
freely available.

2 Normative references

The following normative documents contain provisions which, through
reference in this text, constitute provisions of this International
Standard. For dated references, subsequent amendments to, or revisions
of, any of these publications do not apply. However, parties to
agreements based on this International Standard are encouraged to
investigate the possibility of applying the most recent editions of the
normative documents indicated below. For undated references, the latest
edition of the normative document referred to applies. Members of ISO
and IEC maintain registers of currently valid International Standards.

[CIE-15] CIE Recommendations on uniform color spaces, color-difference
equations, psychometric color terms, CIE Publication No 15, 1978.

[CIE-15.2] CIE Colorimetry, Supplement Number 15.2 to CIE Publication No
15, 1986.

[ICCP-1] International Color Consortium, "Specification ICC.1: 1998-09,
File Format for Color Profiles", 1998, available at http://www.color.org

[ICCP-1A] International Color Consortium, "Specification
ICC.1A: 1999-02, Addendum 1 to ICC.1: 1998-09", 1999, available at
http://www.color.org

[ISO-639:1988] International Organization for Standardization, Code for
the representation of names of languages.

[ISO-646:1991] International Organization for Standardization,
Information technology --- ISO 7-bit coded character set for information
exchange.

[ISO-3309:1984] International Organization for Standardization,
Information processing systems --- Data communication High-level data
link control procedure --- Frame structure, 3rd Edition.

[ISO-8859-1:1987] International Organization for Standardization,
Information processing --- 8-bit single-byte coded graphic character
sets --- Part 1: Latin alphabet No. 1. Also see sample files at

ftp://ftp.uu.net/graphics/png/documents/iso_8859-1.*

[ISO-9899:1990] International Organization for Standardization,
Programming languages --- C.

[ISO-10646-1:1993] International Organization for Standardization,
Information Technology --- Universal multiple-octet coded character sets
(UCS) --- Part 1: Architecture and basic multilingual plane.

[RFC-1123] Braden, R., Editor, "Requirements for Internet Hosts ---
Application and Support", STD 3, RFC 1123, USC/Information Sciences
Institute, October 1989. ftp://ftp.isi.edu/in-notes/rfc1123.txt

[RFC-2045] Freed, N. and Borenstein, N. , "MIME (Multipurpose
Internet Mail Extensions) Part One: Format of Internet Message
Bodies", RFC 2045, Innosoft, First Virtual, November 1996.
ftp://ftp.isi.edu/in-notes/rfc2045.txt

[RFC-2048] Freed, N., Klensin, J. and Postel, J., "Multipurpose
Internet Mail Extensions (MIME) Part Four: Registration
Procedures", RFC 2048, Innosoft, MCI, ISI, November 1996.
ftp://ftp.isi.edu/in-notes/rfc2048.txt

[RFC-1766] Alvestrand, H., "Tags for the Identification of Languages,
Internet standards track protocol", RFC 1766, UNINETT, March 1995.
ftp://ftp.isi.edu/in-notes/rfc1766.txt

[RFC-1950] Deutsch, P. and Gailly, J-L., "ZLIB Compressed Data Format
Specification version 3.3", RFC 1950, Aladdin Enterprises, May 1996.
ftp://ftp.isi.edu/in-notes/rfc1950.txt

[RFC-1951] Deutsch, P., "DEFLATE Compressed Data Format
Specification version 1.3", RFC 1951, Aladdin Enterprises, May 1996.
ftp://ftp.isi.edu/in-notes/rfc1951.txt

3 Definitions

For the purposes of this International Standard the following
definitions apply.

3.1 ab

exponentiation; a raised to the power b. In gamma-related calculations,
zero raised to any positive power is valid and shall give a zero result.

3.2 alpha

value representing a pixel's degree of opacity. The more opaque a
pixel, the more it hides the background against which the image is
presented. Zero alpha represents a completely transparent pixel, maximum
alpha represents a completely opaque pixel.

3.3 alpha compaction

an implicit representation of transparent pixels. If every pixel with
a specific colour or greyscale value is fully transparent and all
other pixels are fully opaque, the alpha channel may be represented
implicitly.

3.4 ancillary chunk

class of chunk that provides additional information. A PNG decoder,
without processing an ancillary chunk, can still produce a meaningful
image, though not necessarily the best possible image.

3.5 bit depth

For indexed-colour images, the number of bits per palette index. For
other images, the number of bits per sample in the image. This is the
value that appears in the IHDR chunk.

3.6 byte

eight bits; also called an octet. The highest bit (value 128) of a byte
is numbered bit 7; the lowest bit (value 1) is numbered bit 0.

3.7 byte order

ordering of bytes for multi-byte data values within a PNG file or PNG
datastream. PNG uses network byte order.

3.8 channel

array of all per-pixel information of a particular kind within a
reference image. There are five kinds of information: red, green, blue,
greyscale and alpha. For example the alpha channel is the array of alpha
values within a reference image.

3.9 chromaticity (CIE)

pair of values x,y that precisely specify a colour, except for the
brightness information.

3.10 chunk

section of a PNG datastream. Each chunk has a chunk type name. Most
chunks also include data. The format and meaning of the data within the
chunk are determined by the chunk type name.

3.11 colour type

value denoting how colour and alpha are specified in the PNG
image. Colour types are sums of the following values: 1 (palette used),
2 (truecolour used), 4 (alpha used). The permitted values of colour type
are 0, 2, 3, 4 and 6.

3.12 composite (verb)

to form an image by merging a foreground image and a background image,
using transparency information to determine where and to what extent
the background should be visible. The foreground image is said to be
"composited against" the background.

3.13 CRC

Cyclic Redundancy Code. A CRC is a type of check value designed to
detect and correct particular types of transmission errors. CRCs are
designed to catch most transmission errors. A decoder calculates the CRC
for the received data and checks by comparing it to the CRC appended to
the data that the encoder calculated. A mismatch indicates that the data
or the CRC were corrupted in transit.

3.14 critical chunk

chunk that shall be understood and processed by the decoder in order to
produce a meaningful image from a PNG datastream.

3.15 CRT

Cathode Ray Tube: a common type of computer display hardware.

3.16 datastream

sequence of bytes. This term is used rather than "file" to describe a
byte sequence that may be only a portion of a file. It is also used to
emphasize that the sequence of bytes might be generated and consumed "on
the fly", never appearing in a stored file at all.

3.17 deflate

name of a particular compression algorithm. This algorithm is used, in
compression mode 0, in conforming PNG datastreams. Deflate is a member
of the LZ77 family of compression methods. It is defined in [RFC-1951].

3.18 display image

image constructed for display from a decoded PNG datastream.

3.19 filter

transformation applied to an array of scanlines with the aim of
improving their compressibility. PNG uses only lossless (reversible)
filter algorithms.

3.20 frame buffer

the final digital storage area for the image shown by most types of
computer display. Software causes an image to appear on screen by
loading the image into the frame buffer.

3.21 gamma

exponent that describes approximations to certain non-linear transfer
functions encountered in image capture and reproduction. Within this
International Standard, gamma is the exponent in the transfer function
from image-sample to display-output

display-output = image-samplegamma

where both image-sample and display-output are scaled to the range 0 to
1.

3.22 greyscale

image representation in which each pixel is defined by a single sample
of colour information, representing overall luminance (on a scale from
black to white), and optionally an alpha sample.

3.23 image data

1-dimensional array of scanlines within an image.

3.24 implied alpha

implicit representation of the alpha channel. If every pixel is fully
opaque the alpha channel can be implied by the colour type and is not
stored explicitly as part of the image data.

3.25 indexed-colour

image representation in which each pixel of the original image is
represented by a single index into a palette. The selected palette entry
defines the actual colour of the pixel.

3.26 indexing

image represented by a palette and an array of indices pointing to
entries in the palette. The alpha information is represented in a
separate chunk. The number of distinct colour and alpha or greyscale and
alpha values has to be less than 256.

3.27 interlaced PNG image

sequence of reduced images generated from the PNG image by pass
abstraction.

3.28 lossless compression

method of data compression that reconstructs the original data exactly,
bit-for-bit.

3.29 lossy compression

method of data compression that reconstructs the original data
approximately, rather than exactly.

3.30 LSB

Least Significant Byte of a multi-byte value.

3.31 luminance

formal definition of luminance is in [CIE-15]. Informally it is the
perceived brightness, or greyscale level, of a colour. Luminance and
chromaticity together fully define a perceived colour.

3.32 LUT

Look Up Table. In frame buffer hardware, a LUT can be used to map
indexed-colour pixels into a selected set of truecolour values, or to
perform gamma correction. In software, a LUT can often be used as a
fast way of implementing any mathematical function of a single integer
variable.

3.33 LZ77

data compression algorithm described by Ziv and Lempel in their 1977
paper [ZL].

3.34 MSB

Most Significant Byte of a multi-byte value.

3.35 network byte order

byte order in which the most significant byte comes first, then the
less significant bytes in descending order of significance (MSB LSB for
two-byte integers, MSB B2 B1 LSB for four-byte integers).

3.36 palette

indexed table of 3 8-bit sample values, red, green and blue, which with
an indexed-colour image defines the red, green and blue sample values of
the reference image. Alpha samples may be defined for palette entries,
via the tRNS chunk and may be used to reconstruct the alpha sample
values of the reference image.

3.37 pass abstraction

organizing a PNG image as a sequence of reduced images to change the
order of transmission and enable progressive display.

3.38 pixel

information stored for a single grid point in an image. A pixel consists
of a sequence of samples from all channels. The complete image is a
rectangular array of pixels.

3.39 PNG datastream

result of encoding a PNG image. A PNG datastream consists of a PNG
signature followed by a sequence of critical and ancillary chunks.

3.40 PNG decoder

process or device which reconstructs a reference image equivalent to the
source image from a PNG datastream.

3.41 PNG editor

creates a modification of an existing PNG datastream, preserving
unmodified ancillary information wherever possible and obeying the chunk
ordering rules, even for unknown chunk types.

3.42 PNG encoder

process or device which generates a PNG datastream from a source image.

3.43 PNG file

PNG datastream stored as a file on some physical medium.

3.44 PNG image

result of transforming a reference image.

3.45 PNG signature

sequence of bytes appearing at the start of every PNG datastream. It
differentiates a PNG datastream from other types of datastream and
allows an early detection of some transmission errors.

3.46 reduced image

pass of the interlaced PNG image extracted from the PNG image by pass
abstraction.

3.47 reference image

rectangular array of pixels, each having red, green and blue (RGB) and
alpha samples. Every reference image can be represented exactly by a PNG
datastream and every PNG datastream can be converted into a reference
image. Sample values can have a sample depth from 1 to 16 and may be
different for each channel.

3.48 sample

intersection of a channel and a pixel in an image.

3.49 sample depth

number of bits used to represent a sample value. In an indexed-colour
PNG image, samples are stored in the palette and thus the sample depth
is always 8 by definition of the palette. In other types of PNG image it
is the same as the bit depth.

3.50 sample depth scaling

mapping of a range of sample values onto the full range of a sample
depth allowed in a PNG datastream.

3.51 scanline

row of entries within an image or interlaced PNG image.

3.52 source image

image which is presented to a PNG encoder.

3.53 truecolour

image representation in which each pixel is defined by samples,
representing red, green, and blue intensities and optionally an alpha
sample.

3.54 white point

chromaticity of a computer display's nominal white value.

3.55 zlib

particular format for data that have been compressed using deflate-style
compression. Also the name of a library containing a sample
implementation of this method. The format is defined in [RFC-1950].

4 Concepts

4.1 Images

This International Standard specifies the PNG datastream and places
some requirements on PNG encoders which generate PNG datastreams, PNG
decoders which interpret PNG datastreams and PNG editors which transform
one PNG datastream into another. A PNG editor is defined as an editor
that transforms a PNG datastream into another standard PNG datastream
even when the PNG editor has incomplete knowledge of all the information
in the PNG datastream. The standard does not specify the interface
between an application and either a PNG encoder, decoder or editor. The
precise form in which an image is presented to an encoder or delivered
by a decoder is not specified. Four kinds of image are distinguished,
some of which only exist conceptually.

a) The source image is the image which is presented to a
PNG encoder. The result of encoding a source image is a PNG
datastream. Conceptually, the source image always has a colour or
greyscale value for each pixel, although this information may not be
available explicitly.

b) The reference image, which only exists conceptually, is a rectangular
array of rectangular pixels, each having red, green, blue (RGB) and
alpha samples. The reference image may use sample depths in the range 1
to 16 bits. Every reference image can be represented exactly by a PNG
datastream and every PNG datastream can be converted into a reference
image. Every source image that can be encoded by PNG has an equivalent
representation as a reference image, though since the colour space of
the source image is not specified by PNG, this equivalence can involve
loss if the colour space of the source image does not have a direct
representation in an RGB colour space. The reference image is conceptual
in the sense that an encoder may not necessarily need to explicitly
compute and store the reference image. A PNG encoder conceptually
encodes the reference image.

c) The PNG image is obtained from the reference image by a series
of transformations: alpha separation, scaling, indexing and alpha
indexing. Five types of PNG image are defined (see 6.2: Colour types and
values). Which of the transformation steps will need to be performed
by a PNG encoder will depend on the specific form in which the source
image is presented and the type of PNG image to be generated. Some
encoding steps may be optimized or not be required depending on the
form of the source image. Although not all sample depths in the range
1 to 16 bits are explicitly supported in the PNG image, the number of
significant bits in each channel of the image may be recorded. A PNG
encoder generates a PNG datastream from the PNG image. A PNG decoder
takes the PNG datastream and recreates the PNG image. The reference
image is a canonical form of the PNG image.

d) The display image is constructed from the PNG image obtained by
decoding a PNG datastream. No specific format is specified for the
display image. A viewer presents the decoded PNG image to the user as
close to the original source image as it can achieve.

The relationships between the four kinds of image are illustrated in
figure 4.1.


Figure 4.1: Relationships between source, reference, PNG and display
images

The array of samples of a particular type (for example, green samples)
is called a channel. Each horizontal row of pixels is called a
scanline. Pixels are ordered from left-to-right within each scanline
and scanlines are ordered top-to-bottom. All pixels have the same
aspect ratio. The sample depth is defined as the number of bits used to
specify a sample in an image. The samples for a pixel may have different
sample depths. An alpha value of zero defines a fully transparent pixel
and a fully opaque pixel is defined by the maximum value. If the only
alpha values used are zero and the maximum value, the alpha sample
depth conveys no information. Generating a reference image where the
only alpha values used are zero and the maximum value is regarded as
lossless even when the maximum value in the decoded reference image is
different from the one in the reference image that was encoded. These
relationships are illustrated in figure 4.2.


Figure 4.2: Relationships between sample, sample depth, pixel and
channel

4.2 Colour spaces

The RGB colour space in which colour samples are situated may be
specified in one of three ways:

a) by an ICC profile;

b) by specifying explicitly that the colour space is sRGB when the
samples conform to this colour space;

c) by specifying the value of gamma and the 1931 CIE x, y chromaticities
of the red, green and blue primaries used in the image and the reference
white point.

For high-end applications the first method provides the most flexibility
and control. The second method is appropriate when the sRGB colour space
is supported. It is recommended that explicit gamma and chromaticity
information is also provided when either the first or second methods are
used, as in cases where full ICC profiles or the sRGB colour space are
not supported, PNG decoders can still make sensible use of chromaticity
information.

4.3 Source to PNG image transformation

4.3.1 Introduction

To improve the encoding of the source image, a number of transformations
are applied to the reference image to create the PNG image to be encoded
(see figure 4.3). Before these transformations are applied, there may
also be a transformation from RGB sample values to greyscale sample
values. The transformations are not necessarily independent and need not
necessarily be applied in the sequence shown. The source image may have
some or all of these transformations already applied in which case a PNG
encoder only needs to perform the ones still required.


Figure 4.3: Reference image to PNG image transformation

4.3.2 Implied alpha

If all the alpha samples for the reference image have the maximum
value for the specified sample depth, the alpha channel for the PNG
image may be represented implicitly. If the alpha channel is not
explicitly present in the source image, all pixels are considered to
be fully opaque (i.e. have the maximum allowed alpha value) in the
reference image. If the alpha channel is represented implicitly, no
alpha channel appears in the PNG image. This transformation is called
alpha separation. In decoding the PNG image, if no alpha information is
associated with the PNG image, it is assumed that all alpha samples are
the same and have the maximum allowed value.

4.3.3 Scaling

In the PNG image, not all sample depths are supported (see 6.2: Colour
types and values). If the sample depth in the reference image does not
correspond to an allowed sample depth for the PNG image being encoded,
the possible sample values in the reference image are linearly mapped
into the next allowable range for the PNG image. Figure 4.4 shows, if
the sample depth in the reference image was 3 and the next allowable
depth in the PNG image was 4, how the possible values are mapped into a
PNG image with depth 4.


Figure 4.4: Scaling sample values

4.3.4 Indexing

If the number of distinct pixels in the reference image is small, an
alternative representation of the reference image may be more efficient
for encoding. This representation is called indexed-colour. The array
of pixels in the reference image is replaced by an array of indices of
the same dimension. Each position in the indexed colour array contains
a pointer into a table, called the palette, and an associated alpha
value. The associated alpha values are stored (possibly implicitly) in a
separate array called the tRNS chunk with each palette entry having an
associated entry in the tRNS chunk. Colour images that are not indexed
are called truecolour images.


Figure 4.5: Indexed-colour image

A suggested palette or palettes may be constructed even when the
reference image is left as a truecolour image. In this case, the
suggested palette provides a set of colours which the truecolour image
could be transformed into. It is included so that a viewer which
cannot display truecolour images directly has some guidance as to an
appropriate palette to use.

4.3.5 Alpha compaction

The final transformation stage is provided for images where the number
of distinct alpha values is limited or the extreme case where alpha
values are either completely opaque or completely transparent.

For indexed-colour images, encoders are encouraged to rearrange the
palette so that the table entries with the maximum alpha value are
grouped at the end. In this case, the tRNS chunk can be shortened and
not include these entries. A palette entry with no associated entry in
the tRNS chunk has the maximum value of alpha.

For greyscale images, if all greyscale values are completely opaque
apart from one, which is completely transparent, the alpha channel
may be discarded from the PNG image. The tRNS chunk contains a single
greyscale value defining the greyscale that is completely transparent.

For truecolour images, if all colour values are completely opaque apart
from one, which is completely transparent, the alpha channel may be
discarded from the PNG image. The tRNS chunk contains a single colour
value defining the colour that is completely transparent.

4.4 PNG image

After transformation, the reference image has been transformed into one
of five types of PNG image (see figure 4.5) :

a) Truecolour with alpha: the pixel consists of four channels. The first
three channels define the truecolour while the fourth channel defines
the alpha value. Apart from scaling, the PNG image is similar to the
representation of the reference image.

b) Greyscale with alpha: the pixel consists of two channels. The
first channel defines the greyscale and the second channel contains
the alpha value. Apart from scaling, the PNG image is similar to the
representation of the reference image.

c) Truecolour: the pixel consists of three channels containing
values defining the red, green and blue components in the RGB colour
model. Black is represented by all three channels containing the value 0
and white by all three channels containing the largest value. The alpha
channel is represented by a single value in the tRNS chunk defining the
colour value in the PNG image that is completely transparent. All other
colour values are completely opaque.

d) Greyscale: the pixel has a single channel which contains a value
relating to the luminance of the pixel, where a value of zero is black
and the largest value is white. The alpha channel is represented by a
single value in the tRNS chunk defining the greyscale value in the PNG
image that is completely transparent. All other greyscale values are
completely opaque.

e) Indexed-colour: the pixel has a single channel which contains an
index into a palette which specifies the colour of the pixel. The tRNS
chunk contains a table of alpha values. The table length is equal to
or less than the palette. If the table lengths are the same, the entry
in the tRNS chunk gives the alpha value for the corresponding entry
in the palette. If the alpha table in the tRNS chunk is smaller than
the palette, the entry in the tRNS chunk gives the alpha value for the
corresponding entry in the palette. For those entries in the palette
with no entry in the alpha table in the tRNS chunk, the alpha value for
the palette entry is the maximum alpha value representing completely
opaque.


Figure 4.6: Possible PNG image pixel types

4.5 Encoding the PNG image

4.5.1 Introduction

The process of encoding a PNG image is given in figure 4.7. The steps
refer to the operations on the array of pixels or indices in the PNG
image. The palette and tRNS chunk are not encoded in this way.


Figure 4.7: Encoding PNG image

a) Pass abstraction: to allow for progressive display, the PNG image is
converted into one or more images.

b) Scanline abstraction: the image is processed a scanline at a time.

c) Filtering: each scanline is transformed into a filtered scanline
using one of the defined filter types to prepare the scanline for image
compression.

d) Compression: occurs on all the filtered scanlines in the image.

e) Chunking: the compressed image is divided into conveniently sized
IDAT chunks. An error detection code is added to each chunk.

f) Datastream construction: the IDAT chunks are inserted into the
datastream.

4.5.2 Pass abstraction

Pass abstraction (see figure 4.8) splits a PNG image into a sequence
of reduced images where the first image defines a coarse view of the
original source image and subsequent images enhance this coarse view
until the last image completes the original source image. The set of
reduced images is also called an interlaced PNG image. Two interlace
methods are defined in this International Standard. The first method is
a null method; pixels are stored sequentially from left to right and
scanlines from top to bottom. The second method makes multiple passes
over the image to produce a sequence of seven reduced images. The seven
passes for a sample image are illustrated in figure 4.8. See clause
7: Interlacing and pass abstraction.


Figure 4.8: Pass abstraction

4.5.3 Scanline abstraction

PNG standardizes a number of filter methods and filter types that may
be used to prepare image data for compression. Scanline abstraction
depends on the chosen filter method and filter type. The filter method
defines a set of strategies for filtering. Within each filter method,
the filter type defines the specific filtering to be applied to a
specific scanline. The encoder shall use only a single filter method for
an interlaced PNG image, but may use different filter types for each
scanline in a reduced image. See clause 8: Filtering.

4.5.4 Filtering

Filtering transforms the byte sequence in a scanline to an equal length
sequence of bytes preceded by the filter type (see figure 4.9 for an
example).


Figure 4.9: Filtering a scanline

4.5.5 Compression

The sequence of filtered scanlines in the interlaced PNG image derived
from the reference image are compressed (see figure 4.10) by one of the
available compression methods. The concatenated filtered scanlines for
the interlaced PNG image form the input to the compression stage. The
output from the compression stage is a single compressed datastream. See
clause 9: Compression.


Figure 4.10: Compression

4.5.6 Chunking

Chunking provides a convenient breakdown of the compressed datastream
into manageable chunks (see figure 4.10). Each chunk has its own
redundancy check. See clause 10: Chunk specifications.

4.6 Ancillary information

Ancillary information may be associated with an image. Decoders may
ignore all or some of the ancillary information. The types of ancillary
information provided are described in Table 4.1.

Table 4.1 -- Ancillary information types

Type

Description

Background colour

colour to be used when presenting the image if no better option is
available.

Gamma and chromaticity

gamma characteristic of the image with respect to the original scene
together with chromaticity characteristics of the RGB values defined
in the source image may be provided. Display programs are strongly
encouraged to use this information, plus information about the display
device they are using and room lighting, to present the image to the
viewer in a way that reproduces what the image's original author saw
as closely as possible. For high-precision applications, the exact
chromaticity of the RGB data in an image can be specified, allowing more
accurate colour matching than gamma correction alone will provide. These
allow more realistic presentation of the source image. See Annex
C: Gamma and chromaticity.

ICC profile

description of the colour space (in the form of an International Color
Consortium ICC profile) to which the samples in the reference image
conform. Since not all applications may be capable of colour management,
it is recommended that separate gamma and chromaticity information
be provided for use by such applications, whenever ICC profiles are
written. PNG decoders that are capable of colour management shall
interpret ICC profiles and ignore any additional gamma and chromaticity
information provided in the PNG datastream.

Image histogram

frequency estimates of the usage of each entry in the palette by the PNG
image.

Physical pixel dimensions

intended pixel size and aspect ratio to be used in presenting the source
image.

Significant bits

gives information concerning the number of bits that are significant in
the channel values or the palette.

sRGB colour space

the sRGB colour space and the required rendering intent (as defined
by the International Color Consortium) may be specified. To allow PNG
datastreams to be sensibly interpreted when a colour management system
is not in use, it is recommended that separate gamma and chromaticity
information for the sRGB colour also be provided.

Suggested palette

a reduced palette or palettes may be provided for use when the display
device is not capable of displaying the full range of colours in the
image. Suggested palettes may appear for any type of image. A suggested
palette may be provided by the sPLT chunk, whic can also provide
histogram data. It is also possible to use the PLTE and hIST chunks for
this purpose, though this is not recommended.

Textual data

textual information associated with the source image that may be
compressed. International text, specified using the UTF-8 encoding of
the Unicode character set, may also be included.

Time

gives the time when the PNG image was last modified.

Transparency

provides alpha information that allows the reference image to be
reconstructed when the alpha channel is not retained in the PNG
image. Decoders may ignore the transparency information.

4.7 PNG datastream

4.7.1 Chunks

The PNG datastream consists of a PNG signature (see 5.2: PNG signature)
and a sequence of chunks (see clause 10: Chunk specifications). Each
chunk has a chunk type which specifies its function. The chunks defined
in 4.5.6: Chunking, for example, are image data chunks named IDAT
chunks.

Each chunk consists of four parts (see figure 4.11). Length is the
number of bytes in the chunk data part. The chunk type defines the
meaning of the chunk and its chunk properties. Some processing of chunks
is possible when only the chunk properties are known.


Figure 4.11: Chunk parts

The cyclic redundancy code (CRC) is calculated from the data in the
chunk type and chunk data and can be used to perform a check on the
consistency of that data.

4.7.2 Chunk types

There are 18 chunk types defined in this International Standard. The
first four are termed critical chunks that shall be understood and
correctly interpreted according to the provisions of this International
Standard.

a) IHDR: image header that is the first chunk in a PNG datastream.

b) PLTE: palette table associated with indexed PNG images.

c) IDAT: image data chunks.

d) IEND: image trailer that is the last chunk in a PNG datastream.

The remaining 14 chunk types are termed ancillary chunk types which
encoders may generate and decoders may interpret.

e) cHRM, gAMA, iCCP, sBIT, sRGB: provide additional information
concerning the colours in the source image.

f) bKGD, hIST, tRNS, pHYS,sPLT: provide additional information
concerning the source image and its presentation after decoding.

g) iTXt, tEXt, zTXt: provide textual information related to the source
image.

h) tIME: provides a time stamp indicating when the PNG image was last
modified.

4.8 Error handling

Errors in a PNG datastream will fall into three general classes:

a) transmission errors or damage to a computer file system, which will
tend to corrupt much or all of the datastream;

b) code errors, which appear as invalid values fields in chunks, missing
or misplaced chunks, or chunks unrecognized by the decoder.

c) unknown codes, codes or value fields not recognized by the
decoder. These could be private codes or registered codes not supported
by the decoder.

Examples of the transmission errors include transmission in "text" or
"ascii" mode, in which byte codes 13 and/or 10 may be added, removed or
converted throughout the datastream; unexpected termination, in which
the datastream is truncated; or a physical error on a storage device, in
which one or more blocks (typically 512 bytes each) will have garbled or
random values. Code errors include an invalid value for a row filter, an
invalid compression method, an invalid chunk length, the absence of a
PLTE chunk before IDAT in an indexed image, or the presence of multiple
gAMA chunks. PNG handles errors as follows:

d) Detect errors as early as possible using the PNG signature bytes
and CRCs on each chunk. A CRC should be checked before using the chunk
data. Sometimes this is impractical, for example when a streaming PNG
decoder is processing a large IDAT chunk, but even then the CRC should
be checked when the end of the chunk is reached.

e) Recover from an error, if possible; otherwise fail gracefully. Errors
that have little or no effect on the processing of the image may be
ignored, while those that affect critical data shall be dealt with in a
manner appropriate to the application.

f) Provide helpful messages describing errors, including recoverable
errors.

Three classes of PNG chunks are relevant to this philosophy. For the
purposes of this classification, an "unknown chunk" is either one whose
type was genuinely unknown to the decoder's author, or one that the
author chose to treat as unknown, because default handling of that chunk
type would be sufficient for the program's purposes. Other chunks are
called "known chunks". Given this definition, the three classes are as
follows:

g) known chunks, which necessarily includes all of the critical chunks
defined in this International Standard (IHDR, PLTE, IDAT, IEND)

h) unknown critical chunks (bit 5 of the first byte of the chunk type is
0)

i) unknown ancillary chunks (bit 5 of the first byte of the chunk type
is 1)

See 5.4: Chunk naming conventions for a full description of chunk naming
conventions.

PNG chunk names are marked "critical" or "ancillary" according to
whether the chunks are critical for the purpose of extracting a viewable
image (as with IHDR, PLTE and IDAT) or critical to understanding
the datastream structure (as with IEND). This is a specific kind
of criticality and one that is not necessarily relevant to every
conceivable decoder. For example, a program whose sole purpose is to
extract text annotations (for example, copyright information) does not
require a viewable image. Another decoder might consider tRNS and gAMA
essential to its proper execution.

Code errors always involve known chunks as code errors in unknown chunks
cannot be detected. The PNG decoder has to determine whether a code
error is fatal (unrecoverable) or not, depending on its requirements
and the situation. For example, most decoders can ignore an invalid
IEND chunk; a text-extraction program can ignore the absence of IDAT;
an image viewer cannot recover from an empty PLTE chunk in an indexed
image but it can ignore an invalid PLTE chunk in a truecolour image; and
a program that extracts the alpha channel can ignore an invalid gAMA
chunk, but may consider the presence of two tRNS chunks to be a fatal
error. Anomalous situations other than code errors shall be treated as
follows:

j) Encountering an unknown ancillary chunk is never an error. The chunk
can simply be ignored.

k) Encountering an unknown critical chunk is a fatal condition for any
decoder trying to extract the image from the datastream. By ignoring a
critical chunk, a decoder could not know whether the image it extracted
was the one intended by the encoder.

l) A PNG signature mismatch, a CRC mismatch, or an unexpected
end-of-stream indicates a corrupted datastream, and may be regarded
as a fatal error. A decoder could try to salvage something from the
datastream, but the extent of the damage will not be known.

m) Encountering an unexpected value in a known chunk. If the chunk is an
ancillary chunk, the entire chunk can be ignored. If it is a critical
chunk, the chunk is treated as an unknown critical chunk and is a fatal
condition.

When a fatal condition occurs, the decoder should fail immediately,
signal an error to the user if appropriate, and optionally continue
displaying any image data already visible to the user (i.e. "fail
gracefully"). The application as a whole need not terminate.

When a non-fatal error occurs, the decoder should signal a warning to
the user if appropriate, recover from the error, and continue processing
normally.

Decoders that do not compute CRCs should interpret apparent code errors
as indications of corruption (see also 12.2: Error checking).

Errors in compressed chunks (IDAT, zTXt, iTXt, iCCP) could lead to
buffer overruns. Implementors of deflate decompressors should guard
against this possibility.

4.9 Registration

For some facilities in PNG, there are a number of alternatives defined
and this International Standard allows other alternatives to be defined
by registration. According to the rules for the designation and
operation of registration authorities in the ISO/IEC Directives, the
ISO and IEC Councils have designated the following as the registration
authority:

The World-Wide Web Consortium Host at INRIA

The Registration Authority for PNG

INRIA- Sophia Antipolis

BP 93

06902 Sophia Antipolis Cedex

FRANCE

5 Datastream and data representation

5.1 Integers and byte order

This clause defines basic data representations used in reference and PNG
images, as well as the expected representation of the image data.

All integers that require more than one byte shall be in network
byte order (as illustrated in figure 5.1): the most significant byte
comes first, then the less significant bytes in descending order of
significance (MSB LSB for two-byte integers, MSB B2 B1 LSB for four-byte
integers). The highest bit (value 128) of a byte is numbered bit 7;
the lowest bit (value 1) is numbered bit 0. Values are unsigned unless
otherwise noted. Values explicitly noted as signed are represented in
two's complement notation.

Unless otherwise stated, four-byte unsigned integers are limited to the
range 0 to 231-1 and four-byte signed integers to the range -(231-1) to
231-1.


Figure 5.1: Number representation in PNG

5.2 PNG signature

A PNG datastream consists of a PNG signature followed by a sequence
of chunks. This clause defines the signature and the basic properties
of chunks. Individual chunk types are discussed in clause 10: Chunk
specifications. The first eight bytes of a PNG datastream always contain
the following (decimal) values:

   137 80 78 71 13 10 26 10

This signature indicates that the remainder of the datastream contains
a single PNG image, consisting of a series of chunks beginning with an
IHDR chunk and ending with an IEND chunk.

5.3 Chunk layout

Each chunk consists of four fields (see figure 4.11):

Length

A four-byte unsigned integer giving the number of bytes in the chunk's
data field. The length counts only the data field, not itself, the chunk
type, or the CRC. Zero is a valid length. Although encoders and decoders
should treat the length as unsigned, its value shall not exceed 231-1
bytes.

Chunk Type

A four-byte chunk type. Each byte of a chunk type is restricted to the
decimal values 65 to 90 and 97 to 122. These correspond to the uppercase
and lowercase ISO 646 letters (A-Z and a-z) respectively. Encoders
and decoders shall treat the chunk types as fixed binary values, not
character strings. For example, it would not be correct to represent the
chunk type IDAT by the equivalents of those letters in some different
character set. Additional naming conventions for chunk types are
discussed in 5.4: Chunk naming conventions.

Chunk Data

The data bytes appropriate to the chunk type, if any. This field can be
of zero length.

CRC

A four-byte CRC (Cyclic Redundancy Code) calculated on the preceding
bytes in the chunk, including the chunk type field and chunk data
fields, but not including the length field. The CRC is always present,
even for chunks containing no data. See 5.5: Cyclic Redundancy Code
algorithm.

The chunk data length may be any number of bytes up to the maximum;
therefore, implementors cannot assume that chunks are aligned on any
boundaries larger than bytes.

Chunks may appear in any order, subject to the restrictions placed on
each chunk type (see 5.6: Chunk ordering). (One notable restriction is
that IHDR shall appear immediately after the PNG signature and IEND
shall appear last; thus the IEND chunk serves as an end-of-datastream
marker.) Multiple chunks of the same type may appear, but only if
specifically permitted for that type.

5.4 Chunk naming conventions

Chunk type codes are assigned so that a decoder can determine some
properties of a chunk even when the type code is not recognized. These
rules allow safe, flexible extension of the PNG format. A PNG decoder
can determine an appropriate action when an unrecognized chunk type
is encountered. (The chunk types standardized in this International
Standard are defined in clause 10: Chunk specifications and the way
to define non-standard chunks is defined in clause 13: Editors and
extensions.) The naming rules are normally only of interest when the
decoder does not recognize the chunk's type.

Four bits of the type code, the property bits, namely bit 5 (value 32)
of each byte, are used to convey chunk properties. This choice means
that a human can read off the assigned properties according to whether
each letter of the type code is uppercase (bit 5 is 0) or lowercase (bit
5 is 1). However, decoders should test the properties of an unknown
chunk by numerically testing the specified bits; testing whether a
character is uppercase or lowercase is inefficient, and even incorrect
if a locale-specific case definition is used.

The property bits are an inherent part of the chunk name, and hence are
fixed for any chunk type. Thus, TEXT and Text would be unrelated chunk
type codes, not the same chunk with different properties.

The semantics of bit 5 of each byte are:

First byte: Ancillary bit

0 (uppercase) = critical, 1 (lowercase) = ancillary.

Chunks that are necessary for successful display of the datastream's
contents are critical chunks, for example the image header chunk (IHDR).

Chunks that are not strictly necessary in order to meaningfully display
the contents of the datastream are ancillary chunks, for example the
time chunk (tIME).

Second byte: Private bit

0 (uppercase) = public, 1 (lowercase) = private.

A public chunk is one that is defined in this International
Standard or is registered in the list of PNG special-purpose public
chunk types maintained by the Registration Authority (see 4.9
Registration). Applications can also define private (unregistered)
chunks for their own purposes. The names of private chunks have a
lowercase second letter, while public chunks will always be assigned
names with uppercase second letters. Decoders do not need to test the
private-chunk property bit, since it has no functional significance;
it is simply an administrative convenience to ensure that public and
private chunk names will not conflict. See clause 13: Editors and
extensions and 11.10.2: Use of private chunks.

Third byte: Reserved bit

0 (uppercase) in datastreams conforming to this version of PNG.

The significance of the case of the third letter of the chunk name is
reserved for possible future extension. In this International Standard,
all chunk names shall have uppercase third letters.

Fourth byte: Safe-to-copy bit

0 (uppercase) = unsafe to copy, 1 (lowercase) = safe to copy.

This property bit is not of interest to pure decoders, but it is needed
by PNG editors. This bit defines the proper handling of unrecognized
chunks in a datastream that is being modified. Rules for PNG editors are
discussed further in 13.2: Behaviour of PNG editors.

EXAMPLE The hypothetical chunk type name "bLOb" has the property bits:


   bLOb <-- 32 bit chunk type code represented in text form


   |||+- Safe-to-copy bit is 1 (lower case letter; bit 5 is 1)

   ||+-- Reserved bit is 0 (upper case letter; bit 5 is 0)

   |+--- Private bit is 0 (upper case letter; bit 5 is 0)

   +---- Ancillary bit is 1 (lower case letter; bit 5 is 1)

Therefore, this name represents an ancillary, public, safe-to-copy
chunk, conforming to this version of PNG.

5.5 Cyclic Redundancy Code algorithm

CRC fields are calculated using standardized CRC methods with pre and
post conditioning, as defined by ISO 3309 [ISO-3309] and ITU-T V.42
[ITU-T-V42]. The CRC polynomial employed is

x32 + x26 + x23 + x22 + x16 + x12 + x11 + x10 + x8 + x7 + x5 + x4 + x2 +
x + 1

In PNG, the 32-bit CRC is initialized to all 1's, and then the data from
each byte is processed from the least significant bit (1) to the most
significant bit (128). After all the data bytes are processed, the CRC
is inverted (its ones complement is taken). This value is transmitted
(stored in the datastream) MSB first. For the purpose of separating
into bytes and ordering, the least significant bit of the 32-bit CRC is
defined to be the coefficient of the x31 term.

Practical calculation of the CRC often employs a precalculated table to
accelerate the computation. See Annex D: Sample Cyclic Redundancy Code.

5.6 Chunk ordering

The constraints on the positioning of the individual chunks are listed
in Table 5.1 and illustrated diagrammatically in figure 5.2 and figure
5.3. These lattice diagrams represent the constraints on positioning
imposed by this International Standard. The lines in the diagrams define
partial ordering relationships. Chunks to the left shall appear before
chunks to the right. Chunks which are vertically aligned and appear
between two other chunk types may appear in any order between the two
chunk types to which they are connected. The superscript associated with
the chunk type indicates whether the chunk is mandatory, optional or may
appear more than once. A vertical bar between two chunk types indicates
alternative (see Table 5.2).

Table 5.1 -- Chunk ordering rules

Critical chunks (shall appear in this order, except PLTE is optional)

Chunk name

Multiple allowed

Ordering constraints

IHDR

No

Shall be first

PLTE

No

Before first IDAT

IDAT

Yes

Multiple IDAT chunks shall be consecutive

IEND

No

Shall be last

Ancillary chunks(need not appear in this order)

Chunk name

Multiple allowed

Ordering constraints

cHRM

No

Before PLTE and IDAT

gAMA

No

Before PLTE and IDAT

iCCP

No

Before PLTE and IDAT

sBIT

No

Before PLTE and IDAT

sRGB

No

Before PLTE and IDAT

bKGD

No

After PLTE; before IDAT

hIST

No

After PLTE; before IDAT

tRNS

No

After PLTE; before IDAT

pHYS

No

Before IDAT

sPLT

No

Before IDAT

tIME

No

None

tIME

No

None

iTXt

No

None

tEXt

No

None

zTXt

No

None


Table 5.2 -- Meaning of symbols used in lattice diagrams

Symbol

Meaning

+

One or many

1

Only one

0

Zero or one

*

Zero or many


Alternative


Figure 5.2: Lattice diagram: PNG images with palette in datastream


Figure 5.3: Lattice diagram: PNG images with no palette in datastream

6 Reference image to PNG image transformation

6.1 Introduction

This clause gives further details concerning the datatypes and
structures used in the transformation of the reference image to the PNG
image.

6.2 Colour types and values

There are five types of PNG image. PNG images may be either greyscale,
truecolour or indexed-colour. The greyscale and truecolour PNG images
may have an alpha channel. For indexed-colour images the alpha channel
may be specified by a tRNS chunk. The allowed PNG image types and
corresponding colour types are listed in Table 6.1. Colour types are
sums of the following values: 1 (palette used), 2 (truecolour used) and
4 (alpha used).

Table 6.1 -- PNG image types and colour types

PNG image type

Colour type

Greyscale, no alpha channel

0

Truecolour, no alpha channel

2

Indexed-colour

3

Greyscale with alpha channel

4

Truecolour with alpha channel

6

Greyscale samples represent luminance (if the gAMA chunk is present) or
device greyscale (if the gAMA chunk is absent).

Pixels in truecolour images consist of three sample values, RGB
(representing Red, Green and Blue). Pixels in indexed colour images
consist of a pointer index. Indexed-colour PNG images have a palette
consisting of RGB sample values for each entry in the palette. RGB
sample values represent device-independent colour information (if a
colour space definition is present) or uncalibrated device-dependent
colour information (if a colour space definition is absent).

Greyscale and RGB sample values range from zero (least intense) to the
maximum value for the sample depth (most intense). Note that the maximum
value at a given sample depth is (2sampledepth-1), not 2sampledepth. For
greyscale sample values, a value of zero is black and the maximum value
is white. For truecolour sample values and palette entries, the red,
green and blue values all set to zero represent black and the red, green
and blue values all set to the maximum value represent white.

Sample values are not necessarily proportional to light intensity;
the gAMA chunk specifies the relationship between sample values and
display output intensity. Viewers are strongly encouraged to compensate
properly. See 4.6: Ancillary information.

6.3 Scaling

Reference image sample values with a precision not directly supported
in PNG (for example, 5 bit/sample truecolour) shall be scaled up to a
higher supported bit depth, preferably the next higher supported bit
depth. The sample value precision for R, G, B, and alpha in colour
images may be different. Also, the sample value precision for greyscale
and alpha in greyscale images may be different. The possible precisions
for each type of PNG image are given in Table 6.2. Note that in the case
of indexed-colour, the indices in the image may have precision 1, 2, 4
or 8 but the sample values in the palette have precision 8.


Table 6.2 -- Allowed combinations of PNG image type, bit depth and
sample depth

PNG image type

Colour type

Allowed image bit depth

Allowed sample depths

Greyscale, no alpha channel

0

1, 2, 4, 8, 16

1, 2, 4, 8, 16

Truecolour, no alpha channel

2

8, 16

8, 16

Indexed-colour

3

1, 2, 4, 8

8

Greyscale with alpha channel

4

8, 16

8, 16

Truecolour, with alpha channel

6

8, 16

8, 16

Scaling is reversible with no loss of data, and scaling reduces the
number of cases that decoders have to cope with. See 11.5: Sample depth
scaling and 12.9: Sample depth rescaling.

6.4 Scanline

A PNG image is a rectangular pixel array, with pixels appearing
left-to-right within each scanline, and scanlines appearing
top-to-bottom. The size of each pixel is determined by the bit depth.

Pixels within a scanline in a PNG image are always packed into a
sequence of bytes with no wasted bits between pixels. Scanlines always
begin on byte boundaries. Permitted bit depths and colour types are
restricted so that in all cases the packing is simple and efficient.

Greyscale (without an alpha channel) and indexed-colour PNG images have
a single channel. The sample value for each pixel may have values of
precision less than a byte (1, 2 or 4 bits). These sample values are
packed into bytes with the leftmost sample value in the high-order bits
of a byte followed by the other sample values for the scanline.

If the scanline width does not result in the last byte being filled,
the low-order bits in the last byte of each scanline are not used. The
contents of these unused bits are not specified.

PNG images that are not indexed-colour images may have sample values
with a bit depth of 16. Such sample values are in network byte order
(MSB first, LSB second).

6.5 Alpha representation

Transparency of the PNG image may be defined in four different ways (see
4.3.2: Implied alpha and 4.3.5: Alpha compaction):

a) there may be an alpha channel as part of the PNG image;

b) the tRNS chunk may contain a table that effectively adds an alpha
entry to the palette;

c) an entry in the tRNS chunk may specify which pixels are transparent
and which are opaque;

d) no tRNS chunk is present and all pixels are fully opaque. Gamma
correction is not applied to the alpha sample values.

Alpha channels have samples of bit depth 8 or 16. Alpha samples are
represented with the same bit depth used for the image samples. The
alpha sample for each pixel is stored immediately following the
greyscale or RGB samples of the pixel. Alpha samples always represent a
linear fraction of full opacity.

An alpha value of zero represents full transparency, and a value of 28
- 1 or 216 - 1 represents a fully opaque pixel depending on whether
the sample is 8 or 16 bits. Intermediate values indicate partially
transparent pixels that can be composited with a background image to
yield a composite image.

The colour values stored for a pixel are not premultiplied by the
alpha value assigned to the pixel. This rule is sometimes called
"unassociated" or "non-premultiplied" alpha. (Another common technique
is to store sample values premultiplied by the alpha value; in effect,
such an image is already composited against a black background. PNG does
not use premultiplied alpha. In consequence an image editor can take a
PNG image and easily change its transparency.)

See 11.4: Alpha channel creation, and 12.13: Alpha channel processing.

6.6 Filtering

PNG allows the image data to be filtered before it is
compressed. Filtering can improve the compressibility of the data. The
filter step itself results in a sequence of bytes of the same size
as the incoming sequence, but in a different representation, plus a
one-byte filter type per scanline. Filtering does not reduce the size of
the actual scanline data. All PNG filters are strictly lossless.

Because different filter types can be used for different scanlines,
the filter algorithm is specified for each scanline by a filter
type byte that precedes the filtered scanline in the precompression
datastream. The filter type byte is not considered part of the image
data, but it is included in the datastream sent to the compression step.

See clause 8: Filtering.

6.7 Text strings

A PNG datastream may store text associated with the image, such as an
image description or copyright notice. Keywords are used to indicate
what each text string represents.

In the tEXt and zTXt chunks, it is recommended that the [ISO-8859-1]
(Latin-1) character set is used in text strings with newlines specified
by the LineFeed character (code 10). This character set is a superset
of 7-bit ASCII. Using other character codes is strongly discouraged as
their meaning is implementation dependent.

The iTXt chunk allows international text to be used, specified using the
UTF-8 encoding of the Unicode character set [ISO/IEC 10646-1].

The zTXt chunk contains compressed text. The iTXt chunk may optionally
contain compressed text. See 9.2: Other uses of compression.

7 Interlacing and pass abstraction

7.1. Introduction

Pass abstraction (see figure 4.8) splits a PNG image into a sequence
of reduced images (the interlaced PNG image) where the first image
defines a coarse view of the original source image and subsequent
images enhance this coarse view until the last image completes the PNG
image. This allows progressive display of the interlaced PNG image by
the decoder and allows images to "fade in" when they are being displayed
on-the-fly. On average, interlacing slightly expands the datastream
size, but it gives the user a meaningful display much more rapidly.

7.2. Interlace methods

Two interlace methods are defined in this International
Standard, methods 0 and 1. Other methods may be registered (see
4.9: Registration).

With interlace method 0, pixels are abstracted sequentially from left to
right, and scanlines sequentially from top to bottom. The interlaced PNG
image is a single reduced image.

Interlace method 1, known as Adam7, defines seven distinct passes over
the image. Each pass transmits a subset of the pixels in the source
image. The pass in which each pixel is transmitted (numbered from 1
to 7) is defined by replicating the following 8-by-8 pattern over the
entire image, starting at the upper left corner:

   1 6 4 6 2 6 4 6

   7 7 7 7 7 7 7 7

   5 6 5 6 5 6 5 6

   7 7 7 7 7 7 7 7

   3 6 4 6 3 6 4 6

   7 7 7 7 7 7 7 7

   5 6 5 6 5 6 5 6

   7 7 7 7 7 7 7 7

Figure 4.8 shows the seven passes of interlace method 1. Within each
pass, the selected pixels are transmitted left to right within a
scanline, and selected scanlines sequentially from top to bottom. For
example, pass 2 contains pixels 4, 12, 20, etc. of scanlines 0, 8, 16,
etc. (where scanline 0, pixel 0 is the upper left corner). The last
pass contains all of scanlines 1, 3, 5, etc. The transmission order is
defined so that all the scanlines transmitted in a pass will have the
same number of pixels; this is necessary for proper application of some
of the filters. The interlaced PNG image consists of a sequence of seven
reduced images. For example, if the PNG image is 16 by 16 pixels, then
the third pass will be a reduced image of two scanlines, each containing
four pixels.

Scanlines that do not completely fill an integral number of bytes are
padded as defined in 6.4: Scanline.

NOTE: If the reference image contains fewer than five columns or fewer
than five rows, some passes will be empty.

8 Filtering

8.1 Filter methods and filter types

A filter transforms the PNG image for optimum compression. PNG provides
filter methods and a set of filter types associated with each filter
method. All the reduced images in an interlaced image shall use a single
filter method, but individual scanlines in each reduced image may use
different filter types defined by that filter method. Only filter method
0 is defined by this International Standard. Other filter methods may be
registered (see 4.9 Registration).

PNG imposes no additional restriction on which filter types can be
applied to an interlaced PNG image. However, the filters are not equally
effective on all types of data.

See 11.8: Filter selection.

Filtering transforms the byte sequence in a scanline to an equal length
sequence of bytes preceded by the filter type (see figures 8.1 and
8.2). Filter type bytes are only associated with non-empty scanlines. No
filter type bytes are present in an empty pass. See 12.7: Interlacing
and progressive display.

8.2 Filter types for filter method 0

Filtering algorithms are applied to bytes, not to pixels, regardless
of the bit depth or colour type of the image. The filters operate on
the byte sequence formed by a scanline that has been represented as
described in 6.4: Scanline. If the image includes an alpha channel, the
alpha data is filtered in the same way as the image data.

Filters may use the original values of the following bytes to generate
the new byte value:

x

the value of the byte being filtered

a

the value of the corresponding byte in the pixel immediately before the
byte being filtered (or the entire byte being filtered, when the bit
depth is 8 or less)

b

the value of the corresponding byte in the previous scanline

c

the value of the corresponding byte in the pixel immediately before
the byte being filtered in the previous scanline (or the value of the
entire byte immediately before the byte being filtered in the previous
scanline, when the bit depth is 8 or less)

Figure 8.1 shows the positions of the a, b and c pixels (or bytes).

PNG filter method 0 defines five basic filter types as listed in Table
8.1. Orig(y) denotes the orginal (unfiltered) value of byte y. Filt(y)
denotes the value after a filter has been applied. Recon(y) denotes
the value after the corresponding reconstruction function has been
applied. The filter function for the Paeth type PaethPredictor is
defined below.

Filter method 0 in IHDR specifies exactly this set of five filter
types and these shall not be extended by Registration (see 4.9
Registration). If additional filter types are proposed for registration,
a different filter method number will be assigned to the extended set by
the Registration Authority. This ensures that decoders for filter method
0 need not decompress the data to discover that it contains unsupported
filter types.

Table 8.1 -- Filter types

Type

Name

Filter Function

Reconstruction Function

0

None

Filt(x) = Orig(x)

Recon(x) = Filt(x)

1

Sub

Filt(x) = Orig(x) - Orig(a)

Recon(x) = Filt(x) + Recon(a)

2

Up

Filt(x) = Orig(x) - Orig(b)

Recon(x) = Filt(x) + Recon(b)

3

Average

Filt(x) = Orig(x)

  - floor((Orig(a)+Orig(b))/2)

Recon(x) = Filt(x) +

  floor((Recon(a)+Recon(b))/2)

4

Paeth

Filt(x) = Orig(x) -

  PaethPredictor(Orig(a),

           Orig(b),Orig(c))

Recon(x) = Filt(x) +

  PaethPredictor(Recon(a),

       Recon(b),Recon(c))

For all filters, the bytes "to the left of" the first pixel in a
scanline shall be treated as being zero. For filters that refer to the
prior scanline, the entire prior scanline and bytes "to the left of" the
first pixel in the prior scanline shall be treated as being zeroes for
the first scanline of a reduced image.

To reverse the effect of a filter requires the decoded values of the
prior pixel on the same scanline, the pixel immediately above the
current pixel on the prior scanline, and the pixel just to the left of
the pixel above.

Unsigned arithmetic modulo 256 is used, so that both the inputs and
outputs fit into bytes. The sequence of Filt values is transmitted as
the filtered scanline.

8.3 Filter type 3: Average

The sum Orig(a)+Orig(b) shall be formed without overflow (using at least
nine-bit arithmetic). floor() indicates that the result of the division
is rounded to the next lower integer if fractional; in other words, it
is an integer division or right shift operation.

8.4 Filter type 4: Paeth

The Paeth filter function computes a simple linear function of the three
neighbouring pixels (left, above, upper left), then chooses as predictor
the neighbouring pixel closest to the computed value. The algorithm used
in this International Standard is an adaptation of the technique due to
Alan W. Paeth [PAETH].

The PaethPredictor function is defined in figure 8.1 and is applied to
each byte regardless of bit depth.


Figure 8.1: The PaethPredictor function

The calculations within the PaethPredictor function shall be performed
exactly, without overflow.

The order in which the comparisons are performed is critical and shall
not be altered. This corresponds to testing pixels in the order: pixel
to the left, pixel above, pixel to the upper left and tries to establish
in which of the three directions: vertical, horizontal or diagonal, the
gradient of the image is smallest.

Exactly the same PaethPredictor function is used by both encoder and
decoder.

9 Compression

9.1 Compression of filtered scanlines

PNG allows for a number of compression methods to be defined. Only PNG
compression method 0 is defined by this International Standard. Other
compression methods may be Registered (see 4.9: Registration). PNG
compression method 0 is deflate/inflate compression with a sliding
window of at most 32768 bytes. Deflate compression is an LZ77 derivative
[ZL].

Deflate-compressed datastreams within PNG are stored in the "zlib"
format, which has the structure:


zlib compression method/flags code

1 byte

Additional flags/check bits

1 byte

Compressed data blocks

n bytes

Check value

4 bytes

Further details on this format are given in the zlib specification
[RFC-1950].

For PNG compression method 0, the zlib compression method/flags code
shall specify method code 8 ("deflate" compression) and an LZ77 window
size of not more than 32768 bytes. The zlib compression method number
is not the same as the PNG compression method number entry in the IHDR
chunk (see 10.2.2 IHDR Image header). The additional flags shall not
specify a preset dictionary.

If the data to be compressed contains 16384 bytes or fewer, the PNG
encoder may set the window size by rounding up to a power of 2 (256
minimum). This decreases the memory required for both encoding and
decoding, without adversely affecting the compression ratio.

The compressed data within the zlib datastream are stored as a
series of blocks, each of which can represent raw (uncompressed)
data, LZ77-compressed data encoded with fixed Huffman codes, or
LZ77-compressed data encoded with custom Huffman codes. A marker bit in
the final block identifies it as the last block, allowing the decoder
to recognize the end of the compressed datastream. Further details on
the compression algorithm and the encoding are given in the deflate
specification [RFC-1951].

The check value stored at the end of the zlib datastream is calculated
on the uncompressed data represented by the datastream. The algorithm
used to calculate this is not the same as the CRC calculation used for
PNG chunk CRC field values. The zlib check value is useful mainly as
a cross-check that the deflate and inflate algorithms are implemented
correctly. Verifying the chunk CRCs provides confidence that the PNG
datastream has been transmitted undamaged.

In a PNG datastream, the concatenation of the contents of all the IDAT
chunks makes up a zlib datastream. This datastream decompresses to
filtered image data.

It is important to emphasize that the boundaries between IDAT chunks are
arbitrary and can fall anywhere in the zlib datastream. There is not
necessarily any correlation between IDAT chunk boundaries and deflate
block boundaries or any other feature of the zlib data. For example, it
is entirely possible for the terminating zlib check value to be split
across IDAT chunks.

Similarly, there is no required correlation between the structure of the
image data (i.e., scanline boundaries) and deflate block boundaries or
IDAT chunk boundaries. The complete filtered PNG image is represented by
a single zlib datastream that is stored in a number of IDAT chunks.

9.2 Other uses of compression

PNG also uses zlib datastreams in iTXt, iCCP and zTXt chunks. Unlike
the image data, such datastreams are not split across chunks; each such
chunk contains an independent zlib datastream.

10 Chunk specifications

10.1 Introduction

The PNG datastream format consists of a PNG signature (see 5.2: PNG
signature) followed by a sequence of chunks. Each chunk has a chunk
type which specifies its function. This clause defines the types
of PNG chunks standardized in this International Standard. The PNG
datastream structure is defined in clause 5: Datastream and data
representation. This also defines the order in which chunks may
appear. For details specific to encoders see 11.10.1: Text chunk
processing. For details specific to decoders see 12.4.3: Text chunk
processing.

10.2 Critical chunks

10.2.1 General

Critical chunks are those chunks that are absolutely required in order
to successfully decode a PNG image in a PNG datastream. Extension
chunks may be defined as critical chunks (see clause 13: Editors and
extensions), though this practice is strongly discouraged.

A valid PNG datastream shall contain an IHDR chunk immediately following
the PNG signature, one or more IDAT chunks, and an IEND chunk as the
last chunk. Only one IHDR chunk and one IEND chunk are allowed in a PNG
datastream.

10.2.2 IHDR Image header

The four-byte chunk type field contains the decimal values

73 72 68 82

The IHDR chunk shall be the first chunk in the PNG datastream. It
contains:


Width

4 bytes

Height

4 bytes

Bit depth

1 byte

Colour type

1 byte

Compression method

1 byte

Filter method

1 byte

Interlace method

1 byte

Width and height give the image dimensions in pixels. They are four-byte
integers. Zero is an invalid value. The maximum for each is 231-1 in
order to accommodate programming languages that have difficulty with
unsigned four-byte values.

Bit depth is a single-byte integer giving the number of bits per sample
or per palette index (not per pixel). See 6.2: Colour types and values.

Colour type is a single-byte integer that describes the interpretation
of the image data. Valid values are 0, 2, 3, 4, and 6. See 6.2: Colour
types and values

Bit depth restrictions for each colour type are imposed to simplify
implementations and to prohibit combinations that do not compress
well. The allowed combinations are defined in Table 10.1.

Table 10.1 -- Allowed combinations of colour type and bit depth

Colour type

Allowed bit depths

Interpretaion

0

1, 2, 4, 8, 16

Each pixel is a greyscale sample

2

8, 16

Each pixel is an R,G,B triple

3

1, 2, 4, 8

Each pixel is a palette index; a PLTE chunk shall appear.

4

8, 16

Each pixel is a greyscale sample, followed by an alpha sample.

6

8, 16

Each pixel is an R,G,B triple, followed by an alpha sample.

The sample depth is the same as the bit depth except in the case
of colour type 3, in which the sample depth is always 8 bits (see
6.3: Scaling).

Compression method is a single-byte integer that indicates the
method used to compress the image data. Only compression method 0
(deflate/inflate compression with a 32K sliding window) is defined
in this International Standard. All conforming PNG images shall be
compressed with this scheme. Other schemes may be registered with the
Registration Authority (see 4.9: Registration).

Filter method is a single-byte integer that indicates the preprocessing
method applied to the image data before compression. Only filter method
0 (adaptive filtering with five basic filter types) is defined in this
International Standard. See clause 8: Filtering for details.

Interlace method is a single-byte integer that indicates the
transmission order of the image data. Two values are defined in this
International Standard: 0 (no interlace) or 1 (Adam7 interlace). See
clause 7: Interlacing and pass abstraction for details.

10.2.3 PLTE Palette

The four-byte chunk type field contains the decimal values

80 76 84 69

The PLTE chunk contains from 1 to 256 palette entries, each a three-byte
series of the form:


Red

1 byte

Green

1 byte

Blue

1 byte

The number of entries is determined from the chunk length. A chunk
length not divisible by 3 is an error.

This chunk shall appear for colour type 3, and may appear for colour
types 2 and 6; it shall not appear for colour types 0 and 4. There shall
not be more than one PLTE chunk.

For colour type 3 (indexed-colour), the PLTE chunk is required. The
first entry in PLTE is referenced by pixel value 0, the second by pixel
value 1, etc. The number of palette entries shall not exceed the range
that can be represented in the image bit depth (for example, 24 = 16 for
a bit depth of 4). It is permissible to have fewer entries than the bit
depth would allow. In that case, any out-of-range pixel value found in
the image data is an error.

For colour types 2 and 6 (truecolour and truecolour with alpha), the
PLTE chunk is optional. If present, it provides a suggested set of
colours (from 1 to 256) to which the truecolour image can be quantized
if it cannot be displayed directly. It is, however, recommended
that the sPLT chunk be used for this purpose, rather than the PLTE
chunk. If neither PLTE nor sPLT chunks are present and the image cannot
be displayed directly quantization has to be done by the viewing
system. However, it is often preferable that the selection of colours is
done once by the PNG encoder. (See 11.6: Suggested palettes.)

Note that the palette uses 8 bits (1 byte) per sample regardless of the
image bit depth specification. In particular, the palette is 8 bits deep
even when it is a suggested quantization of a 16-bit truecolour image.

There is no requirement that the palette entries all be used by the
image, nor that they all be different.

10.2.4 IDAT Image data

The four-byte chunk type field contains the decimal values

73 68 65 84

The IDAT chunk contains the actual image data which is the output
stream of the compression algorithm. See clause 8: Filtering and clause
9: Compression for details.

There may be multiple IDAT chunks; if so, they shall appear
consecutively with no other intervening chunks. The compressed
datastream is then the concatenation of the contents of all the IDAT
chunks.

10.2.5 IEND Image trailer

The four-byte chunk type field contains the decimal values

73 69 78 68

The IEND chunk marks the end of the PNG datastream. The chunk's data
field is empty.

10.3 Ancillary chunks

10.3.1 General

The ancillary chunks defined in this International Standard are listed
in the order in 4.7.2: Chunk types. This is not the order in which
they appear in a PNG datastream. Ancillary chunks may be ignored by a
decoder. For each ancillary chunk, the actions described are on the
assumption that the decoder is not ignoring the chunk.

10.3.2 Additional colour information

10.3.2.1 cHRM Primary chromaticities and white point

The four-byte chunk type field contains the decimal values

99 72 82 77

The cHRM chunk may be used to specify the 1931 CIE x,y chromaticities
of the red, green, and blue primaries used in the image, and the
referenced white point. See Annex C: Gamma and chromaticity for more
information. The iCCP and sRGB chunks provide more sophisticated support
for colour management and control.

The cHRM chunk contains:


White point x

4 bytes

White point y

4 bytes

Red x

4 bytes

Red y

4 bytes

Green x

4 bytes

Green y

4 bytes

Blue x

4 bytes

Blue y

4 bytes

Each value is encoded as a four-byte unsigned integer, representing the
x or y value times 100000.

EXAMPLE A value of 0.3127 would be stored as the integer 31270.

cHRM is allowed in all PNG datastreams, although it is of little value
for greyscale images.

An sRGB chunk or iCCP chunk when present and recognized, overrides the
cHRM chunk.

10.3.2.2 gAMA Image gamma

The four-byte chunk type field contains the decimal values

103 65 77 65

The gAMA chunk specifies the relationship between the image samples and
the desired display output intensity.

The gAMA chunk contains:


Image gamma

4 bytes

The value is encoded as a four-byte unsigned integer, representing gamma
times 100000.

EXAMPLE A gamma of 0.45 would be stored as the integer 45000.

10.3.2.3 iCCP Embedded ICC profile

The four-byte chunk type field contains the decimal values

105 67 67 80

The iCCP chunk contains:


Profile name

1-79 bytes (character string)

Null separator

1 byte (null character)

Compression method

1 byte

Compressed profile

0-n bytes

The profile name may be any convenient name for referring to the
profile. It is case-sensitive and subject to the same restrictions
as the keyword parameter for the tEXt chunk. Profile names shall
contain only printable Latin-1 characters and spaces (only character
codes 32-126 and 161-255 decimal are allowed). Leading, trailing and
consecutive spaces are not permitted. The only compression method
defined in this International Standard is method 0 (deflate/inflate
compression). The compression method entry is followed by a compressed
datastream that makes up the remainder of the chunk. Decompression of
this datastream yields the ICC profile.

If the iCCP chunk is present, the image samples conform to the colour
space represented by the embedded ICC profile as defined by the
International Colour Consortium [ICC]. The colour space of the ICC
profile shall be an RGB colour space for colour images (PNG colour
types 2, 3 and 6), or a greyscale colour space for greyscale images
(PNG colour types 0 and 4). A PNG encoder that writes the iCCP chunk is
encouraged to also write gAMA and cHRM chunks that approximate the ICC
profile, to provide compatibility with applications that do not use the
iCCP chunk. When the iCCP chunk is present, PNG decoders that recognize
it and are capable of colour management [ICC] shall ignore the gAMA and
cHRM chunks and use the iCCP chunk instead and interpret it according to
[ICC-1] and [ICC-1A]. PNG decoders that are incapable of full-fledged
colour management shall use the gAMA and cHRM chunks if present.

A PNG datastream shall contain at most one embedded profile, whether
specified explicitly with an iCCP chunk or implicitly with an sRGB
chunk.

10.3.2.4 sBIT Significant bits

The four-byte chunk type field contains the decimal values

115 66 73 84

To simplify decoders, PNG specifies that only certain sample depths may
be used, and further specifies that sample values should be scaled to
the full range of possible values at the sample depth. However, the sBIT
chunk is provided in order to store the original number of significant
bits (which is less than the sample depth). This allows PNG decoders to
recover the original data losslessly even if the data had a sample depth
not directly supported by PNG.

The sBIT chunk contains:


Colour type 0


significant source image greyscale bits

1 byte

Colour types 2 and 3


significant source image red bits

1 byte

significant source image green bits

1 byte

significant source image blue bits

1 byte

Colour type 4


significant greyscale bits

1 byte

significant source image alpha bits

1 byte

Colour type 6


significant source image red bits

1 byte

significant source image green bits

1 byte

significant source image blue bits

1 byte

significant source image alpha bits

1 byte

Each depth specified in sBIT shall be greater than zero and less than or
equal to the sample depth (which is 8 for indexed-colour images, and the
bit depth given in IHDR for other colour types).

10.3.2.5 sRGB Standard RGB colour space

The four-byte chunk type field contains the decimal values

115 82 71 66

If the sRGB chunk is present, the image samples conform to the sRGB
colour space [sRGB] and should be displayed using the specified
rendering intent defined by the International Colour Consortium [ICC-1]
and [ICC-1A].


Rendering intent

1 byte

The following values are defined for rendering intent:


0

Perceptual

for images preferring good adaptation to the output device gamut at the
expense of colorimetric accuracy, such as photographs.

1

Relative colorimetric

for images requiring colour appearance matching (relative to the output
device white point) such as logos.

2

Saturation

for images preferring preservation of saturation at the expense of hue
and lightness, such as charts and graphs.

3

Absolute colorimetric

for images requiring preservation of absolute colorimetry, such as
previews of images destined for a different output device (proofs).

It is recommended that a PNG encoder that writes the sRGB chunk also
writes a gAMA chunk (and optionally a cHRM chunk) for compatibility with
encoders that do not use the sRGB chunk. Only the following values shall
be used.

gAMA

Gamma

45455

cHRM


White point x

31270

White point y

32900

Red x

64000

Red y

33000

Green x

30000

Green y

60000

Blue x

15000

Blue y

6000

When the sRGB chunk is present, it is recommended that decoders that
recognize it and are capable of colour management [ICC] ignore the gAMA
and cHRM chunks and use the sRGB chunk instead. Decoders that recognize
the sRGB chunk but are not capable of full-fledged colour management are
recommended to ignore the gAMA and cHRM chunks and use the values given
above as if they had appeared in gAMA and cHRM chunks.

It is recommended that the sRGB and iCCP chunks do not both appear in a
PNG datastream.

10.3.3 Additional source image information

10.3.3.1 bKGD Background colour

The four-byte chunk type field contains the decimal values

98 75 71 68

The bKGD chunk specifies a default background colour to present the
image against. If there is any other preferred background, either
user-specified or part of a larger page (as in a browser), the bKGD
chunk should be ignored. The bKGD chunk contains:


Colour types 0 and 4


Greyscale

2 bytes

Colour types 2 and 6


Red

2 bytes

Green

2 bytes

Blue

2 bytes

Colour type 3


Palette index

1 byte

For colour type 3 (indexed-colour), the value is the palette index of
the colour to be used as background.

For colour types 0 and 4 (greyscale, with or without alpha), the
value is the grey level to be used as background in the range 0
to (2bitdepth)-1. For colour types 2 and 6 (truecolour, with or
without alpha), the values give the RGB colour in the range 0 to
(2bitdepth)-1. In each case, for consistency, two bytes per sample are
used regardless of the image bit depth. If the image bit depth is less
than 16, the least significant bits are used and the others are 0.

10.3.3.2 hIST Image histogram

The four-byte chunk type field contains the decimal values

104 73 83 84

The hIST chunk gives the approximate usage frequency of each colour
in the palette. A histogram chunk can appear only when a PLTE chunk
appears. If a viewer is unable to provide all the colours listed in the
palette, the histogram may help it decide how to choose a subset of the
colours for display.

The hIST chunk contains a series of two-byte (16 bit) unsigned
integers. There shall be exactly one entry for each entry in the PLTE
chunk. Each entry is proportional to the fraction of pixels in the image
that have that palette index; the exact scale factor is chosen by the
encoder.

Histogram entries are approximate, with the exception that a zero entry
specifies that the corresponding palette entry is not used at all in the
image. A histogram entry shall be nonzero if there are any pixels of
that colour.

NOTE When the palette is a suggested quantization of a truecolour image,
the histogram is necessarily approximate, since a decoder may map pixels
to palette entries differently than the encoder did. In this situation,
zero entries should not normally appear; it is expected that all entries
will be used.

10.3.3.3 tRNS Transparency

The four-byte chunk type field contains the decimal values

116 82 78 83

The tRNS chunk specifies either alpha values that are associated with
palette entries (for indexed-colour images) or a single transparent
colour (for greyscale and truecolour images). The tRNS chunk contains:


colour type 0


Grey sample value

2 bytes

colour type 2


Red sample value

2 bytes

Blue sample value

2 bytes

Green sample value

2 bytes

colour type 3


Alpha for palette index 0

1 byte

Alpha for palette index 1

1 byte

...etc

1 byte

For colour type 3 (indexed-colour), the tRNS chunk contains a series of
one-byte alpha values, corresponding to entries in the PLTE chunk. Each
entry indicates that pixels of the corresponding palette index shall
be treated as having the specified alpha value. Alpha values have the
same interpretation as in an 8-bit full alpha channel: 0 is fully
transparent, 255 is fully opaque, regardless of image bit depth. The
tRNS chunk shall not contain more alpha values than there are palette
entries, but tRNS may contain fewer values than there are palette
entries. In this case, the alpha value for all remaining palette entries
is assumed to be 255. In the common case in which only palette index 0
need be made transparent, only a one-byte tRNS chunk is needed.

For colour types 0 or 2, two bytes are used regardless of the image
bit depth (see 5.1: Integers and byte order). Pixels of the specified
grey sample value or RGB sample values are specified as transparent
(equivalent to alpha value 0); all other pixels are to be treated as
fully opaque (alpha value 2bitdepth-1). If the image bit depth is less
than 16, the least significant bits are used and the others are 0.

A tRNS shall not appear for colour types 4 and 6, since a full alpha
channel is already present in those cases.

10.3.3.4 pHYs Physical pixel dimensions

The four-byte chunk type field contains the decimal values

112 72 89 115

The pHYs chunk specifies the intended pixel size or aspect ratio for
display of the image. It contains:


Pixels per unit, X axis

4 bytes (unsigned integer)

Pixels per unit, Y axis

4 bytes (unsigned integer)

Unit specifier

1 byte

The following values are defined for the unit specifier:


0

unit is unknown

1

unit is the metre

When the unit specifier is 0, the pHYs chunk defines pixel aspect ratio
only; the actual size of the pixels remains unspecified.

If this ancillary chunk is not present, pixels are assumed to be square,
and the physical size of each pixel is unspecified.

10.3.3.5 sPLT Suggested palette

The four-byte chunk type field contains the decimal values

115 80 76 84

The sPLT chunk contains a null-terminated text string that names the
palette and a one-byte sample depth, followed by a series of palette
entries, each a six-byte or ten-byte series containing five unsigned
integers (red, green, blue, alpha and frequency):


Palette name

1-79 bytes (character string)

Null terminator

1 byte (null character)

Sample depth

1 byte

Red

1 or 2 bytes

Green

1 or 2 bytes

Blue

1 or 2 bytes

Alpha

1 or 2 bytes

Frequency

2 bytes

... etc ...


There may be any number of entries. A PNG decoder determines the number
of entries from the length of the chunk remaining after the sample depth
byte. This shall be divisible by 6 if the sPLT sample depth is 8, or by
10 if the sPLT sample depth is 16. Entries shall appear in decreasing
order of frequency. There is no requirement that the entries all be used
by the image, nor that they all be different.

The palette name can be any convenient name for referring to the palette
(for example "256 colour including Macintosh default", "256 colour
including Windows-3.1 default", "Optimal 512"). The choice of the most
appropriate suggested palette may be aided by the inclusion of more than
one in a PNG datastream.

The palette name is case-sensitive and subject to the same restrictions
as the keyword parameter for the tEXt chunk. Palette names shall
contain only printable Latin-1 characters and spaces (only character
codes 32-126 and 161-255 decimal are allowed). Leading, trailing and
consecutive spaces are not permitted.

The sPLT sample depth shall be 8 or 16.

The red, green, blue and alpha samples are either one or two bytes
each, depending on the sPLT sample depth, regardless of the image bit
depth. The colour samples are not premultiplied by alpha, nor are they
precomposited with any background. An alpha value of 0 means fully
transparent, whilst an alpha value of 255 (when the sPLT sample depth is
8) or 216 - 1 (when the sPLT sample depth is 16) means fully opaque. The
palette samples are in the same colour space as the colour samples of
the PNG image.

Each frequency value is proportional to the fraction of pixels in the
image that are closest to that palette entry in RGBA space, before the
image has been composited against any background. The exact scale factor
is chosen by the PNG encoder; it is recommended that the resulting
range of individual values reasonably fills the range 0 to 65535. A PNG
encoder may artificially inflate the frequencies for colours considered
to be "important", for example the colours used in a logo or the facial
features of a portrait. Zero is a valid frequency meaning that the
colour is "least important" or that it is rarely, if ever, used. When
all the frequencies are zero, they are meaningless, that is to say,
nothing may be inferred about the actual frequencies with which the
colours appear in the PNG image.

Multiple sPLT chunks are permitted, but each shall have a different
palette name.

10.3.4 Textual information

10.3.4.1 iTXt International textual data

The four-byte chunk type field contains the decimal values

105 84 88 116

The iTXt chunk contains textual data, just as tEXt does; however, iTXt
allows international text, specified using the UTF-8 encoding of the
Unicode character set [ISO/IEC 10646-1] to be included. This chunk also
allows text to be compressed.

An iTXt chunk contains:


Keyword

1-79 bytes (character string)

Null separator

1 byte (null character)

Compression flag

1 byte

Compression method

1 byte

Language tag

0-n bytes (character string)

Null separator

1 byte (null character)

Translated keyword

0-n bytes

Null separator

1 byte (null character)

Text

0-n bytes

The keyword is the same as in the tEXt chunk (see 10.3.4.2: tEXt Textual
data).

The compression flag is 0 for uncompressed text, 1 for compressed
text. Only the text field may be compressed. The compression method
entry defines the compression method used. The only compression
method defined in this International Standard is 0 (deflate/inflate
compression). For uncompressed text, encoders shall set the compression
method to 0 and decoders shall ignore it.

The language tag [RFC-1766] indicates the human language used by the
translated keyword and the text. Unlike the keyword, the language tag is
case-insensitive. It is an ISO 646 string consisting of hyphen-separated
words of 1-8 letters each (for example cn, en-uk, no-bok, x-klingon,
x-KlInGoN). If the first word is two letters long, it is an ISO
language code [ISO-639]. If the language tag is empty, the language is
unspecified.

The translated keyword and text both use the UTF-8 encoding of the
Unicode character set [ISO/IEC 10646-1], and neither shall contain a
zero byte (null character). The text, unlike other strings, is not
null-terminated; its length is implied by the chunk length.

Line breaks shall not appear in the translated keyword. In the text, a
newline shall be represented by a single line feed character (decimal
10). The remaining control characters (1-9, 11-31, 127-159) are
discouraged in both the translated keyword and text. In UTF-8 there is
a difference between the characters 128-159 (which are discouraged) and
the bytes 128-159 (which are often necessary).

The translated keyword, if not empty, shall contain a translation
of the keyword into the language indicated by the language tag, and
applications displaying the keyword should display the translated
keyword in addition.

10.3.4.2 tEXt Textual data

The four-byte chunk type field contains the decimal values

116 69 88 116

Textual information that the encoder wishes to record with the image can
be stored in tEXt chunks. Each tEXt chunk contains a keyword and a text
string, in the format:


Keyword

1-79 bytes (character string)

Null character

1 byte (null character)

Text string

0-n bytes (character string)

Neither the keyword nor the text string may contain a null
character. Note that the text string is not null-terminated (the length
of the chunk defines the ending). The text string may be of any length
from zero bytes up to the maximum permissible chunk size less the length
of the keyword and null character separator.

Any number of tEXt chunks may appear, and more than one with the same
keyword is permissible.

The keyword indicates the type of information represented by the text
string. The following keywords are predefined and should be used where
appropriate:


Title

Short (one line) title or caption for image

Author

Name of image's creator

Description

Description of image (possibly long)

Copyright

Copyright notice

Creation Time

Time of original image creation

Software

Software used to create the image

Disclaimer

Legal disclaimer

Warning

Warning of nature of content

Source

Device used to create the image

Comment

Miscellaneous comment

For the Creation Time keyword, the date format defined in section 5.2.14
of RFC 1123 is suggested, but not required [RFC-1123].

Other keywords may be invented for other purposes. Keywords of general
interest can be registered with the PNG Registration Authority (see 4.9
Registration). However, it is also permitted to use private unregistered
keywords. (Private keywords should be reasonably self-explanatory, in
order to minimize the chance that the same keyword will be used for
incompatible purposes by different people.)

Both keyword and text are interpreted according to the ISO 8859-1
(Latin-1) character set [ISO-8859-1]. The text string can contain any
Latin-1 character. Newlines in the text string should be represented
by a single linefeed character (decimal 10); use of other control
characters in the text is not recommended as their interpretation will
be implementation dependent.

Keywords shall contain only printable Latin-1 characters and spaces;
that is, only character codes 32-126 and 161-255 decimal are allowed. To
reduce the chances for human misreading of a keyword, leading and
trailing spaces are not permitted, as are consecutive spaces. Also the
non-breaking space (code 160) is not permitted in keywords, since it is
visually indistinguishable from an ordinary space.

Keywords shall be spelled exactly as registered, so that decoders can
use simple literal comparisons when looking for particular keywords. In
particular, keywords are considered case-sensitive.

10.3.4.3 zTXt Compressed textual data

The four-byte chunk type field contains the decimal values

122 84 88 116

The zTXt chunk contains textual data, just as tEXt does; however, zTXt
takes advantage of compression. zTXt and tEXt chunks are semantically
equivalent, but zTXt is recommended for storing large blocks of text.

A zTXt chunk contains:


Keyword

1-79 bytes (character string)

Null character

1 byte (null character)

Compression method

1 byte

Compressed text

0-n bytes (character string)

The keyword and null character are the same as the tEXt chunk (see
10.3.4.2: tEXt Textual data). The keyword is not compressed. The
compression method entry defines the compression method used. The only
value defined in this International Standard is 0 (deflate/inflate
compression). Other values may be registered with the Registration
Authority (see 4.9 Registration). The compression method entry is
followed by a compressed datastream that makes up the remainder of the
chunk. For compression method 0, this datastream adheres to the zlib
datastream format (see 9.2: Other uses of compression). Decompression of
this datastream yields Latin-1 text that is identical to the text that
would be stored in an equivalent tEXt chunk.

Any number of zTXt chunks may appear in the same datastream. See the
definition of the tEXt chunk in 10.3.4.2: tEXt Textual data for the
predefined keywords and the recommended format of the text.

10.3.5 Time stamp information

10.3.5.1 tIME Image last-modification time

The four-byte chunk type field contains the decimal values

116 73 77 69

The tIME chunk gives the time of the last image modification (not the
time of initial image creation). It contains:


Year

2 bytes (complete; for example, 1995, not 95)

Month

1 byte (1-12)

Day

1 byte (1-31)

Hour

1 byte (0-23)

Minute

1 byte (0-59)

Second

1 byte (0-60) (to allow for leap seconds)

Universal Time (UTC) should be specified rather than local time.

The tIME chunk is intended for use as an automatically-applied time
stamp that is updated whenever the image data are changed.

11 PNG Encoders

11.1 Introduction

This annex gives requirements and recommendations for encoder
behaviour. The absolute requirement on a PNG encoder is that it produces
PNG datastreams that conform to the format specified in the preceding
clauses. Best results will usually be achieved by following the
additional recommendations given here.

11.2 Encoder gamma handling

See Annex C: Gamma and chromaticity for a discussion of gamma issues.

PNG encoders capable of full colour management [ICC] will perform more
sophisticated calculations than those described here and may choose
to use the iCCP chunk. If it is known that the image samples conform
to the sRGB specification [sRGB] the sRGB chunk shall be used without
additional gamma handling. In both cases it is recommended that an
appropriate gAMA chunk be generated, for use by PNG decoders in an
environment where colour management is not available.

A PNG encoder has to determine:

a) what value to write in the gAMA chunk;

b) how to transform the image samples provided into the values to be
written in the PNG datastream.

The value to write in the gAMA chunk is that value which causes a PNG
decoder to behave in the required way.

The transform to be applied to samples depends on the nature of the
image samples and their precision. If the samples represent light
intensity in floating-point or high precision integer form (perhaps from
a computer graphics renderer), the encoder may perform "gamma encoding"
(applying a power function with exponent less than 1) before quantizing
the data to integer values for inclusion in the PNG datastream. This
results in fewer banding artifacts at a given sample depth, or allows
smaller samples while retaining the same visual quality. An intensity
level expressed as a floating-point value in the range 0 to 1 can be
converted to a file image sample by:

integer_sample = floor((2bitdepth-1)intensityencoding_exponent + 0.5)

If the intensity in the equation is the desired object intensity, the
encoding_exponent is the gamma value to be used in the gAMA chunk.

If the intensity available to the PNG encoder is the original scene
intensity, another transformation may be needed. There is sometimes
a requirement for the displayed image to have higher contrast than
the original source image. This corresponds to an end-to-end transfer
function from original scene to display output with an exponent greater
than 1. In this case:

gamma = encoding_exponent/end_to_end_exponent

If it is not known whether the conditions under which the original image
was captured or calculated warrant such a contrast change, it may be
assumed that the display intensities are proportional to original scene
intensities, i.e. the end_to_end_exponent is 1 and hence:

gamma = encoding_exponent

If the image is being written to a datastream only, the encoder is free
to choose the encoding exponent. Choosing a value that causes the gamma
value in the gAMA chunk to be 1/2.2 is often a reasonable choice because
it minimizes the work for a PNG decoder displaying on a typical video
monitor.

Some image renderers may simultaneously write the image to a PNG
datastream and display it on-screen. The displayed pixels should be
gamma corrected for the display system and viewing conditions in use, so
that the user sees a proper representation of the intended scene.

If the renderer wants to write the displayed sample values to the PNG
datastream, avoiding a separate gamma encoding step for the datastream,
the renderer should approximate the transfer function of the display
system by a power function, and write the reciprocal of the exponent
into the gAMA chunk. This will allow a PNG decoder to reproduce what was
displayed on screen for the orginator during rendering.

However, it is equally reasonable for a renderer to compute displayed
pixels appropriate for the display device, and to perform separate gamma
encoding for data storage and transmission, arranging to have a value in
the gAMA chunk more appropriate to the future use of the image.

Computer graphics renderers often do not perform gamma encoding, instead
making sample values directly proportional to scene light intensity. If
the PNG encoder receives sample values that have already been quantized
into integer values, there is no point in doing gamma encoding on them;
that would just result in further loss of information. The encoder
should just write the sample values to the PNG datastream. This does not
imply that the gAMA chunk should contain a gamma value of 1.0 because
the desired end-to-end transfer function from scene intensity to display
output intensity is not necessarily linear. However, the desired gamma
value is probably not far from 1.0. It may depend on whether the scene
being rendered is a daylight scene or an indoor scene, etc.

When the sample values come directly from a piece of hardware, the
correct gAMA value can in principle be inferred from the transfer
function of the hardware and lighting conditions of the scene. In the
case of video digitizers ("frame grabbers"), the samples are probably
in the sRGB colour space, because the sRGB specification was designed
to be compatibile with modern video standards. Image scanners are
less predictable. Their output samples may be proportional to the
input light intensity since CCD sensors themselves are linear, or the
scanner hardware may have already applied a power function designed to
compensate for dot gain in subsequent printing (an exponent of about
0.57), or the scanner may have corrected the samples for display on a
monitor. Reference to the scanner's manual or scanning a calibrated
target may be necessary to determine the characteristics of a particular
scanner. It should be remembered that gamma relates samples to desired
display output, not to scanner input.

Datastream converters generally should not attempt to convert supplied
images to a different gamma. The data should be stored in the PNG
datastream without conversion, and the source gamma should be recorded
if it is known. Gamma alteration at datastream conversion time causes
re-quantization of the set of intensity levels that are represented,
introducing further roundoff error with little benefit. It is almost
always better to just copy the sample values intact from the input to
the output file.

If the source datastream describes the gamma characteristics of the
image, a datastream converter is strongly encouraged to write a gAMA
chunk. Some file formats specify the display-exponent of the function
(mapping samples to display output rather than the other direction). If
the source file's gamma value is greater than 1.0, it is probably a
display-exponent, and the reciprocal of this value should be used for
the PNG gamma value. If the source file format records the relationship
between image samples and a quantity other than display output, it will
be more complex than this to deduce the PNG gamma value.

If a PNG encoder or datastream convertor knows that the image has been
displayed satisfactorily using a display system whose transfer function
can be approximated by a power function with exponent display_exponent,
the image can be marked as having the gamma value:

    gamma = 1/display_exponent

It is better to write a gAMA chunk with a value that is approximately
correct than to omit the chunk and force PNG decoders to guess an
approximate gamma. If a PNG encoder is unable to infer the gamma value,
it is preferable to omit the gAMA chunk. If a guess has to be made this
should be left to the PNG decoder.

Gamma does not apply to alpha samples; alpha is always represented
linearly.

See also 12.10: Decoder gamma handling.

11.3 Encoder colour handling

See Annex C: Gamma and chromaticity for a discussion of colour issues.

PNG encoders capable of full colour management [ICC] will perform more
sophisticated calculations than those described here and may choose to
use the iCCP chunk. If it is known that the image samples conform to the
sRGB specification [sRGB] PNG encoders are strongly encouraged to use
the sRGB chunk.

If it is possible for the encoder to determine the chromaticities of
the source display primaries, or to make a strong guess based on the
origin of the image or the hardware running it, the encoder is strongly
encouraged to output the cHRM chunk. If this is done, the gAMA chunk
should also be written; decoders can do little with cHRM if gAMA is
missing.

There are a number of recommendations and standards for primaries and
white points, some of which are linked to particular technologies, for
example the CCIR 709 standard [ITU-R-BT709] and the SMPTE-C standard
[SMPTE-170M].

There are three cases that need to be considered:

a) the encoder is part of the generation system;

b) the source image is captured by a camera or scanner;

c) the PNG datastream was generated by translation from some other
format.

Scanners that produce PNG datastreams as output should insert the filter
chromaticities into a cHRM chunk.

In the case of hand-drawn or digitally edited images, it has to be
determined what monitor they were viewed on when being produced. Many
image editing programs allow the type of monitor being used to
be specified. This is often because they are working in some
device-independent space internally. Such programs have enough
information to write valid cHRM and gAMA chunks, and are strongly
encouraged to do so automatically.

If the encoder is compiled as a portion of a computer image renderer
that performs full-spectral rendering, the monitor values that were used
to convert from the internal device-independent colour space to RGB
should be written into the cHRM chunk. Any colours that are outside the
gamut of the chosen RGB device should be mapped to be within the gamut;
PNG does not store out of gamut colours.

If the computer image renderer performs calculations directly in
device-dependent RGB space, a cHRM chunk should not be written unless
the scene description and rendering parameters have been adjusted for a
particular monitor. In that case, the data for that monitor should be
used to construct a cHRM chunk.

A few image formats store calibration information, which can be used to
fill in the cHRM chunk. For example, all PhotoCD images use the CCIR
709 primaries and D65 whitepoint, so these values can be written into
the cHRM chunk when converting a PhotoCD file. PhotoCD also uses the
SMPTE-170M transfer function, which is closely approximated by a gAMA
of 0.5. (PhotoCD can store colours outside the RGB gamut, so the image
data will require mapping to be within the gamut.) TIFF 6.0 files can
optionally store calibration information, which if present should be
used to construct the cHRM chunk.

Video created with recent video equipment probably uses the CCIR 709
primaries and D65 white point [ITU-R-BT709], which are given in Table
11.1.

Table 11.1 -- CCIR 709 primaries and D65 whitepoint


R

G

B

White

x

0.640

0.300

0.150

0.3127

y

0.330

0.600

0.060

0.3290

An older but still very popular video standard is SMPTE-C [SMPTE-170M]
given in Table 11.2.

Table 11.2 -- SMPTE-C video standard


R

G

B

White

x

0.630

0.310

0.155

0.3127

y

0.340

0.595

0.070

0.3290

It is not recommended that datastream converters attempt to convert
supplied images to a different RGB colour space. The data should be
stored in the PNG datastream without conversion, and the source primary
chromaticities should be recorded if they are known. Colour space
transformation at datastream conversion time is a bad idea because of
gamut mismatches and rounding errors. As with gamma conversions, it is
better to store the data losslessly and incur at most one conversion
when the image is finally displayed.

See also 12.11: Decoder colour handling.

11.4 Alpha channel creation

The alpha channel can be regarded either as a mask that temporarily
hides transparent parts of the image, or as a means for constructing a
non-rectangular image. In the first case, the colour values of fully
transparent pixels should be preserved for future use. In the second
case, the transparent pixels carry no useful data and are simply there
to fill out the rectangular image area required by PNG. In this case,
fully transparent pixels should all be assigned the same colour value
for best compression.

Image authors should keep in mind the possibility that a decoder will
ignore transparency control. Hence, the colours assigned to transparent
pixels should be reasonable background colours whenever feasible.

For applications that do not require a full alpha channel, or cannot
afford the price in compression efficiency, the tRNS transparency chunk
is also available.

If the image has a known background colour, this colour should be
written in the bKGD chunk. Even decoders that ignore transparency may
use the bKGD colour to fill unused screen area.

If the original image has premultiplied (also called "associated") alpha
data, it can be converted to PNG's non-premultiplied format by dividing
each sample value by the corresponding alpha value, then multiplying by
the maximum value for the image bit depth, and rounding to the nearest
integer. In valid premultiplied data, the sample values never exceed
their corresponding alpha values, so the result of the division should
always be in the range 0 to 1. If the alpha value is zero, output black
(zeroes).

11.5 Sample depth scaling

When encoding input samples that have a sample depth that cannot be
directly represented in PNG, the encoder shall scale the samples up to a
sample depth that is allowed by PNG. The most accurate scaling method is
the linear equation:

   output = floor((input * MAXOUTSAMPLE / MAXINSAMPLE) + 0.5)

where the input samples range from 0 to MAXINSAMPLE and the outputs
range from 0 to MAXOUTSAMPLE (which is 2sampledepth-1).

A close approximation to the linear scaling method is achieved by "left
bit replication", which is shifting the valid bits to begin in the most
significant bit and repeating the most significant bits into the open
bits. This method is often faster to compute than linear scaling.

EXAMPLE Assume that 5-bit samples are being scaled up to 8 bits. If the
source sample value is 27 (in the range from 0-31), then the original
bits are:

   4 3 2 1 0

   ---------

   1 1 0 1 1

Left bit replication gives a value of 222:

   7 6 5 4 3 2 1 0

   ----------------

   1 1 0 1 1 1 1 0

   |=======| |===|

       |      Leftmost Bits Repeated to Fill Open Bits


   Original Bits

which matches the value computed by the linear equation. Left bit
replication usually gives the same value as linear scaling, and is never
off by more than one.

A distinctly less accurate approximation is obtained by simply
left-shifting the input value and filling the low order bits with
zeroes. This scheme cannot reproduce white exactly, since it does not
generate an all-ones maximum value; the net effect is to darken the
image slightly. This method is not recommended in general, but it does
have the effect of improving compression, particularly when dealing
with greater-than-eight-bit sample depths. Since the relative error
introduced by zero-fill scaling is small at high sample depths, some
encoders may choose to use it. Zero-fill shall not be used for alpha
channel data, however, since many decoders will treat alpha values of
all zeroes and all ones as special cases. It is important to represent
both those values exactly in the scaled data.

When the encoder writes an sBIT chunk, it is required to do the scaling
in such a way that the high-order bits of the stored samples match the
original data. That is, if the sBIT chunk specifies a sample depth of S,
the high-order S bits of the stored data shall agree with the original
S-bit data values. This allows decoders to recover the original data by
shifting right. The added low-order bits are not constrained. All the
above scaling methods meet this restriction.

When scaling up source image data, it is recommended that the low-order
bits be filled consistently for all samples; that is, the same source
value should generate the same sample value at any pixel position. This
improves compression by reducing the number of distinct sample
values. This is not a mandatory requirement, and some encoders may
choose not to follow it. For example, an encoder might instead dither
the low-order bits, improving displayed image quality at the price of
increasing file size.

In some applications the original source data may have a range that
is not a power of 2. The linear scaling equation still works for this
case, although the shifting methods do not. It is recommended that an
sBIT chunk not be written for such images, since sBIT suggests that the
original data range was exactly 0..2S-1.

11.6 Suggested palettes

Suggested palettes may appear as sPLT chunks in any PNG datastream,
or as a PLTE chunk in truecolour PNG datastreams. In either case,
the suggested palette is not an essential part of the image data,
but it may be used to present the image on indexed-colour display
hardware. Suggested palettes are of no interest to viewers running on
truecolour hardware.

When an sPLT chunk is used to provide a suggested palette, it is
recommended that the encoder use the frequency fields to indicate the
relative importance of the palette entries, rather than leave them
all zero (meaning undefined). The frequency values are most easily
computed as "nearest neighbour" counts, that is, the approximate usage
of each RGBA palette entry if no dithering is applied. (These counts
will often be available "for free" as a consequence of developing the
suggested palette.) Because the suggested palette includes transparency
information, it should be computed for the uncomposited image.

Even for indexed-colour images, sPLT can be used to define alternative
reduced palettes for viewers that are unable to display all the colours
present in the PLTE chunk.

An older method for including a suggested palette in a truecolour PNG
datastream uses the PLTE chunk. If this method is used, the histogram
(frequencies) should appear in a separate hIST chunk. The PLTE chunk
does not include transparency information. Hence for images of colour
type 6 (truecolour with alpha channel), it is recommended that a bKGD
chunk appear and that the palette and histogram be computed with
reference to the image as it would appear after compositing against the
specified background colour. This definition is necessary to ensure that
the useful palette entries are generated for pixels having fractional
alpha values. The resulting palette will probably be useful only to
viewers that present the image against the same background colour. It is
recommended that PNG editors delete or recompute the palette if the bKGD
chunk is altered or removed in an image of colour type 6.

For images of colour type 2 (truecolour without alpha channel), it is
recommended that the PLTE and hIST be computed with reference to the
RGB data only, ignoring any transparent-colour specification. If the
datastream uses transparency (has a tRNS chunk), viewers can easily
adapt the resulting palette for use with their intended background
colour. They need only replace the palette entry closest to the tRNS
colour with their background colour (which may or may not match the
datastream's bKGD colour, if any).

If PLTE appears without bKGD in an image of colour type 6, the
circumstances under which the palette was computed are unspecified.

sPLT is more flexible than PLTE for providing suggested palettes in the
following ways:

d) With sPLT multiple suggested palettes may be provided. A PNG decoder
may choose an appropriate palette based on name or number of entries.

e) In a type 6 PNG datastream (truecolour with alpha channel), PLTE
represents a palette already composited against the bKGD colour, so it
is useful only for display against that background colour. The sPLT
chunk provides an uncomposited palette, which is useful for display
against backgrounds chosen by the PNG decoder.

f) Since sPLT is a noncritical chunk, a PNG editor may add or
modify suggested palettes without being forced to discard unknown
unsafe-to-copy chunks.

g) Whereas sPLT is allowed in PNG datastreams for colour types 0, 3
and 4 (greyscale and indexed), PLTE cannot be used to provide reduced
palettes in these cases.

h) More than 256 entries may appear in sPLT.

A PNG encoder that uses sPLT may choose to write a PLTE and hIST
suggested palette as well, for compatibility with decoders that do not
recognize sPLT.

11.7 Interlacing

This International Standard defines two interlacing methods, one of
which is no interlacing. Interlacing provides a convenient basis from
which decoders can progressively display an image, as described in
12.7: Interlacing and progressive display.

11.8 Filter selection

For images of colour type 3 (indexed-colour), filter type 0 (None) is
usually the most effective. Colour images with 256 or fewer colours
should almost always be stored in indexed-colour format; truecolour
format is likely to be much larger.

Filter type 0 is also recommended for images of bit depths less than
8. For low-bit-depth greyscale images, in rare cases, better compression
may be obtained by first expanding the image to 8-bit representation and
then applying filtering.

For truecolour and greyscale images, any of the five filters may prove
the most effective. If an encoder uses a fixed filter, the Paeth filter
is most likely to be the best.

For best compression of truecolour and greyscale images, an adaptive
filtering approach is recommended in which a filter is chosen for
each scanline. The following simple heuristic has performed well in
early tests: compute the output scanline using all five filters, and
select the filter that gives the smallest sum of absolute values
of outputs. (Consider the output bytes as signed differences for
this test.) This method usually outperforms any single fixed filter
choice. However, it is likely that much better heuristics will be found
as more experience is gained with PNG.

Filtering according to these recommendations is effective in conjunction
with either of the two interlacing methods defined in this International
Standard.

11.9 Compression

There is no required correlation between the structure of the interlaced
PNG image data (i.e., scanline boundaries) and deflate block boundaries
or IDAT chunk boundaries. The complete image data is represented
by a single zlib datastream that is stored in some number of IDAT
chunks. Some encoder implementations may emit datastreams in which some
of these structures are indeed related. But decoders cannot rely on
this.

The encoder may divide the compressed datastream into IDAT chunks
however it wishes. (Multiple IDAT chunks are allowed so that encoders
may work in a fixed amount of memory; typically the chunk size will
correspond to the encoder's buffer size.) It is important to emphasize
that IDAT chunk boundaries have no semantic significance and can occur
at any point in the compressed datastream. A PNG datastream in which
each IDAT chunk contains only one data byte is legal, though remarkably
wasteful of space. (Zero-length IDAT chunks are also legal, though even
more wasteful.)

11.10 Chunking

11.10.1 Text chunk processing

A nonempty keyword shall be provided for each text chunk. The generic
keyword "Comment" can be used if no better description of the text is
available. If a user-supplied keyword is used, encoders should check
that it meets the restrictions on keywords.

For the tEXt and zTXt chunks, PNG text strings are expected to use
the Latin-1 character set. Encoders should avoid storing characters
that are not defined in Latin-1, and should provide character code
remapping if the local system's character set is not Latin-1. Encoders
should discourage the creation of single lines of text longer than 79
characters, in order to facilitate easy reading. It is recommended
that text items less than 1024 bytes in size should be output using
uncompressed tEXt chunks. It is recommended that the basic title and
author keywords be output using uncompressed tEXt chunks. Placing large
tEXt and zTXt chunks after the image data (after the IDAT chunks) can
speed up image display in some situations, as the decoder will decode
the image data first. It is recommended that small text chunks, such as
the image title, appear before the IDAT chunks.

The iTXt chunk provides support for international text, represented
using the UTF-8 encoding of the Unicode character set.

11.10.2 Use of private chunks

Applications can use PNG private chunks to carry information that need
not be understood by other applications. Such chunks shall be given
chunk types with bit 5 of the second byte (the private bit) set to 1,
to ensure that they can never conflict with any future public chunk
definition. However, there is no guarantee that some other application
will not use the same private chunk name. If a private chunk type is
used, it is prudent to store additional identifying information at the
beginning of the chunk data.

An ancillary chunk type (bit 5 of the first byte of the chunk type code,
the ancillary bit, is set to 0, see 5.4: Chunk naming conventions), not
a critical chunk type, should be used for all private chunks that store
information that is not absolutely essential to view the image. Creation
of private critical chunks is discouraged because they render PNG
datastreams unportable. Such chunks should not be used in publicly
available software or datastreams. If private critical chunks are
essential for an application, it is recommended that one appear near the
start of the datastream, so that a standard decoder need not read very
far before discovering that it cannot handle the datastream.

If other organizations need to understand a new chunk type, it should
be submitted to the Registration Authority (see 4.9: Registration). A
proposed public chunk name (with bit 5 of the second byte of the chunk
type code, the private bit, set to 0) shall not be used in publicly
available software or datastreams until registration has been approved.

If an ancillary chunk contains textual information that might be of
interest to a human user, a special chunk type should not be defined
for it. Instead a tEXt chunk should be used and a suitable keyword
defined. The information will then be available to other users.

Keywords in tEXt chunks should be reasonably self-explanatory, since
the aim is to let other users understand what the chunk contains. If
of general usefulness, new keywords should be registered with the
Registration Authority. (see 4.9: Registration). However, it is
permissible to use keywords without registering them first.

11.10.3 Private type and method codes

This specification defines the meaning of only some of the possible
values of some fields. For example, only compression method 0 and filter
types 0 through 4 are defined in this International Standard. Numbers
greater than 127 shall be used when inventing experimental or private
definitions of values for any of these fields. Numbers below 128 are
reserved for possible future public extensions of this specification via
the Registration Authority. The use of private type codes may render
a datastream unreadable by standard decoders. Such codes are strongly
discouraged except for experimental purposes, and should not appear in
publicly available software or datastreams.

11.10.4 Ancillary chunks

All ancillary chunks are optional, in the sense that encoders need
not write them and decoders may ignore them. However, encoders are
encouraged to write the standard ancillary chunks when the information
is available, and decoders are encouraged to interpret these chunks when
appropriate and feasible.

12 PNG decoders and viewers

12.1 Introduction

This annex gives some requirements and recommendations for decoder
behaviour and viewer behaviour. A viewer presents the decoded PNG
image to the user. Since viewer and decoder behaviour is closely
connected, decoders and viewers are treated together here. The only
absolute requirement on a PNG decoder is that it successfully reads
any datastream conforming to the format specified in the preceding
chapters. However, best results will usually be achieved by following
these additional recommendations.

Decoders shall support all legal combinations of bit depth and colour
type.

All ancillary chunks are optional; encoders need not write them and
decoders may ignore them. However, encoders are encouraged to write
the standard ancillary chunks when the information is available, and
decoders are encouraged to interpret these chunks when appropriate and
feasible.

12.2 Error checking

The PNG error handling philosophy is described in 4.8: Error handling.

To ensure early detection of common file-transfer problems, decoders
should verify that all eight bytes of the PNG file signature are
correct. A decoder can have additional confidence in the datastream's
integrity if the next eight bytes are an IHDR chunk with the correct
chunk length.

Unknown chunk types shall be handled as described in 5.4: Chunk naming
conventions. An unknown chunk type is not to be treated as an error
unless it is a critical chunk.

In some situations it is desirable to check chunk headers (length and
type code) before reading the chunk data and CRC. The chunk type can
be checked for plausibility by seeing whether all four bytes are ASCII
letters (codes 65-90 and 97-122); note that this need only be done for
unrecognized type codes. If the total datastream size is known (from
file system information, HTTP protocol, etc), the chunk length can be
checked for plausibility as well.

If CRCs are not checked, dropped/added data bytes or an erroneous
chunk length can cause the decoder to get out of step and misinterpret
subsequent data as a chunk header. Verifying that the chunk type
contains letters is an inexpensive way of providing early error
detection in this situation.

For known-length chunks such as IHDR, decoders should treat an
unexpected chunk length as an error. Future extensions to this
specification will not add new fields to existing chunks; instead, new
chunk types will be added to carry new information.

Unexpected values in fields of known chunks (for example, an unexpected
compression method in the IHDR chunk) shall be checked for and treated
as errors. However, it is recommended that unexpected field values be
treated as fatal errors only in critical chunks. An unexpected value
in an ancillary chunk can be handled by ignoring the whole chunk as
though it were an unknown chunk type. (This recommendation assumes that
the chunk's CRC has been verified. In decoders that do not check CRCs,
it is safer to treat any unexpected value as indicating a corrupted
datastream.)

Compression method is a single-byte integer that indicates the
method used to compress the image data. At present, only compression
method 0 (deflate/inflate compression with a 32K sliding window) is
defined. Standard PNG images shall be compressed with this scheme. The
compression method field is provided for possible future expansion or
proprietary variants. Decoders must check this byte and report an error
if it holds an unrecognized code. See clause 9: Compression for details.

12.3 Security considerations

A PNG datastream is composed of a collection of explicitly typed
"chunks". Chunks whose contents are defined by the specification could
actually contain anything, including malicious code. But there is no
known risk that such malicious code could be executed on the recipient's
computer as a result of decoding the PNG image.

The possible security risks associated with future chunk types cannot
be specified at this time. Security issues will be considered by the
Registration Authority when evaluating chunks proposed for registration
as public chunks. There is no additional security risk associated with
unknown or unimplemented chunk types, because such chunks will be
ignored, or at most be copied into another PNG datastream.

The iTXt, tEXt and zTXt chunks contain data that is meant to be
displayed as plain text. It is possible that if the decoder displays
such text without filtering out control characters, especially the
ESC (escape) character, certain systems or terminals could behave in
undesirable and insecure ways. It is recommended that decoders filter
out control characters to avoid this risk; see 12.4.3: Text chunk
processing.

Every chunk begins with a length field, which makes it easier to write
decoders that are invulnerable to fraudulent chunks that attempt to
overflow buffers. The CRC at the end of every chunk provide a robust
defence against accidentally corrupted data. The PNG signature bytes
provide early detection of common file transmission errors.

A decoder that fails to check CRCs could be subject to data
corruption. The only likely consequence of such corruption is
incorrectly displayed pixels within the image. Worse things might happen
if the CRC of the IHDR chunk is not checked and the width or height
fields are corrupted. See 12.2: Error checking.

A poorly written decoder might be subject to buffer overflow, because
chunks can be extremely large, up to 231-1 bytes long. But properly
written decoders will handle large chunks without difficulty.

12.4 Chunking

12.4.1 General

Decoders shall recognize type codes by a simple four-byte literal
comparison; it is incorrect to perform case conversion on type codes. A
decoder encountering an unknown chunk in which the ancillary bit is 1
may safely ignore the chunk and proceed to display the image. A decoder
encountering an unknown chunk in which the ancillary bit is 0 shall
indicate to the user that the image contains information it cannot
safely interpret.

(Decoders should not flag an error if bit 5 is set to 1 in the third
byte (the reserved bit), however, as some future version of the PNG
specification could define a meaning for this bit. It is sufficient to
treat a chunk with this bit set in the same way as any other unknown
chunk type.)

12.4.2 Pixel dimensions

Rectangular pixels can be represented (see 10.3.3.4: pHYs Physical pixel
dimensions), but viewers are not required to account for them; a viewer
can present any PNG datastream as though its pixels are square.

Conversely, viewers running on display hardware with rectangular pixels
are strongly encouraged to rescale images for proper display.

12.4.3 Text chunk processing

PNG decoders should attempt to display to the user all the iTXt, tEXt
and zTXt chunks found in the datastream. Even if the decoder does
not recognize a particular text keyword, the user might be able to
understand it.

It is recommended that text contained in tEXt and zTXt chunks be
characters in the ISO 8859-1 "Latin-1" character set (that is it
does not contain codes 0-31 and 127-159), and the linefeed character
(decimal 10). If decoders encounter other characters, some can be safely
displayed (e.g., TAB, FF, and CR, decimal 9, 12, and 13, respectively),
but others, especially the ESC character (decimal 27), could pose a
security hazard (because unexpected actions may be taken by display
hardware or software). Decoders should not attempt to directly display
any non-Latin-1 characters (except for newline and perhaps TAB, FF, CR)
encountered in a tEXt or zTXt chunk. Instead, they should be ignored
or displayed in a visible notation such as "\nnn". See 12.3: Security
considerations.

Even though encoders are recommended to represent newlines as linefeed
(decimal 10), it is recommended that decoders not rely on this; it is
best to recognize all the common newline combinations (CR, LF, and
CR-LF) and display each as a single newline. TAB can be expanded to the
proper number of spaces needed to arrive at a column multiple of 8.

Decoders running on systems with non-Latin-1 character set encoding
should provide character code remapping so that Latin-1 characters are
displayed correctly. Some systems may not provide all the characters
defined in Latin-1. Mapping unavailable characters to a visible notation
such as "\nnn" is a good fallback. Character codes 127-255 should
be displayed only if they are printable characters on the decoding
system. Some systems may interpret such codes as control characters;
for security, decoders running on such systems should not display such
characters literally.

Decoders should be prepared to display text chunks that contain any
number of printing characters between newline characters. However it
is recommended that encoders avoid creating lines in excess of 79
characters.

12.5 Decompression

The compression technique used in this International Standard does
not require the entire compressed datastream to be available before
decompression can start. Display can therefore commence before the
entire decompressed datastream is available. It is extremely unlikely
that any general purpose compression methods registered in the future
will not have this property.

12.6 Filtering

To reverse the effect of a filter, the decoder may need to use
the decoded values of the prior pixel on the same line, the pixel
immediately above the current pixel on the prior line, and the pixel
just to the left of the pixel above. This implies that at least one
scanline's worth of image data needs to be stored by the decoder at all
times. Even though some filter types do not refer to the prior scanline,
the decoder will always need to store each scanline as it is decoded,
since the next scanline might use a filter that refers to it.

Similarly, there is no required correlation between the structure of
the image data (i.e., scanline boundaries) and deflate block boundaries
or IDAT chunk boundaries. The complete image data is represented by a
single zlib datastream that is stored in some number of IDAT chunks; a
decoder that assumes any more than this is incorrect. (Of course, some
encoder implementations may emit datastreams in which some of these
structures are indeed related. But decoders cannot rely on this.)

12.7 Interlacing and progressive display

Decoders are required to be able to read interlaced images, whether
or not they actually perform progressive display. If the reference
image contains fewer than five columns or fewer than five rows, some
passes will be empty. Encoders and decoders shall handle this case
correctly. In particular, filter type bytes are only associated with
nonempty scanlines; no filter type bytes are present in an empty reduced
image.

When receiving images over slow transmission links, decoders can
improve perceived performance by displaying interlaced images
progressively. This means that as each reduced image is received,
an approximation to the complete image is displayed based on the
data received so far. One simple yet pleasing effect can be obtained
by expanding each received pixel to fill a rectangle covering the
yet-to-be-transmitted pixel positions below and to the right of the
received pixel. This process can be described by the following ISO C
code [ISO-9899]:


/*

    variables declared and initialized elsewhere in the code:

        height, width

    functions or macros defined elsewhere in the code:

        visit(), min()

 */


int starting_row[7] = { 0, 0, 4, 0, 2, 0, 1 };

int starting_col[7] = { 0, 4, 0, 2, 0, 1, 0 };

int row_increment[7] = { 8, 8, 8, 4, 4, 2, 2 };

int col_increment[7] = { 8, 8, 4, 4, 2, 2, 1 };

int block_height[7] = { 8, 8, 4, 4, 2, 2, 1 };

int block_width[7] = { 8, 4, 4, 2, 2, 1, 1 };


int pass;

long row, col;


pass = 0;

while (pass < 7)

{

    row = starting_row[pass];

    while (row < height)

    {

        col = starting_col[pass];

        while (col < width)

        {

            visit(row, col,

                  min(block_height[pass], height - row),

                  min(block_width[pass], width - col));

            col = col + col_increment[pass];

        }

        row = row + row_increment[pass];

    }

    pass = pass + 1;

}

The function "visit(row,column,height,width)" obtains the next
transmitted pixel and paints a rectangle of the specified height and
width, whose upper-left corner is at the specified row and column, using
the colour indicated by the pixel. Note that row and column are measured
from 0,0 at the upper left corner.

If the decoder is merging the received image with a background image,
it may be more convenient just to paint the received pixel positions
(the "visit()" function sets only the pixel at the specified row and
column, not the whole rectangle). This produces a "fade-in" effect
as the new image gradually replaces the old. An advantage of this
approach is that proper alpha or transparency processing can be done as
each pixel is replaced. Painting a rectangle as described above will
overwrite background-image pixels that may be needed later, if the
pixels eventually received for those positions turn out to be wholly or
partially transparent. This is only a problem if the background image is
not stored anywhere offscreen.

12.8 Truecolour image handling

To achieve PNG's goal of universal interchangeability, decoders
shall accept all types of PNG image: indexed-colour, truecolour, and
greyscale. Viewers running on indexed-colour display hardware need to
be able to reduce truecolour images to indexed-colour for viewing. This
process is called "colour quantization".

A simple, fast way of colour quantization is to reduce the image to a
fixed palette. Palettes with uniform colour spacing ("colour cubes") are
usually used to minimize the per-pixel computation. For photograph-like
images, dithering is recommended to avoid ugly contours in what should
be smooth gradients; however, dithering introduces graininess that can
be objectionable.

The quality of rendering can be improved substantially by using a
palette chosen specifically for the image, since a colour cube usually
has numerous entries that are unused in any particular image. This
approach requires more work, first in choosing the palette, and second
in mapping individual pixels to the closest available colour. PNG
allows the encoder to supply a suggested palette in one or more sPLT
chunks or a PLTE chunk. However, not all encoders will do so, and the
suggested palette may be unsuitable in any case (it may have too many or
too few colours). Therefore, high-quality viewers will need to have a
palette selection routine at hand. A large lookup table is usually the
most feasible way of mapping individual pixels to palette entries with
adequate speed.

Numerous implementations of colour quantization are available. The PNG
sample implementation, libpng, (ftp://ftp.uu.net/graphics/png/src/),
includes code for the purpose.

12.9 Sample depth rescaling

Decoders may wish to scale PNG data to a lesser sample depth (data
precision) for display. For example, 16-bit data will need to be reduced
to 8-bit depth for use on most present-day display hardware. Reduction
of 8-bit data to 5-bit depth is also common.

The most accurate scaling is achieved by the linear equation

output = floor((input * MAXOUTSAMPLE / MAXINSAMPLE) + 0.5)

where

MAXINSAMPLE = (2sampledepth)-1

MAXOUTSAMPLE = (2desired_sampledepth)-1

A slightly less accurate conversion is achieved by simply shifting
right by sampledepth-desired_sampledepth places. For example, to reduce
16-bit samples to 8-bit, the low-order byte can be discarded. In many
situations the shift method is sufficiently accurate for display
purposes, and it is certainly much faster. (But if gamma correction is
being done, sample rescaling can be merged into the gamma correction
lookup table, as is illustrated in 12.10: Decoder gamma handling.)

When an sBIT chunk is present, the reference image data can be recovered
by shifting right to the sample depth specified by sBIT. Note that
linear scaling will not necessarily reproduce the original data, because
the encoder is not required to have used linear scaling to scale the
data up. However, the encoder is required to have used a method that
preserves the high-order bits, so shifting always works. This is the
only case in which shifting might be said to be more accurate than
linear scaling. A decoder need not pay attention to sBIT; the stored
image is a valid PNG datastream of the sample depth indicated by IHDR.

When comparing pixel values to tRNS chunk values to detect transparent
pixels, the comparison has to be done exactly. Therefore, transparent
pixel detection shall be done before reducing sample precision.

12.10 Decoder gamma handling

See Annex C: Gamma and chromaticity for a discussion of gamma issues.

Decoders capable of full colour management [ICC] will perform more
sophisticated calculations than those described here.

For an image display program to produce correct tone reproduction, it
is necessary to take into account the relationship between samples and
display output and the transfer function of the display system. This can
be done by calculating:

sample = integer_sample / (2bitdepth - 1.0)

display_output = sample1.0/gamma

display_input = inverse_display_transfer(display_output)

framebuf_sample = floor(display_input * MAX_FRAMEBUF_SAMPLE)+0.5)

where integer_sample is the sample value from the datastream,
framebuf_sample is the value to write into the frame buffer, and
MAX_FRAMEBUF_SAMPLE is the maximum value of a frame buffer sample (255
for 8-bit, 31 for 5-bit, etc). The first line converts an integer sample
into a normalized floating point value (in the range 0.0 to 1.0),
the second converts to a value proportional to the desired display
output intensity, the third accounts for the display system's transfer
function, and the fourth converts to an integer frame buffer sample.

A step could be inserted between the second and third to adjust
display_output to account for the difference between the actual
viewing conditions and the reference viewing conditions. However, this
adjustment requires accounting for veiling glare, black mapping and
colour appearance models, none of which can be well approximated by
power functions. Such calculations are not described here. If viewing
conditions are ignored, the error will usually be small.

The display transfer function can be typically approximated by a power
function with exponent display_exponent, in which case the second and
third lines can be merged into:

   display_input = sample(1.0/(gamma * display_exponent)) =
   sampledecoding_exponent

so as to perform only one power calculation. For colour images, the
entire calculation is performed separately for R, G, and B values.

The value of gamma can be taken directly from the gAMA
chunk. Alternatively, an application may wish to allow the user to
adjust the appearance of the displayed image by influencing the value
of gamma. For example, the user could manually set a parameter,
user_exponent which defaults to 1.0, and the application could set:

    gamma = gamma_from_file / user_exponent

    decoding_exponent = 1.0 / (gamma * display_exponent)

        = user_exponent / (gamma_from_file * display_exponent)

The user would set user_exponent greater than 1 to darken the mid-level
tones, or less than 1 to lighten them.

It is not necessary to perform a transcendental mathematical computation
for every pixel. Instead, a lookup table can be computed that gives
the correct output value for every possible sample value. This
requires only 256 calculations per image (for 8-bit accuracy), not
one or three calculations per pixel. For an indexed-colour image, a
one-time correction of the palette is sufficient, unless the image uses
transparency and is being displayed against a nonuniform background.

If floating-point calculations are not possible, gamma correction tables
can be computed using integer arithmetic and a precomputed table of
logarithms. Example code appears in [PNG-EXTENSIONS].

When the incoming image has unknown gamma (gAMA, sRGB and iCCP all
absent), choose a likely default gamma value, but allow the user to
select a new one if the result proves too dark or too light. The default
gamma may depend on other knowledge about the image, for example whether
it came from the Internet or from the local system.

In practice, it is often difficult to determine what value of display
exponent should be used. In systems with no built-in gamma correction,
the display exponent is determined entirely by the CRT. A CRT_gamma
of 2.2 should be used unless detailed calibration measurements are
available for the particular CRT used.

Many modern frame buffers have lookup tables that are used to perform
gamma correction, and on these systems the display exponent value
should be the exponent of the lookup table and CRT combined. It may
not be possible to find out what the lookup table contains from within
the viewer application, in which case it may be necessary to ask the
user to supply the display system's exponent value. Unfortunately,
different manufacturers use different ways of specifying what should go
into the lookup table, so interpretation of the system gamma value is
system-dependent. Annex C: Gamma and chromaticity gives some examples.

The response of real displays is actually more complex than can
be described by a single number (the display exponent). If actual
measurements of the monitor's light output as a function of voltage
input are available, the third and fourth lines of the computation above
can be replaced by a lookup in these measurements, to find the actual
frame buffer value that most nearly gives the desired brightness.

12.11 Decoder colour handling

See Annex C: Gamma and chromaticity for a discussion of colour issues.

In many cases, decoders will treat image data in PNG datastreams as
device-dependent RGB values and display them without modification
(except for appropriate gamma correction). This provides the fastest
display of PNG images. But unless the viewer uses exactly the same
display hardware as that used by the author of the original image, the
colours will not be exactly the same as those seen by the original
author, particularly for darker or near-neutral colours. The cHRM chunk
provides information that allows closer colour matching than that
provided by gamma correction alone.

Decoders can use the cHRM data to transform the image data from RGB
to XYZ and thence into a perceptually linear colour space such as
CIE LAB. They can then partition the colours to generate an optimal
palette, because the geometric distance between two colours in CIE LAB
is strongly related to how different those colours appear (unlike, for
example, RGB or XYZ spaces). The resulting palette of colours, once
transformed back into RGB colour space, could be used for display or
written into a PLTE chunk.

Decoders that are part of image processing applications might also
transform image data into CIE LAB space for analysis.

In applications where colour fidelity is critical, such as product
design, scientific visualization, medicine, architecture, or
advertising, PNG decoders can transform the image data from source
RGB to display RGB space of the monitor used to view the image. This
involves calculating the matrix to go from source RGB to XYZ and the
matrix to go from XYZ to display RGB, then combining them to produce the
overall transformation. The PNG decoder is responsible for implementing
gamut mapping.

Decoders running on platforms that have a Colour Management System (CMS)
can pass the image data, gAMA and cHRM values to the CMS for display or
further processing.

PNG decoders that provide colour printing facilities can use the
facilities in Level 2 PostScript to specify image data in calibrated RGB
space or in a device-independent colour space such as XYZ. This will
provide better colour fidelity than a simple RGB to CMYK conversion. The
PostScript Language Reference manual [POSTSCRIPT] gives examples. Such
decoders are responsible for implementing gamut mapping between source
RGB (specified in the cHRM chunk) and the target printer. The PostScript
interpreter is then responsible for producing the required colours.

PNG decoders can use the cHRM data to calculate an accurate greyscale
representation of a colour image. Conversion from RGB to grey is simply
a case of calculating the Y (luminance) component of XYZ, which is a
weighted sum of R, G and B values. The weights depend upon the monitor
type, i.e. the values in the cHRM chunk. PNG decoders may wish to do
this for PNG datastreams with no cHRM chunk. In this case, a reasonable
default would be the CCIR 709 primaries [ITU-R-BT709]. The original
NTSC primaries should not be used unless the PNG image really was
colour-balanced for such a monitor.

12.12 Background colour

The background colour given by bKGD will typically be used to fill
unused screen space around the image, as well as any transparent pixels
within the image. (Thus, bKGD is valid and useful even when the image
does not use transparency.) If no bKGD chunk is present, the viewer will
need to decide upon a suitable background colour.

Viewers that have a specific background against which to present the
image (such as Web browsers) should ignore the bKGD chunk, in effect
overriding bKGD with their preferred background colour or background
image.

The background colour given by bKGD is not to be considered transparent,
even if it happens to match the colour given by tRNS (or, in the case
of an indexed-colour image, refers to a palette index that is marked
as transparent by tRNS). Otherwise one would have to imagine something
"behind the background" to composite against. The background colour is
either used as background or ignored; it is not an intermediate layer
between the PNG image and some other background.

Indeed, it will be common that bKGD and tRNS specify the same colour,
since then a decoder that does not implement transparency processing
will give the intended display, at least when no partially-transparent
pixels are present.

12.13 Alpha channel processing

The alpha channel can be used to composite a foreground image against a
background image. The PNG datastream defines the foreground image and
the transparency mask, but not the background image. PNG decoders are
not required to support this most general case. It is expected that most
will be able to support compositing against a single background colour.

The equation for computing a composited sample value is:

   output = alpha * foreground + (1-alpha) * background

where alpha and the input and output sample values are expressed as
fractions in the range 0 to 1. This computation should be performed with
intensity samples (not gamma-encoded samples). For colour images, the
computation is done separately for R, G, and B samples.

The following code illustrates the general case of compositing a
foreground image against a background image. It assumes that the
original pixel data are available for the background image, and that
output is to a frame buffer for display. Other variants are possible;
see the comments below the code. The code allows the sample depths and
gamma values of foreground image, background image, and frame buffer/CRT
all to be different and not necessarily suited to the display system. In
practice this should not be assumed without first checking.

This code is ISO C [ISO-9899], with line numbers added for reference in
the comments below.

   01 int foreground[4]; /* image pixel: R, G, B, A */

   02 int background[3]; /* background pixel: R, G, B */

   03 int fbpix[3]; /* frame buffer pixel */

   04 int fg_maxsample; /* foreground max sample */

   05 int bg_maxsample; /* background max sample */

   06 int fb_maxsample; /* frame buffer max sample */

   07 int ialpha;

   08 float alpha, compalpha;

   09 float gamfg, linfg, gambg, linbg, comppix, gcvideo;


       /* Get max sample values in data and frame buffer */

   10 fg_maxsample = (1 << fg_sample_depth) - 1;

   11 bg_maxsample = (1 << bg_sample_depth) - 1;

   12 fb_maxsample = (1 << frame_buffer_sample_depth) - 1;

       /*

        * Get integer version of alpha.

        * Check for opaque and transparent special cases;

        * no compositing needed if so.

        *

        * We show the whole gamma decode/correct process in

        * floating point, but it would more likely be done

        * with lookup tables.

        */

   13 ialpha = foreground[3];


   14 if (ialpha == 0) {

           /*

            * Foreground image is transparent here.

            * If the background image is already in the frame

            * buffer, there is nothing to do.

            */

   15 ;

   16 } else if (ialpha == fg_maxsample) {

           /*

            * Copy foreground pixel to frame buffer.

            */

   17 for (i = 0; i < 3; i++) {

   18 gamfg = (float) foreground[i] / fg_maxsample;

   19 linfg = pow(gamfg, 1.0/fg_gamma);

   20 comppix = linfg;

   21 gcvideo = pow(comppix,1.0 / display_exponent);

   22 fbpix[i] = (int) (gcvideo * fb_maxsample + 0.5);

   23 }

   24 } else {

           /*

            * Compositing is necessary.

            * Get floating-point alpha and its complement.

            * Note: alpha is always linear; gamma does not

            * affect it.

            */

   25 alpha = (float) ialpha / fg_maxsample;

   26 compalpha = 1.0 - alpha;


   27 for (i = 0; i < 3; i++) {

               /*

                * Convert foreground and background to floating

                * point, then undo gamma encoding.

                */

   28 gamfg = (float) foreground[i] / fg_maxsample;

   29 linfg = pow(gamfg, 1.0/fg_gamma);

   30 gambg = (float) background[i] / bg_maxsample;

   31 linbg = pow(gambg, 1.0/bg_gamma);

               /*

                * Composite.

                */

   32 comppix = linfg * alpha + linbg * compalpha;

               /*

                * Gamma correct for display.

                * Convert to integer frame buffer pixel.

                */

   33 gcvideo = pow(comppix,1.0 / display_exponent);

   34 fbpix[i] = (int) (gcvideo * fb_maxsample + 0.5);

   35 }

   36 }

Variations:

a) If output is to another PNG image datastream instead of a frame
buffer, lines 21, 22, 33, and 34 should be changed along the following
lines

   /*

    * Gamma encode for storage in output datastream.

    * Convert to integer sample value.

    */

   gamout = pow(comppix, outfile_gamma);

   outpix[i] = (int) (gamout * out_maxsample + 0.5);

Also, it becomes necessary to process background pixels when alpha is
zero, rather than just skipping pixels. Thus, line 15 will need to be
replaced by copies of lines 17-23, but processing background instead of
foreground pixel values.

b) If the sample depths of the output file, foreground file, and
background file are all the same, and the three gamma values also
match, then the no-compositing code in lines 14-23 reduces to copying
pixel values from the input file to the output file if alpha is one,
or copying pixel values from background to output file if alpha
is zero. Since alpha is typically either zero or one for the vast
majority of pixels in an image, this is a significant saving. No gamma
computations are needed for most pixels.

c) When the sample depths and gamma values all match, it may appear
attractive to skip the gamma decoding and encoding (lines 28-31, 33-34)
and just perform line 32 using gamma-encoded sample values. Although
this does not have too bad an effect on image quality, the time savings
are small if alpha values of zero and one are treated as special cases
as recommended here.

d) If the original pixel values of the background image are no longer
available, only processed frame buffer pixels left by display of the
background image, then lines 30 and 31 need to extract intensity from
the frame buffer pixel values using code such as

   /*

    * Convert frame buffer value into intensity sample.

    */

   gcvideo = (float) fbpix[i] / fb_maxsample;

   linbg = pow(gcvideo, display_exponent);

However, some roundoff error can result, so it is better to have the
original background pixels available if at all possible.

e) Note that lines 18-22 are performing exactly the same gamma
computation that is done when no alpha channel is present. If the
no-alpha case is handled with a lookup table, the same lookup table can
be used here. Lines 28-31 and 33-34 can also be done with (different)
lookup tables.

f) Integer arithmetic can be used instead of floating point, providing
care is taken to maintain sufficient precision throughout.

NOTE In floating point, no overflow or underflow checks are needed,
because the input sample values are guaranteed to be between 0 and 1,
and compositing always yields a result that is in between the input
values (inclusive). With integer arithmetic, some roundoff-error
analysis might be needed to guarantee no overflow or underflow.

When displaying a PNG image with full alpha channel, it is important
to be able to composite the image against some background, even if it
is only black. Ignoring the alpha channel will cause PNG images that
have been converted from an associated-alpha representation to look
wrong. (Of course, if the alpha channel is a separate transparency mask,
then ignoring alpha is a useful option: it allows the hidden parts of
the image to be recovered.)

Even if the decoder author does not wish to implement true compositing
logic, it is simple to deal with images that contain only zero and one
alpha values. (This is implicitly true for greyscale and truecolour
PNG datastreams that use a tRNS chunk; it is easy to check whether
tRNS contains any values other than 0 and 255.) In this simple case,
transparent pixels are replaced by the background colour, while others
are unchanged.

If a decoder contains only this much transparency capability, it should
deal with a full alpha channel by treating all nonzero alpha values as
fully opaque or by diterhing. Neither approach will yield very good
results for images converted from associated-alpha formats, but this is
preferable to doing nothing. Dithering full alpha to binary alpha is
very much like dithering greyscale to black-and-white, except that all
fully transparent and fully opaque pixels should be left unchanged by
the dither.

12.14 Suggested-palette and histogram usage

For viewers running on indexed-colour hardware attempting to display
a truecolour image, or an indexed-colour image whose palette is too
large for the framebuffer, the encoder may have provided one or more
suggested palettes in sPLT chunks. If one of these is found to be
suitable, based on size and perhaps name, the PNG decoder can use that
palette. Suggested palettes with a sample depth different from what
the decoder needs can be converted using sample depth rescaling (see
12.9: Sample depth rescaling).

When the background is a solid colour, the PNG decoder should composite
the image and the suggested palette against that colour, then quantize
the resulting image to the resulting RGB palette. When the image uses
transparency and the background is not a solid colour, no suggested
palette is likely to be useful.

For truecolour images, a suggested palette might also be provided in a
PLTE chunk. If the image has a tRNS chunk and the background is a solid
colour, the viewer will need to adapt the suggested palette for use with
its desired background colour. To do this, the palette entry closest to
the tRNS colour should be replaced with the desired background colour;
or alternatively a palette entry for the background colour can be added,
if the viewer can handle more colours than there are PLTE entries.

For images of colour type 6 (truecolour with alpha channel), any
suggested palette should have been designed for display of the image
against a uniform background of the colour specified by bKGD. Viewers
should probably ignore the palette if they intend to use a different
background, or if the bKGD chunk is missing. Viewers can use a suggested
palette for display against a different background than it was intended
for, but the results may not be very good.

If the viewer presents a transparent truecolour image against a
background that is more complex than a uniform colour, it is unlikely
that the suggested palette will be optimal for the composite image. In
this case it is best to perform a truecolour compositing step on the
truecolour PNG image and background image, then colour-quantize the
resulting image.

In truecolour PNG datastreams, if both PLTE and sPLT chunks appear,
the PNG decoder may choose from among the palettes suggested by both,
bearing in mind the different transparency semantics described above.

The frequencies in the sPLT and hIST chunks are useful when the viewer
cannot provide as many colours as are used in the palette in the PNG
datastream. If the viewer has a shortfall of only a few colours, it is
usually adequate to drop the least-used colours from the palette. To
reduce the number of colours substantially, it is best to choose
entirely new representative colours, rather than trying to use a subset
of the existing palette. This amounts to performing a new colour
quantization step; however, the existing palette and histogram can be
used as the input data, thus avoiding a scan of the image data in the
IDAT chunks.

If no palette or histogram chunk is provided, a decoder can develop
its own, at the cost of an extra pass over the image data in the IDAT
chunks. Alternatively, a default palette (probably a colour cube) can be
used.

See also 11.6: Suggested palettes.

13 Editors and extensions

13.1 Additional chunk types

The provisions of this International Standard may be extended by adding
new chunk types, which may be either private or public. Applications can
use private chunk types to carry data that is not of interest to other
people's applications.

Decoders shall be prepared to encounter unrecognized public or private
chunk type codes. The chunk naming conventions (see 5.4: Chunk naming
conventions) enable critical/ancillary, public/private and safe to
copy/unsafe to copy chunks to be distinguished.

Additional public PNG chunk types are defined in the document Register
of PNG extensions [PNG-EXTENSIONS]. Chunks described there are expected
to be less widely supported than those defined in this International
Standard. However, application authors are encouraged to use those chunk
types whenever appropriate for their applications. Additional chunk
types can be proposed for inclusion in that list by contacting the PNG
Registration Authority (see 4.9: Registration).

New public chunks will only be registered if they are of use to others
and do not violate the design philosophy of PNG. Chunk registration
is not automatic, although it is the intent of the Registration
Authority that it be straightforward when a new chunk of potentially
wide application is needed. The creation of new critical chunk types is
discouraged unless absolutely necessary.

13.2 Behaviour of PNG editors

A "PNG editor" is defined as a program that modifies some ancillary
information and preserves other ancillary information in a PNG
datastream. Two examples of PNG editors are a program that adds or
modifies text chunks, and a program that adds a suggested palette to a
truecolour PNG datastream. Ordinary image editors are not PNG editors
because they usually discard all unrecognized information while reading
in an image.

To allow new chunk types to be added to PNG, it is necessary to
establish rules about the ordering requirements for all chunk
types. Otherwise a PNG editor does not know what to do when it
encounters an unknown chunk.

EXAMPLE Consider a hypothetical new ancillary chunk type that is
safe-to-copy and is required to appear after PLTE if PLTE is present. If
a program attempts to add a PLTE chunk and does not recognize this
new chunk, it may insert the PLTE chunk in the wrong place, namely
after the new chunk. Such problems could be prevented by requiring PNG
editors to discard all unknown chunks, but that is a very unattractive
solution. Instead, PNG requires new ancillary chunks not to have
ordering restrictions like this.

To prevent this type of problem while allowing for future extension,
constraints are placed on both the behaviour of PNG editors and the
allowed ordering requirements for chunks. The safe-to-copy bit defines
the proper handling of unrecognized chunks in a datastream that is being
modified.

a) If a chunk's safe-to-copy bit is 1, the chunk may be copied to
a modified PNG datastream whether or not the PNG editor recognizes
the chunk type, and regardless of the extent of the datastream
modifications.

b) If a chunk's safe-to-copy bit is 0, it indicates that the chunk
depends on the image data. If the program has made any changes to
critical chunks, including addition, modification, deletion, or
reordering of critical chunks, then unrecognized unsafe chunks shall not
be copied to the output PNG datastream. (Of course, if the program does
recognize the chunk, it can choose to output an appropriately modified
version.)

c) A PNG editor is always allowed to copy all unrecognized ancillary
chunks if it has only added, deleted, modified, or reordered ancillary
chunks. This implies that it is not permissible for ancillary chunks to
depend on other ancillary chunks.

d) PNG editors shall terminate on encountering an unrecognized critical
chunk type, because there is no way to be certain that a valid
datastream will result from modifying a datastream containing such a
chunk. (Simply discarding the chunk is not good enough, because it might
have unknown implications for the interpretation of other chunks.) The
safe/unsafe mechanism is intended for use with ancillary chunks. The
safe-to-copy bit will always be 0 for critical chunks.

The rules governing ordering of chunks are as follow.

e) When copying an unknown unsafe-to-copy ancillary chunk, a PNG editor
shall not move the chunk relative to any critical chunk. It may relocate
the chunk freely relative to other ancillary chunks that occur between
the same pair of critical chunks. (This is well defined since the editor
shall not add, delete, modify, or reorder critical chunks if it is
preserving unknown unsafe-to-copy chunks.)

f) When copying an unknown safe-to-copy ancillary chunk, a PNG editor
shall not move the chunk from before IDAT to after IDAT or vice
versa. (This is well defined because IDAT is always present.) Any other
reordering is permitted.

g) When copying a known ancillary chunk type, an editor need only
honour the specific chunk ordering rules that exist for that chunk
type. However, it may always choose to apply the above general rules
instead.

These rules are expressed in terms of copying chunks from an input
datastream to an output datastream, but they apply in the obvious way if
a PNG datastream is modified in place.

See also 5.4: Chunk naming conventions.

It is recommended that the tIME chunk is not modified by PNG editors
that do not change the image data. The Creation Time keyword in the
tEXt, zTXt and iTXt chunks may be used for a user-supplied time.

13.3 Ordering of chunks

13.3.1 Ordering of critical chunks

Critical chunks may have arbitrary ordering requirements, because PNG
editors are required to terminate if they encounter unknown critical
chunks.

EXAMPLE IHDR has the special ordering rule that it shall always appear
first. A PNG editor, or indeed any PNG-writing program, shall know
and follow the ordering rules for any critical chunk type that it can
generate.

13.3.2 Ordering of ancillary chunks

The strictest ordering rules for an ancillary chunk type are:

a) Unsafe-to-copy chunks may have ordering requirements relative to
critical chunks.

b) Safe-to-copy chunks may have ordering requirements relative to IDAT.

The actual ordering rules for any particular ancillary chunk type may be
weaker. See for example the ordering rules for the standard ancillary
chunk types in 5.6: Chunk ordering.

Decoders shall not assume more about the positioning of any ancillary
chunk than is specified by the chunk ordering rules. In particular, it
is never valid to assume that a specific ancillary chunk type occurs
with any particular positioning relative to other ancillary chunks.

EXAMPLE It is unsafe to assume that a particular private ancillary chunk
occurs immediately before IEND. Even if it is always written in that
position by a particular application, a PNG editor might have inserted
some other ancillary chunk after it. But it is safe to assume that the
chunk will remain somewhere between IDAT and IEND.

14 Conformance

14.1 Introduction

14.1.1 Objectives

This clause addresses conformance of PNG datastreams, PNG encoders, PNG
decoders, and PNG editors.

The primary objectives of the specifications in this clause are:

a) to promote interoperability by eliminating arbitrary subsets of, or
extensions to, this International Standard;

b) to promote uniformity in the development of conformance tests;

c) to promote consistent results across PNG encoders, decoders, and
editors;

d) to facilitate automated test generation.

14.1.2 Scope

Conformance is defined for PNG datastreams and for PNG encoders,
decoders, and editors.

This clause addresses the PNG datastream and implementation requirements
including the range of allowable differences for PNG encoders, PNG
decoders, and PNG editors. This clause does not directly address the
environmental, performance, or resource requirements of the encoder,
decoder, or editor.

The scope of this clause is limited to rules for the open interchange of
PNG datastreams.

14.2 Conformance

14.2.1 Conformance of PNG datastreams

A PNG datastream conforms to this International Standard if the
following conditions are met:

a) The PNG datastream contains a PNG signature as the first content (see
5.2 PNG file signature).

b) With respect to the chunk types defined in this International
Standard:

* The PNG datastream contains as its first chunk, an IHDR chunk,
immediately following the PNG signature.

* The PNG datastream contains as its last chunk, an IEND chunk.

c) No chunks or other content follow the IEND chunk.

d) All chunks contained therein match the specification of the
corresponding chunk types of this International Standard. The PNG
datastream shall obey the relationships among chunk types defined in
this International Standard.

e) The sequence of chunks in the PNG datastream obeys the ordering
relationship specified in this International Standard.

f) All field values in the PNG datastream obey the relationships
specified in this International Standard producing the structure
specified in this International Standard.

g) No chunks appear in the PNG datastream other than those specified in
this International Standard or those defined according to the rules for
creating new chunk types as defined in this International Standard.

h) The PNG datastream is encoded according to the rules of this
International Standard.

14.2.2 Conformance of PNG encoders

A PNG encoder conforms to this International Standard if it satisfies
the following conditions:

a) All PNG datastreams that are generated by the PNG encoder are
conforming PNG datastreams.

b) When encoding input samples that have a sample depth that cannot be
directly represented in PNG, the encoder scales the samples up to the
next higher sample depth that is allowed by PNG. The data are scaled in
such a way that the high-order bits match the original data.

c) Only Latin-1 character set characters and the LineFeed character are
encoded into PNG text strings except for the international text chunk.

d) Numbers greater than 127 are used when encoding experimental or
private definitions of values for any of the method or type fields.

14.2.3 Conformance of PNG decoders

A PNG decoder conforms to this International Standard if it satisfies
the following conditions:

a) It is able to read any PNG datastream that conforms to this
International Standard including both public or private chunks whose
types may not be recognized.

b) It presents the graphical characteristics of the standardized
critical chunks, compression, filter and interlace methods and types in
any PNG datastream that conforms to this International Standard.

c) Unknown chunk types are handled as described in 5.4 Chunk naming
conventions. An unknown chunk type is not treated as an error unless it
is a critical chunk.

d) Unexpected values in fields of known chunks (for example, an
unexpected compression method in the IHDR chunk) are treated as errors.

e) All types of PNG images (indexed-colour, truecolour, greyscale,
truecolour with alpha, and greyscale with alpha) are processed. For
example, decoders which are part of viewers running on indexed-colour
display hardware shall reduce truecolour images to indexed format for
viewing.

f) Encountering an unknown chunk in which the ancillary bit is 0
generates an error.

g) A chunk with bit five of the third byte in the chunk type code set is
treated as an unknown chunk type.

h) All legal combinations of bit depth and colour type as defined in
10.2.2: IHDR Image header are supported.

i) An error is reported if an unrecognized value is encountered in the
compression method, filter method, or interlace method bytes of the IHDR
chunk.

j) When processing 16-bit greyscale or truecolour data in the tRNS
chunk, both bytes of the sample values are evaluated to determine
whether a pixel is transparent.

k) When processing an image compressed by compression method 0, the
decoder assumes no more than that the complete image data is represented
by a single compressed datastream that is stored in some number of IDAT
chunks.

l) No assumptions are made concerning the positioning of any ancillary
chunk other than those that are specified by the chunk ordering rules.

14.2.4 Conformance of PNG editors

A PNG editor conforms to this International Standard if:

a) it conforms to the requirements for PNG encoders;

b) it conforms to the requirements for PNG decoders;

c) it is able to encode all chunks that it decodes;

d) it preserves the ordering of the chunks presented within the rules in
5.6: Chunk ordering.

e) It properly processes the safe-to-copy bit information.

Annex A File conventions and Internet media type

(informative)

A.1 File name extension

On systems where file names customarily include an extension signifying
file type, the extension ".png" is recommended for PNG files. Lower case
".png" is preferred if file names are case-sensitive.

A.2 Internet media type

The internet media type "image/png" is the Internet Media Type for PNG
[RFC-2045], [RFC-2048]. It is recommended that implementations also
recognize the media type "image/x-png".

A.3 Macintosh file layout

In the Apple Computer Inc. Macintosh system, the following conventions
are recommended:

a) The four-byte file type code for PNG files is "PNGf". (This code has
been registered with Apple Computer Inc. for PNG files.) The creator
code will vary depending on the creating application.

b) The contents of the data fork is a PNG file exactly as described in
this International Standard.

c) The contents of the resource fork are unspecified. It may be empty or
may contain application-dependent resources.

d) When transferring a Macintosh PNG file to a non-Macintosh system,
only the data fork should be transferred.

Annex B Guidelines for new chunk types

(informative)

This International Standard allows extension through the addition
of new chunk types and new interlace, filter and compression
methods. Extensions might be made to the standard either for
experimental purposes or by organizations for internal use.

Chunk types that are intended for general, public, use, or are required
for specific application domains, should be standardized through
registration (see 4.9 Registration). The process for registration is
defined by the Registration Authority. The conventions for naming chunks
are given in 5.4: Chunk naming conventions.

Some guidelines for defining private chunks are given below.

a) Do not define new chunks that redefine the meaning of existing chunks
or change the interpretation of an existing standardized chunk, e.g. do
not add a new chunk to say that RGB and alpha values actually mean CMYK.

b) Minimize the use of private chunks to aid portability.

c) Avoid defining chunks that depend on total datastream contents, if
such chunks have to be defined, make them critical chunks.

d) For textual information that is representable in Latin-1 avoid
defining a new chunk type. Use a tEXt or zTXt chunk with a suitable
keyword to identify the type of information. For textual information
that is not representable in Latin-1 but which can be represented in
UTF-8, use an iTXt chunk with a suitable keyword.

e) Group mutually dependent ancillary information into a single
chunk. This avoids the need to introduce chunk ordering relationships.

f) Avoid defining private critical chunks.

Annex C Gamma and chromaticity

(informative)

Gamma is a numerical parameter used to describe approximations to
certain non-linear transfer functions encountered in image capture and
reproduction. Gamma is the exponent in a power law function, for example
the function:

intensity = (voltage + constant)gamma

which is used to model the non-linearity of CRT displays.

For the purposes of this International Standard, it is convenient to
consider five places in a general image pipeline at which non-linear
transfer functions may occur and which may be modelled by power
laws. The characteristic gamma associated with each is given a specific
name.

input-exponent

the characteristic of the image sensor.

encoding-exponent

the gamma of any transfer function applied before the image is stored in
a PNG datastream.

decoding-exponent

the gamma of any transfer function applied before the image is stored in
a frame buffer for display.

LUT-exponent

the gamma of the transfer function applied between the frame buffer and
the display device (typically this is applied by a Look Up Table).

output-exponent

the gamma of the display device. For a CRT, this is typically a value
close to 2.2.

It is convenient to define some additional entities that describe some
composite transfer functions, or combinations of stages.

gamma

gamma = 1.0 / (decoding_exponent * display_exponent) This models the
transfer function that has been applied to the image data contained in a
PNG datastream. This gamma value is the exponent of the function mapping
display output intensity to samples in the PNG datastream.

display-gamma

display-exponent = LUT-exponent * output-exponent This value
characterizes the transfer function applied between the frame buffer and
the display surface of the display device.

end_to_end_exponent

the exponent of the function mapping image sensor input intensity to
display output intensity. This is generally a value in the range 1.0 to
1.5.

The PNG gAMA chunk is used to record the gamma value, and characterizes
any non-linear transfer function that has been applied before the
sample values are encoded in the PNG datastream. This information may
be used by decoders together with additional information about the
display environment in order to achieve, or approximate, a desired
end_to_end_exponent.

For additional information about this subject, see the book by Poynton
[POYNTON], especially chapter 6.

Background information about chromaticity and colour spaces may be
found in references [COLOUR-1], [COLOUR-2], [COLOUR-3], [COLOUR-4],
[COLOUR-5].

Annex D Sample Cyclic Redundancy Code

(informative)

The following sample code represents a practical implementation of the
CRC (Cyclic Redundancy Check) employed in PNG chunks. (See also ISO 3309
[ISO-3309] or ITU-T V.42 [ITU-T-V42] for a formal specification.)

The sample code is in the ISO C [ISO-9899] programming language. The
following hints may help non C users to read the code more easily.

Table D.1 -- Hints for reading ISO C code

&

Bitwise AND operator.

^

Bitwise exclusive-OR operator. (Caution: elsewhere in this document, ^
represents exponentiation.)


Bitwise right shift operator. When applied to an unsigned quantity, as
here, right shift inserts zeroes at the left.

!

Logical NOT operator.

++

"n++" increments the variable n. In "for" loops, it is applied after the
variable is tested.

0xNNN

0x introduces a hexadecimal (base 16) constant. Suffix L indicates a
long value (at least 32 bits).


   /* Table of CRCs of all 8-bit messages. */

   unsigned long crc_table[256];


   /* Flag: has the table been computed? Initially false. */

   int crc_table_computed = 0;


   /* Make the table for a fast CRC. */

   void make_crc_table(void)

   {

     unsigned long c;

     int n, k;


     for (n = 0; n < 256; n++) {

       c = (unsigned long) n;

       for (k = 0; k < 8; k++) {

         if (c & 1)

           c = 0xedb88320L ^ (c >> 1);

         else

           c = c >> 1;

       }

       crc_table[n] = c;

     }

     crc_table_computed = 1;

   }


   /* Update a running CRC with the bytes buf[0..len-1]--the CRC

      should be initialized to all 1's, and the transmitted value

      is the 1's complement of the final running CRC (see the

      crc() routine below)). */


   unsigned long update_crc(unsigned long crc, unsigned char *buf,

                            int len)

   {

     unsigned long c = crc;

     int n;


     if (!crc_table_computed)

       make_crc_table();

     for (n = 0; n < len; n++) {

       c = crc_table[(c ^ buf[n]) & 0xff] ^ (c >> 8);

     }

     return c;

   }


   /* Return the CRC of the bytes buf[0..len-1]. */

   unsigned long crc(unsigned char *buf, int len)

   {

     return update_crc(0xffffffffL, buf, len) ^ 0xffffffffL;

   }

Annex E Online resources

(informative)

Introduction

This annex gives the locations of some Internet resources for PNG
software developers. By the nature of the Internet, the list is
incomplete and subject to change.

Archive sites

This International Standard can be found at http://www.w3.org/TR/REC...
[to be completed when published].

PNG home page

There is a World Wide Web home page for PNG at
http://www.cdrom.com/pub/png/. This page is a central location for
current information about PNG and PNG-related tools.

Additional documentation and portable C code for deflate and inflate are
available from the Info-ZIP archives at the zlib home page.

Sample implementation and test images

A sample implementation in portable C can be accessed from the PNG home
page. Test images can also be accessed from this home page.

Electronic mail

Queries concerning PNG developments may be addressed to
png-info@uunet.uu.net or to png-group@w3.org.

Annex F Relationship to W3C PNG

(informative)

This International Standard is strongly based on W3C Recommendation PNG
Specification Version 1.0 which was reviewed by W3C members, approved
as a W3C Recommendation and published in October 1996 according to the
established W3C process. Subsequent amendments to the PNG Specification,
Version 1.1, have also been incorporated into this International
Standard.

A complete review of the document has been done by ISO/IEC/JTC 1/SC 24
in collaboration with W3C in order to transform this recommendation
into an ISO/IEC international standard. A major design goal during this
review was to avoid changes that will invalidate existing files, editors
or viewers that conform to W3C Recommendation PNG Specification Version
1.0.

The W3C PNG Recommendation was developed with major contribution from
the following people.

Editor (Version 1.0)

Thomas Boutell, boutell @ boutell.com

Editor (Version 1.1)

Glenn Randers-Pehrson, randeg @ alum.rpi.edu

Contributing Editor (Version 1.0)

Tom Lane, tgl @ sss.pgh.pa.us

Contributing Editor (Version 1.1)

Adam M. Costello, amc @ cs.berkeley.edu

Authors (Versions 1.0 and 1.1 combined)

Authors' names are presented in alphabetical order.

* Mark Adler, madler @ alumni.caltech.edu

* Thomas Boutell, boutell @ boutell.com

* John Bowler, jbowler @ acm.org

* Christian Brunschen, cb @ df.lth.se

* Adam M. Costello, amc @ cs.berkeley.edu

* Lee Daniel Crocker, lee @ piclab.com

* Andreas Dilger, adilger @ enel.ucalgary.ca

* Oliver Fromme, oliver @ fromme.com

* Jean-loup Gailly, jloup @ gzip.org

* Chris Herborth, chrish @ qnx.com

* Alex Jakulin, Aleks.Jakulin @ snet.fri.uni-lj.si

* Neal Kettler, neal @ westwood.com

* Tom Lane, tgl @ sss.pgh.pa.us

* Alexander Lehmann, alex @ hal.rhein-main.de

* Chris Lilley, chris @ w3.org

* Dave Martindale, davem @ cs.ubc.ca

* Owen Mortensen, ojm @ acm.org

* Keith S. Pickens, ksp @ rice.edu

* Robert P. Poole

* Glenn Randers-Pehrson, randeg @ alum.rpi.edu

* Greg Roelofs, newt @ pobox.com

* Willem van Schaik, willem @ schaik.com

* Guy Schalnat, gschal @ infinet.com

* Paul Schmidt, pschmidt @ photodex.com

* Michael Stokes, Michael_Stokes @ hp.com

* Tim Wegner, twegner @ phoenix.net

* Jeremy Wohl, jeremyw @ evantide.com

List of changes between W3C Recommendation PNG Specification Version 1.0
and this International Standard

Editorial changes

The document has been reformatted according to the requirements of ISO:

a) a concepts clause has been introduced;

b) conformance for files, editors and viewers has been defined in a
conformance clause.

Technical changes

a) Bytes in Latin-1 previously discouraged are now not allowed;

b) in this International Standard, decoders are required to report
an error if an unrecognized interlace method is encountered in the
interlace method bytes of the IHDR chunk.

c) new chunk types introduced in PNG version 1.1 have been incorporated
(iCCP, sRGB, sPLT). A new chunk for international text (iTXt) has also
been added. In accord with version 1.1, the scope of the 31-bit limit
on chunk lengths and image dimensions has been extended to apply to all
four-byte unsigned integers. The value -231 is not allowed.

Bibliography

[COLOUR-1] Hall, Roy, Illumination and Color in Computer Generated
Imagery. Springer-Verlag, New York, 1989. ISBN 0-387-96774-5.

[COLOUR-2] Kasson, J., and W. Plouffe, "An Analysis of Selected Computer
Interchange Color Spaces", ACM Transactions on Graphics, vol 11 no 4
(1992), pp 373-405.

[COLOUR-3] Lilley, C., F. Lin, W.T. Hewitt, and T.L.J. Howard, Colour
in Computer Graphics. CVCP, Sheffield, 1993.  ISBN 1-85889-022-5. Also
available from http://info.mcc.ac.uk/MVC/training/gravigs/colour/

[COLOUR-4] Stone, M.C., W.B. Cowan, and J.C. Beatty, "Color gamut
mapping and the printing of digital images", ACM Transactions on
Graphics, vol 7 no 3 (1988), pp 249-292.

[COLOUR-5] Travis, David, Effective Color Displays --- Theory and
Practice. Academic Press, London, 1991. ISBN 0-12-697690-2.

[GAMMA-FAQ] Poynton, C., "Gamma FAQ".
URL:http://www.inforamp.net/~poynton/Poynton-colour.html

[ICC]The International Color Consortium. http://www.color.org/

[ITU-R-BT709] International Telecommunications Union, "Basic Parameter
Values for the HDTV Standard for the Studio and for International
Programme Exchange", ITU-R Recommendation BT.709 (formerly CCIR
Rec. 709), 1990.

[ITU-T-V42] International Telecommunications Union, "Error-correcting
Procedures for DCEs Using Asynchronous-to-Synchronous Conversion", ITU-T
Recommendation V.42, 1994, Rev. 1.

[PAETH] Paeth, A.W., "Image File Compression Made Easy", in Graphics
Gems II, James Arvo, editor. Academic Press, San Diego, 1991. ISBN
0-12-064480-0.

[POSTSCRIPT] Adobe Systems Incorporated, PostScript Language Reference
Manual, 2nd edition. Addison-Wesley, Reading, 1990. ISBN 0-201-18127-4.

[POYNTON] Poynton, Charles A., A Technical Introduction to Digital
Video. John Wiley and Sons, Inc., New York, 1996. ISBN 0-471-12253-X.

[PNG-1.0] W3C Recommendation, "PNG (Portable Network Graphics)
Specification, Version 1.0", 1996. Available in several
formats from http://www.w3.org/Graphics/PNG/ and from
ftp://ftp.uu.net/graphics/png/documents/png-1.0-*

[PNG-1.1] PNG Development Group, "PNG (Portable Network Graphics)
Specification, Version 1.1", 1999.  Available in several formats from
ftp://ftp.uu.net/graphics/png/documents/png-1.1-pdg.*

[PNG-EXTENSIONS] PNG Group, "Extensions to the PNG 1.1
Specification, Version 1.1.0". Available in several formats from
ftp://ftp.uu.net/graphics/png/documents/pngext-.*

[SMPTE-170M] Society of Motion Picture and Television Engineers,
"Television -- Composite Analog Video Signal -- NTSC for Studio
Applications", SMPTE-170M, 1994.

[sRGB] M. Stokes, M. Anderson, S. Chandrasekar and R. Motto,
A Standard Default Color Space for the Internet - sRGB.
http://www.w3.org/Graphics/Color/sRGB. The key portions of this document
are being adopted with revisions into: International Electrotechnical
Commission, "Colour Measurement and Management in Multimedia Systems and
Equipment - Part 2-1: Default RGB Colour Space - sRGB", IEC 61966-2-1.

[ZL] J. Ziv and A. Lempel, A Universal Algorithm for Sequential Data
Compression, IEEE Transactions on Information Theory, Vol. IT-23, No. 3,
pp. 337 - 343, 1977.

Additional documentation and portable C code for deflate
and inflate are available from the zlib home page at
http://www.cdrom.com/pub/infozip/zlib/ .

2nd CD Text ISO/IEC 15948: xxxx


2nd CD Text ISO/IEC 15948: xxxx


86


85


i


�