Prob
From Djvu
Current revision as of 14:32, 21 November 2006
<plain_html>
<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns="http://www.w3.org/TR/REC-html40">
<head> <meta http-equiv=Content-Type content="text/html; charset=windows-1252"> <meta name=ProgId content=Word.Document> <meta name=Generator content="Microsoft Word 9"> <meta name=Originator content="Microsoft Word 9"> <link rel=File-List href="./what_is_files/filelist.xml"> <link rel=Edit-Time-Data href="./what_is_files/editdata.mso"> <title> </title> <style> </style>
</head>
<body lang=EN-US link="#124a9c" vlink=purple style='tab-interval:36.0pt'>
<![if !vml]><img width=445 height=53 src="what_is_files/image001.gif" v:shapes="_x0000_i1025"><![endif]><o:p></o:p>
<![if !vml]><img width=188 height=192 src="what_is_files/image002.jpg" align=right border=0 v:shapes="_x0000_s1026"><![endif]>Over 90 percent of the information in the world is still on paper. Many of those paper documents include color graphics and/or photographs that represent significant invested value. And almost none of that rich content is on the Internet. <o:p></o:p>
That's
because scanning such documents and getting them onto a Web site has been
problematic at best. At the high resolution necessary to ensure the readability
of the text and to preserve the quality of the images, file sizes become far
too bulky for acceptable download speed. Reducing resolution to achieve
satisfactory download speed means forfeiting quality and legibility. Conventional
web formats such as JPEG, GIF, and PNG produce prohibitively large image files
at decent resolution. As a result, Web site content developers have been
largely unable to leverage existing printed materials.
<![if !vml]><img width=168 height=168
src="what_is_files/image003.jpg" align=left border=0 v:shapes="_x0000_s1027"><![endif]>DjVu (pronounced "dйjа
vu") is a new image compression technology developed since 1996 at
AT&T Labs to solve precisely that problem. DjVu allows the distribution on
the Internet of very high resolution images of scanned documents, digital
documents, and photographs. DjVu allows content developers to scan
high-resolution color pages of books, magazines, catalogs, manuals, newspapers,
historical or ancient documents, and make them available on the Web. <o:p></o:p>
Information that was previously trapped in hard copy form can now be made available to wide audience. <o:p></o:p>
<![if !vml]><img width=232 height=53 src="what_is_files/image004.gif" v:shapes="_x0000_i1026"><![endif]>Research institutions, libraries, and government agencies can give access to their archives. Companies can distribute internal documents on their intranets. <o:p></o:p>
<![if !vml]><img width=235 height=296
src="what_is_files/image005.jpg" align=right border=0 v:shapes="_x0000_s1028"><![endif]>The commercialization of DjVu is
handled by Seattle-based <a href="http://www.lizardtech.com/">LizardTech Inc.</a>
in partnership with AT&T Labs. DjVu is an open standard. The file format
specification, as well as an open source implementations of the decoder (and
part of the encoder) are available.
<![if !vml]><img border=0 width=325 height=52
src="what_is_files/image006.gif" v:shapes="_x0000_i1027"><![endif]>
DjVu typically achieves <![if !vml]><img width=309 height=226
src="what_is_files/image007.jpg" align=left border=0 v:shapes="_x0000_s1029"><![endif]>compression ratios about 5 to 10
times better than existing methods such as JPEG and GIF for color documents,
and 3 to 8 times than TIFF for black and white documents. Scanned pages at 300
DPI in full color can be compressed down to 30 to 100KB files from 25MB..
Black-and-white pages at 300 DPI typically occupy 5 to 30KB when compressed.
This puts the size of high-quality scanned pages within the realm of an average
HTML page (which is typically around 50KB). <o:p></o:p>
For color document images that contain both text and pictures, DjVu files are typically 5 to 10 times smaller than JPEG at similar quality. For black-and-white pages, DjVu files are typically 10 to 20 times smaller than JPEG and five times smaller than GIF. DjVu files are also about 3 to 8 times smaller than black and white PDF files produced from scanned documents (scanned documents in color are impractical in PDF). <o:p></o:p>
In addition to scanned documents, DjVu can also be applied to documents produced electronically in formats such as Adobe's PostScript or PDF. In that case, the file sizes are between 15 to 20KB per page at 300 DPI. <o:p></o:p>
<![if !vml]><img width=185 height=130 src="what_is_files/image008.jpg" align=left border=0 v:shapes="_x0000_s1030"><![endif]>The DjVu plug-in is available for standard Web browsers on various platforms. The DjVu plug-in allows for easy panning and zooming of document images. A unique on the fly decompression technology allows images that normally require 25MB of RAM to be decompressed to require only 2MB of RAM. <o:p></o:p>
<![if !vml]><img width=202 height=129
src="what_is_files/image009.jpg" align=right border=0 v:shapes="_x0000_s1031"><![endif]>Conventional image viewing software
decompresses images in their entirety before displaying them. This is
impractical for high-resolution document images since they typically go beyond
the memory capacity of many PCs, causing excessive disk swapping. DjVu, on the
other hand, never decompresses the entire image,
<![if !vml]><img width=181 height=137
src="what_is_files/image010.jpg" align=left border=0 v:shapes="_x0000_s1032"><![endif]>but instead keeps the image in
memory in a compact form, and decompresses the piece displayed on the screen in
real time as the user views the image. Images as large as 2,500 pixels by 3,300
pixels (a standard page image at 300 DPI) can be downloaded and displayed on
very low-end PCs. <o:p></o:p>
<![if !vml]><img width=148 height=134 src="what_is_files/image011.jpg" align=right border=0 v:shapes="_x0000_s1033"><![endif]>The DjVu format is progressive. Users get an initial version of the page very quickly, and the visual quality of the page progressively improves as more bits arrive. For example, the text of a typical magazine page would appear in just three seconds over a 56Kbps modem connection. In another second or two, the first versions of the pictures and backgrounds will appear. Then, after a few more seconds, the final full-quality version of the page is completed. <o:p></o:p>
One
of the main technologies behind DjVu is the ability to separate an image into a
background layer (i.e., paper texture and pictures) and foreground layer (text
and line drawings). Traditional image compression techniques are fine for
simple photographs, but they drastically degrade sharp color transitions
between adjacent highly contrasted areas - which is why they render type so
poorly. By separating the text from the backgrounds, DjVu can keep the text at
high resolution (thereby preserving the sharp edges and maximizing legibility),
while at the same time compressing the backgrounds and pictures at lower
resolution with a wavelet-based compression technique.
DjVu is used by many commercial and non-commercial web sites on the Web today. <o:p></o:p>
<![if !supportEmptyParas]> <![endif]><o:p></o:p>
</body>
</html>
</plain_html>