|
- Typically, the converted data are stored in the
Portable Document Format (PDF) developed by Adobe
Corporation and it is now the industry standard for
electronic publishing. In addition, the input data could
be converted into other formats, such as XML, HTML, OEB,
LIT, Glassbook, RCA Gemstar eBooks, etc.
- For black-and-white image files, the input resolution
is 600 dpi, gray scale; for color image files, the input
resolution is 300 dpi. The output resolution is
typically 52 dpi, which could be upgraded up to 6,000
dpi or any specific requirements of the customers.
- The conversion could be in the forms of single layer
(Clean Text) or double layer (Image with searchable
ASCII).
- The final products could be stored on CDs, DVDs or
online hosting. A typical CD contains 650 MB
to 1000 MB of recordable memory space and a typical DVD
contains 8 GB of recordable memory space.
|
1.1 Single Layer Conversion (Clean Text)
|
- Single layer conversion of input document, in
agreed-upon format, except those determined by the
processor to be unsuitable for manufacturing, and those
excepted in the following points for reasons of their
content.
- Agreed upon output formats may include the following:
|
| 1.1.1 PDF
requirements |
- A PDF file size is created the same size as the final
trim page size.
- The page text is centered on a given PDF page.
- The last page in a book shall be blank.
- A book page count is divisible by 4.
|
| 1.1.2 XML |
- The conversion provides the DTD
|
| 1.1.3 HTML |
- One file per book
- The title, subtitle, author, date of publication and
body text shall be converted to and saved as an edited
digital ASCII file with a 99.99 % or less word error
rate from the original text (the "primary
file").
- The processor establishes the count of pages in each
book.
- The processor establishes the character count in each
book.
|
1.2 Double Layer Conversion (Image with searchable
ASCII)
|
- Double layer conversion of input document, in
agreed-upon format, except those determined by the
processor to be unsuitable for manufacturing, and those
excepted in the following points for reasons of their
content.
- Agreed upon output formats may include the following:
|
| 1.2.1 TIFF
requirements |
- One file per page
- CCITT Group 4
- File naming convention to identify the cover to cover
order of images (example - the file name for the first
page could be page00001, file name for second page could
be page00002)
|
| 1.2.2 ASCII
requirements |
- One file per page
- Standard ASCII text file from OCR
- An unedited file consists of the converted text (from
OCR) with error markings intact
- File naming convention to identify link the text to a
page image (example - the "image" file name is
page00001, then the"text" file is text00001).
* The service provider establishes the count of pages
in each book..
* The service provider establishes the character count
in each book.
|