Our publishing experts can assist an author/publisher in indexing the book to make the content of the book quickly accessible to the reader. An index serves as a road map to specific information in the book and contributes to the value of the book as a reference source.
The true test of an index is whether the degree of detail, both in subject selection and in identification of the relationship among the subject matches the need of the reader.
|
OCR – Read & Clean UP
We extract the text from the image files and store them in multiple file formats such as:
- Adobe PDF
- MS Word
- MS Excel
- Rich Text Format
- ASCII Text, Unicode Text
- XML and
- Other word processing formats such as WordPerfect, Star Office etc.
Text Proofing
The extracted text will be proof read by comparing with the original source file received – PDF through eye-ball comparison. This process ensures the expected level of text accuracy (99.995%).
Level Identification
The e-publishing team will first analyse the different styles applied in the source file, for instance – the font, emphasis and separators - comma, colon, semi-colon to differentiate the main heading and its corresponding sub / sub-sub headings. After completion of style analysis, they differentiate the main heading, sub / sub-sub heading, page numbers and cross references – “see”, “see also”, “see under” by tab delimiter.
Formatting
The level identified text is formatted as follows:
- The main and sub / sub-sub heading text are changed to title case except for connecting words.
- The personal names are standardized to American name format
- The cross reference words like “see”, “see also”, “see under” are changed to lower case and the other cross reference text are capitalized
- All the end punctuations like – comma, colon, semi-colon are removed.
Quality Assurance (QA)
At every stage of production, the data is run through the in-process quality check to ensure the data quality – typo graphical error, spelling error, text mismatch and incorrect level.
Our quality assurance expert check the production completed data by random sampling method to ensure the final delivery quality. |