

the output quality depends on much about how it was created." The determining variable is, as daisymina8912 stated: With that said Acrobat, in particular Acrobat X, or other products can provide a relatively 'ok' export to Word / RTF from an untagged PDF. After all, one of Tagged PDFs 'design criteria' is to support

Retaining "format" upon export of PDF page content is often a 'no-start' if the PDF is not, at a minimum, a workable "Tagged PDF". convert pdf to word and remain the format?" "best-estimate" of the tag structure (after which you'd do manual cleanup). Then you can manually develop the tagged PDF or use Acrobat to get a Use Acrobat's OCR mode Formatted Text & Graphics (Acrobat 8.x or earlier) or ClearScan (Acrobat 9.x). With that said, if the PDF content is a scanned image then you'd Alternatively, you could let Acrobat make a "best-estimate" tagging If the PDF is not a well-formed, tagged PDF or is not tagged at all you can use Acrobat Professional to manuallyĭevelop the tagged PDF. | Making content accessible to users with visual impairments | Conversion to other common file formats (such as HTML, XML, and RTF) with document structure and basic styling information preserved | Processing text for such purposes as searching, indexing, and spell-checking | Automatic reflow of text and associated graphics to fit a page of a different size than was assumed for the original layout | Simple extraction of text and graphics for pasting into other applications PDF’s logical structure facilities provide a mechanism for incorporating structural information about a document’s content into a PDF file. It defines a set of standard structure types and attributes that allow page content (text, graphics, and images) to be extracted and reused for other purposes.


Tagged PDF is a stylized use of PDF that builds on the logical structure framework. The content author must master content in a well-formed, logical hierarchy in conjunction with using built-in paragraph/character tagsĬurrently, three applications have adequate tag management.Īdobe FrameMaker, Adobe InDesign and MS Word (via Adobe PDFMaker or (for Office 2007) the Office save as to PDF or XPS.įrom this, a well-formed Tagged output PDF may be provided. Such comes from an authoring application that has adequate tag management provided. That is to say the PDF is compliant to ISO 32000-1 discussion of logical hierarchy and tagged PDF. You would need to start with a well-formed, tagged PDF. could anybody tell me how to convert pdf to word and remain the format?
