Tuesday, March 15, 2016

Word Corrupt Document Checker - Open Source Freeware

https://wordcorruptdocchecker.codeplex.com/ - "I created this tool to fix corrupt Word documents (non-binary, just open xml files). Mainly this applies to the .docx format. It basically just checks through a list of corrupt xml tags that I've come across and attempts to fix the tags. It will create a copy of the file you provide and attempt to fix the copy. The original file should be left unchanged."

A well coded app for fixing errors of the type "The end tag does not match the start tag" where one or more "MC" tags have gotten out of order. It could be used as an alternative to my Savvy DOCX Recovery which is also open source freeware but which doesn't always work.

A Comprehensive Guide on How to Repair or Recover Corrupt Word Documents

Microsoft Word Math Error
Typical error from a Word 2013 or before
installation when loading a corrupt file
This is a cross-post from an answer I made to a Quora question a few days ago and I have been trying to perfect, although this version is better edited. It attempts to give a comprehensive answer of how to recover from corrupt Word files that either won't open or is behaving strangely. For quick fixes for your corrupt DOC or DOCX file, try these 3 pieces of freeware: Repair My Word(for a DOC file), Savvy DOCX Recovery(for a DOCX file), Word Corrupt Document Checker(also for a DOCX file - better than Savvy DOCX Recovery for certain kinds of corruption). Thanks to Rohn007 for editing advice.


Thankfully these days, the number of Word files becoming corrupt, may be diminishing and the new since Word 2007 DOCX file format, is tailor made to allow for easier file repair, however a corrupt Word file is still a common experience and it often happens to valuable documents at just the wrong time in their composition. A corrupt file is one that opens but is missing previously written parts of the document or otherwise acting abnormally. They are also documents which no longer open, instead throwing an error message, like the one above, when you try to work with them.
Below are steps you can follow and links to open source software, freeware, commercial software and videos you can watch helping you to recover from Word document corruption. BEFORE YOU DO ANYTHING ELSE, MAKE COPIES OF YOUR CORRUPT FILE AND ONLY WORK ON THE COPIES. SOME OF THE METHODS DESCRIBED BELOW ARE DESTRUCTIVE TO THE CONTENT OF YOUR DOCUMENTS!!!
Note, DOCX files are in reality conventionally zipped collections of mostly XML subfiles. You can see this by changing the extension of the file from .docx to .zip in the files properties (the quickest way is just to add .zip to the ,docx), by right clicking on the file and choosing to rename or renaming starting by hitting the F2 key. DOC files, an earlier format, are complicated collections of objects too, but are mostly not stored as conventionally zipped elements. I have seen zipped up elements of DOC files toward the end of the code but not a document.xml which contains all the text in DOCX files. In fact in DOC documents often the text is stored in plain text mode. You can see this by opening a copy of your corrupt file in Notepad or by changing the extension of the file from DOC to TXT and opening it in WordNotepad destroys/corrupts the binary structure of DOC files so only look at it this way on a copy or you will be adding to your file's corruption!
File corruption can occur for a number of reasons including software crashes, CPU mistakes, computers overheating (due to weather or dust buildup), power outages, viruses, partial hdd and ssd disk failures, other kind of drive failures including unsafe removal of usb and other connected connected memory cards. Also unfortunately these days a lot of usb memory and memory cards are "fake" in that they tell the computer they are one size, say 128GB which is a size you supposedly bought it at, but in reality the memory stick or card only holds 8GB. When the fake memory reaches its real storage limit, it begins overwriting and corrupting the oldest files or simply throwing away the new data leading in the latter case to only a partial newly saved file or in the former case a partially corrupted or missing older file(s). You can test your USB or "camera" memory with the software described here.
Depiction of the 2 Corrupting strategies fake memory uses with overflow data
What the two kinds of fake memory do to data what's they reach capacity.

Corrupt Word File Recover & Repair Methods

  • Find Previous, Lost, Deleted, Temporary and Unsaved Versions of Your File - the idea here is to get a version of your file that if not identical to the last good instance of what you were working on, is at least close to what you need to start reconstructing your lost work.

    • Previous Versions

      • Recovering Previous Versions from the tab in Explorer, requires that System Protection be turned for the disk where the file reside and a System Restore Point has been made. Wikipedia says: "Shadow copies are created automatically once per day, or manually when triggered by the backup utility or installer applications which create a restore point," so only copies made during those processes can be restored.
      • System Restore is not configured to be on by default in Windows 8, 8.1 and 10, it has to be turned on manually. Also for earlier versions, the default behavior is for the feature to be only turned on for the operating system drive, not for other partitions or disks.
      • Shadow Explorer (freeware) - again requires that System Protection be turned on and a restore point made. Works with all versions of Windows after XP, may not work with XP except on a network that has the Shadow Copy/System Restore service set up.
      • Z-VSScopy (freeware) - ditto.
Z-VSScopy Settings tab
Z-VSScopy screenshot.

        • Previous Version File Recoverer (open source freeware) - my piece of software that focuses on individual files which you browse to and select from the current (non-shadow copy) of the file. A list of the shadow copies of the file are shown and you can select and save the one desired. Also requires that System Protection be turned on and a restore point made and will not work with XP except on a network that has been set up for it.
        • Using File History to recover previous versions of files - starting with Windows 8, a new "previous version" system was setup as the Shadow Copy/System Restore feature was perceived by Microsoft to be little used and too much of a drag on resources. File History requires configuration as well with a designation of external or network drive location for the backup copies of the files to reside. After that, every hour, Windows will check if you files from your Libraries, Contacts, Desktop and Favorites (and only these locations) and will make new backup copies if they have.

          To restore a file from a Previous version using this system, click on History on the Home tab of Explorer.
Location of File History Restore on the Home tab of Explorer
Click on History on the Home Tab of Explorer.

    • Lost Versions

      • Everything (freeware) - this program makes a quick index of all the files on your system and then provide a much faster file search interface than Windows provides, although Cortana's search capabilities on Windows 10 might rival it.
Everything with loaded file name query
Everything in action.

    • Deleted Versions

      • Check your Recycle Bin. Right click on your file and choose Restore.
      • Recuva (freeware) - find deleted file even after they have been removed from the Recycle Bin or been formatted over. This is because deleted files on the spinning platter drives are not actually erased until the file system needs the space. Instead they are just removed from the file index that Windows keeps.
Initial Recuva window where you select the file type to recover
Recuva screenshot displaying the window where you can choose what kind of data to recover.

      • PhotoRec (freeware) - also a favorite of many. The interface is not as fancy as Recuva but it works well.
PhotoRec recovery filetype choice screen
PhotoRec's recovery filetype configuration dialogue.

      • Restoration (freeware) - another oldy but goody in the category of restoring deleted files.
S2 Service Word Recovery Tools main interface
S2 Service Word Recovery Tools main interface.

    • Unsaved Versions

      • The default location may be C:\Users\[Your_User_Name]\AppData\Local\Microsoft
        or C:\Users\[Your_User_Name]\AppData\Roaming\Microsoft
        You will need to change the extension from ASD to DOC, not DOCX! ASD files are still in the old format DOC files.
      • If you need to find unsaved changes to files that have been already saved at least once, the default location may be C:\Users\[Your_User_Name]\AppData\Roaming\Microsoft\Word\. The actual path is stored in the Word Options under the Save region. Again the ASD extension needs to be changed to DOC, not DOCX.
Options-Save area with the Auto-Recover file location indicated
The AutoRecover file location.
Configuration window
PC Inspector File Recovery configuration window where the type of recovery is chosen.

      • Recover My Files (commercial)
      • Word Regenerator (commercial - Rohn007 recommends the umbrella program Office Regenerator which probably works the same. "On my 100GB data partition it ran for about 1 hour and found 691 files, 450 MB, going back almost 7 months! I don’t remember if that was the last time I used a wipe freespace tool on my drive. They definitely were not in my recycle bin. Don’t do first auto run. The interface provides lots of Filter criteria such as: date created range, last saved range, last printed range, title, num pages, editing time, revision number, file type, author. All of the recovered files I looked at were properly structured." - note the other undeleters above may be similarly effective. Try the demos first!)

    • If You Can Still Open the File - your file may be corrupt if symptoms appear such as Word: repeatedly renumbering the existing pages in the document, repeatedly rearranging the page breaks in the document, providing incorrect document layout and formatting, showing unreadable characters on the screen, displaying error messages during processing, causing the computer to hang when the document is loaded or any other unexpected behavior that cannot be attributed to the typical operation of the program. Note that it could also be that Microsoft Word itself or the entire Microsoft Office suite may need repairing or reinstalling. See here regarding steps on how to do this.

      • Make a few copies of your corrupt document and only work on the copies! Some these procedure will possibly destroy or make your file worse :-(.
      • Start Word in Safe Modeby holding down the CTL key as you click to start Word. View and save the file if it opens without error.
      • Start Word without loading macros by holding down the Shift key while starting Word. This can also be done by pasting in winword.exe /m in the Run commandCommand Prompt or the Search/Cortana field of Windows. Afterwords, try loading your document and see if it loads without corruption. If it does then you need to try to disable all macros you have in your template that start when Word starts and then re-enable them one at a time until you find the one that is causing the issue.
      • Change the template that you are using by renaming either the global template (Normal.dotm) if you using that, or find the one you are using and renaming that one.
The Normal.dotm template selected in its location in Explorer
My Word Normal.dotm template and its location (with my user name blurred out).

      • Start Word without it loading add-ins or the normal template by: starting it from a Command Prompt, the Run Box, or the Cortana/Search field and using the command: winword.exe /a. If your file opens normally without corruption, then there is an Add-in causing the trouble. In Word, go to File then Options and then Add-ins. In the panel windows that opens, look for which kind of Add-ins are active and then go to the bottom and choose to manage those types of Add-ins. Uncheck the running Add-ins one by one or all at once if you think you don't need them and then try reopening your file.
Cortana/Windows Search field
The Winword.exe /a command typed into the
Windows 10 Start Menu Search/Cortana field.

      • Change the print driver by adding another printer like another instance of the Microsoft XPS Document Writer, printing from that, verifying it worked, deleting the original printer, reinstalling it and then trying the first printer again.
      • Force Word to repair the file by starting an open dialog window in Word, selecting your file in the Explorer tree, and choosing Open and Repair by clicking on the tiny arrow to the right of the word Open on the open button and choosing the command from the drop down menu.
Where to find Open and Repair in the Word Open Window
Where to find the Open and Repair function in Word.

Where to find "Show formatting" on the Home menu
Word's toolbar displaying the Show formatting menu command icon.
      • If you text is not appearing after a certain point in your document, note the page number then switch to Web or Draft view. In this view, if you can now see paragraphs beyond the ones that appear in your Print view, just after where the truncation appears, delete the next paragraph table or object (maybe even an equation) in the new view. Switch now to Print Layout view and see if the truncation of the document below the presumably bad element, has gone away. If it has not, continue to switch to Draft or Web view and delete more further paragraphs or objects.
      • Rohn007's and Microsoft's advice is to group offending elements togethor. Per Roh007's reply to a post of mine: "If you can open a file and you have a lot of objects of one kind that you suspect are causing corruption you can group them togethor to possible stop future corruption from occurring. Grouping objects by first turning on the Selection pane. This can be found on the far right of the Home tab of the ribbon. The editing group of the Home tab has a dropdown button named Select. Click the Select button, and then click Selection Pane...Press the Ctrl button on your keyboard and then click each text box (or other object?) in the selection pane. Click the Group button under the Format tab. This will group all the objects together. As soon as you have all objects grouped on each page, save the document under a new name."

The Home Tab With Selection Menu and Section Pane Indicated

    • If You Can No Longer Open the FileMake a few copies of your corrupt document and only work on the copies! Some these procedure will possibly destroy or make your file worse :-(.

      • Repair the zip structure of the DOCX file if you are working with that kind of format document (and not the older DOC one). DOCX files are in reality conventionally zipped collections of mostly XML sub-files.Repairing the zip structure is sometimes all Word needs to stop throwing errors and open the file. You can repair the zip structure with the repair feature of some freeware zip suites like iZarc, or with the Repair feature of Corrupt Extractor for Microsoft Office, which I contracted to be custommed programmed for me. The program also sometimes recovers the text from corrupt DOCX files, as well as the media, such as images and inserted Excel spreadsheets.
Where to find the Zip repair feature
The zip repair feature in Corrupt Extractor for Microsoft Office.
      • Start Word in Safe Mode by holding down the Ctl key as you click to start Word. View and save the file if it opens.
Where to begin to find where to uncheck add-ins to stop them loading
The Add-in "tab" of the File -> Options window.

      • Start Word without it loading add-ins or the normal template by: starting it from a Command Prompt, the Run box, or the Cortana/ Search field and using the command: winword.exe /a. If your file opens, then there is an Add-in causing the trouble. Go to File then Options and then Add-ins. In the panel windows that opens, look for which kind of Add-ins are active and then go to the bottom and choose to manage those types of Add-ins. Uncheck the running Add-ins one by one or all at once if you think you don't need them and try reopening your file.
      • Start Word without loading macros by holding down the Shift key while starting Word. This can also be done by pasting in winword.exe /m in the Command Prompt, the Run box, or the Cortana/ Search field of Windows. Afterwords, try loading your document.
      • Check to see if you have any other copies of your file. Have you Emailed it recently? If so check your sent folder. Have you deleted a version? Check the Recycle Bin or use RecuvaRestorationRecover My FilesWord Regenerator mentioned earlier or use another technique to find a previous, lost, temporary or unsaved version of your file with the software mentioned above or using the techniques mentioned in the Advice section at the end of the article.
      • Open with Microsoft Word's in-built method Open and Repair by choosing the File menu, then Open, then Select file, then clicking the down arrow beside the open button and choosing Open and Repair. See above for an image.
      • Try changing the extension to DOC if a DOCX file or vice versa. 
      • Also look at the raw characters of a copy of your corrupt file in NotePadNotePad++WordPad or Word (for the latter two programs, change the extension to txt) that a lot of instances of the characters "PK" indicate a DOCX file and those two characters start the raw character parade of a normal zip file as well as a DOCX one. The characters "ÐÏ" normally start a DOC file and towards the end there are normally a few zipped up sub-files (as indicated by section starting with "PK" again) even in the DOC file format.
What a DOC file looks like in NotePad++
The raw characters at the beginning of DOC file as displayed in NotePad++.
      • Try opening the file in Draft view without updating the links by first switching the view to Draft on a blank document. Next open the Options and choose Advanced and click with the left mouse button to put checkmarks next to "Use Draft font in Draft and Outline Views" and "Show picture placeholders" instead of the images in the file. Also continuing in the Advanced Options scroll down to the General section of the window and uncheck the Update automatic links at open option. Now try to open the file. If it opens, repair it by looking for the truncations in Print Layout and removing objects from Draft or Web view just after the truncations as mentioned before.
Where to check to show Picture Placeholders and to use Draft fonts in Draft and Outline views
Checks to Show Picture Placeholders and to use Draft fonts in Draft and Outline views.
      • Insert the text into a new file by first choosing to an insert an Object, then on the window that opens choose the Create from File tab and browse to your corrupt file.
Insert text from a file dialogue window
How to insert a text from a file by choosing to Insert an Object and the Create from a file tab.
      • Create a link to the damaged document by opening a blank document and typing something like "This is test." Save the new file with the name "Rescue link." Copy the text. Create a second new document. In the new file, choose Paste Special (by clicking on the tiny arrow underneath the Paste button on the Home Menu) and then move the radio button on the left of the Paste Special window to Paste link and choose Formatted Text RTF. Now select the linked text, right click on it, choose first Linked Document Object and then Links. Find your new linked document in the list, and choose Change Source and change the source to your damaged document. See if any text or other recoverable elements appear. Finally right click on the recovered text and/or objects, choose Linked Document Object again choose Links again. This time choose Break Links...This may only work if your corrupt document is a DOC one and not DOCX format.
Changing the source document
The Link Document Object -> Links window, with the a File Open dialog box choosing the corrupted file to link to.
      • Use the Recover Text from Any File function of Word by browsing to your corrupt file, and then selecting the little down arrow on the button above the Open and Cancel buttons on the lower right corner of the Open dialog box and choosing, Recover Text from Any File...This again may only work with DOC files.
Where to find "recover text from any file" on the Word Open dialogue box
The Recover Text from Any File choice chosen in a Word File Open dialog box.
      • Open the file in Word Viewer 2007 or some other viewer. If the viewer doesn't have the ability to convert to another format, you can try to select the whole document by hitting Ctl-A on your keyboard and then Ctl-C to copy the text (and possibly formatting too) to your clipboard. From there you can paste it into a new blank Word document.
      • Another way to do things is to take screenshots of the results in the viewer and then do OCR, for example using one of these solutions, and then paste the results of the text for each page image into a new Word file. This may of course be very labor intensive. Another easier alternative in this situation is to first print from the viewer to a PDF Printer like PDF Creator, which saves the document as a PDF file. At this point you can open the file in Adobe Reader and then on the File menu of the program, choosing first Save As Other and then Text. Of course if you have Adobe Acrobat Standard or Professional, the Export To menu choice will also let you convert directly back to a Word document. There are also a few pieces of freeware and free services which claim they can convert PDF's directly back to Word documents. The quality varies though and at least in the past you could end up with each line of your document being an independent text box, which obviously makes editing a nightmare. In the most recent version of Adobe Acrobat DC, the new incarnation Adobe Acrobat Professional, the Word conversion text flow problems, appear to be gone or greatly lessened.
      • Try to convert the text using a text/data extracting/converter service like: ZamzarCometdocsYouConvertItDocMorphFreeFileConvertand others.
      • Try to open the file on a different computer and if possible a different version of Word.
      • Try reinstalling or repairing Word in your Control Panel applet called Programs and Features. After repairing the install, try opening the file again.
      • Try re-registering the ole32.dll file. First open an elevated command prompt. To do this, click Start, click All Programs, click Accessories, right-click Command Prompt, and then click Run as administrator. If you are prompted for an administrator password or for a confirmation, type the password, or click Allow. Type the following: regsvr32 %SystemRoot%\system32\ole32.dll and then click OK. When you receive the message "DllRegisterServer in ole32.dll succeeded.", click OK, and try opening your file again.
      • Force Word to re-register itself by typing winword /regserver. Quit Word, then restart it and try opening your file. NOTE: Use this switch when you want Microsoft Word to rewrite all of its registry keys and reassociate itself with Word files, such as documents and templates.
      • Open Word, go to Options, click on Trust Center. Next click on the Trusted Center Settings and then Protect View. From there, uncheck the three kinds of Enable Protected ViewSave and try to reopen your file. You may need to repair the install of Word or Office first before this works.
What 3 boxes to check in the Protected View section of Word's options
The Protected View section of the Trust Center section of the File -> Option menu. 
The arrows point to where to uncheck Enable Protected View from various data origins.

  • Try to Open the File in Another Word Processor (which is hopefully less picky about format than Word).

    • Stand Alone Word Processors

      • WordPad - can be started by typing "wordpad" without the quotes at a Command Prompt, the Run box or the Cortana/Search box. It also can be started by opening Explorer and pasting "%WINDIR%\write.exe" into the address bar without the quotes. The %WINDIR%\write.exe command will also work in the Command Prompt, the Run box or the Cortana/Search boxWordPad is also in the Windows Accessories folder of the Start Menu.
         is a very effective opener of corrupt DOCX (but not DOC) files and is installed by default in Windows. It is particularly effective if you have you pretreat the DOCX files with a zip repair program. Simply repairing the zip corruption is often enough to get it to open in WordPad (or even Word!). Corrupt Extractor for Microsoft OfficeInfo-ZIP's ZipPeazip and Izarc all have zip repair facilities and are freeware or open source freeware.
A sample file open in WordPad
A sample DOCX file open in WordPad.
      • Opening a copy of the file is really the first step that should be done when trying to repair or recover it. You can also try viewing a copy of the file in Windows Notepad, or the free NotePad++ or even changing the extension of the copy of your file to TXT and trying to open the file in WordPad or Word. This will for instance, tell you if there are any characters in the file at all or if they are all the NUL or completely blank (filled with the space character). If the file is made up of blank spaces or filled with the NUL character, nothing can be done with the document because there is no content. Instead you need to find an earlier, lost, temporary, deleted or unsaved version of your file to look at. NOTE: it is very important to not open the original file in NotePad as it changes the structure of the file, itself causing corruption. NotePad++ doesn't have the problem. Note too, as mentioned, you can look for the characters "PK" in DOCX files as they indicate that the file has recoverable zip structure. Each subfile in a zip file is itself zipped up in a separate package and each of those packages starts with the characters "PK". If you change the extension to TXT and open it in WordPad or Word, it may also corrupt the DOCX or DOC structure, so be sure to only use a copy of your baby. DOC files normally start with the characters "ÐÏ" without the quotes. They also normally have a few zipped up sub-files at the end of the file. However they do not have the all important document.xml sub-file where all the text is stored in a DOCX file.
A DOCX file this time open in NotePad++ displaying the raw characters
A screenshot of raw characters again in NotePad++, but this time of a DOCX file. Additionally
it shows green highlighting of where the Zip sub-file indicating characters "PK" exist.
A document loaded in AbiWord
AbiWord with document loaded.
SoftMaker FreeOffice 2016 TextMaker with a loaded sample document
SoftMaker FreeOffice 2016 TextMaker with a loaded sample document.
Zoho Writer with a loaded sample document
Zoho Office Writer with a loaded sample document.

  • Corrupt Word Document Repair Software

    • Corrupt File Text Recovery (Freeware and Open Source Freeware)

      • BinText - a GUI text extractor useful mainly for DOC files (freeware).
BinText displaying extracted text from a sample DOC document
Screenshot of BinText extracting text from the a Word DOC file
made from the content found on the McAfee BinText web page.
      • CMD Corrupt OfficeOpen2txt - command line version * Corrupt Extractor for Microsoft Office - see below (freeware)
      • Corrupt DOCX Salvager - a simple gui DOCX file only text extractor (open source freeware).
      • Corrupt Extractor for Microsoft Office - a gui text and image extractor for DOCX, XLSX and PPTX files. It also has a zip repair function built in (open source freeware).
      • Corrupt Office Salvager - another simple GUI for opening Microsoft Office files including Word. Is very similar to Corrupt DOCX Salvager (open source freeware).
      • MvOLE - command line DOC only reader of even corrupt files (open source freeware).
      • Repair My Word - A gui DOC file only text extractor (freeware).
      • SilverCoder's DocToText - excellent command line app which Works for DOC and DOCX as well as ODT Open Office format files (open source freeware).
      • Sandeep Kumar's Docx2txt - command line utility for extracting text from DOCX files. I believe it has a different algorithm than DocToText (open source freeware).
Microsoft Office Visualization Tool with loaded and parsed document
Screenshot of Microsoft Office Visualization Tool with a file loaded
and parsed and showing the Repair and Defragment menu choice.
Savvy DOCX Recovery in midst of recovery
Savvy DOCX Recovery in the middle of processing a file.
Easy Office Recovery, a favorite of mine, showing successfully recovered DOCX and DOC files.
It has the capacity to do bulk recoveries and works very quickly. It recovers text and formatting.
R-Word demo with a loaded folder containing Word files.
WordFix in the midst of recovery
WordFix in the midst of recovery.

    • Patches to Prevent Word from Causing Corruption In the Future

      • Word 2000 Fix: Font Repair Macro - Fixes Word 2000 Eastern European Font Error Text Character Corruption.
      • Service Pack 2 (and SP1 for that matter) for Office 2010 fixes the issue of "The end tag does not match the start tag" error for Word.
      • Word 2007 hotfix for fixing "unspecified errors" occurring after installation of security update MS08-072 (you must have Office SP1 or SP2 installed).
      • Word 2010 hotfix for corruption (probably "unspecified errors" again) caused when editing math formula. 

  • Corrupt Word Document Repair Services

Office Recovery online welcome and upload page
OfficeRecovery Online's web page. The service
offers free recovery if you are willing to wait 14 days.
Munsoft online recovery welcome and upload page
From the makers of EasyOffice Recovery and is certainly
easy and economical if you have only a few files to recover.
Online File Repair Service upload and prices page
Online File Repair Service may offer the most
inexpensive commercial recovery available.
S2 Services upload and price page
S2 Services File Recovery, my services, is falsely flagged by Google as a Malware site. I
offer reasonably 
priced manual analysis and recovery service. Because I'm most often the 
last resort resource, most files I get, cannot be recovered and I only charge the $5 analysis fee.

  • Advice

Manually Finding Autosaved Word Files and Opening Them in Word.

Using Corrupt office2txt, now called Corrupt Office Salvager.
Corrupt office2txt is very similar to Corrupt DOCX Salvager.)

(the video is a bit jumpy and seems to repeat itself).

In a Mac using a viewer to view a corrupted file that will no longer open in Pages, 
Taking screenshots and then doing OCR to recreate the text, like scanner software 
might do with a page of a book scanned to get the text into editable text. This 
would apply to Windows as well if you could find a viewer that opens your file.

Using Hetman File Recovery to find lost, deleted, temporary, 
previous and unsaved versions of your file. Word Regenerator
PhotoRec and Recover My Files probably work similarly.
Try reinstalling or repairing Word in your Control Panel applet called 
Programs and Features. After repairing the install, try opening the file again.
Video showing how to speed the opening of Word by removing the Protected
View options. 
To do this, open a new blank document in Word, go to Options, click
on Trust Center, clicking on 
the Trusted Center Settings and then Protect View, from
there the video maker, unchecks the 
three kinds of Enable Protected View. He/She
saves the configuration and successfully reopens 
the file. As is done in the video you may
need to first repair the install of Microsoft Office 
or Word. https://notepad-plus-plus.org/

Hasleo Data Recovery FreeV3.2 - Free as in Freeware - Permanently from Hasleo Software

https://www.hasleo.com/win-data-recovery/free-data-recovery.html "Hasleo Data Recovery FreeV3.2 100% Free Data Recovery Software...