FOSS Content Creator - Extract images from Word* docx with One Simple Command on GNU Linux* or use the proprietary application, WinZip*, for Windows*

ID 659463
Updated 7/2/2019
Version Latest
Public

author-image

By

If you use proprietary applications for documentation, you may not be aware of this but a docx is really an archive.

Often, these *.docx documents are used to transfer content to web developers to be placed for online content in Content Management Systems, wikis or blogs. Often, Word documents are the only resource for the images.

Example: 26 images

This example includes 26 images within a docx document. How, when using a proprietary solution, would you extract these images?

Option 1. Click each image and "Save As Picture" for all 26 images.

Option 2. Unzip in GNU Linux*

Option 3. Unzip in Windows*

Extract Images using GNU Linux

To extract images from a *.docx file, run the command in a GNU Linux terminal:

unzip NameofFile.docx

In this case, the document is 26images.docx so the command would be:

unzip 26images.docx

The docx unzips and extracts the xml and images.

click icon  Click the images below to open a larger display in a new window.

shell
  Extracting
Extracted files and directories
Location of images: /extractiondir/word/media

Proprietary Solution

If you must use proprietary software, WinZip* will extract the images. Click File > Open and select the file. The location of the images is:

word\media\allthepngsarehere.png

proprietary way

Conclusion

Extracting image files instead of the manual process of saving each image one at at time can save time and dollars for your projects. Streamline your projects using GNU Linux and you can potentially save man hours and dollars.

All images included in this blog post are GFDL. A sample docx is attached so you can try this for yourself.