Rapid7 Vulnerability & Exploit Database

Gather PDF Authors

Back to Search

Gather PDF Authors

Created
05/30/2018

Description

This module downloads PDF documents and extracts the author's name from the document metadata. This module expects a URL to be provided using the URL option. Alternatively, multiple URLs can be provided by supplying the path to a file containing a list of URLs in the URL_LIST option. The URL_TYPE option is used to specify the type of URLs supplied. By specifying 'pdf' for the URL_TYPE, the module will treat the specified URL(s) as PDF documents. The module will download the documents and extract the authors' names from the document metadata. By specifying 'html' for the URL_TYPE, the module will treat the specified URL(s) as HTML pages. The module will scrape the pages for links to PDF documents, download the PDF documents, and extract the author's name from the document metadata.

Author(s)

  • bcoles <bcoles@gmail.com>

Development

Module Options

To display the available options, load the module within the Metasploit console and run the commands 'show options' or 'show advanced':

msf > use auxiliary/gather/http_pdf_authors
msf auxiliary(http_pdf_authors) > show actions
    ...actions...
msf auxiliary(http_pdf_authors) > set ACTION < action-name >
msf auxiliary(http_pdf_authors) > show options
    ...show and set options...
msf auxiliary(http_pdf_authors) > run 

Time is precious, so I don’t want to do something manually that I can automate. Leveraging the Metasploit Framework when automating any task keeps us from having to re-create the wheel as we can use the existing libraries and focus our efforts where it matters.

– Jim O’Gorman | President, Offensive Security

;