LinkedIn Document Extractor
$/month

LinkedIn Document Extractor

Collect LinkedIn document publication activity data.

Overview

This app collects data on LinkedIn document publication activities for specified users. It scrapes information such as the document URL, title, page count, author, and publication timestamp, as well as engagement metrics like reactions, comments, and reposts.  The collected data is saved as an Excel file. Whether you're analyzing competitor content, tracking industry trends, or monitoring influencers, this app helps you stay on top of the latest reactions with ease.

Want a fuller view of user engagement?

Explore our complete suite of LinkedIn Activity Extractor tools — including posts, comments, articles, newsletters, events, and reactions — to track how your target users engage beyond just documents.

How to Use

🛠 First time using an Octoparse AI app?

Make sure you've installed the Octoparse AI client and completed the required setup (including browser extension installation) before running this app.

👉 Follow our beginner setup guide

  1. Download the app from Octoparse AI app store.

  2. (Optional) Customize the workflow.

    This app allows you to customize it to create your own LinkedIn workflow. Simply click the "Edit" button next to the app in your list to open the workflow editor.

  3. Launch the app in your list.

    1. Parameter Description

      1. ProfileUrls: Select an Excel file containing up to 80 LinkedIn profile URLs from which you want to extract documents. The URLs should be formatted as https://www.linkedin.com/in/****/ and start from cell A2 in Sheet1 (with the first row as the header).

      2. DocumentCount: Specify the number of document data points to collect for each profile.

      3. ExportPath: Select the folder where you'd like to save the scraped data.

      4. Browser: Select the browser where the app will run.

      5. AdsPowerProfileId: If you're using AdsPower, open a profile first and enter its ID here.

    2. Click "Run application"

  4. Output

For each document publication activity, this app collects data including: profile URL, document URL, document title, document page count, author, reaction count, comment count, repost count, and the timestamp of the post.  Each time the app runs, it generates a new Excel file with the collected data.

Notes

  • Please log into your LinkedIn account in the specified browser beforehand and ensure the browser is in full-screen mode while the app is running.

  • Do not use the mouse or keyboard while the app is running to avoid interruptions.

  • Please follow LinkedIn's automation rate limits and be mindful of how many profiles you extract document data from each day.

    • For best results, limit each run to 10 profiles, and no more than 80 profiles per day.

    • If you're pulling 100 document results per profile, try cutting that number in half to stay within safe limits.

    • Start small and increase gradually to reduce the chance of being flagged.

    • Stick with the same IP when possible, and if you run into blocks or connection issues, scale down your extraction volume.

Troubleshooting

Encounter obstacles but don't know how to resolve them when executing the app?

Please contact our support team at [email protected] to find the way out!

Version

version 5

2025-04-21

Now editable, with updated instructions.

version 4

2025-02-07

version 3

2025-01-24

version 2

2025-01-23

version 1

2025-01-23

DEVELOPER

View profile
ownerName

Octopus Data Inc

Tags

  • LinkedIn
  • Web scraper
  • Sales workflow

App Support

Report a problem