Exhaustive Job Extractor

Chrome Extension for Multi-Source Job Scraping

Version 1.8.0

12+ Job Boards

LinkedIn, Indeed, Glassdoor, Dice, and more

AI-Powered

Automatic skill extraction and enrichment

Export Ready

JSON, CSV, and bulk operations

Installation

Note: This extension is currently in development. Follow the steps below to install it locally.
  1. Download the Extension
    git clone https://github.com/yourusername/get-me-the-job.git
    cd get-me-the-job/job-extractor
  2. Open Chrome Extensions Page

    Navigate to chrome://extensions/ in your browser

  3. Enable Developer Mode

    Toggle the switch in the top-right corner

  4. Load Unpacked Extension

    Click "Load unpacked" and select the job-extractor folder

  5. Pin the Extension

    Click the puzzle icon in Chrome toolbar and pin "Exhaustive Job Extractor"

Supported Job Boards

Major Platforms
Specialized Platforms

How to Use

  1. Navigate to any supported job posting (e.g., LinkedIn job detail page)
  2. Click the extension icon in your browser toolbar
  3. Click the "Extract Job" button
  4. Wait for extraction to complete (~3 seconds)
  5. Review the extracted data in the side panel
  6. Click "Download JSON" to save the job data
Tip: The extension automatically extracts skills, salary, company details, and more!

  1. Go to LinkedIn and perform a job search
  2. Click the extension icon
  3. Click "Crawl Search Results"
  4. When prompted, enter how many pages to scrape (1-40)
  5. The extension will:
    • Phase 1: Collect all job URLs from multiple pages
    • Phase 2: Extract detailed data from each job
  6. Click "Download All" when complete
Note: LinkedIn may rate-limit after ~50 requests. Use delays and be respectful!

Enhance your job data with AI-powered analysis using Google Gemini:

  1. Get a free API key from Google AI Studio
  2. Click the extension icon and go to "Options" (gear icon)
  3. Paste your Gemini API key
  4. Enable "AI enrichment" checkbox in the side panel
  5. Extract jobs as normal - AI analysis will be added automatically
AI Enrichment adds:
  • Enhanced skill extraction
  • Experience level detection
  • Required vs. preferred qualifications
  • Job responsibilities breakdown
  • Company culture insights

Save time by filtering jobs before extracting full details:

  1. Enable "Filter jobs before scraping" checkbox
  2. Set your filters:
    • Required skills: e.g., "python, react"
    • Excluded keywords: e.g., "senior, lead"
    • Min/Max salary: e.g., $80K - $120K
    • Location: e.g., "Remote, San Francisco"
  3. Start crawling - only matching jobs will be extracted
Filters are applied during Phase 1, significantly reducing scraping time!

Export Formats:
JSON Format

Complete data structure with all fields

CSV Format

Spreadsheet-ready format

Filename Verification:
  • Checked: Chrome will ask where to save and let you rename files
  • Unchecked: Files auto-save to Downloads folder with timestamps

The extension automatically detects duplicate jobs:

  1. After scraping, duplicates are highlighted in yellow
  2. Click "View Duplicates" to see details
  3. Options:
    • Delete duplicates: Keep only unique jobs
    • Export report: Save duplicate analysis as JSON
    • Keep all: Proceed with duplicates included
Duplicate Detection Methods:
  • Job ID matching (most reliable)
  • URL matching
  • Title + Company similarity (fuzzy matching)

Key Features

Speed Control

Adjust scraping speed (Fast/Medium/Slow) to avoid rate limits

Pause/Resume

Pause multi-page scraping and resume later

Local Storage

All scraped jobs saved locally - view anytime

Search & Filter

Search through saved jobs by title, company, or location

Tips & Best Practices

Do's:
  • Use "Medium" or "Slow" speed for large scraping jobs
  • Enable filters to reduce unnecessary API calls
  • Regularly export your data as backup
  • Check for duplicates before downloading
  • Use AI enrichment for better skill extraction
Don'ts:
  • Don't scrape more than 50 jobs at once without delays
  • Don't use "Fast" speed on LinkedIn (risk of rate limiting)
  • Don't close the extension during multi-page scraping
  • Don't scrape the same search results repeatedly
  • Don't share your Gemini API key publicly

Troubleshooting

Issue Solution
"This page is not supported" Make sure you're on a job detail page, not a search results page
Extension icon doesn't appear Go to chrome://extensions/ and click "Reload"
No data extracted Refresh the job page and wait 3 seconds before extracting
AI enrichment not working Verify your Gemini API key in Options and check console for errors
LinkedIn rate limiting Wait 1 hour, use slower speed, or scrape in smaller batches
CSV export empty Make sure you have jobs in the extension storage first

Technical Details

Technology Stack:
  • Manifest V3 Chrome Extension
  • Content Scripts for data extraction
  • Service Worker for background tasks
  • Google Gemini AI API integration
  • Local Chrome Storage API
Data Extracted:
  • Job title, company, location
  • Salary range (if available)
  • Skills and technologies
  • Experience requirements
  • Company details and size
  • Full job description (HTML)
Privacy: All data is stored locally in your browser. Nothing is sent to external servers except for optional AI enrichment via Gemini API.

Ready to supercharge your job search?

Install Extension View on GitHub