Capturing a full-page screenshot using Selenium's Marionette with Python

Question

Capturing a full-page screenshot using Selenium's Marionette with Python

Following the recent update of Firefox to version 47, we found it necessary to add the Marionette extension in order to continue using Selenium Webdriver. In my case, I also had to upgrade from Selenium 2.52 to 2.53.

I rely on the Python version of Selenium Webdriver to capture high-resolution images of maps created with HTML and JavaScript. Previously, this process worked seamlessly in Firefox, allowing me to take screenshots of entire pages that were much larger than the size of my screen. However, with the recent changes, screenshots are now limited to the visible area on the screen only. The code snippet I use is as follows:

import time
from selenium import webdriver
from selenium.webdriver.common.desired_capabilities import DesiredCapabilities

caps = DesiredCapabilities.FIREFOX
caps["marionette"] = True

browser = webdriver.Firefox(capabilities=caps)
browser.get(html_file)
time.sleep(15)

browser.save_screenshot(image_name)
browser.quit()

I have explored various options such as downgrading, stitching together multiple screenshots, or transitioning to Qgis. Nevertheless, I am searching for a more refined solution that allows me to continue utilizing the latest Firefox version and a similar workflow. Does anyone have insight into a potential solution? Perhaps by manipulating Selenium to perceive a larger viewport or by adopting another Linux-compatible browser that supports full-page screenshot capabilities?

python selenium firefox screenshot firefox-marionette

Answer 1

Answer №1

Here is the solution I use for capturing a full page screenshot:

#!/usr/bin/python
from selenium import webdriver
from PIL import Image
from cStringIO import StringIO

verbose = 1

browser = webdriver.Firefox()
browser.get('http://stackoverflow.com/questions/37906704/taking-a-whole-page-screenshot-with-selenium-marionette-in-python')

# JavaScript code to get the height of the entire document
js = 'return Math.max( document.body.scrollHeight, document.body.offsetHeight,  document.documentElement.clientHeight,  document.documentElement.scrollHeight,  document.documentElement.offsetHeight);'

scrollheight = browser.execute_script(js)

if verbose > 0: 
    print scrollheight

slices = []
offset = 0
while offset < scrollheight:
    if verbose > 0: 
        print offset

    browser.execute_script("window.scrollTo(0, %s);" % offset)
    img = Image.open(StringIO(browser.get_screenshot_as_png()))
    offset += img.size[1]
    slices.append(img)

    if verbose > 0:
        browser.get_screenshot_as_file('%s/screen_%s.png' % ('/tmp', offset))
        print scrollheight


screenshot = Image.new('RGB', (slices[0].size[0], scrollheight))
offset = 0
for img in slices:
    screenshot.paste(img, (0, offset))
    offset += img.size[1]

screenshot.save('/tmp/test.png')

You can find the code snippet here on GitHub.

An issue with scrolling and stitching is that HTML nodes set to "display: fixed" will appear repeatedly in each screenshot.

Answer 2

Here is the solution I use for capturing a full page screenshot:

#!/usr/bin/python
from selenium import webdriver
from PIL import Image
from cStringIO import StringIO

verbose = 1

browser = webdriver.Firefox()
browser.get('http://stackoverflow.com/questions/37906704/taking-a-whole-page-screenshot-with-selenium-marionette-in-python')

# JavaScript code to get the height of the entire document
js = 'return Math.max( document.body.scrollHeight, document.body.offsetHeight,  document.documentElement.clientHeight,  document.documentElement.scrollHeight,  document.documentElement.offsetHeight);'

scrollheight = browser.execute_script(js)

if verbose > 0: 
    print scrollheight

slices = []
offset = 0
while offset < scrollheight:
    if verbose > 0: 
        print offset

    browser.execute_script("window.scrollTo(0, %s);" % offset)
    img = Image.open(StringIO(browser.get_screenshot_as_png()))
    offset += img.size[1]
    slices.append(img)

    if verbose > 0:
        browser.get_screenshot_as_file('%s/screen_%s.png' % ('/tmp', offset))
        print scrollheight


screenshot = Image.new('RGB', (slices[0].size[0], scrollheight))
offset = 0
for img in slices:
    screenshot.paste(img, (0, offset))
    offset += img.size[1]

screenshot.save('/tmp/test.png')

You can find the code snippet here on GitHub.

An issue with scrolling and stitching is that HTML nodes set to "display: fixed" will appear repeatedly in each screenshot.

Answer 3

Answer №2

My experience with this approach has been quite positive. Although it operates in headless mode, the results achieved are similar to those obtained in normal mode.

from selenium import webdriver

firefox_options = webdriver.FirefoxOptions()
firefox_options.set_headless() 

firefox_driver = webdriver.Firefox(executable_path=<path_to_gecko_driver>, firefox_options=firefox_options)
firefox_driver.get(<some_url>)

firefox_elem = firefox_driver.find_element_by_tag_name('html')
firefox_elem.screenshot(<png_screenshot_file_path>)

Answer 4

My experience with this approach has been quite positive. Although it operates in headless mode, the results achieved are similar to those obtained in normal mode.

from selenium import webdriver

firefox_options = webdriver.FirefoxOptions()
firefox_options.set_headless() 

firefox_driver = webdriver.Firefox(executable_path=<path_to_gecko_driver>, firefox_options=firefox_options)
firefox_driver.get(<some_url>)

firefox_elem = firefox_driver.find_element_by_tag_name('html')
firefox_elem.screenshot(<png_screenshot_file_path>)

Capturing a full-page screenshot using Selenium's Marionette with Python

Answer №1

Answer №2

Similar questions

Using a custom filename for image downloads with Scrapy

Locating a table in Java using Selenium without an ID

Divide the strings using punctuation marks, but leave the tags intact

Ways to determine if an item is present in my list

Using pysnmp to fetch SNMP data: a step-by-step guide

What is the process for invoking a function in a Python script right before it is terminated using the kill command?

Discovering xpath in Chrome Version 58.0.3029.81 (64-bit) can be done effortlessly with the help of xpathfinder. Unfortunately, the shortcut shift+ctrl+x

The alignment of margins appears as intended in Opera, Chrome, and IE, but unfortunately it does not

"Flawed spacing in python-mode of emacs causing incorrect indentation

How to programmatically upload files to various S3 buckets using Django Storages and Boto3

Exploring the method to iterate with Selenium RC directly on XPath search outcomes

Suggestions for resolving the error message "Meta.fields cannot be a string. Did you intend to type: 'name'?"

It appears that there is a security concern with your current connection while utilizing Selenium.WebDriver version 3.6.0 with Firefox version 56

Utilizing pytest and tox for managing environment variables

A guide on mimicking URL input and pressing ENTER in Chrome using Selenium and NodeJS

Error related to environment / authentication - BigQuery Admin: {invalid_grant, Invalid JWT Signature}

What is the rationale behind assigning names to variables in Tensorflow?

Having trouble accessing a website using Selenium-wire after compiling with Pyinstaller

Django redirects to an alternative template instead of the default one

Truth Values and their Roles in Functions