javascript - Selenium 网络驱动程序 : firefox headless inject javascript to modify browser property-6ren

javascript - Selenium 网络驱动程序 : firefox headless inject javascript to modify browser property

转载作者：数据小太阳更新时间：2023-10-29 05:17:23

我正在尝试弄清楚如何将 selenium webdriver 与 python 或 java 结合使用来注入(inject) javascript 以修改浏览器属性/属性。我的最终目标是获得类似于 this 的东西使用 selenium 和 firefox，因为它是更开放和灵活的选择。

Puppeter 和 chromium 文件 test.js:

const puppeteer = require("puppeteer");

(async () => {
  const browser = await puppeteer.launch({
    args: ["--no-sandbox"],
    headless: true,
  });
  const page = await browser.newPage();
  const fs = require("fs");

  // In your puppeteer script, assuming the javascriptChromium.js file is in same folder of our script
  const preloadFile = fs.readFileSync("./javascriptChromium.js", "utf8");
  await page.evaluateOnNewDocument(preloadFile);

  const testUrl="https://intoli.com/blog/not-possible-to-block-chrome-headless/chrome-headless-test.html";

  await page.goto(testUrl);

  // save screenshot
  await page.screenshot({path: "puppeteer-chromium-async-script-test.png"});
  await browser.close()
})();

Javascript 文件 javascriptChromium.js

// overwrite the `languages` property to use a custom getter
Object.defineProperty(navigator, "languages", {
  get: function() {
    return ["en-US", "en", "es"];
  }
});

// Overwrite the `plugins` property to use a custom getter.
Object.defineProperty(navigator, 'plugins', {
  get: () => [1, 2, 3, 4, 5],
});

// Pass the Webdriver test
Object.defineProperty(navigator, 'webdriver', {
  get: () => false,
});

此代码运行良好，我检查属性是否已通过此 test Web site 更改.

现在，selenium 和 firefox:

import os
from selenium import webdriver

def readJSFile(scriptFile):
    with open(scriptFile, 'r') as fileHandle:  
        script=fileHandle.read()
    return script
injectedJavascript=readJSFile("./javascriptFirefox.js")

options=webdriver.FirefoxOptions()
options.set_headless(True)
driver=webdriver.Firefox(options=options)
driver.set_script_timeout(3)

# inject JavaScript
try:
    driver.execute_async_script(injectedJavascript)
except:
    print("Timeout")

# solution found here https://stackoverflow.com/questions/17385779/how-do-i-load-a-javascript-file-into-the-dom-using-selenium
driver.execute_script("var s=window.document.createElement('script'); s.src='javascriptFirefox.js';window.document.head.appendChild(s);")
testUrl="https://intoli.com/blog/not-possible-to-block-chrome-headless/chrome-headless-test.html";
driver.get(testUrl)

# example sync script
time=driver.execute_script("return performance.timing.loadEventEnd - performance.timing.navigationStart;")
print(time)
# example async script
time=driver.execute_async_script("var callback = arguments[arguments.length-1]; const time = () => { total=performance.timing.loadEventEnd - performance.timing.navigationStart; callback(total); }; time();")
print(time)

file="selenium-firefox-async-script-test.png"
driver.save_screenshot(file)

driver.quit()

Javascript 文件 javascriptFirefox.js

// overwrite the `languages` property to use a custom getter
const setProperty = () => {
    Object.defineProperty(navigator, "languages", {
        get: function() {
            return ["en-US", "en", "es"];
        }
    });

    // Overwrite the `plugins` property to use a custom getter.
    Object.defineProperty(navigator, 'plugins', {
        get: () => [1, 2, 3, 4, 5],
    });

    // Pass the Webdriver test
    Object.defineProperty(navigator, 'webdriver', {
      get: () => false,
    });
    callback();
};
setProperty();

我是 javascript 的新手，但这两种方法(puppeteer 和 selenium)之间的不同之处在于它们如何管理当前选项卡/页面。前者通过页面类和方法page.evaluateOnNewDocument而对于后者，我没有找到等效的方法。我也尝试过使用 greasemonkey 或 violentlmonkey 来注入(inject) javascript 但没有成功。

你有什么建议吗？

谢谢

最佳答案

我按照这个 post 找到了问题的解决方案.简而言之，通过使用扩展，可以将 javascript 代码注入(inject)到 Firefox 的网页中。为了避免其他用户浪费时间，主要文件有:

Python文件:selenium+firefox

import json
import os
import sys

from selenium import webdriver
from selenium.common.exceptions import NoSuchElementException
from selenium.webdriver.firefox.firefox_profile import AddonFormatError

# Patch in support for WebExtensions in Firefox.
# See: https://intoli.com/blog/firefox-extensions-with-selenium/
class FirefoxProfileWithWebExtensionSupport(webdriver.FirefoxProfile):
    def _addon_details(self, addon_path):
        try:
            return super()._addon_details(addon_path)
        except AddonFormatError:
            try:
                with open(os.path.join(addon_path, "manifest.json"), "r") as f:
                    manifest = json.load(f)
                    return {
                        "id": manifest["applications"]["gecko"]["id"],
                        "version": manifest["version"],
                        "name": manifest["name"],
                        "unpack": False,
                    }
            except (IOError, KeyError) as e:
                raise AddonFormatError(str(e), sys.exc_info()[2])

profile_folder="profile_path"
profile=FirefoxProfileWithWebExtensionSupport(profile_folder)
extension_directory="extension"
profile.add_extension(extension_directory)
# firefox dev it is necessary for custom profile, not for standard one
firefox_binary="/usr/bin/firefox-dev"
options=webdriver.FirefoxOptions()
# firefox 56+ headless mode https://developer.mozilla.org/en-US/Firefox/Headless_mode
options.set_headless(True)
driver=webdriver.Firefox(options=options, firefox_profile=profile, firefox_binary=firefox_binary)

test_url="https://intoli.com/blog/not-possible-to-block-chrome-headless/chrome-headless-test.html";
driver.get(test_url)

file="selenium-firefox-extension-profile-script-second-test.png"
driver.save_screenshot(file)

test_url="https://intoli.com/blog/making-chrome-headless-undetectable/chrome-headless-test.html";
driver.get(test_url)

file="selenium-firefox-extension-profile-script-first-test.png"
driver.save_screenshot(file)

driver.quit()

扩展文件:manifest.js 和 content.js

{
  "manifest_version": 2,
  "name": "Smart Extension",
  "version": "1.0.0",
  "applications": {
    "gecko": {
      "id": "user@protonmail.com"
    }
  },
  "content_scripts": [
    {
      "matches": ["*://*/*"],
      "js": ["content.js"],
      "run_at": "document_start"
    }
  ]
}

var script=document.createElement("script");
script.src=browser.extension.getURL("myscript.js");
script.async=false;
document.documentElement.appendChild(script);

Javascript 文件:myscript.js

// overwrite the `languages` property to use a custom getter
Object.defineProperty(navigator, "languages", {
  get: function() {
    return ["en", "es"];
  }
});

// Overwrite the `plugins` property to use a custom getter.
Object.defineProperty(navigator, "plugins", {
  get: () => new Array(Math.floor(Math.random() * 6) + 1),
});

// Pass the Webdriver test
Object.defineProperty(navigator, "webdriver", {
  get: () => false,
});

// hairline: store the existing descriptor
const elementDescriptor=Object.getOwnPropertyDescriptor(HTMLElement.prototype, "offsetHeight");

// redefine the property with a patched descriptor
Object.defineProperty(HTMLDivElement.prototype, "offsetHeight", {
    ...elementDescriptor,
  get: function() {
    if (this.id === "modernizr") {
      return 1;
    }
    return elementDescriptor.get.apply(this);
  },
});

["height", "width"].forEach(property => {
  // store the existing descriptor
  const imageDescriptor=Object.getOwnPropertyDescriptor(HTMLImageElement.prototype, property);

  // redefine the property with a patched descriptor
  Object.defineProperty(HTMLImageElement.prototype, property, {
    ...imageDescriptor,
    get: function() {
      // return an arbitrary non-zero dimension if the image failed to load
      if (this.complete && this.naturalHeight == 0) {
        return 24;
      }
      // otherwise, return the actual dimension
      return imageDescriptor.get.apply(this);
    },
  });
});

const getParameter=WebGLRenderingContext.getParameter;
WebGLRenderingContext.prototype.getParameter=function(parameter) {
  // UNMASKED_VENDOR_WEBGL WebGLRenderingContext.prototype.VENDOR
  if (parameter === 37445) {
    return "Intel Open Source Technology Center";
  }
  // UNMASKED_RENDERER_WEBGL WebGLRenderingContext.prototype.RENDERER
  if (parameter === 37446) { 
    return "Mesa DRI Intel(R) Ivybridge Mobile";
  }
  return getParameter(parameter);
};

这适用于图形模式下的所有测试，而在 headless 模式下，除了似乎影响 bug 的 WebGL 测试之外的所有测试。 .

关于javascript - Selenium 网络驱动程序 : firefox headless inject javascript to modify browser property，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/51439377/

文章推荐： javascript - react 函数说 "is not a function"

文章推荐： javascript - Angular 动画 : Animate Parent and Child Elements

文章推荐： javascript - Ng 日期时间选择器的值更改不起作用

文章推荐： javascript - JSON.Stringify 方法替换参数不适用于嵌套对象

java - Firefox headless (headless)在 headless (headless) Redhat Linux 机器上崩溃
我正在尝试使用 firefox headless 在 headless (headless) redhat linux 构建机器上运行 selenium 测试。我创建驱动程序的方法如下所示: priv
python - 带 Selenium 的 headless (headless) Chrome ，只能找到滚动非 headless (headless)的方法
关于这个主题有很多东西可以找到，但无法弄清楚。我需要滚动到(不太长)无限滚动页面的末尾。我有 2 个选项可以使用 chrome 非 headless (headless)但似乎不能 headless
headless - 如何正确编译 ParaView 以进行 headless (headless)离屏渲染？
我在远程服务器上运行 OpenFOAM，基本上设法通过 paraview 的 pvserver 可视化结果 as described here .然而，在连接后，客户端产生 Server DISPLA
headless (headless)模式下的Android屏幕截图很难看
我想在 headless 模式下截取 Android 设备的屏幕截图，也就是说我是这样创建的: echo no | /opt/android/android-sdk-linux/tools/andro
selenium - headless (headless)浏览器在自动化方面的差异
主要区别在于，基于GUI和非GUI（Headless）执行。我正在寻找所有Headless浏览器之间的差异，但是很遗憾，我没有找到任何差异。我一个接一个地经历，这使我更加困惑。如果有人可以分享具有差
linux - OpenOffice headless (headless)
已关闭。这个问题是 off-topic 。目前不接受答案。想要改进这个问题吗？ Update the question所以它是on-topic用于堆栈溢出。已关闭10 年前。 Improve th
Java headless (headless)双三次图像调整大小
我需要在没有 X 服务器的情况下执行 java 图像裁剪和调整大小。我尝试了几种方法。下面的第一种方法有效，但输出了一个相当难看的调整大小的图像(可能使用最近邻算法来调整大小: static Buf
Python - Firefox headless (headless)
过去几天我一直在使用 Selenium、Tor 和 Firefox 作为多个任务的组合。我已经设法用 Python 编写了一个简单的脚本，它通过 Selenium 控制 Firefox，而 Firef
python - Pygame headless (headless)设置
我正在使用 pygame 的操纵杆 api 在 headless (headless)系统上对我的项目使用操纵杆，但是 pygame 需要一个“屏幕”，所以我设置了一个虚拟视频系统来克服这个问题。它工
firefox - 截图后不存在 headless (headless)Firefox
我想使用 headless firefox 在 macos 上捕获网页的图像。这是我执行的命令:/Applications/Firefox.app/Contents/MacOS/firefox-bi
php - headless (headless) Chromium 浏览器始终显示验证码
我正在使用带有 headless-chromium-php 的 google chrome headless (headless)浏览器导航到某些网站，但它总是被验证码检测到我尝试使用此 plug
terminal - 使用 Octave headless (headless)
是否有可能使用 Octave headless。像这样的东西 octave result.txt 最佳答案使用 octave --silent --eval 5+4 > result.txt 你会
android - 在 headless (headless)模式下获取本地化字符串
我目前正在尝试在 headless (headless)模式下运行应用程序，我定义了后台回调: void callbackInBackground() { // Invoked from the s
opengl - 在 headless (headless)LibGDX单元测试中创建纹理
我正在使用LibGDX headless backend运行jUnit测试。这在某些测试中效果很好，但是如果我尝试创建new Texture('myTexture.png');，则会收到NullPoi
Selenium 只能在非 headless (headless)模式下工作吗？
我想在这个页面上使用 Selenium:https://www.avis.com/en/home 如果没有 headless (headless)模式，该代码一切正常: import requests
jasmine - Jasmine headless (headless)Webkit中更好的故障报告
在Jasmine headless (headless)Webkit中运行测试时，我遇到了一个简单的TypeError: 'undefined' is not an object失败。但是没有提示在哪
firefox - 真正的 headless (headless)浏览器
我负责测试一个大量使用 AJAX 的企业 Web 应用程序。我需要构建一个系统，允许在没有人工干预的情况下连续运行测试。目前我最感兴趣的是负载测试，但我希望用于生成负载的相同脚本用于功能测试。目前用
docker - headless Docker主机与 headless 容器
TL; DR:我可以配置一个容器来原生访问VGA，以覆盖主机视频输出吗？我正在考虑处置低功耗的XenServer(以前为ESXi)白盒以设置docker最小安装(例如CoreOS，RancherOs
docker - headless (headless) Protractor 不分片测试
我正在尝试 headless (headless)运行我的测试，并将我的两个测试套件分片以并行运行它们。在我的本地计算机上，它们并行运行，但在这种 headless (headless)设置中，它们一
eclipse - 是否有一种 headless (headless)方式来导入项目并刷新工作区？
仍在尝试为大型大学项目(RCP 产品)建立 headless (headless)构建。每个 Eclipse 用户都知道以下手动功能:“文件 --> 导入 --> 将现有项目导入工作区”以及“构建工

数据小太阳

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

javascript - Selenium 网络驱动程序 : firefox headless inject javascript to modify browser property