Selenium Webdriverを使用して、ページ全体ではなく特定の要素のスクリーンショットをキャプチャする方法は？

Question

現在、Selenium WebDriverを使用してスクリーンショットをキャプチャしようとしています。しかし、ページ全体のスクリーンショットしか取得できません。ただし、私が欲しかったのは、単にページの一部をキャプチャするか、IDまたは特定の要素ロケーターに基づいて特定の要素のみをキャプチャすることです。（たとえば、画像id = "Butterfly"の画像をキャプチャしたい）

選択したアイテムまたは要素ごとにスクリーンショットをキャプチャする方法はありますか？

Surya · Accepted Answer

以下のようにページのスクリーンショット全体をトリミングすることにより、要素のスクリーンショットを取得できます。

driver.get("http://www.google.com"); WebElement ele = driver.findElement(By.id("hplogo")); // Get entire page screenshot File screenshot = ((TakesScreenshot)driver).getScreenshotAs(OutputType.FILE); BufferedImage fullImg = ImageIO.read(screenshot); // Get the location of element on the page Point point = ele.getLocation(); // Get width and height of the element int eleWidth = ele.getSize().getWidth(); int eleHeight = ele.getSize().getHeight(); // Crop the entire page screenshot to get only element screenshot BufferedImage eleScreenshot= fullImg.getSubimage(point.getX(), point.getY(), eleWidth, eleHeight); ImageIO.write(eleScreenshot, "png", screenshot); // Copy the element screenshot to disk File screenshotLocation = new File("C:\images\GoogleLogo_screenshot.png"); FileUtils.copyFile(screenshot, screenshotLocation);

ambodi · Answer

Node.jsで、動作する次のコードを書きましたが、Seleniumの公式WebDriverJSに基づいているのではなく、SauceLabs's WebDriverに基づいています。 WD.js および非常にコンパクトな画像ライブラリ EasyImage 。

要素のスクリーンショットを実際に撮ることはできないことを強調したいのですが、最初にページ全体のスクリーンショットを撮ってから、ページの好きな部分を選択し、その特定の部分をトリミングすることです：

browser.get(URL_TO_VISIT) .waitForElementById(dependentElementId, webdriver.asserters.isDisplayed, 3000) .elementById(elementID) .getSize().then(function(size) { browser.elementById(elementID) .getLocation().then(function(location) { browser.takeScreenshot().then(function(data) { var base64Data = data.replace(/^data:image\/png;base64,/, ""); fs.writeFile(filePath, base64Data, 'base64', function(err) { if (err) { console.log(err); } else { cropInFile(size, location, filePath); } doneCallback(); }); }); }); });

そしてcropInFileFunctionは次のようになります。

var cropInFile = function(size, location, srcFile) { easyimg.crop({ src: srcFile, dst: srcFile, cropwidth: size.width, cropheight: size.height, x: location.x, y: location.y, gravity: 'North-West' }, function(err, stdout, stderr) { if (err) throw err; }); };

Alex Siminiuc · Answer

YandexのASHOTフレームワークは、Selenium WebDriverスクリプトでスクリーンショットを撮るために使用できます。

完全なWebページ
ウェブ要素

このフレームワークは https://github.com/yandex-qatools/ashot にあります。

スクリーンショットを撮るためのコードは非常に簡単です。

全ページ

screenshot = new AShot().shootingStrategy( new ViewportPastingStrategy(1000)).takeScreenshot(driver); ImageIO.write(screenshot.getImage(), "PNG", new File("c:\temp\results.png"));

特定のウェブ要素

screenshot = new AShot().takeScreenshot(driver, driver.findElement(By.xpath("(//div[@id='ct_search'])[1]"))); ImageIO.write(screenshot.getImage(), "PNG", new File("c:\temp\div_element.png"));

この記事の詳細とコードサンプルを参照してください。

Rohith R Nair · Answer

これは、Selenium WebdriverとPillowを使用したPython 3バージョンです。このプログラムは、ページ全体のスクリーンショットをキャプチャし、その場所に基づいて要素をトリミングします。要素の画像は、image.pngとして利用できます。 Firefoxは、element.screenshot_as_png（ 'image_name'）を使用した要素画像の直接保存をサポートしています。

from Selenium import webdriver from PIL import Image driver = webdriver.Chrome() driver.get('https://www.google.co.in') element = driver.find_element_by_id("lst-ib") location = element.location size = element.size driver.save_screenshot("shot.png") x = location['x'] y = location['y'] w = size['width'] h = size['height'] width = x + w height = y + h im = Image.open('shot.png') im = im.crop((int(x), int(y), int(width), int(height))) im.save('image.png')

更新

現在、chromeは個々の要素のスクリーンショットもサポートしています。以下のように、Web要素のスクリーンショットを直接キャプチャできます。

from Selenium import webdriver driver = webdriver.Chrome() driver.get('https://www.google.co.in') image = driver.find_element_by_id("lst-ib").screenshot_as_png # or # element = driver.find_element_by_id("lst-ib") # element.screenshot_as_png("image.png")

user2504655 · Answer

スクリーンショットを撮るのに多くの時間を無駄にしました。あなたのスクリーンショットを保存したいと思います。 chrome + Selenium + c＃を使用しましたが、結果はまったくひどいものでした。最後に、関数を書きました：

driver.Manage().Window.Maximize(); RemoteWebElement remElement = (RemoteWebElement)driver.FindElement(By.Id("submit-button")); Point location = remElement.LocationOnScreenOnceScrolledIntoView; int viewportWidth = Convert.ToInt32(((IJavaScriptExecutor)driver).ExecuteScript("return document.documentElement.clientWidth")); int viewportHeight = Convert.ToInt32(((IJavaScriptExecutor)driver).ExecuteScript("return document.documentElement.clientHeight")); driver.SwitchTo(); int elementLocation_X = location.X; int elementLocation_Y = location.Y; IWebElement img = driver.FindElement(By.Id("submit-button")); int elementSize_Width = img.Size.Width; int elementSize_Height = img.Size.Height; Size s = new Size(); s.Width = driver.Manage().Window.Size.Width; s.Height = driver.Manage().Window.Size.Height; Bitmap bitmap = new Bitmap(s.Width, s.Height); Graphics graphics = Graphics.FromImage(bitmap as Image); graphics.CopyFromScreen(0, 0, 0, 0, s); bitmap.Save(filePath, System.Drawing.Imaging.ImageFormat.Png); RectangleF part = new RectangleF(elementLocation_X, elementLocation_Y + (s.Height - viewportHeight), elementSize_Width, elementSize_Height); Bitmap bmpobj = (Bitmap)Image.FromFile(filePath); Bitmap bn = bmpobj.Clone(part, bmpobj.PixelFormat); bn.Save(finalPictureFilePath, System.Drawing.Imaging.ImageFormat.Png);

Brook · Answer

C＃でコードを要求するすべての人のために、以下は私の実装の簡易バージョンです。

public static void TakeScreenshot(IWebDriver driver, IWebElement element) { try { string fileName = DateTime.Now.ToString("yyyy-MM-dd HH-mm-ss") + ".jpg"; Byte[] byteArray = ((ITakesScreenshot)driver).GetScreenshot().AsByteArray; System.Drawing.Bitmap screenshot = new System.Drawing.Bitmap(new System.IO.MemoryStream(byteArray)); System.Drawing.Rectangle croppedImage = new System.Drawing.Rectangle(element.Location.X, element.Location.Y, element.Size.Width, element.Size.Height); screenshot = screenshot.Clone(croppedImage, screenshot.PixelFormat); screenshot.Save(String.Format(@"C:\SeleniumScreenshots\" + fileName, System.Drawing.Imaging.ImageFormat.Jpeg)); } catch (Exception e) { logger.Error(e.StackTrace + ' ' + e.Message); } }

H&#252;seyin Yağlı · Answer

C＃の拡張関数は次のとおりです。

public static BitmapImage GetElementImage(this IWebDriver webDriver, By by) { var elements = webDriver.FindElements(by); if (elements.Count == 0) return null; var element = elements[0]; var screenShot = (webDriver as ITakesScreenshot).GetScreenshot(); using (var ms = new MemoryStream(screenShot.AsByteArray)) { Bitmap screenBitmap; screenBitmap = new Bitmap(ms); return screenBitmap.Clone( new Rectangle( element.Location.X, element.Location.Y, element.Size.Width, element.Size.Height ), screenBitmap.PixelFormat ).ToBitmapImage(); } }

これを使用して、次のような要素の画像を取得できます。

var image = webDriver.GetElementImage(By.Id("someId"));

rath · Answer

Suryaの答えディスクIOにかかわることを気にしないのなら、うまくいきます。そうではない場合は、この方法が適している可能性があります

private Image getScreenshot(final WebDriver d, final WebElement e) throws IOException { final BufferedImage img; final Point topleft; final Point bottomright; final byte[] screengrab; screengrab = ((TakesScreenshot) d).getScreenshotAs(OutputType.BYTES); img = ImageIO.read(new ByteArrayInputStream(screengrab)); //crop the image to focus on e //get dimensions (crop points) topleft = e.getLocation(); bottomright = new Point(e.getSize().getWidth(), e.getSize().getHeight()); return img.getSubimage(topleft.getX(), topleft.getY(), bottomright.getX(), bottomright.getY()); }

必要に応じて、screengrabの宣言をスキップして、代わりに

img = ImageIO.read( new ByteArrayInputStream( ((TakesScreenshot) d).getScreenshotAs(OutputType.BYTES)));

よりクリーンですが、わかりやすくするために残しておきます。その後、ファイルとして保存または JPanelに保存を心ゆくまでお楽しみください。

sillicon · Answer

JavaScriptソリューションを探しているなら、ここに私の要点があります：

https://Gist.github.com/sillicon/4abcd9079a7d29cbb53ebee547b55fba

基本的な考え方は同じです。最初にスクリーンショットを撮り、それからトリミングします。ただし、私のソリューションでは、純粋なWebDriver APIコードだけで他のライブラリは必要ありません。ただし、副作用は、テストブラウザの負荷が増加する可能性があることです。

Waqar Ullah Khan · Answer

public void GenerateSnapshot(string url, string selector, string filePath) { using (IWebDriver driver = new ChromeDriver()) { driver.Navigate().GoToUrl(url); var remElement = driver.FindElement(By.CssSelector(selector)); Point location = remElement.Location; var screenshot = (driver as ChromeDriver).GetScreenshot(); using (MemoryStream stream = new MemoryStream(screenshot.AsByteArray)) { using (Bitmap bitmap = new Bitmap(stream)) { RectangleF part = new RectangleF(location.X, location.Y, remElement.Size.Width, remElement.Size.Height); using (Bitmap bn = bitmap.Clone(part, bitmap.PixelFormat)) { bn.Save(filePath, System.Drawing.Imaging.ImageFormat.Png); } } } driver.Close(); } }

rovr138 · Answer

Python 3

Selenium 3.141.0とchromedriver 73.0.3683.68を試してみましたが、これは動作しますが、

from Selenium import webdriver chromedriver = '/usr/local/bin/chromedriver' chromeOptions = webdriver.ChromeOptions() chromeOptions.add_argument('window-size=1366x768') chromeOptions.add_argument('disable-extensions') cdriver = webdriver.Chrome(options=chromeOptions, executable_path=chromedriver) cdriver.get('url') element = cdriver.find_element_by_css_selector('.some-css.selector') element.screenshot_as_png('elemenent.png')

フル画像を取得し、フルスクリーン画像の一部を取得する必要はありません。

Rohitの答えが作成されたとき、これは利用できなかったかもしれません。

ER.swatantra · Answer

Seleniumの特定の要素のスナップショットを取得する機能の下。ここで、ドライバーはWebDriverの一種です。

private static void getScreenshot(final WebElement e, String fileName) throws IOException { final BufferedImage img; final Point topleft; final Point bottomright; final byte[] screengrab; screengrab = ((TakesScreenshot) driver).getScreenshotAs(OutputType.BYTES); img = ImageIO.read(new ByteArrayInputStream(screengrab)); topleft = e.getLocation(); bottomright = new Point(e.getSize().getWidth(), e.getSize().getHeight()); BufferedImage imgScreenshot= (BufferedImage)img.getSubimage(topleft.getX(), topleft.getY(), bottomright.getX(), bottomright.getY()); File screenshotLocation = new File("Images/"+fileName +".png"); ImageIO.write(imgScreenshot, "png", screenshotLocation); }

Jan Rozycki · Answer

自動視覚比較ツール https://github.com/bfirsh/needle を使用することを検討してください。これには、特定の要素（CSSセレクターによって選択された）のスクリーンショットを撮影できる機能が組み込まれています。このツールはSeleniumのWebDriverで動作し、Pythonで書かれています。

Mnemo · Answer

using System.Drawing; using System.Drawing.Imaging; using OpenQA.Selenium; using OpenQA.Selenium.Firefox; public void ScreenshotByElement() { IWebDriver driver = new FirefoxDriver(); String baseURL = "www.google.com/"; //url link String filePath = @"c:\img1.png"; driver.Navigate().GoToUrl(baseURL); var remElement = driver.FindElement(By.Id("Butterfly")); Point location = remElement.Location; var screenshot = (driver as FirefoxDriver).GetScreenshot(); using (MemoryStream stream = new MemoryStream(screenshot.AsByteArray)) { using (Bitmap bitmap = new Bitmap(stream)) { RectangleF part = new RectangleF(location.X, location.Y, remElement.Size.Width, remElement.Size.Height); using (Bitmap bn = bitmap.Clone(part, bitmap.PixelFormat)) { bn.Save(filePath, ImageFormat.Png); } } } }

Green Lei · Answer

クロムで例外Java.awt.image.RasterFormatExceptionが発生した場合、または要素をスクロールして表示したい場合は、スクリーンショットをキャプチャします。

@Suryaの回答からのソリューションを以下に示します。

 JavascriptExecutor jsExecutor = (JavascriptExecutor) driver; Long offsetTop = (Long) jsExecutor.executeScript("window.scroll(0, document.querySelector(\""+cssSelector+"\").offsetTop - 0); return document.querySelector(\""+cssSelector+"\").getBoundingClientRect().top;"); WebElement ele = driver.findElement(By.cssSelector(cssSelector)); // Get entire page screenshot File screenshot = ((TakesScreenshot)driver).getScreenshotAs(OutputType.FILE); BufferedImage fullImg = ImageIO.read(screenshot); // Get the location of element on the page Point point = ele.getLocation(); // Get width and height of the element int eleWidth = ele.getSize().getWidth(); int eleHeight = ele.getSize().getHeight(); // Crop the entire page screenshot to get only element screenshot BufferedImage eleScreenshot= fullImg.getSubimage(point.getX(), Math.toIntExact(offsetTop), eleWidth, eleHeight); ImageIO.write(eleScreenshot, "png", screenshot); // Copy the element screenshot to disk File screenshotLocation = new File("c:\temp\div_element_1.png"); FileUtils.copyFile(screenshot, screenshotLocation);

Andrew · Answer

c＃コード：

public Bitmap MakeElemScreenshot( IWebDriver driver, WebElement elem) { Screenshot myScreenShot = ((ITakesScreenshot)driver).GetScreenshot(); Bitmap screen = new Bitmap(new MemoryStream(myScreenShot.AsByteArray)); Bitmap elemScreenshot = screen.Clone(new Rectangle(elem.Location, elem.Size), screen.PixelFormat); screen.Dispose(); return elemScreenshot; }