Questions tagged [jsoup]

Jsoup stands as a robust HTML parser tool in Java that enables the extraction and manipulation of HTML data utilizing a blend of DOM, CSS, and jQuery-inspired techniques.

What is the best way to extract and analyze CSS code from an HTML page using Java programming language

I'm looking to extract and parse the content of HTML pages using the Jsoup API, which can be found at . However, I've encountered a challenge in extracting CSS that is located in a different folder. How can this be achieved? Currently, my code saves the ...

Finding text outside of a HTML tag with jsoup

I am working with the following HTML code: Data <div class="alg"></div> <div class="alg"></div> Pepsi 791 <div class="alg"></div> <div class="alg"></ ...

Searching for tables data by employing the Curl command

Seeking assistance with parsing data from a website. The JSON request runs successfully in the command line but encounters issues when attempting to execute it on Android: The request: "curl '' -H 'Host: billetterie.ctm.ma' -H &apo ...

jsoup - locate element and delete it along with the element preceding it

In my Android app, I am working on extracting data from a stock market historical prices table. However, there are certain rows in the table that need to be removed for clarity. Specifically, I am trying to remove a row in the third tr. I have managed to r ...

Tips for extracting specific sub-sections of HTML content with Jsoup

Attempting to extract prices from an HTML file using Jsoup. The structured HTML looks like this: //some html <div class="price-point-wrap use-roundtrippricing"> <div class="price-point-wrap-top use-roundtrippricing"> <div class=" ...

Remember to follow the camel case convention when utilizing element.tagName(String) with jSoup

Looking to change the name of an HTML tag using Jsoup in the following way - element.tagName("p:panelGroup"); But when I run this code, the output changes the case and I end up with p:panelgroup. Is there a method to preserve the camel case in the resul ...

Extract data from a multi-page table using Jsoup

Currently, I am working on extracting player statistics from the table of data found on this website using jsoup and selenium in java. However, I have encountered a problem when attempting to parse a table that spans multiple pages. Does anyone have any s ...

parse website using jsoup redirection

While parsing data from this URL, I encountered an issue with the ajax load process. Despite my efforts to parse the website, I only receive the body element and not the desired response within the ticket_lists. I am unsure how to tackle the redirection on ...

"Exploring the Power of Jsoup for Selecting

I'm attempting to extract all the links nested within the div class news column index. Here is a snippet of the HTML structure: https://i.stack.imgur.com/zdFWS.jpg Below is the code I have used, but unfortunately, it's not yielding any results. I'm quite ...

Selecting an element from CSS using Jsoup

Is it possible to retrieve all images that do not have the first class attribute as "content.slide0"? In my example, I am utilizing the Jsoup library to display selectable elements in a WebView. Elements element = doc.select("HERE_SOLUTION"); <head ...

When using Jsoup, make sure to nest the <div> tag within an <a>

As per the findings in this response: Referring to HTML 4.01 guidelines, <a> elements are limited to inline elements only. Since a <div> is a block element, it should not be placed within an <a>. However... In HTML5, <a> elements are all ...

When attempting to access a PDF file via URL, binary information is visible on the browser instead

I'm currently developing a Java application and facing an issue with opening PDF files in a web page using both IE and Chrome. After extensive research and attempts using the JSoup API, here is the complete process from HTML to Java controller to display t ...