Parse an HTML page to retrieve some text

joliveau.loan · July 5, 2021, 4:49pm

Hello,
To be clear, what I need is something to parse the HTML content of a webpage and then being able to use the text contained in those elements into proper Bubble.io text elements.
Don’t hesitate to ask for more details if needed.
Thanks.

alanpieczonka · July 5, 2021, 5:56pm

Hi @joliveau.loan ,

to begin with, have you checked if the webpage that you want to crawl doesn’t have some sort of API so you could retrieve the data that way?

Regards,
Alan

joliveau.loan · July 5, 2021, 5:57pm

Hello @alanpieczonka ,
I have checked, and no, the page doesn’t have any kind of API.

alanpieczonka · July 5, 2021, 6:11pm

Unfortunately I haven’t done this exact feature so I can’t be very specific here, but you could probably use combination of tools like Octoparse (for crawling the data) and Parabola (for pushing the data into Bubble).
But maybe there is a simpler way and someone will post it here.

Jici · July 5, 2021, 7:25pm

You can first use API Connector to retrieve the whole page HTML (GET request to the page url, and type to “text”)
Using regex,you can extract what you need

ZeroqodeSupport · July 6, 2021, 10:11am

Hello, @joliveau.loan!

Check our Any Page Parsing plugin. Perhaps it can do the job you want:

Hope it helps.
Regards,
Zeroqode Support Team

Topic		Replies	Views
Webscraping - Return HTML of external page? Plugins	7	812	June 1, 2023
Parse HTML data return into lists Need help	5	568	June 29, 2023
Use Toolbox Plugin and Javascript to scrape a page Plugins	5	620	April 5, 2022
Fetch data from website (scraping?) Plugins	6	4673	October 25, 2021
Read data from an external webpage Questions	6	3147	July 22, 2019

Parse an HTML page to retrieve some text

Related topics