Help with regex returning blanks

jasonahowie · February 8, 2023, 7:22pm

Hey all!

I’m getting a list of URLs from an API; some are duplicative because of parameters and anchor tags found on the page. I want to clean the URLs from this:
https://somewebsite.com/awesome?p=isforparameter
https://somewebsite.com/awesome#anchorsayyyymatey

I have this regex:
(http|ftp|https)://([\w_-]+(?:(?:.[\w_-]+)+))([\w.,@^=%&:/~±]*[\w@^=%&/~±])

It correctly identifies the parts of the URL that I want to keep, as demonstrated here:

But when I use it in Bubble, like so:

I get blanks in the URL text field.

Any suggestions?

See less

bubble.trouble · February 8, 2023, 8:21pm

This might be a dumb question but have you tried it without the opening “/” and the closing “/i”?

keith · February 8, 2023, 10:11pm

A far simpler way to remove the querystring from a URL is:

some_url :split [by ?] :first item

Note: the second item in the split list will be the querystring (sans the “?”), should one exist.

(I realize @jasonahowie has a list of URLs…. You can apply the same principle in :formatted as)

jasonahowie · February 15, 2023, 11:33pm

Relied too much on Regex101… thank you.

Topic		Replies	Views
Help with Regex Need help	4	1842	September 8, 2021
Question about RegEx and extracting from URLs Database	1	1041	November 22, 2020
Regex pattern to remove URL parameters/UTM App Organization	14	13991	June 14, 2022
Extract with Regex: Need pattern to remove the parameters from URLs Need help	3	755	February 16, 2022
Regex results in unwanted characters Need help	4	407	January 18, 2023