[Solved] Regular expression - parse string

Sam32 · January 17, 2015, 1:53pm

Hi,

I’m trying to extract all content of below string starting from tag [caption] to tag [/caption] including the tags. Preg_match_all seems to me the most adequate function. Can you help with the pattern please?

$str = "Some text... [caption id=\"attachment_275305\" align=\"aligncenter\" width=\"640\"]<a href=\"http://www.site.com\"><img class=\"size-full\" src=\"http://www.site.com\" alt=\"pic\" width=\"640\" height=\"345\" /><\/a> Photo by[/caption] ...more text";

Many thanks.

StarLion · January 17, 2015, 2:07pm

"~\[caption(.*?)\](.*?)\[/caption\]~i"

Note: Trim all subpatterns for extraneous spacing.
Edit: Probably also want the case insensitive flag.

Sam32 · January 17, 2015, 2:22pm

It worked just like I wanted.

Is it also possible to remove the block [caption][/caption] and remain only with the text using regular expressions.

Thanks for your help.

StarLion · January 17, 2015, 2:23pm

stick that pattern into preg_replace, passing it “$2” as the replacement.

Sam32 · January 17, 2015, 2:36pm

Just noticed it’s returning only a single block [caption][/caption]. It can happen that the string has 2 or more. Sorry, I should had mentioned this on my first post.

As to the preg_replace, unfortunately the link was kept.

echo preg_replace("~\[caption(.*?)\](.*?)\[/caption\]~i", "$2", $str);

Some text... <a href="http://www.site.com"><img class="size-full" src="http://www.site.com" alt="pic" width="640" height="345" /><\/a> Photo by ...more text

Thanks once again.

StarLion · January 17, 2015, 2:46pm

so use [FPHP]preg_match_all[/FPHP] instead of preg_match?

And if you want to remove HTML tags, [FPHP]strip_tags[/FPHP].

Sam32 · January 17, 2015, 2:55pm

Sorry, a little confusion with local/production environment. I didn’t replace preg_match with preg_match_all.

All is working now. Thanks!

Sam32 · January 17, 2015, 10:20pm

There is still an issue.

$str = "Some text... [caption id=\"attachment_275305\" align=\"aligncenter\" width=\"640\"]<a href=\"http://www.site.com\"><img class=\"size-full\" src=\"http://www.site.com\" alt=\"pic\" width=\"640\" height=\"345\" /><\/a> Photo by[/caption] ...more text";

echo strip_tags(preg_replace("~\[caption(.*?)\](.*?)\[/caption\]~i", "$2", $str));

The result is:
“Some text… Photo by …more text”

Instead of:
“Some text… …more text”

I wanted to remove all content within block [caption] [/caption]. Why is it that “Photo by” still appears?

Mittineague · January 17, 2015, 10:30pm

Because you said you wanted to extract it. That’s what the regex is doing.

If you mean you want to replace it with an empty string, change the
"$2" to ""

Sam32 · January 17, 2015, 10:36pm

Working now. Thanks.

system · April 19, 2015, 5:46am

This topic was automatically closed 91 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
preg_match question PHP	7	659	April 10, 2010
Help with regex to grab all instances of pattern from string PHP	5	264	November 13, 2010
Brain Freeze on REGEX PHP	5	531	January 8, 2010
Preg_Match Question PHP	8	1167	October 8, 2014
Regex Preg_match_all search PHP	3	2722	October 8, 2014

[Solved] Regular expression - parse string

Related topics