Extract a number from inside an HTML comment

busboy · January 26, 2016, 3:56am

What is the best way to extract 12345 when it resides in a html comment tag like this:

Thank you.

busboy · January 26, 2016, 4:12am

Doing some more searching online I came across what I think will work, substr.

$string = "<!--12345--> a small html comment in my example";

$sID = substr($string, 4, 5);

echo $sID;

This returns 12345, but how can I accommodate larger or smaller numbers? If 67812545 is the number, the 4, 5 parameters in substr will no longer extract the full number.

Please advise.

tracknut · January 26, 2016, 4:19am

Before you end up running around in circles, make sure the spec is right. For example, I notice in your second post you added the text " a small html comment in my example" after the that you listed in the first post. Can you likewise have text in front:

here it is Do you allow spaces, as:

What about fractional numbers:

<!--123.45-->

There are probably other cases I didn’t think of. All of these can complicate the solution for what was originally a very simple extract.

Mittineague · January 26, 2016, 4:20am

I guess it night be possible to use DOM functions to extract comment nodes, but I don’t know as I’ve never needed to try it that way

The more common way is to use
http://php.net/manual/en/function.preg-match.php

This requires accessing the HTML as a String, so depending on how big that might be there may be a performance hit. Negligible or otherwise.

busboy · January 26, 2016, 4:21am

The string will ALWAYS begin with the following:

<!--

There will not be any spaces in the html comment. Does this help?

busboy · January 26, 2016, 4:22am

The string can sometimes include several paragraphs of text, equating to hundreds of words in length. I’m not sure how much would text would need to be involved in this function to take a performance hit.

tracknut · January 26, 2016, 4:30am

One option might be:

$n = substr($s,4)-strstr($s,"-->");

Given your last comment about having a lot of text after the closing comment, I’m wondering though whether there’s a better solution that doesn’t involve subtracting the string like that.

busboy · January 26, 2016, 4:33am

May the Lord bless you tracknut for your kindness. What you have supplied works perfectly.

Thank you.

tracknut · January 26, 2016, 4:34am

No problem. If you’re concerned about speed, I’m going to guess this:

$n = substr($s,4,strpos($s,"-->")-4);

would be faster. I haven’t timed it though.

Mittineague · January 26, 2016, 4:36am

What I’ve used before is a recursive directory search to get all PHP files, then using file_get_contents() put the PHP code into a string so I could then use preg_match()

No way I could have got this information without it.

A list of the 168 PHP Classes and 3,856 PHP Functions found in WordPress version 3.0

PHP and HTML files are both text files, so it works.

The trick is getting the regex right,

busboy · January 26, 2016, 4:37am

Thank you Mittineague.

spaceshiptrooper · January 26, 2016, 6:36am

I’ve always found strtr very helpful when trying to strip or do something to a string. You can use arrays to do so. It saves you time rather than making two of the same lines for stripping, you can actually use arrays to do that and it’ll just be 1 line.

<?php
$string = "<!--12345--> a small html comment in my example";
$array = ['<!-- ' => '', '<!--' => '', ' -->' => '', '-->' => ''];
$string = strtr($string, $array);
echo $string; // Returns 12345 without spaces and the <!-- nor -->

It also doesn’t affect any of your numbers if you don’t specify in the array.

busboy · January 27, 2016, 3:43am

Thank you spaceshiptrooper!

system · May 26, 2016, 12:58am

This topic was automatically closed 91 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Extract specific substring from a string PHP	15	13141	January 23, 2011
Problem in Extracting values PHP	7	579	October 8, 2014
Get number from string inc decimal and space PHP	4	406	February 26, 2010
Extracting Text with Regex PHP	4	1837	December 18, 2010
Javascript to extract numbers out of a string JavaScript	6	44115	October 8, 2014

Extract a number from inside an HTML comment

Related topics