Is it bad practice to execute this query in a for or foreach-loop?

vbasic41 · January 15, 2016, 12:13pm

So I have the following query:

$new = $result[$i]['resclients']; `
$resclients=$mysqli->query("SELECT id,client_name FROM clients WHERE id IN ($new)");`

And I am wondering, is it bad practice to execute the above query in a for or foreach-loop, does it hurt the MySQL server?

Or, is it better to do LEFT JOINS or INNER JOINS or RIGHT JOINS?

Forgot to add, the $result is actually a two dimensional array.

Array
(
    [0] => Array
        (
            [id] => 7
            [resclients] => 6,7,8,9,10,11,12,13,14,15
        )

    [1] => Array
        (
            [id] => 5
            [resclients] => 5
        )

    [2] => Array
        (
            [id] => 4
            [resclients] => 4
        )
)

Just a small portion of it.

droopsnoot · January 15, 2016, 1:10pm

I would guess that if it suits the rest of the code to combine those into a single query, then the database server would probably have less work to do. But if doing so makes it much more complex to read and understand the code, possibly not worth it. All depends on what else is happening with the code, size of the data, size of the server.

s_molinari · January 15, 2016, 2:58pm

What is the object of the result array or rather what is it you are wanting as a final result? Can you form it into a question? Like, "For the first 10 results (what object are the results?), what are the id and client_name of each result’s resclients? Generally, from what I’ve learned, putting queries in a loop, especially a loop with a lot of iterations, is bad practice.

Scott

felgall · January 15, 2016, 7:35pm

If you do decide to use the loop then convert the code to use prepare/bind instead of query and keep the prepare outside of the loop so that the database can do as much as possible of the processing outside of the loop and only needs to substitute a few variables to execute the code inside the loop.

Mittineague · January 15, 2016, 9:28pm

IMHO nesting queries is best avoided if at all possible and is usually in code of those that have trouble putting together more complex queries.

As a simplified pseudocode example, say I have 3 tables

first - id, f_name, last_id 
last - id, l_name, age_id 
age - id, years

And I want to get the first name, last name and age of all “Johns”

$result = SELECT f_name, last_id FROM first WHERE f_name LIKE 'John' 
 while ($result) 
  $result2 = SELECT l_name, age_id FROM last WHERE id = $result['last_id'] 
   while ($result2) 
    $result3 = SELECT years FROM age WHERE id = $result2['age_id'] 
     while ($result3) 
      echo $result['f_name'] . $result2['l_name'] . $result3['years']

OK, it may work - BUT - the code is hitting the database hard.

Compare to

$result = SELECT first.f_name, last.l_name, age.years 
          FROM first 
          INNER JOIN last 
           ON first.last_id = last.id 
          INNER JOIN age 
           ON last.age_id = age.id 
          WHERE first.f_name LIKE 'John'

The query is more complex - BUT - the code is hitting the database only once

The first example consumes more PHP memory saving all the “result” variable values.

As long as the database is designed with good indexes, there will be less resource use with the second example.

It depends of course on the size of the tables etc. Inefficient code may work fine on a small scale, but when things get big it could bring things to a crawl.

felgall · January 16, 2016, 3:13am

You can always add or remove indexes later if necessary without having to change the code.

vbasic41 · January 16, 2016, 11:11am

@felgall, @Mittineague, @s_molinari, and @droopsnoot, thanks for your replies.

I want to achive the following ouput:

[0] represents someone
length([resclients]) holds IDs, which are in ASC, and in return order are DESC, so the length becomes 10.

Department A added 10 clients, now to grab data about each client I need to use the query, and somehow loop to get separate outputs.

Department A added 10 clients => [resclients] => 6,7,8,9,10,11,12,13,14,15

Department B added 1 client => [resclients] => 5

Department C added 1 client => [resclients] => 4

Each of these IDs come from the same table, then I am trying to ensure that “Dep N added x clients”, and then grab each clients name etc.

Not sure if I am on the right track! Thanks a lot folks.

Mittineague · January 16, 2016, 5:06pm

I don’t know if it is outside of your control, but it looks like the problem may have more to do with poor database design.
Anytime a column holds multiple values like “3,5,6,8,9” it’s a good indication that there’s a good chance it could be improved somehow.

In any case, since IN works with an array, you might be able to do something like

loop 
$new = array_merge($new, [$i]) 
endloop 
WHERE IN $new

instead of

loop 
$new = [$i] 
WHERE IN $new 
endloop

felgall · January 16, 2016, 9:18pm

I think they are referring to the list of values to be returned. I don’t think they mean that the data is stored like that in the database.

What is the actual structure of the table. You are now talking about departments but there has been no mention of departments in the list of fields in the table mentioned so far.

If the department is identified by a field in the table then counting how many rows there are for each is trivial. If it isn’t in the table then you can’t query data that hasn’t been stored in the first place.

Mittineague · January 16, 2016, 10:59pm

I should clarify this.

IN is not working with an array, but an array value that is a comma separated list

So putting those values into an array would require an implode() to get it back to being a string of the list

s_molinari · January 17, 2016, 8:14am

I am still not sure I understand the goal of the query. I’ll try to put it into a question from the given information and you can tell me if I am wrong or right.

How many departments gained new clients and what are the name and id of those clients?

Is that correct? If it is, what determines a new client from an old one?

Scott

vbasic41 · January 17, 2016, 8:15am

CREATE Table department_activity (
  id BIGINT(100) AUTO_INCREMENT PRIMARY KEY,
  department_id BIGINT(100) NOT NULL, 
  object_id BIGINT(100) NOT NULL, 
  object_type VARCHAR(50) NOT NULL,
  action_name VARCHAR(50) NOT NULL, 
  activity_date DATETIME NOT NULL
);

id - Unique Activity Item ID.
department_id - ID of the department who created the activity item.
object_id - Internal ID of the object.
object_type - Type of object.
action_name - The action taken against the object.
activity_date - Timestamp that the action was created.

More information below:

INSERT INTO department_activity (department_id, object_id, object_type, action_name, activity_date)
VALUES ('1197381911108', '3438983', 'client', 'added' '2016-01-17 09:18:43');

1197381911108 - ID of the department.
3438983 - ID of the client.
‘client’ - The type of object.
‘added’ - The action taken.
‘2016-01-17 09:18:43’ - Timestamp when the action was taken.

I am producing it through this query:

FROM 
    (SELECT department_activity.*, date(department_activity.activity_date) groupby_date,
    COUNT(department_activity.id) AS number_of_clients_added,
    GROUP_CONCAT(department_activity.id) AS clients_comma_list 
    FROM department_activity
    INNER JOIN subscriptions 
    ON department_activity.user_id = subscriptions.subscribing_user
    WHERE subscriptions.user_id = '0'
    GROUP BY department_activity.user_id,
             date(department_activity.activity_date)
    ) As department_activities 
GROUP BY object_id,
      groupby_date
ORDER BY activity_date DESC
LIMIT 20;

I hope this information is enough to get us somewhere.

Mittineague · January 17, 2016, 8:25am

Just curious.
Is there a reason id isn’t UNSIGNED i.e. negative id values are possible?
The field name object_id suggests it’s numeric, is it really a String?

http://dev.mysql.com/doc/refman/5.7/en/example-auto-increment.html

Use the smallest integer data type for the AUTO_INCREMENT column that is large enough to hold the maximum sequence value you will need. When the column reaches the upper limit of the data type, the next attempt to generate a sequence number fails. Use the UNSIGNED attribute if possible to allow a greater range. For example, if you use TINYINT, the maximum permissible sequence number is 127. For TINYINT UNSIGNED, the maximum is 255. See Section 11.2.1, “Integer Types (Exact Value) - INTEGER, INT, SMALLINT, TINYINT, MEDIUMINT, BIGINT” for the ranges of all the integer types.

vbasic41 · January 17, 2016, 8:39am

A little bit off-topic here. Those are minor issues.

s_molinari · January 17, 2016, 9:50am

What if the department went and deleted one or more of the clients they just added?

Are the clients at all connected to the departments other than through the activity table?

What determines an old client from a new one?

Scott

vbasic41 · January 17, 2016, 10:23am

@s_molinari,

Then that row on the department_activity table will be removed as well.

Only through the department_activity table.

The department_activity table determines this.

vbasic41 · January 17, 2016, 2:30pm

Can I use the comma separated values in the same query with a UNION to query more records from a number of tables.

GROUP_CONCAT(department_activity.id) AS clients_comma_list

and use clients_comma_list in a way that would allow me to pull everything in one single query. Just curious.

s_molinari · January 18, 2016, 2:33pm

But, then then both “activities” are lost. That makes no sense at all, if this is some sort of historical tracking system for auditing purposes.

Then I’d do a join through the department_activity table.

What in the department_activity table determines a new to old client?

Scott

vbasic41 · January 18, 2016, 2:34pm

TIMESTAMP.

How?

s_molinari · January 18, 2016, 2:36pm

Are you storing a timestamp somewhere else, in order to determine the new customers? Like last time the query was made? Or somthing to that effect? The timestamp alone doesn’t determine “new” from “old”.

Scott