Key Takeaways
- cURL, a powerful tool used for a variety of tasks from sending emails to downloading subtitles, can be used in PHP through an extension offering the same functionality as the console utility.
- cURL can be used to retrieve web pages, log into websites, work with FTP and send multiple requests. For instance, you can simulate logging into a WordPress-powered website by sending a POST request with specific details.
- Troubleshooting cURL requests is simplified with the use of two functions: curl_getinfo() and curl_error(). These functions provide detailed information about the channel and any errors that may occur during the request.
- cURL is an efficient and powerful tool for making remote calls, making it ideal for tasks such as accessing external APIs or crawling. It offers a user-friendly interface and relatively easy execution of requests.
How Does it Work?
All cURL requests follow the same basic pattern:- First we initialize the cURL resource (often abbreviated as ch for “cURL handle”) by calling the
curl_init()
function. - Next we set various options, such as the URL, request method, payload data, etc. Options can be set individually with
curl_setopt()
, or we can pass an array of options tocurl_setopt_array()
. - Then we execute the request by calling
curl_exec()
. - Finally, we free the resource to clear out memory.
<?php
// init the resource
$ch = curl_init();
// set a single option...
curl_setopt($ch, OPTION, $value);
// ... or an array of options
curl_setopt_array($ch, array(
OPTION1 => $value1,
OPTION2 => $value2
));
// execute
$output = curl_exec($ch);
// free
curl_close($ch);
The only thing that changes for the request is what options are set, which of course depends on what you’re doing with cURL.
Retrieve a Web Page
The most basic example of using cURL that I can think of is simply fetching the contents of a web page. So, let’s fetch the homepage of the BBC as an example.<?php
curl_setopt_array(
$ch, array(
CURLOPT_URL => 'http://www.bbc.co.uk/',
CURLOPT_RETURNTRANSFER => true
));
$output = curl_exec($ch);
echo $output;
Check the output in your browser and you should see the BBC website displayed. We’re lucky as the site displays correctly because of its absolute linking to stylesheets and images.
The options we just used were:
CURLOPT_URL
– specifies the URL for the requestCURLOPT_RETURNTRANSFER
– when set false,curl_exec()
returns true or false depending on the success of the request. When set to true,curl_exec()
returns the contents of the response.
Log in to a Website
cURL executed a GET request to retrieve the BBC page, but cURL can also use other methods, such as POST and PUT. For this example, let’s simulate logging into a WordPress-powered website. Logging in is done by sending a POST request to http://example.com/wp-login.php with the following details:login
– the usernamepwd
– the passwordredirect_to
– the URL we want to go to after logging intestcookie
– should be set to 1 (this is just for WordPress)
<?php
$postData = array(
'login' => 'acogneau',
'pwd' => 'secretpassword',
'redirect_to' => 'http://example.com',
'testcookie' => '1'
);
curl_setopt_array($ch, array(
CURLOPT_URL => 'http://example.com/wp-login.php',
CURLOPT_RETURNTRANSFER => true,
CURLOPT_POST => true,
CURLOPT_POSTFIELDS => $postData,
CURLOPT_FOLLOWLOCATION => true
));
$output = curl_exec($ch);
echo $output;
The new options are:
CURLOPT_POST
– set this true if you want to send a POST requestCURLOPT_POSTFIELDS
– the data that will be sent in the body of the requestCURLOPT_FOLLOWLOCATION
– if set true, cURL will follow redirects
<?php
curl_setopt_array($ch, array(
CURLOPT_URL => 'http://example.com/wp-login.php',
CURLOPT_RETURNTRANSFER => true,
CURLOPT_POST => true,
CURLOPT_POSTFIELDS => $postData,
CURLOPT_FOLLOWLOCATION => true,
CURLOPT_COOKIESESSION => true,
CUROPT_COOKIEJAR => 'cookie.txt'
));
The new options are:
CURLOPT_COOKIESESSION
– if set to true, cURL will start a new cookie session and ignore any previous cookiesCURLOPT_COOKIEJAR
– this is the name of the file where cURL should save cookie information. Make sure you have the correct permissions to write to the file!
Working with FTP
Using cURL to download and upload files via FTP is easy as well. Let’s look at downloading a file:<?php
curl_setopt_array($ch, array(
CURLOPT_URL => 'ftp://ftp.example.com/test.txt',
CURLOPT_RETURNTRANSFER => true,
CURLOPT_USERPWD => 'username:password'
));
$output = curl_exec($ch);
echo $output;
Note that there aren’t many public FTP servers that allow anonymous uploads and downloads for security reasons, so the URL and credentials above are just place-holders.
This is almost the same as sending an HTTP request, but only a couple minor differences:
CURLOPT_URL
– the URL of the file, note the use of “ftp://” instead of “http://”CURLOT_USERPWD
– the login credentials for the FTP server
<?php
$fp = fopen('test.txt', 'r');
curl_setopt_array($ch, array(
CURLOPT_URL => 'ftp://ftp.example.com/test.txt',
CURLOPT_USERPWD => 'username:password'
CURLOPT_UPLOAD => true,
CURLOPT_INFILE => $fp,
CURLOPT_INFILESIZE => filesize('test.txt')
));
curl_exec($ch);
fclose($fp);
curl_close($ch);
The important options here are:
CURLOPT_UPLOAD
– obvious booleanCURLOPT_INFILE
– a readable stream for the file we want to uploadCURLOPT_INFILESIZE
– the size of the file we want to upload in bytes
Sending Multiple Requests
Imagine we have to perform five requests to retrieve all of the necessary data. Keep in mind that some things will be beyond our control, such as network latency and the response speed of the target servers. It should be obvious then that any delays when issuing five consecutive calls can really add up! One way to mitigate this problem is to issue the requests asynchronously. Asynchronous techniques are more common in the JavaScript and Node.js communities, but briefly instead of waiting for a time-consuming task to complete, we assign the task to a different thread or process and continue to do other things in the meantime. When the task is complete we come back for its result. The important thing is that we haven’t wasted time waiting for a result; we spent it executing other code independently. The approach for performing multiple asynchronous cURL requests is a bit different from before. We start out the same – we initiate each channel and then set the options – but then we initiate a multihandlerusing curl_multi_init()
and add our channels to it with curl_multi_add_handle()
. We execute the handlers by looping through them and checking their status. In the end we get a response’s content with curl_multi_getcontent()
.
<?php
// URLs we want to retrieve
$urls = array(
'http://www.google.com',
'http://www.bing.com',
'http://www.yahoo.com',
'http://www.twitter.com',
'http://www.facebook.com'
);
// initialize the multihandler
$mh = curl_multi_init();
$channels = array();
foreach ($urls as $key => $url) {
// initiate individual channel
$channels[$key] = curl_init();
curl_setopt_array($channels[$key], array(
CURLOPT_URL => $url,
CURLOPT_RETURNTRANSFER => true,
CURLOPT_FOLLOWLOCATION => true
));
// add channel to multihandler
curl_multi_add_handle($mh, $channels[$key]);
}
// execute - if there is an active connection then keep looping
$active = null;
do {
$status = curl_multi_exec($mh, $active);
}
while ($active && $status == CURLM_OK);
// echo the content, remove the handlers, then close them
foreach ($channels as $chan) {
echo curl_multi_getcontent($chan);
curl_multi_remove_handle($mh, $chan);
curl_close($chan);
}
// close the multihandler
curl_multi_close($mh);
The above code took around 1,100 ms to execute on my laptop. Performing the requests sequentially without the multi interface it took around 2,000 ms. Imagine what your gain will be if you are sending hundreds of requests!
Multiple projects exist that abstract and wrap the multi interface. Discussing them is beyond the scope of the article, but if you’re planning to issue multiple requests asynchronously then I recommend you take a look at them:
- github.com/petewarden/ParallelCurl
- semlabs.co.uk/journal/object-oriented-curl-class-with-multi-threading
Troubleshooting
If you’re using cURL then you are probably performing your requests to third-party servers. You can’t control them and much can go wrong: servers can go offline, directory structures can change, etc. We need an efficient way to find out what’s wrong when something doesn’t work, and luckily cURL offers two functions for this:curl_getinfo()
and curl_error()
.
curl_getinfo()
returns an array with all of the information regarding the channel, so if you want to check if everything is all right you can use:
<?php
var_dump(curl_getinfo($ch));
If an error pops up, you can check it out with curl_error()
:
<?php
if (!curl_exec($ch)) {
// if curl_exec() returned false and thus failed
echo 'An error has occurred: ' . curl_error($ch);
}
else {
echo 'everything was successful';
}
Conclusion
cURL offers a powerful and efficient way to make remote calls, so if you’re ever in need of a crawler or something to access an external API, cURL is a great tool for the job. It provides us an nice interface and a relatively easy way to execute requests. For more information, check out the PHP Manual and the cURL website. See you next time! Comments on this article are closed. Have a question about PHP? Why not ask it on our forums? Image via FotoliaFrequently Asked Questions (FAQs) about Using cURL for Remote Requests
What is cURL and why is it used in PHP?
cURL, or Client URL, is a library that allows you to make HTTP requests in PHP. It’s used to communicate with different types of servers and to download or upload data. cURL supports various protocols like HTTP, HTTPS, FTP, and more. It’s a powerful tool that can be used to interact with APIs, scrape web pages, or even download files.
How do I install and enable cURL in PHP?
cURL is usually included in most web servers. However, if it’s not enabled, you can do so by modifying your PHP.ini file. Locate the line that says “;extension=curl” and remove the semicolon. If the line doesn’t exist, you can add it at the end of the file. After making changes, save the file and restart your web server.
How do I make a simple cURL request in PHP?
To make a simple cURL request, you first need to initialize cURL with the curl_init() function. Then, set your options using the curl_setopt() function. Finally, execute the request with curl_exec() and close the session with curl_close(). Here’s a basic example:$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, "http://example.com");
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
$output = curl_exec($ch);
curl_close($ch);
How can I handle errors in cURL?
You can handle errors in cURL by using the curl_errno() and curl_error() functions. These functions return the last error number and error message respectively. Here’s an example:if(curl_errno($ch)) {
echo 'Error:' . curl_error($ch);
}
How do I send a POST request using cURL?
To send a POST request, you need to set the CURLOPT_POST option to true and the CURLOPT_POSTFIELDS option to an array of data you want to send. Here’s an example:curl_setopt($ch, CURLOPT_POST, 1);
curl_setopt($ch, CURLOPT_POSTFIELDS, "postvar1=value1&postvar2=value2");
How can I set custom headers for a cURL request?
You can set custom headers by using the CURLOPT_HTTPHEADER option. This option takes an array of headers as its value. Here’s an example:$headers = array(
'Content-Type: application/json',
'Authorization: Bearer ' . $token
);
curl_setopt($ch, CURLOPT_HTTPHEADER, $headers);
How do I follow redirects with cURL?
To follow redirects, you need to set the CURLOPT_FOLLOWLOCATION option to true. Here’s how you can do it:curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
How can I get the response headers from a cURL request?
To get the response headers, you can set the CURLOPT_HEADER option to true. This will include the headers in the output. Here’s an example:curl_setopt($ch, CURLOPT_HEADER, true);
How do I send a file using cURL?
To send a file, you can use the CURLOPT_POSTFIELDS option and prefix the file path with an @ symbol. Here’s an example:curl_setopt($ch, CURLOPT_POSTFIELDS, array('file' => '@/path/to/file.txt'));
How do I use cURL with a proxy?
To use cURL with a proxy, you can set the CURLOPT_PROXY option to the address of the proxy. Here’s how you can do it:curl_setopt($ch, CURLOPT_PROXY, "http://proxy.example.com:8080");