This package provides a Guzzle 6/7 middleware that transparently converts documents obtained by Guzzle from its native encoding to UTF-8 (or any other specified encoding). It supports the following features:
- Detection of charset from
Content-Type
HTTP header. - Detection of charset from
meta
element in HTML document. - Detection of charset from XML declaration in RSS and other XML documents.
- Updating the
Content-Type
header in theResponse
object according to target encoding. - Updating the metadata in the
Response
body according to target encoding (not enabled by default).
It is recommended to install the library using Composer:
composer require fossar/guzzle-transcoder
use Fossar\GuzzleTranscoder\GuzzleTranscoder;
use GuzzleHttp\Client;
use GuzzleHttp\HandlerStack;
$stack = HandlerStack::create();
$stack->push(new GuzzleTranscoder);
$client = new Client(['handler' => $stack]);
$url = 'https://www.myseosolution.de/scripts/encoding-test.php?enc=iso'; // request website with iso-8859-1 encoding
$req = $client->get($url);
echo $req->getBody();
use Fossar\GuzzleTranscoder\GuzzleTranscoder;
use GuzzleHttp\Client;
use GuzzleHttp\HandlerStack;
$stack = HandlerStack::create();
$stack->push(new GuzzleTranscoder([
'targetEncoding' => 'windows-1252',
// Swap the default settings:
'replaceHeaders' => false,
'replaceContent' => true,
]));
$client = new Client(['handler' => $stack]);
$url = 'https://www.myseosolution.de/scripts/encoding-test.php?enc=iso'; // request website with iso-8859-1 encoding
$req = $client->get($url);
echo $req->getHeaderLine('Content-Type') . "\n"; // HTTP header will remain unchanged
echo $req->getBody();
It is largely based on Pascal Landau’s guzzle-auto-charset-encoding-subscriber and web-utility libraries.
We are using Transcoder library. This allows us to fall back to iconv
when mbstring
is not available or an encoding is not supported by it.
The source code is available under the terms of MIT license