Bad Encoding Scenario

Select scenario:
Cyrillic German Mixed CP1252 Latin-1 Invalid UTF-8
Enable iframe (mixed encoding)
Challenge: The HTTP Content-Type header declares a wrong charset. Your scraper must detect the actual encoding and decode the content correctly.
Declared charset (header)utf-8
Actual encoding (body)iso-8859-1
ScenarioFrench/Spanish accented text as raw Latin-1 bytes but header declares charset=utf-8

Latin-1 Accented Text

Café, naïve, résumé – common French words.

Español: ¿Dónde está la biblioteca? ¡Hola mundo!

Português: A criança é muito inteligente.

À la carte, pièce de résistance, tête-à-tête.


All scenarios | Home