Unicode data in response #29

naro · 2014-06-11T09:40:51Z

I'm trying to write a test against a HTML file containing some non ascii characters. Very simple example is this file with a non-ascii dash between words Hello, World:

<!DOCTYPE html>
<html>
<head>
  <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body>
  Hello — world
</body>
</html>

Running this test case fails with UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 140: ordinal not in range(128) error message:

*** Settings ***

Library    HttpLibrary.HTTP

*** Test Cases ***

Test unicode
    Create Http Context  supl.cz  http
    GET  /page2.html
    Response Body Should Contain  Hello

Testing a page without a dash is fine (note page1.html instead of page2.html)

*** Settings ***

Library    HttpLibrary.HTTP

*** Test Cases ***

Test unicode
    Create Http Context  supl.cz  http
    GET  /page1.html
    Response Body Should Contain  Hello

Using a dash character in test script seems to be fine (the following test is expected to fail, because page1.html does not contain "Hello — world" but contains "Hello world"):

*** Settings ***

Library    HttpLibrary.HTTP

*** Test Cases ***

Test unicode
    Create Http Context  supl.cz  http
    GET  /page1.html
    Response Body Should Contain  Hello — world

It looks like we need support for decoding response body to unicode so it can be compared to unicode strings.

The text was updated successfully, but these errors were encountered:

naro · 2014-06-11T11:16:12Z

It seems adding a new method
response_text_should_contain which would use exactly the same code as response_body_should_contain except checking self.response.text instead of self.response.body would solve this issue.
'body' is bytes string, 'text' is unicode string, which is what I'm looking for.

naro · 2014-06-11T11:21:37Z

and also get_response_text would be helpful in that case :)

blunttester · 2017-03-28T09:15:41Z

If we have an unicode character in the url (e.g. é) httplibrary GET is not able to be completed either. So it is not only handling the response.body where the error occurs.

peritus self-assigned this Jun 11, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unicode data in response #29

Unicode data in response #29

naro commented Jun 11, 2014

naro commented Jun 11, 2014

naro commented Jun 11, 2014

blunttester commented Mar 28, 2017

Unicode data in response #29

Unicode data in response #29

Comments

naro commented Jun 11, 2014

naro commented Jun 11, 2014

naro commented Jun 11, 2014

blunttester commented Mar 28, 2017