2019-12-06 22:03 UTC

View Issue Details Jump to Notes ]
IDProjectCategoryView StatusLast Update
0000440NetSurf[All Projects] Generalpublic2015-03-10 11:04
ReporterJohn-Mark Bell 
Assigned ToVincent Sanders 
SeverityminorReproducibilityalways 
StatusclosedResolutionfixed 
Product Version 
Target VersionFixed in Version3.2 
Summary0000440: Fetching of file:/ URLs ignores encoding
DescriptionFetching of file:/ URLs fails to convert the path to the current system local encoding. This generates an inconsistency in the way in which local files are identified.
In the case of a link in a web page, the URL will be UTF-8 encoded.
In the case of a file loaded from disc (or from an URL/URI/IEURL file), the URL will be encoded in the system local encoding.
When fetching the URL, the encoding is ignored (as it's assumed to be correct).
If the file path contains top-bit-set characters (such as a hard-space), the UTF-8 encoded link from an HTML document will fail to fetch the file as it'll look for ...%c2%a0... instead of whatever the representation of hard-space actually is on the system at the time.
To fix this requires a couple of steps:

1) Make the fetching system assume that the URL is UTF-8 encoded (which is already the case in 99% of cases) and, in the case of file:/ URLs, convert the path to the system local encoding before the fetch occurs.

2) When loading a local file (or a reference to a local file), convert the URL to UTF-8 encoding.
Additional InformationImported from sourceforge bug http://sourceforge.net/support/tracker.php?aid=1403338 on Tue Dec 10 17:34:12 2013

TagsNo tags attached.
Fixed in CI build #
Reported in CI build #
URL of problem page
Attached Files

-Relationships
+Relationships

-Notes
Vincent Sanders

~0000369

Vincent Sanders (administrator)

Tested on RISC OS under emulation can now happily navigate to file paths with various encoded characters in them including hard spaces.
Vincent Sanders

~0000706

Vincent Sanders (administrator)

Confirmed fixed in 3.2 release
+Notes

-Issue History
Date Modified Username Field Change
2013-12-10 17:41 Vincent Sanders New Issue
2013-12-10 17:41 Vincent Sanders Status new => assigned
2013-12-10 17:41 Vincent Sanders Assigned To => Sourceforge Import placeholder
2013-12-23 13:57 Vincent Sanders Assigned To Sourceforge Import placeholder =>
2013-12-23 13:57 Vincent Sanders Status assigned => confirmed
2013-12-23 13:57 Vincent Sanders Resolution no change required => open
2013-12-23 13:57 Vincent Sanders Description Updated View Revisions
2013-12-23 13:57 Vincent Sanders Additional Information Updated View Revisions
2014-06-05 21:20 Vincent Sanders Note Added: 0000369
2014-06-05 21:20 Vincent Sanders Assigned To => Vincent Sanders
2014-06-05 21:20 Vincent Sanders Reproducibility have not tried => always
2014-06-05 21:20 Vincent Sanders Status confirmed => resolved
2014-06-05 21:20 Vincent Sanders Resolution open => fixed
2014-06-05 21:20 Vincent Sanders Fixed in Version => 3.2
2015-03-10 11:04 Vincent Sanders Note Added: 0000706
2015-03-10 11:04 Vincent Sanders Status resolved => closed
+Issue History