Best practices when making text/test documents for font testing
I recently posted a text file* to help people who want to quickly get a sense of how their font looks in a variety of languages.
But one of the people using it says they can't see all the glyphs. The format I uses windows linebreaks and Unicode 8 no BOM. I could have used just Unicode 8 or Unicode 16, or Unicode 16, No BOM, or Unicode 16 Little Endian, or Unicode 16 Little Endian, No BOM.
Any idea what line break type is going to be most compatible - and perhaps more importantly what Unicode encoding is best & why?
* The thread is here if you want to use the file or do a technical check:
http://typophile.com/node/31399
It is also here in v1.2:
| Attachment | Size |
|---|---|
| Languages at a glance.v1.2.txt | 49 KB |
| Languages at a glance.v1.2.pdf | 107.82 KB |



17.Oct.2007 10.08am
The most important thing is not to share this as TXT, since the application will need to guess the encoding or even worse: use the default. Make it an RTF and choose the encoding and everything will be fine.
17.Oct.2007 10.17am
Just some feedback:
I opened your file with several applications on Mac OSX (10.4).
Hope that is of use for somebody.
17.Oct.2007 10.49am
TextWrangler also opens the file correctly.
Textwrangler is free.
17.Oct.2007 10.50am
http://www.barebones.com/products/textwrangler/
17.Oct.2007 10.51am
On Mac OSX (10.4)
17.Oct.2007 10.58am
I think UTF-8 enjoys broader support than 16-bit Unicode. But Ralf is probably right in suggesting a format that can represent the encoding explicitly.
17.Oct.2007 11.30am
Thanks everybody. This was very helpful!
I will fix some errors that have come to light and re-post v1.1 in 3 formats: TXT RTF & PDF.
By the way; TextEdit can be made to use 'Unicode UTF-8' also. But you have to make a change in the preferences.
The other thing which I neglected to say and may be useful to know is that to test for Lingala ( one of the languages ) you must have extended Latin -B support in the font being tested. I would like to add more languages like this over time. Maybe then I can group them by Latin, Latin A & Latin B.
What do you think?
17.Oct.2007 2.56pm
The revised (v1.1) files are up. If people want the RTF they will have to email me until such a time as Typophile adds it as a supported file type in threads.
17.Oct.2007 3.05pm
Cool, thanks for posting.
I was wondering why "Type" is capitalized on the first line and then spotted a couple of other typos - "greared" & "Cyrilic".
Thx, Si
17.Oct.2007 10.38pm
Thanks Simon!
18.Oct.2007 9.36am
I have update both threads to have the latest version: 1.2