Word Counter
Below are the principles due to which Crowdin counts words:
- A word is a combination of letters, punctuation marks, and special characters (e.g.:@ # $ % ^ & * – _ ` ‘ “) followed by space.
- A sequence of punctuation marks or special characters is not considered a word.
- By default, HTML tags are considered separate words for most formats, except the following ones: HTML, Front Matter HTML, HAML, MD, Front Matter MD, XML, WEBXML, IDML, XLIFF, XLIFF 2.0, ADOC, DOCX, MIF, DITA. You can change the default word count settings in your project’s Settings > Import > Word count.
- URLs (e.g.
https://crowdin.com
) and emails (e.g.support@crowdin.com
) are considered one word. - Hieroglyphs in Chinese, Japanese, and other hieroglyphic languages are counted as one word/hieroglyph. For example, “ライフ・イン・トウキョウ。” is counted as ten words.
Other examples of how the words are counted:
String | Words |
---|---|
Number is -123.45 | 3 |
<a href=”{0}” target=”_parent”>here</a> | 1 / 7 (if non-HTML-based format is used) |
0 – 1 at 2 | 4 |
two-in-one | 1 |
2-in-one | 1 |
two-in-1 | 1 |
%file_type% | 1 |
hello?world | 1 |
hello ? world | 2 |
<span style="color: #333333;">☂ ☃ ☀⚤</span> | 0 |
© %company% | 1 |
01/01/1980 | 3 |
Monday, August 8, 2011 | 4 |
https://ka-graphie.example.com/6d8b.png | 1 |
Let’s look | 2 |
Let’s look | 3 (another type of apostrophe is used) |
Word(s) | 2 |
Translatable HTML Attributes
When working with HTML-based file formats, depending on the file structure, some HTML attribute values may be considered translatable while others not. You can see the list of attributes and situations when their values are considered translatable in the table below.
Attribute | Details | Example |
---|---|---|
title | translatable if contained in any HTML tag | <p title="value">Text</p> |
src | translatable if contained in img , iframe , embed , video , audio , source *, track * tags* – if nested in the parent tags video , audio | <audio src="sound.ogg" controls>Text</audio> |
href | translatable if contained in a tag | <a href="https://www.site.com">Site name</a> |
data | translatable if contained in object tag | <object data="image.jpg"></object> |
value | translatable if contained in input , button tags | <input value="Text"> |
placeholder | translatable if contained in input , textarea tags | <textarea placeholder="Enter some text here..."></textarea> |
alt | translatable if contained in img tag | <img src="image.jpg" alt="Image"> |
label | translatable if contained in optgroup , track * tags* – if nested in the parent tags video , audio | <optgroup label="Text"> |
content | translatable if contained in meta tag | <meta name="description" content="Text"> |
Thank you for your feedback!