Tag content of source file can be visualized in the Editor by clicking Expand tags under the heading in the menu and edited by clicking F2.
Important
Issues with tags are a common cause of export errors. Always ensure tags and formatting are correct before exporting files by running quality assurance checks.
All elements have at least the following translatable attributes: title, aria-label, aria-valuetext.
Non standard html tags are:
-
ignored - separate paragraphs are created when is set to false.
-
Standard ignored html tags:
base,basefont,canvas,colgroup,frameset,link,object,script,style
-
-
Imported as single (non-translatable) inline tags when is true
Some elements have more attributes: <img> has alt, but <a> does not as alt is not a standard attribute for <a>.
See HTML elements reference for standard attributes and use of HTML entities.
File Types
-
.HTML
-
.HTM
Import Options
-
-
Converts tags which are not defined as HTML. If there are invalid tags, the system skips them and parses the rest.
Examples:
<html> Test simple <unknown/>html.</html>
Non-HTML tags will be processed as Phrase tags and imported as Test simple
{1}html.<html> Test simple <% unknown %>html.</html>
Invalid variable (
<% unknown %>) causes the parsing to get broken. The broken part will be treated as a non-translatable and the editor will showTest simple, html. -
Imports standalone comments as tags. Inline comments (inside paragraphs) are always imported as tags.
-
Preserves the same whitespace structure in the target document (e.g. spacing, newlines, etc.).
-
All language declarations in the completed HTML files are updated with a proper language code.
-
Disables escaping of
<and&entities in completed files. -
A list of HTML tags separated by commas that should not be imported for translation.
-
Apply regular expressions to convert specified text to tags.
-
Enter a list of character entities that will be converted into tags during translation. These are exported to the translated document in the same form as they are present in the source file.
Example:
-
-
Automated detection is available only for
<meta>tags with the attributename="description"orname="keywords"andattributes="property". -
Use a regular expression to specify attributes in
<meta>tags to import their content. As an example, use(?:pan-keywords)to import meta tags with the attribute name="pan-keywords".
-
-
A list of HTML tags separated by commas that indicate translatable attributes.
-
A list of HTML tags separated by commas that indicate non-translatable inline elements.
-
A list of HTML tags separated by commas that indicate translatable inline elements.
-
Segmentation placeholders for HTML tags
Customize the internal placeholder characters used by the HTML subfilter during segmentation. These placeholders represent open, close, and single HTML tags internally so that SRX segmentation rules can identify them.
Changes affect segmentation only if the SRX rules explicitly reference these characters. This can alter translation memory (TM) leverage for new imports.
Accepted input formats:
-
Unicode code point in the format U+XXXX (for example,
U+E7FD) -
Simple character pasted directly into the field (for example, ␢)
If any of the three fields is left empty, the platform defaults for the corresponding placeholder are used:
-
:
U+E7FD -
:
U+E7FE -
:
U+E7FF
-