Skip to content

Request for clarification(s) of export/import process #7

@JoanEliot

Description

@JoanEliot

Thank you for this very useful software.

After aligning two texts, is there a way to export them, as segmented, and then re-import one of them with its segmentation preserved, so that a previously un-aligned text can now be aligned with those segments?

I'm wanting to align several texts to one reference text. I realize that InterText can only align/display two texts at a time, but my project involves five texts each: one in the original language, three translations, plus a transliterated version for the language that is written in a non-Latin alphabet.

I want to use one text (the one in the original language) as a source or reference version, against which each of the three other texts is aligned. Each segment should contain one and only one paragraph of the source version of the text. So after importing the source language text and one of the translations, first I verified that all the segments are set up correctly in the source version. Then I aligned the other text to that source version. Now I want to align each additional translation to the source version as already segmented.

I've been unable to get that to work by any of the export/re-import commands I've tried. The closest I've come is by defining a custom export profile that omits all XML tags and simply separates paragraphs with empty lines, and sentences with line breaks. But when I re-import the exported source language file, along with another text I want to align with it. Paragraph breaks are preserved correctly, but each segment contains (at most) one sentence, not a whole paragraph. (For that matter, segments are lost even if I simply re-import both of the files I've just exported.)

The technique described in section 10.2 of the manual should work for aligning the translation of the version in the non-Latin script with its transliteration. Otherwise, though, aligning translations to each other rather than to the source language text would not be ideal for me.

I've looked for but haven't found an option that would automatically generate one and only one segment for each paragraph in one of the imported files. Any suggestions?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions