Introduction
Preliminary interactions with users indicate they expect to be able to import and export their data in commonly used formats, such as the export formats of ELAN and FLEx, ELAN Annotation Format (EAF) and FieldWorks XML (FWXML). Both of these formats are commonly used in day-to-day workflows, and are also the deposit format used by some language archives.
Background
These formats are the two most popular that are in use. It's not clear to me that we need to immediately support other formats, but we should look into it more by talking to users. Some other ones that come to mind
- Toolbox, if it has its own format.
- Plaintext: I think users will expect a formatted whitespace representation of at least interlinear data.
- LaTeX interlinear package formats (gb4e, linguex): a nice convenience.
Currently, we do not support any import, and we only support export of the raw JS representation of a document.
Problem
We should add import and/or export support for these formats, prioritizing according to demand. Some decisions that need to be made:
- Where and how in the code should this functionality be implemented? It could take place on the backend, or purely in the frontend.
- For each import/export function, what is the appropriate level of granularity (e.g. project, document, or perhaps even sentence)?
- For particular format requirements, would it be sufficient to support a core set of formats and expect users to write their own code, separate from our system, to accomplish the formatting need they have?
- If the answer to the former is "no", is it worth investing in some kind of bring-your-own-code (e.g. in JavaScript) solution for users who want to produce their own export/import functionality?
Proposal
FLEx and ELAN at the very least should be covered. As for the rest, TBD.
Child Issues
(List child issues here as they are created. This issue should not be closed until all child issues are closed.)
Introduction
Preliminary interactions with users indicate they expect to be able to import and export their data in commonly used formats, such as the export formats of ELAN and FLEx, ELAN Annotation Format (EAF) and FieldWorks XML (FWXML). Both of these formats are commonly used in day-to-day workflows, and are also the deposit format used by some language archives.
Background
These formats are the two most popular that are in use. It's not clear to me that we need to immediately support other formats, but we should look into it more by talking to users. Some other ones that come to mind
Currently, we do not support any import, and we only support export of the raw JS representation of a document.
Problem
We should add import and/or export support for these formats, prioritizing according to demand. Some decisions that need to be made:
Proposal
FLEx and ELAN at the very least should be covered. As for the rest, TBD.
Child Issues
(List child issues here as they are created. This issue should not be closed until all child issues are closed.)