Skip to content

Import/Export for Common Formats #29

@lgessler

Description

@lgessler

Introduction

Preliminary interactions with users indicate they expect to be able to import and export their data in commonly used formats, such as the export formats of ELAN and FLEx, ELAN Annotation Format (EAF) and FieldWorks XML (FWXML). Both of these formats are commonly used in day-to-day workflows, and are also the deposit format used by some language archives.

Background

These formats are the two most popular that are in use. It's not clear to me that we need to immediately support other formats, but we should look into it more by talking to users. Some other ones that come to mind

  • Toolbox, if it has its own format.
  • Plaintext: I think users will expect a formatted whitespace representation of at least interlinear data.
  • LaTeX interlinear package formats (gb4e, linguex): a nice convenience.

Currently, we do not support any import, and we only support export of the raw JS representation of a document.

Problem

We should add import and/or export support for these formats, prioritizing according to demand. Some decisions that need to be made:

  1. Where and how in the code should this functionality be implemented? It could take place on the backend, or purely in the frontend.
  2. For each import/export function, what is the appropriate level of granularity (e.g. project, document, or perhaps even sentence)?
  3. For particular format requirements, would it be sufficient to support a core set of formats and expect users to write their own code, separate from our system, to accomplish the formatting need they have?
  4. If the answer to the former is "no", is it worth investing in some kind of bring-your-own-code (e.g. in JavaScript) solution for users who want to produce their own export/import functionality?

Proposal

FLEx and ELAN at the very least should be covered. As for the rest, TBD.

Child Issues

(List child issues here as they are created. This issue should not be closed until all child issues are closed.)

Metadata

Metadata

Assignees

No one assigned

    Labels

    epicDescribes a large or multifaceted piece of functionality. Has sub-issues.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions