Skip to content

TSV has invalid format #1

@donkirkby

Description

@donkirkby

Thanks for publishing this resource. I think I might be able to build a learning tool from it.

When I go and look at dev-anno.001_zh.001_en.tsv, GitHub displays this warning:

We can make this file beautiful and searchable if this error is corrected: Illegal quoting in line 11.

Most of the other TSV files have the same warning, but dev-anno.002_zh.002_en.tsv doesn't, so you can see what the "beautiful and searchable" mode looks like.

The problem is that intertext2tsv.py uses simple string concatenation, and doesn't escape quotes. I'll create a pull request that converts it to using the csv module.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions