Commit 659bba1c authored by Tomaž Erjavec's avatar Tomaž Erjavec
Browse files

Fix readme.

parent 41c6afab
...@@ -3,12 +3,18 @@ ...@@ -3,12 +3,18 @@
This project contains the siIUS corpus containing historical Slovene This project contains the siIUS corpus containing historical Slovene
legal texts. The complex TEI encoding of the source siIUS digital legal texts. The complex TEI encoding of the source siIUS digital
library is here simplified, and its text tokenised. library available from https://dihur.si/si-ius/ is here simplified,
and its text tokenised.
The complete Git sources are stored in the gitignored GitGroup/ directory,
just the TEI XML documents in the DARIAH/ directory,
the conversion scripts in the bin/ directory (and the Makefile),
and the output corpus files in the CLARIN/ directory.
As further work it will be PoS tagged, and lemmatised with annotated As further work it will be PoS tagged, and lemmatised with annotated
term candidates, converted to vertical file and made available under term candidates, converted to vertical file and made available under
the CLARIN.SI concordancers. the CLARIN.SI concordancers.
The compilation of the siIUS digital library was supported by The compilation of the siIUS digital library is supported by
DARIAH-SI, and the compliation of the siIUS annotated corpus by DARIAH-SI, and the compliation of the siIUS annotated corpus by
CLARIN.SI. CLARIN.SI.
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment