Skip to content
GitLab
Menu
Projects
Groups
Snippets
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
Tomaž Erjavec
siIUS
Commits
659bba1c
Commit
659bba1c
authored
Feb 04, 2020
by
Tomaž Erjavec
Browse files
Fix readme.
parent
41c6afab
Changes
1
Hide whitespace changes
Inline
Side-by-side
README.md
View file @
659bba1c
...
@@ -3,12 +3,18 @@
...
@@ -3,12 +3,18 @@
This project contains the siIUS corpus containing historical Slovene
This project contains the siIUS corpus containing historical Slovene
legal texts. The complex TEI encoding of the source siIUS digital
legal texts. The complex TEI encoding of the source siIUS digital
library is here simplified, and its text tokenised.
library available from https://dihur.si/si-ius/ is here simplified,
and its text tokenised.
The complete Git sources are stored in the gitignored GitGroup/ directory,
just the TEI XML documents in the DARIAH/ directory,
the conversion scripts in the bin/ directory (and the Makefile),
and the output corpus files in the CLARIN/ directory.
As further work it will be PoS tagged, and lemmatised with annotated
As further work it will be PoS tagged, and lemmatised with annotated
term candidates, converted to vertical file and made available under
term candidates, converted to vertical file and made available under
the CLARIN.SI concordancers.
the CLARIN.SI concordancers.
The compilation of the siIUS digital library
wa
s supported by
The compilation of the siIUS digital library
i
s supported by
DARIAH-SI, and the compliation of the siIUS annotated corpus by
DARIAH-SI, and the compliation of the siIUS annotated corpus by
CLARIN.SI.
CLARIN.SI.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment