This is not currently part of the peer-reviewed material of the project. Do not cite as a research publication.
This month I made some minor updates to the variant lemmatising form, including a button to look up word forms in ONP that have not previously been recorded in the database.
I also worked a bit with ONP's data on compounds and links to dictionaries/glosses with a view to incorporating and / or using the information in LP.
Progress at 28/8/17
|Stanzas in corpus:||5797|
|Stanzas entered in database:||4845||(83.6%)|
|Words in corpus:||150501|
|Stanzas with lexical variants:||752||(13.0%)|
|Lexical variants added:||6120|
|Lexical variants lemmatised:||5010||(81.9%)|
|Headwords linked to corpus:||14268|