Here is the link to another project now available on internet. It is a digital version of Du Cange’s Glossarium mediae et infimae latinitatis.
Under the heading “documentaion” they explain how they encoded the text. They speak also to some extend about fragments, as some of the entries contain quotations. They discuss the difficulties, for instance with quotations in prose vs those from poetry, with interrupted quotations or quotations which are too short.