Load the libraries you’ll use, and create a digital text corpus as explained in this tutorial, and tokenize the text (as explained in this tutorial../tokenize)).
import edu.holycross.shot.cite._
import edu.holycross.shot.ohco2._
import edu.holycross.shot.mid.latinmodel._
import edu.holycross.shot.latin._
val catalog = "editions/catalog.cex"
val citation = "editions/citation.cex"
val editions = "editions"
val repo = TextRepositorySource.fromFiles(catalog,citation,editions)
Select a text identified by URN, tokenize this token, and read it. The example below chooses a diplomatic reading.
val urn = CtsUrn("urn:cts:chant:antiphonary.einsiedeln121.text_xml:")
val eins121 = repo.corpus ~~ urn
val tokens = LatinTeiReader.fromCorpus(eins121)
val diplomaticTokens = tokens.map(_.readWithDiplomatic)
val alphabet = edu.holycross.shot.latin.Latin23Alphabet
val diplomaticLC = diplomaticTokens.map(_.text.toLowerCase).map(_.replaceAll("v", "u"))
val syllables = for (diplToken <- diplomaticLC) yield {
val latinString = LatinString(diplToken, alphabet)
latinString.syllabify
}