Unstructured documents can be converted into structured formats using the open-source annotation tool markup for NLP and ML applications like named-entity recognition. When you annotate, the markup learns to anticipate and recommend complicated annotations.