Split a character vector based on a tag. The function tokenizes MDA tagged texts by splitting on each space not followed by an <MDA> tag. It will also work on _ST tags by default.
Split a character vector based on a tag. The function tokenizes MDA tagged texts by splitting on each space not followed by an <MDA> tag. It will also work on _ST tags by default.