TDM starts with breaking up text into units (or tokens) that get fed into the mining software. One year of our open access journal Chemical Science’s articles are available as space-separated ...