Chilin has many specialist databases in store. Two of them are recently launched on TAUS.
One dataset of contains 12,947 segments: 475,509 en-US words, and 401,629 zh-CN words. It is based on the CPC Patent Classification category A61K which covers pharmaceuticals.
The second dataset contains 10,377 segments: 379,898 en-US words, and 327,637 zh-CN words. It is based on the CPC Patent Classification C12N which contains many biotechnology filings.
Sample data is shown below.
Suitable buffers include boric acid, sodium and potassium bicarbonate, sodium and potassium borates, sodium and potassium carbonate, sodium acetate, sodium biphosphate and the like, in amounts sufficient to maintain the pH at between about pH 6 and pH 8, and preferably, between about pH 7 and pH 7.5.