The ChEMBL-og - Open Data For Drug Discovery: A sort of H-index for the coverage of bioactivity databases
peter.suber's bookmarks 2013-08-26
Summary:
"So here's a little idea about quantifying the coverage/diversity of the contents of a bioactivity database (like ChEMBL, but also the internal knowledge of a company in it's screening and lead optimisation programs, etc). Essentially, it's applying the H-index, regularly used for citation analysis to bioassay results. There's a lot of criticism of the H-index in it's use of comparing researchers, and plenty of problems in cross-field comparison, but that is not for here. However, the H-index is a pretty robust statistic capturing the structure of a frequency-class distribution...."