SCIENCE CHINA Information Sciences, Volume 59 , Issue 11 : 113101(2016) https://doi.org/10.1007/s11432-015-0934-9

Similarity assessment for scientific workflow clustering and recommendation

More info
  • ReceivedMay 18, 2016
  • AcceptedAug 22, 2016
  • PublishedOct 14, 2016


This article proposes to identify and recommend scientific workflows for reuse and repurposing. Specifically, a scientific workflow is represented as a layer hierarchy that specifies the hierarchical relations between this workflow, its sub-workflows, and activities. Semantic similarity is calculated between layer hierarchies of workflows. A graph-skeleton based clustering technique is adopted for grouping layer hierarchies into clusters. Barycenters in each cluster are identified, which serve as core workflows in this cluster, for facilitating the cluster identification and workflow ranking and recommendation with respect to the requirement of scientists.



This work was supported partially by National Natural Science Foundation of China (Grant Nos. 61379126, 61662021).


[1] Liu X Z, Huang G, Zhao Q, et al. Imashup: a mashup-based framework for service composition. newblock Sci China Inf Sci, 2014, 57: 012101 Google Scholar

[2] Ning H S, Liu H. Cyber-physical-social-thinking space based science and technology framework for the internet of things. newblock Sci China Inf Sci, 2015, 58: 031102 Google Scholar

[3] Starlinger J, Brancotte B, Cohen-Boulakia S, et al. Similarity search for scientific workflows. newblock Proc VLDB Endowment, 2014, 7: 1143-1154 CrossRef Google Scholar

[4] Huang J, Sun H, Song Q, et al. Revealing density-based clustering structure from the core-connected tree of a network. newblock IEEE Trans Knowl Data Eng, 2013, 25: 1876-1889 CrossRef Google Scholar

Copyright 2020 Science China Press Co., Ltd. 《中国科学》杂志社有限责任公司 版权所有

京ICP备17057255号       京公网安备11010102003388号