Using Binary Strings for Comparing Products from Software-intensive Systems Product Lines

The volume, variety and velocity of products in software-intensive systems product lines is increasing. One challenge is to understand the range of similarity between products to evaluate its impact on product line management. This paper contributes to product line management by presenting a product similarity evaluation process in which (i) a product configured from a product line feature model is represented as a weighted binary string (ii) the overall similarity between products is compared using the Jaccard Coefficient similarity metric (iii) the significance of individual features and feature combinations to product similarity is explored by modifying the weights. We propose a method for automatically allocating weights to features depending on their position in a product line feature model, although we do not claim that this allocation method nor the use of the Jaccard Coefficient is optimal. We illustrate our ideas with mobile phone worked examples.

[1]  Konrad Rieck,et al.  Harry: A Tool for Measuring String Similarity , 2016, J. Mach. Learn. Res..

[2]  Juha Kuusela,et al.  Developing platforms for multiple software product lines , 2012, SPLC '12.

[3]  Heiko Paulheim,et al.  A machine learning approach for product matching and categorization , 2018, Semantic Web.

[4]  Robyn R. Lutz,et al.  The Role of Similarity in Detecting Feature Interaction in Software Product Lines , 2018, 2018 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW).

[5]  H. Yazdani,et al.  New similarity functions , 2016, 2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR).

[6]  Matthew A. Jaro,et al.  Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida , 1989 .

[7]  Eduardo Figueiredo,et al.  A Method to Derive Metric Thresholds for Software Product Lines , 2015, 2015 29th Brazilian Symposium on Software Engineering.

[8]  Pierre-Yves Schobbens,et al.  Search-based Similarity-driven Behavioural SPL Testing , 2016, VaMoS.

[9]  C. Raasch,et al.  The Choice between Uniqueness and Conformity in Mass Customization , 2018, R&D Management.

[10]  Jingpeng Li,et al.  Improved binary similarity measures for software modularization , 2017, Frontiers of Information Technology & Electronic Engineering.

[11]  Sascha El-Sharkawy,et al.  Metrics for analyzing variability and its implementation in software product lines: A systematic literature review , 2019, Inf. Softw. Technol..

[12]  Alexander Egyed,et al.  A systematic mapping study of search-based software engineering for software product lines , 2015, Inf. Softw. Technol..

[13]  Gunter Saake,et al.  Effective product-line testing using similarity-based product prioritization , 2016, Software & Systems Modeling.

[14]  G. Walsh,et al.  Measuring consumer vulnerability to perceived product-similarity problems and its consequences , 2010 .

[15]  Plavini Punyatoya Consumer Evaluation of Brand Extension for Global and Local Brands: The Moderating Role of Product Similarity , 2013 .

[16]  He Jiang,et al.  Search Based Software Engineering , 2012, Lecture Notes in Computer Science.

[17]  H. ElMaraghy,et al.  Product family formation for reconfigurable assembly systems , 2014 .

[18]  C. Tappert,et al.  A Survey of Binary Similarity and Distance Measures , 2010 .

[19]  Product Similarity and Cross-Price Elasticity , 2018 .

[20]  Onder Coban,et al.  A Comparison of Similarity Metrics for Sentiment Analysis on Turkish Twitter Feeds , 2015, 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity).

[21]  Sergio Segura,et al.  Automated analysis of feature models 20 years later: A literature review , 2010, Inf. Syst..

[22]  Hermann Kaindl,et al.  A Feature-Similarity Model for Product Line Engineering , 2015, ICSR.

[23]  Yan Li,et al.  Enabling automated requirements reuse and configuration , 2019, Software & Systems Modeling.

[24]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[25]  Tomasz F. Stepinski,et al.  On using landscape metrics for landscape similarity search , 2016 .

[26]  Sergio Segura,et al.  A Comparison of Test Case Prioritization Criteria for Software Product Lines , 2014, 2014 IEEE Seventh International Conference on Software Testing, Verification and Validation.

[27]  Hermann Kaindl,et al.  Using similarity metrics for mining variability from software repositories , 2014, SPLC '14.

[28]  Shahliza Abd Halim,et al.  An experiment of different similarity measures on test case prioritization for software product lines , 2017 .

[29]  V. Mitchell,et al.  Marketing causes and implications of consumer confusion , 1999 .

[30]  Han M. Shih Product structure (BOM)-based product similarity measures using orthogonal procrustes approach , 2011, Comput. Ind. Eng..

[31]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[32]  Bin Ma,et al.  On the similarity metric and the distance metric , 2009, Theor. Comput. Sci..

[33]  Zibin Zheng,et al.  Configuring Software Product Lines by Combining Many-Objective Optimization and SAT Solvers , 2018, ACM Trans. Softw. Eng. Methodol..

[34]  P. Jaccard Distribution de la flore alpine dans le bassin des Dranses et dans quelques régions voisines , 1901 .