Triclustering in Big Data Setting

doi:10.48550/arXiv.2010.12933

Triclustering in Big Data Setting

In this paper, we describe versions of triclustering algorithms adapted for efficient calculations in distributed environments with MapReduce model or parallelisation mechanism provided by modern programming languages. OAC-family of triclustering algorithms shows good parallelisation capabilities due to the independent processing of triples of a triadic formal context. We provide the time and space complexity of the algorithms and justify their relevance. We also compare performance gain from using a distributed system and scalability.

Publication:

arXiv e-prints

Pub Date:

October 2020

DOI:

10.48550/arXiv.2010.12933

arXiv:

arXiv:2010.12933

Bibcode:

2020arXiv201012933E

Keywords:

Computer Science - Distributed;
Parallel;
and Cluster Computing;
Computer Science - Machine Learning;
68T09;
05C65;
62H30;
I.5.3;
G.2.2;
I.2.6;
H.2.8;
D.1.3

E-Print:

The paper contains an extended version of the prior work presented at the workshop on FCA in the Big Data Era held on June 25, 2019 at Frankfurt University of Applied Sciences, Frankfurt, Germany

NASA/ADS

Triclustering in Big Data Setting

Abstract