Complexity issues in Vertex-Colored Graph Pattern Matching

Abstract : Searching for motifs in graphs has become a crucial problem in the analysis of biological networks. In the context of metabolic network analysis, Lacroix et al [V. Lacroix, C.G. Fernandes and M.-F. Sagot, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 3 (2006), no. 4, 360368 ] introduced the NP-hard general problem of nding occurrences of motifs in vertexcolored graphs, where a motif M is a multiset of colors and an occurrence of M in a vertex-colored graph G, called the target graph, is a subset of vertices that induces a connected graph and the multiset of colors induces by this subset is exactly the motif. Pursuing the line of research pioneered by Lacroix et al. and aiming at dealing with approximate solutions, we consider in this paper the above-mentioned problem in two of its natural optimization forms, referred hereafter as the Min- CC and the Maximum Motif problems. The Min CC problem seeks for an occurrence of a motif M in a vertex-colored graph G that induces a minimum number of connected components whereas the Maximum Motif problem is concerned with nding a maximum cardinality submotif M0 M that occurs as a connected motif in G. We prove the Min-CC problem to be APX-hard even in the extremal case where the motif is a set and the target graph is a path. We complement this result by giving a polynomial-time algorithm in case the motif is built upon a xed number of colors and the target graph is a path. Also, extending [M. Fellows, G. Fertin, D. Hermelin, and S. Vialette, Proc. 34th International Colloquium on Automata, Languages and Programming (ICALP), Lecture Notes in Computer Science, vol. 4596, Springer, 2007, pp. 340351 ], we prove the Min-CC problem to be xed-parameter tractable when parameterized by the size of the motif, and we give a faster algorithm in case the target graph is a tree. Furthermore, we prove the Min-CC problem for trees not to be approximable within ratio c log n for some constant c > 0, where n is the order of the target graph, and to be W[2]-hard when parameterized by the number of connected components in the occurrence of the motif. Finally, we give an exact exponential-time algorithm for the Min-CC problem in case the target graph is a tree. We prove that the Maximum Motif problem is APX-hard even in the case where the target graph is a tree of maximum degree 3, the motif is actually a set and each color occurs at most twice in the tree. Next, we strengthen this result by proving that the problem is not approximable within factor 2log n, for any constant < 1, unless NP DTIME(2poly log n). We complement these results by presenting two xed-parameter algorithms for the problem, where the parameter is the size of the solution. Finally, we give exact exponential-time algorithms for this problem.
Liste complète des métadonnées

Cited literature [29 references]  Display  Hide  Download
Contributor : Guillaume Fertin <>
Submitted on : Tuesday, July 5, 2011 - 3:22:33 PM
Last modification on : Wednesday, May 23, 2018 - 3:44:02 PM
Document(s) archivé(s) le : Monday, November 12, 2012 - 10:10:37 AM


Files produced by the author(s)



Riccardo Dondi, Guillaume Fertin, Stéphane Vialette. Complexity issues in Vertex-Colored Graph Pattern Matching. Journal of Discrete Algorithms, Elsevier, 2011, 9 (1), pp.82-99. ⟨10.1016/j.jda.2010.09.002⟩. ⟨hal-00606154⟩



Record views


Files downloads