Two metrics for attributed hypergraphs
Introduction
Graph models have long been ubiquitous tools for investigating systems of related objects. Given their versatility and effectiveness, novel, more powerful algorithms for efficient graph processing are continuously sought-after. It is often assumed that the relationship between such objects is pairwise, but many complex systems exhibit higher-order relations. By limiting the model of a system to the constraints of a graph-based one an information loss is inevitabile. On the other hand, hypergraphs, a generalization of graphs where the relationships can contain an arbitrary number of vertices, exhibit a much broader modelling capability. Their group relationships grant more expressiveness, which is more suitable for analytical purposes.
Nowadays, in many applied fields the use of hyper-graphical models to give an abstract representation of objects is well-established. Besides performing analytics on single instances, especially in fields such as computer vision, it is crucial to match and recognize different instances, and a lot of work has been put into developing novel, robust algorithms to perform these tasks. However, once we assign an abstract hyper-graphical representation to the objects to be processed we can reduce those problems to the problem of hypergraph matching.
One of the main applications of hypergraphs is the computer-aided design of very-large-scale integration (VLSI CAD, see [11]), i.e. the process of combining hundreds of thousands of transistors or devices into a single chip to create an integrated circuit using algorithms run by a computer.
Computer vision has also drawn a lot of benefits by employing hypergraph models on images in many applications. Motion segmentation [8], [15], matching [12], [13] and object detection [24] are among the most widespread tasks subject to active research, and many techniques for image clustering based on hypergraphs have also been developed [1], [25]. A lot of work has been done to find ways to embed hypergraphs in low-dimensional vector spaces, as their full representation is very sparse and high-dimensional. In [7] a hypergraph is leveraged to depict the attribute relations in the data, and an embedding scheme is proposed to perform clustering. In [20] the authors present a classification framework where low-level features of hyperspectral images are extracted as hypergraphs, which are embedded in a low dimensional space and fed to a SVM for classification. Many algorithms fail to keep the high-order relations in the embedding process. In [22] an algorithm to overcome this limitation is given. A powerful theoretical result for clustering -regular hypergraphs was proven in Rota Bulò and Pelillo [18] for which an efficient heuristic based on a discrete-time high-order replicator dynamics can be used to perform the optimization.
Clearly, in order to assess the fitness of a match we must be able to tell how similar, or rather how different, two objects are. Many applications depend upon some properties of this distance measure , and it is often desirable that it fulfills all the following properties: nonnegativity (, identity (), simmetry () and the triangle inequality (). If it is the case, then the distance is called a metric and we are able to use it to perform numerical comparisons because it establishes a partial order over the set of objects. Moreover, if the distance is a metric then the set of all hypergraphs endowed with such distance is a metric space. This is relevant from a theoretical point of view because it enables the notion of convergence in such space, and allows us to interpret hypergraphs as points of a metric space. On the other hand, the importance of metrics has been clearly put forward in Chávez et al. [4], which shows how searching among unstructured data, a fundamental problem in modern applications, is possible if the set of objects can be given a metric structure. By the same token, another key benefit of the metric property is that it overcomes the necessity of performing embedding on hypergraphs, which is an expensive, lossy operation.
In this paper, we propose two metrics for attributed hypergraphs which naturally generalize the structure-based approach introduced in Hidović and Pelillo [6] in the context of graphs. These have been shown to be robust with respect to both structural distortions and perturbation of node attributes. The proposed metrics are based on the maximal similarity common subhypergraph of two hypergraphs.
Section snippets
Related work
The problem of defining a distance between hypergraphs arise indirectly from the need of establishing a distance between partitions in different areas. In the context of shape recognition, [1] gives a two-step algorithm based on hypergraph distance to solve the clustering problem in domains where the affinity relations are not pairwise. In the context of hypergraph matching, represented as a convex optimization problem, a different approach is presented in Zass and Shashua [23]. Under a
Hypergraph definitions
A hypergraph is a pair , where is the set of vertices and:is the set of hyperedges. We refer to the number of its vertices as its order and to the number of its hyperedges as its size. Sometimes, given a hypergraph , we will refer to its order with . Two vertices are called adjacent if there exists an hyperedge such that and . If , i.e. every nonempty subset of belongs to , then is called a complete hypergraph. Of course,
Metrics for hypergraphs
The problem of matching relational structures, a generalization of traditional graphs, is of pivotal significance in computer vision and pattern recognition. In [16] a unifying framework to solve the problem by a reformulation thereof is proposed, which consists of finding the maximum clique in the association graph of the two graphs representing the structures. Many of these matching problems arising in computer vision are between relational structures endowed with a hierarchical organization,
Conclusions and future works
We have presented two generalizations of normalized metrics for vertex-attributed graphs to the case of vertex-attributed hypergraphs, both of which are derived from the maximum similarity common subhypergraph and the similarity between the pairs of vectors of attributes made correspondent by the subhypergraph isomorphism, which induces said common subhypergraph. They differ by their normalizing elements, which are the maximum of the sizes of the two hypergraphs in the first one and the size of
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References (25)
- et al.
A graph distance metric based on the maximal common subgraph
Pattern Recognit. Lett.
(1998) - et al.
Hypergraph isomorphism using association hypergraphs
Pattern Recognit. Lett.
(2019) - et al.
Graph distances using graph union
Pattern Recognit. Lett.
(2001) - et al.
Beyond pairwise clustering
2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05)
(2005) - et al.
Theoretical and algorithmic framework for hypergraph matching
International Conference on Image Analysis and Processing
(2005) - et al.
Searching in metric spaces
ACM Comput. Surv.
(2001) - et al.
A survey of graph edit distance
Pattern Anal. Appl.
(2009) - et al.
Metrics for attributed graphs based on the maximal similarity common subgraph
Int. J. Pattern Recognit. Artif. Intell.
(2004) - et al.
Learning hypergraph-regularized attribute predictors
2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
(2015) - et al.
Video object segmentation by hypergraph cut
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
(2009)
On Marczewski–Steinhaus type distance between hypergraphs
Appl. Math.
Algorithm 69. Two distances between hypergraphs
Appl. Math.
Cited by (4)
Editorial paper for pattern recognition letters VSI on advances in graph-based recognition for pattern recognition
2022, Pattern Recognition LettersHypergraph co-optimal transport: metric and categorical properties
2023, Journal of Applied and Computational Topology