Elsevier

Pattern Recognition Letters

Volume 149, September 2021, Pages 143-149
Pattern Recognition Letters

Two metrics for attributed hypergraphs

https://doi.org/10.1016/j.patrec.2021.06.007Get rights and content

Highlights

  • We present two novel (normalized) distances measures between attributed hypergraphs.

  • The measures are based on the notion of the maximal common subhypergraph.

  • The two measures are normalized and differ by their normalization criterion.

Abstract

Modern quantitative challenges require to tackle problems on increasingly complex systems in which the relationships between the comprised entities cannot be modelled in a simple pairwise fashion, as graphs do. Such approximation of higher-order relations may lead to a substantial loss of information, hence the need to use more general models than graphs. The most natural choice is to use hypergraphs, discrete structures able to capture k-adic relationships among the entities participating in the problem, modelled as vertices, by grouping them in non-empty sets which constitute the hyperedges of the hypergraph. Since one of the most desirable abilities in this context is to quantify the difference between two such high-order systems, devising distance metrics between hypergraphs becomes of the utmost importance. In this paper, we aim at tackling precisely this problem. Motivated by our previous work on graphs, we propose two distance measures between attributed hypergraphs and we prove that they satisfy the properties of a metric. Both metrics are based on the notion of the maximal common subhypergraph.

Introduction

Graph models have long been ubiquitous tools for investigating systems of related objects. Given their versatility and effectiveness, novel, more powerful algorithms for efficient graph processing are continuously sought-after. It is often assumed that the relationship between such objects is pairwise, but many complex systems exhibit higher-order relations. By limiting the model of a system to the constraints of a graph-based one an information loss is inevitabile. On the other hand, hypergraphs, a generalization of graphs where the relationships can contain an arbitrary number of vertices, exhibit a much broader modelling capability. Their group relationships grant more expressiveness, which is more suitable for analytical purposes.

Nowadays, in many applied fields the use of hyper-graphical models to give an abstract representation of objects is well-established. Besides performing analytics on single instances, especially in fields such as computer vision, it is crucial to match and recognize different instances, and a lot of work has been put into developing novel, robust algorithms to perform these tasks. However, once we assign an abstract hyper-graphical representation to the objects to be processed we can reduce those problems to the problem of hypergraph matching.

One of the main applications of hypergraphs is the computer-aided design of very-large-scale integration (VLSI CAD, see [11]), i.e. the process of combining hundreds of thousands of transistors or devices into a single chip to create an integrated circuit using algorithms run by a computer.

Computer vision has also drawn a lot of benefits by employing hypergraph models on images in many applications. Motion segmentation [8], [15], matching [12], [13] and object detection [24] are among the most widespread tasks subject to active research, and many techniques for image clustering based on hypergraphs have also been developed [1], [25]. A lot of work has been done to find ways to embed hypergraphs in low-dimensional vector spaces, as their full representation is very sparse and high-dimensional. In [7] a hypergraph is leveraged to depict the attribute relations in the data, and an embedding scheme is proposed to perform clustering. In [20] the authors present a classification framework where low-level features of hyperspectral images are extracted as hypergraphs, which are embedded in a low dimensional space and fed to a SVM for classification. Many algorithms fail to keep the high-order relations in the embedding process. In [22] an algorithm to overcome this limitation is given. A powerful theoretical result for clustering k-regular hypergraphs was proven in Rota Bulò and Pelillo [18] for which an efficient heuristic based on a discrete-time high-order replicator dynamics can be used to perform the optimization.

Clearly, in order to assess the fitness of a match we must be able to tell how similar, or rather how different, two objects are. Many applications depend upon some properties of this distance measure d, and it is often desirable that it fulfills all the following properties: nonnegativity (d(x,y)0), identity (d(x,y)=0x=y), simmetry (d(x,y)=d(y,x)) and the triangle inequality (d(x,z)d(x,y)+d(y,z)). If it is the case, then the distance is called a metric and we are able to use it to perform numerical comparisons because it establishes a partial order over the set of objects. Moreover, if the distance is a metric then the set of all hypergraphs endowed with such distance is a metric space. This is relevant from a theoretical point of view because it enables the notion of convergence in such space, and allows us to interpret hypergraphs as points of a metric space. On the other hand, the importance of metrics has been clearly put forward in Chávez et al. [4], which shows how searching among unstructured data, a fundamental problem in modern applications, is possible if the set of objects can be given a metric structure. By the same token, another key benefit of the metric property is that it overcomes the necessity of performing embedding on hypergraphs, which is an expensive, lossy operation.

In this paper, we propose two metrics for attributed hypergraphs which naturally generalize the structure-based approach introduced in Hidović and Pelillo [6] in the context of graphs. These have been shown to be robust with respect to both structural distortions and perturbation of node attributes. The proposed metrics are based on the maximal similarity common subhypergraph of two hypergraphs.

Section snippets

Related work

The problem of defining a distance between hypergraphs arise indirectly from the need of establishing a distance between partitions in different areas. In the context of shape recognition, [1] gives a two-step algorithm based on hypergraph distance to solve the clustering problem in domains where the affinity relations are not pairwise. In the context of hypergraph matching, represented as a convex optimization problem, a different approach is presented in Zass and Shashua [23]. Under a

Hypergraph definitions

A hypergraph is a pair H=(V,E), where V={1,,n} is the set of vertices and:Ek=1n(Vk)=P(V)is the set of hyperedges. We refer to the number of its vertices as its order and to the number of its hyperedges as its size. Sometimes, given a hypergraph H, we will refer to its order with |H|. Two vertices u,vV are called adjacent if there exists an hyperedge eE such that ue and ve. If E=k=1n(Vk), i.e. every nonempty subset of V belongs to E, then H is called a complete hypergraph. Of course,

Metrics for hypergraphs

The problem of matching relational structures, a generalization of traditional graphs, is of pivotal significance in computer vision and pattern recognition. In [16] a unifying framework to solve the problem by a reformulation thereof is proposed, which consists of finding the maximum clique in the association graph of the two graphs representing the structures. Many of these matching problems arising in computer vision are between relational structures endowed with a hierarchical organization,

Conclusions and future works

We have presented two generalizations of normalized metrics for vertex-attributed graphs to the case of vertex-attributed hypergraphs, both of which are derived from the maximum similarity common subhypergraph and the similarity between the pairs of vectors of attributes made correspondent by the subhypergraph isomorphism, which induces said common subhypergraph. They differ by their normalizing elements, which are the maximum of the sizes of the two hypergraphs in the first one and the size of

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References (25)

  • H. Bunke et al.

    A graph distance metric based on the maximal common subgraph

    Pattern Recognit. Lett.

    (1998)
  • G. Sandi et al.

    Hypergraph isomorphism using association hypergraphs

    Pattern Recognit. Lett.

    (2019)
  • W.D. Wallis et al.

    Graph distances using graph union

    Pattern Recognit. Lett.

    (2001)
  • S. Agarwal et al.

    Beyond pairwise clustering

    2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05)

    (2005)
  • H. Bunke et al.

    Theoretical and algorithmic framework for hypergraph matching

    International Conference on Image Analysis and Processing

    (2005)
  • E. Chávez et al.

    Searching in metric spaces

    ACM Comput. Surv.

    (2001)
  • X. Gao et al.

    A survey of graph edit distance

    Pattern Anal. Appl.

    (2009)
  • D. Hidović et al.

    Metrics for attributed graphs based on the maximal similarity common subgraph

    Int. J. Pattern Recognit. Artif. Intell.

    (2004)
  • S. Huang et al.

    Learning hypergraph-regularized attribute predictors

    2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

    (2015)
  • Y. Huang et al.

    Video object segmentation by hypergraph cut

    Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on

    (2009)
  • M. Karoński et al.

    On Marczewski–Steinhaus type distance between hypergraphs

    Appl. Math.

    (1977)
  • M. Karoński et al.

    Algorithm 69. Two distances between hypergraphs

    Appl. Math.

    (1980)
  • View full text