1 Introduction

Coalition Structure Generation (\({\mathsf {CSG}}\)) [1, 18, 23] is a key issue for various applications related to multi-agent cooperation, e.g., distributed vehicle routing [25], waste-water treatment system [7], forming rescue groups in disaster areas [30] and multi-sensor networks [5]. \({\mathsf {CSG}}\) involves partitioning a set of agents into coalitions (where each coalition is a subset of the available set of agents) such that the social surplus is maximized. A partition is also called a coalition structure. In traditional \({\mathsf {CSG}}\), the value of a coalition is assumed to be given by a black box function called the characteristic function, and the value of a coalition structure is provided by the sum of the values of all coalitions. It is well-known that the \({\mathsf {CSG}}\) problem is equivalent to the complete set partitioning problem [33]. Various sophisticated algorithms have been proposed for the \({\mathsf {CSG}}\) problem, for instance, an anytime algorithm with a worst-case guarantee [24], a dynamic programming based algorithm [33], search/dynamic programming based anytime algorithms [19, 22, 27], and a hybrid algorithm called ODP-IP which is a state-of-the-art algorithm that works by combining dynamic programming and anytime algorithms [14].

Let us consider the following simple scenario. There is a service company dispatching interpreters with three employees, Ali, Bob and Chan. Now, this service company has received seven job requests for simultaneous interpretation, each requiring some specific language skills. Table 1 shows the rewards of each job request.

Table 1 The employees necessary for completing a request and the associated reward

Assume that you are the manager of this service company and want to assign employees to jobs such that the sum of the rewards is maximized. Then, this problem can be represented as an instance of the \({\mathsf {CSG}}\) problem. If you assign three employees Ali, Bob and Chan to job requests 1, 2 and 3 separately, the sum of the rewards obtained by the coalition structure \(\{\{Ali\}, \{Bob\}, \{Chan\}\}\) is \(\$20 + \$50 + \$10 = \$80\). When you assign Ali to request 1, and Bob and Chan to request 6, the social surplus is maximized. In this case, the coalition structure is \(\{\{Ali\}, \{Bob, Chan\}\}\) and the service company earns \(\$20 + \$100 = \$120\).

In what follows, we are interested in the uncertainty of agents’ attendances. In traditional \({\mathsf {CSG}}\), one does need to worry about whether all coalitions will be established or not, i.e., the probability of each agent to join any coalition he is assigned to is assumed to be 1.0. However, if we want to mimic the real world, it is natural to consider the uncertainty of agents’ availabilities. For instance, it might happen that an agent is only available on some days or the week or cannot commit in advance with certainty.

In our example of a service company, it is natural that the manager asks the three employees about their schedules before assigning them to jobs. Also, setting an exact probability of each agent to join a coalition is not realistic, e.g., when Ali thinks that he can probably fit job request 1 into his schedule, he will report to the manager “I can probably work for request 1” but will not answer “I can work for it with probability \(66\%\)”. Here, or more specifically in the experimental part, we assume that (1) each agent chooses in advance one of the following attendance types (including the probabilities/attendance rates) according to his/her own scheduleFootnote 1:

  • Type 1: {available (\(90\%\))},

  • Type 2: {probably available (\(70\%\))},

  • Type 3: {unsure (\(50\%\))},

  • Type 4: {probably not available (\(30\%\))},

  • Type 5: {not available (\(10\%\))},

and (2) the \({\mathsf {CSG}}\) maker (e.g. the manager of the service company in our example) knows all the information about the agents’ types. Such simplification leads to a lack of generality in the sense that we allow the agents’ availabilities to be chosen only from a small set of possible types, while the \({\mathsf {PCSG}}\) model allows for an infinite number of agent types. Still, it is arguably reasonable when considering applications, since allowing the agents/the \({\mathsf {CSG}}\) maker to be able to use the whole interval [0, 1] when setting the availability probabilities does not seem sensible, as already discussed above.

In this paper, the main focus is laid on the Probabilistic Coalition Structure Generation (\({\mathsf {PCSG}}\)) problem which is an extension of \({\mathsf {CSG}}\) where the availability type of each agent is considered. First, a formal framework for \({\mathsf {PCSG}}\) which generalizes the one from [26] is introduced. The aim is to find a coalition structure that maximizes the sum of the expected values of all coalitions. Any such coalition structure will be called optimal. What exactly is meant under the term “expected value of a coalition” is an important issue here. In our framework, the expected value of a coalition is parameterized by the parameter k. If up to k agents are missing from a coalition, the contribution of the remaining agents can be read off from the characteristic function. If the number of missing agents is higher than k, we assume that the contribution of the remaining agents is 0, as their (sub)coalition differs too much from the coalition originally planned. Note that when \(k=n-1\), any subset of the original coalition gives its contribution to the expected coalition value, and this corresponds to Flexible \({\mathsf {PCSG}}\) [26]. On the other hand, if \(k=0\), the model corresponds to Cautious \({\mathsf {PCSG}}\) [26]. To illustrate the influence of k, let us consider the expected value of the coalition \(\{Ali, Bob, Chan\}\) for \(k=1\) (i.e., we allow one agent to be missing). The expected value is computed by taking into consideration the following cases (1) Ali, Bob and Chan are available, (2) Ali and Bob are available, (3) Ali and Chan are available and (4) Bob and Chan are available.

Furthermore, we present approximation algorithms for solving the \({\mathsf {PCSG}}\) problem called Bounded Approximation Algorithm based on Attendance Types (\({\mathsf {BAAAT}}\)) and Involved Bounded Approximation Algorithm based on Attendance Types (\({\mathsf {IBAAAT}}\)). In the \({\mathsf {PCSG}}\) problem, since finding the optimal coalition structure becomes easily intractable, it is important to consider fast but approximate algorithms. The basic idea of the proposed algorithm \({\mathsf {BAAAT}}\) is that for a given parameter \({\tilde{p}}\), (1) agents whose attendance probability is below the parameter \({\tilde{p}}\) are placed into singleton coalitions, (2) an optimal coalition structure of the relaxed problem with the remaining agents is computed. On the other hand, \({\mathsf {IBAAAT}}\) has the following two steps: (1) agents are split into two sets such that one is the set of agents whose attendance probability is below the parameter \({\tilde{p}}\) and the other is the set of all the remaining agents, (2) optimal coalition structures of the two relaxed problems are computed. We also prove an upper bound on the error of the solution returned by \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) with respect to the optimum. The error bound is a theoretical worst-case bound that is obtained a priori, that is, before actually running the algorithm. Finally, the performances of \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) are evaluated on a number of benchmarks.

The rest of the paper is organized as follows. In Sect. 2, the Coalition Structure Generation (\({\mathsf {CSG}}\)) problem is briefly described. Section 3 introduces a formal framework for the Probabilistic Coalition Structure Generation (\({\mathsf {PCSG}}\)) problem that will be used in the rest of the paper. In Sect. 4, approximation algorithms for the \({\mathsf {PCSG}}\) problem called Bounded Approximation Algorithm based on Attendance Types (\({\mathsf {BAAAT}}\)) and Involved Bounded Approximation Algorithm based on Attendance Types (\({\mathsf {IBAAAT}}\)) are presented, together with their approximation guarantees. In Sect. 5, \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) are evaluated on a number of benchmarks. Section 6 discusses related work, and, finally, Sect. 7 concludes the paper.

2 Coalition structure generation

We briefly describe the Coalition Structure Generation (\({\mathsf {CSG}}\)) problem [1, 18, 23]. \({\mathsf {CSG}}\) involves partitioning a set of agents into coalitions such that the social surplus (i.e., the sum of the values of all coalitions) is maximized. Let us start with some preliminary definitions.

Let \({\mathcal {A}} = \{a_1, a_2, \dots , a_n\}\) be a finite set of agents. A coalition from \({\mathcal {A}}\), denoted by C, is a non-empty subset of \({\mathcal {A}}\). A coalition structure on \({\mathcal {A}}\), denoted by \({{\mathrm{{CS}}}}\), is a partition of \({\mathcal {A}}\), that is, a jointly exhaustive set of pairwise disjoint coalitions from \({\mathcal {A}}\). More formally, a coalition structure on \({\mathcal {A}}\) is a finite set of coalitions satisfying the following two conditions:

  1. 1.

    \(\forall i, j \in \{1, 2, \dots , m\},\, i \ne j,\, C_i \cap C_j = \emptyset ,\)

  2. 2.

    \(\bigcup _{C_i \in {{\mathrm{{CS}}}}} C_i = {\mathcal {A}}.\)

In other words, each agent belongs to exactly one coalition. Note that some agents may be alone in their coalitions. In our running example of a service company with three employees, there exist seven possible coalitions (i.e., \(\{Ali\}\), \(\{Bob\}\), \(\{Chan\}\), \(\{Ali, Bob\}\), \(\{Ali, Chan\}\), \(\{Bob, Chan\}\), \(\{Ali\), Bob, \(Chan\}\)) and the following five coalition structures:

$$\begin{aligned}&\{\{Ali\}, \{Bob\}, \{Chan\}\}, \{\{Ali\}, \{Bob, Chan\}\}, \{\{Bob\}, \{Ali, Chan\}\},\\&\quad \{\{Chan\}, \{Ali, Bob\}\}, \{\{Ali, Bob, Chan\}\}. \end{aligned}$$

The Coalition Structure Generation problem description is defined as follows:

Definition 1

(\({\mathsf {CSG}}\)problem description) A coalition structure generation problem description is defined by a pair \({\mathsf {CSG}} = \langle {\mathcal {A}}, v \rangle\) where \({\mathcal {A}} = \{a_1, a_2, \dots , a_n\}\) is a set of agents and \(v:2^{{\mathcal {A}}} \rightarrow \mathbb {R}\) is a function called the characteristic function.

The value of a coalition C, denoted by v(C), is given by the characteristic function v. The value of a coalition structure \({{\mathrm{{CS}}}}\), denoted by \(V({\mathrm{{CS}}})\), is provided by the sum of the values of all coalitions, i.e.,

$$\begin{aligned} V({\mathrm{{CS}}}) = \sum _{C_i \in {\mathrm{{CS}}}} v(C_i). \end{aligned}$$
(1)

A coalition structure is said to be optimal, denoted by \({\mathrm{{CS}}}^*\), if it maximizes the social surplus, that is, if it satisfies the following condition:

$$\begin{aligned} \forall {\mathrm{{CS}}},\, V({\mathrm{{CS}}}) \le V({\mathrm{{CS}}}^*). \end{aligned}$$

Example 1

(\({\mathsf {CSG}}\)) Consider the service company with three employees Ali, Bob and Chan introduced in the previous section. Assume that you are the manager of this service company and want to assign the employees to jobs such that the sum of the rewards from Table 1 is maximized. Then, this problem can be represented as an instance of the \({\mathsf {CSG}}\) problem: let \({\mathsf {CSG}} = \langle {\mathcal {A}}, v \rangle\) be a \({\mathsf {CSG}}\) problem description with \({\mathcal {A}} = \{Ali, Bob, Chan\}\), and the function v is then characterized as follows:

$$\begin{aligned}&v(\{Ali\})=20, \,v(\{Bob\})=50, \,v(\{Chan\})=10,\\&v(\{Ali, Bob\})=70,\, v(\{Ali, Chan\})=60, \,v(\{Bob, Chan\})=100,\\&v(\{Ali, Bob, Chan\})=110. \end{aligned}$$

Table 2 shows the rewards associated with all possible coalition structures. The optimal coalition structure in this example is \({\mathrm{{CS}}}^* = \{\{Ali\}, \{Bob, Chan\}\}\), and the obtained reward is \(V({\mathrm{{CS}}}^*) = v(\{Ali\}) + v(\{Bob, Chan\}) = \$20 + \$100 = \$120\). As illustrated by this running example, the characteristic function v is supposed to be represented extensively, as the set of pairs \(\{(C, v(C)) \mid C \subseteq {\mathcal {A}} \text{ and } C \ne \emptyset \}\).

Table 2 The obtained rewards of all possible coalition structures

The \({\mathsf {CSG}}\) problem is defined as follows:

Definition 2

(\({\mathsf {CSG}}\)problem)

  • Input A \({\mathsf {CSG}}\) problem description \({\mathsf {CSG}} = \langle A, v \rangle\),

  • Question Find an optimal coalition structure \({\mathrm{{CS}}}^*\).

Let us now focus on the specific cases of the \({\mathsf {CSG}}\) problem, when the characteristic function v is subadditive or superadditive. For a \({\mathsf {CSG}}\) problem description \({\mathsf {CSG}} = \langle {\mathcal {A}}, v \rangle\), the characteristic function v is said to be subadditive if for any coalitions \(C_i\) and \(C_j\) with \(C_i \cap C_j = \emptyset\), it holds that \(v(C_i) + v(C_j) \ge v(C_i \cup C_j)\). Conversely, v is called superadditive if it holds that \(v(C_i) + v(C_j) \le v(C_i \cup C_j)\). It is well-known that in case v is subadditive, the coalition structure formed by singleton coalitions is optimal, i.e., \({\mathrm{{CS}}}^* = \{\{a_i\} \;|\; a_i \in {\mathcal {A}}\}\). For the superadditive case, the grand coalition is optimal, namely \({\mathrm{{CS}}}^* = \{{\mathcal {A}}\}\) [24].

3 Probabilistic coalition structure generation

We now introduce the Probabilistic Coalition Structure Generation (\({\mathsf {PCSG}}\)) problem, which will be in focus of this work. As opposed to the traditional \({\mathsf {CSG}}\), where we are guaranteed that all coalitions will be established, we want to have the real world in mind. So, it is natural to consider the uncertainty of agents’ availabilities. \({\mathsf {PCSG}}\) is an extension of \({\mathsf {CSG}}\) where exactly the attendance type of each agent is considered. The aim is to find an optimal coalition structure that maximizes the sum of the expected values of all coalitions. Note that it is not a priori clear how one should arrive at the expected value of a coalition. Therefore, an important issue in \({\mathsf {PCSG}}\) is how exactly the expected value of a coalition is computed. In our framework, the computation of the expected value depends on the parameter k. If up to k agents are missing from a coalition, the contribution of the remaining agents can be read off from the characteristic function. If the number of missing agents is higher than k, we assume that the contribution of the remaining agents is 0, as their (sub)coalition differs too much from the coalition originally planned. When \(k=n-1\), any subset of the original coalition gives its contribution to the expected coalition value, and this corresponds to Flexible \({\mathsf {PCSG}}\) [26]. On the other hand, if \(k=0\), the model corresponds to Cautious \({\mathsf {PCSG}}\) [26].

In the experimental part, we in addition assume that (1) each agent chooses in advance one of the following attendance types (including the probabilities/attendance rates) according to his/her own schedule:

  • Type 1: {available (\(90\%\))},

  • Type 2: {probably available (\(70\%\))},

  • Type 3: {unsure (\(50\%\))},

  • Type 4: {probably not available (\(30\%\))},

  • Type 5: {not available (\(10\%\))},

and (2) the \({\mathsf {CSG}}\) maker (e.g. the manager of the service company in our example) knows all the information about the agents’ types.

The Probabilistic Coalition Structure Generation problem description is then defined as follows:

Definition 3

(\({\mathsf {PCSG}}\)problem description) A probabilistic coalition structure generation problem description is defined by a tuple \({\mathsf {PCSG}} = \langle {\mathcal {A}}, v, f, k \rangle\) where \({\mathcal {A}} = \{a_1, a_2,\ldots ,a_n\}\) is a set of agents, \(v: 2^{{\mathcal {A}}} \rightarrow \mathbb {R}\) is a characteristic function, \(f: A \rightarrow [0,1]\) is a function that gives the probability/attendance rate of each agent and k is an integer, \(0\le k\le n-1\).

Here, the participation of each agent a is a binary random variable that takes value 1 with probability f(a) and value 0 with the remaining probability. Furthermore, we assume these random variables to be mutually independent, that is, the participation of one agent has no influence on those of other agents.

For a coalition C, let \({\tilde{A}} \subseteq C\) be the set of absent agents where \(|{\tilde{A}}| \le k\), \(0 \le k \le n-1\). For any such \({\tilde{A}}\), the coalition that remains after removing \({\tilde{A}}\) from C is denoted by \(C\setminus {\tilde{A}}\), and the value of this coalition is given by \(v(C \setminus {\tilde{A}})\). The contribution of this coalition to the expected value of coalition C, denoted by \(v^k_{e}(C, {\tilde{A}})\), is

$$\begin{aligned} v^k_{e}(C, {\tilde{A}}) = v(C \setminus {\tilde{A}}) \cdot \prod _{a \in C \setminus {\tilde{A}}} f(a) \cdot \prod _{a' \in {\tilde{A}}} (1- f(a')). \end{aligned}$$
(2)

If \(\vert {\tilde{A}}\vert >k\), then we define \(v^k_{e}(C, {\tilde{A}})=0\).

Now, the expected value of a coalition C, denoted by \(v_{e,k}(C)\), is given by

$$\begin{aligned} v_{e,k}(C) = \sum _{{\tilde{A}} \in 2^{C}} v^k_e(C, {\tilde{A}}). \end{aligned}$$
(3)

Finally, the expected value of a coalition structure \({\mathrm{{CS}}}\), denoted by \(V_{e,k}({\mathrm{{CS}}})\), is computed as the sum of the expected values of all coalitions, i.e.

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}) = \sum _{C \in {\mathrm{{CS}}}} v_{e,k}(C). \end{aligned}$$
(4)

A coalition structure is said to be optimal, denoted by \({\mathrm{{CS}}}_{e,k}^*\), if it maximizes the sum of the expected values of all coalitions, i.e. if the following condition holds:

$$\begin{aligned} \forall {\mathrm{{CS}}},\, V_{e,k}({\mathrm{{CS}}}) \le V_{e,k}({\mathrm{{CS}}}_{e,k}^*). \end{aligned}$$
Table 3 The expected values of all possible coalition structures when \(k=1\)

Example 1

(continued) Consider our running example of a service company with three employees. Assume that Ali reported Type 1 (i.e., {available (\(90\%\))}), Bob chose Type 3 (i.e., {unsure (\(50\%\))}), and Chan selected Type 4 (i.e., {probably not available (\(30\%\))}). Moreover, the manager sets the parameter \(k=1\), that is, he/she wants to maximize the expected value of a coalition structure where at most one employee may be absent from a coalition in order for the remaining agents to still have a positive contribution to the expected coalition value. The expected value of each coalition is then computed by using the rewards from Table 1 as in Eqs. (2) and (3):

$$\begin{aligned}&v_{e,1}(\{Ali\})=18, \,v_{e,1}(\{Bob\})=25, \,v_{e,1}(\{Chan\})=3,\\&v_{e,1}(\{Ali, Bob\})=43, \,v_{e,1}(\{Ali, Chan\})=29.1, \,v_{e,1}(\{Bob, Chan\})=34,\\&v_{e,1}(\{Ali, Bob, Chan\})=46.5. \end{aligned}$$

Notice that when compared to Flexible \({\mathsf {PCSG}}\) [26], the parameter \(k=1\) only has an influence on the expected value of the grand coalition in this example. All other coalitions have size of at most 2, such that if more than one agents is missing, the set of the remaining agents is empty, and the contribution of the empty set is 0, independently of k. Consider now the grand coalition (i.e., \(\{Ali, Bob, Chan\}\)). The positive contributions to the expected value of the grand coalition are given as follows:

  • Ali, Bob and Chan are available: \(v(\{Ali, Bob, Chan\})\cdot f(Ali)\cdot f(Bob) \cdot f(Chan)= 110\cdot (0.9\cdot 0.5\cdot 0.3)=14.85\)

  • Ali and Bob are available: \(v(\{Ali, Bob\}) \cdot f(Ali) \cdot f(Bob) \cdot (1-f(Chan))= 70 \cdot (0.9 \cdot 0.5\cdot 0.7) = 22.05\).

  • Ali and Chan are available: \(v(\{Ali, Chan\})\cdot f(Ali)\cdot (1-f(Bob))\cdot f(Chan) = 60\cdot (0.9\cdot 0.5\cdot 0.3)=8.1\)

  • Bob and Chan are available: \(v(\{Bob, Chan\})\cdot (1-f(Ali))\cdot f(Bob)\cdot f(Chan)=100\cdot (0.1\cdot 0.5\cdot 0.3)=1.5\)

The expected value of the grand coalition is then \(v_{e,1}(\{Ali, Bob, Chan\}) = 14.85 + 22.05 + 8.1 + 1.5 = 46.5\).

Table 3 shows the expected values of all possible coalition structures for \(k=1\). Compared to the optimal coalition structure \({\mathrm{{CS}}}^*=\{\{Ali\}, \{Bob, Chan\}\}\) in our \({\mathsf {CSG}}\) example, the optimal coalition structure here is \({\mathrm{{CS}}}_{e,1}^* = \{\{Bob\}, \{Ali, Chan\}\}\), and the expected value obtained by \({\mathrm{{CS}}}_{e,1}^*\) is \(V_e({\mathrm{{CS}}}_{e,1}^*) = v_{e,1}(\{Bob\}) + v_{e,1}(\{Ali, Chan\}) = 25 + 29.1 = 54.1\).

The \({\mathsf {PCSG}}\) problem is defined as follows:

Definition 4

(\({\mathsf {PCSG}}\)problem)

  • Input A \({\mathsf {PCSG}}\) problem description \({\mathsf {PCSG}} = \langle {\mathcal {A}}, v, f, k\rangle\)

  • Question Find an optimal coalition structure \({\mathrm{{CS}}}^*_{e,k}\).

The \({\mathsf {PCSG}}\) problem is a generalization of the \({\mathsf {CSG}}\) problem. In case the attendance rate of each agent is 1.0, that is, it is guaranteed that every agent will join any coalition he/she is assigned to, then the \({\mathsf {PCSG}}\) problem reduces to the standard \({\mathsf {CSG}}\) problem. This fact is independent of the choice of parameter k, i.e., it holds for any k, \(0\le k\le n-1\).

Let us now focus on the special case of the \({\mathsf {PCSG}}\) problem when the characteristic function v is subadditive and \(k=n-1\). In that case, the analog of the result in the standard \({\mathsf {CSG}}\) framework also holds in the \({\mathsf {PCSG}}\) framework (which for \(k=n-1\) corresponds to Flexible \({\mathsf {PCSG}}\) [26]). More precisely, if v is subadditive, the coalition structure formed by singleton coalitions is optimal (i.e., \({\mathrm{{CS}}}_{e,n-1}^* = \{\{a_i\} | a_i \in {\mathcal {A}}\}\)) [26].

We now investigate the influence of the parameter k on the optimal solution in the \({\mathsf {PCSG}}\) problem, when the characteristic function v is subadditive.

Proposition 1

Let\({\mathsf {PCSG}}=\langle A, v, f, k\rangle\)be a probabilistic coalition structure generation problem description. If v is subadditive, then the coalition structure formed by singleton coalitions is optimal for any\(k\in [0,n-1]\). More precisely,

$$\begin{aligned} {\mathrm{{CS}}}_{e,k}^*=\{\{a_i\} | a_i \in {\mathcal {A}}\}, \;\forall k\in [0,n-1]. \end{aligned}$$

Proof

Since v is subadditive, we know that for any \(C_1, C_2\subseteq {\mathcal {A}}\) such that \(C_1\cap C_2=\emptyset\), \(v(C_1)+v(C_2)\ge v(C_1\cup C_2)\). We want to compare \(v_{e,k}(C_1)+v_{e,k}(C_2)\) and \(v_{e,k}(C_1\cup C_2)\). Namely, if we prove that \(v_{e,k}(C_1) + v_{e,k}(C_2)\ge v_{e,k}(C_1\cup C_2)\), the claim will follow. We know that

$$\begin{aligned} v_{e,k}(C_1\cup C_2) = \sum _{\begin{array}{c} {\tilde{C}}\subseteq (C_1\cup C_2)\\ \vert {\tilde{C}}\vert \le k \end{array}} \left( v((C_1\cup C_2)\setminus {\tilde{C}})\cdot \prod _{c\in (C_1\cup C_2)\setminus {\tilde{C}}} f(c) \cdot \prod _{c'\in {\tilde{C}}}(1-f(c')) \right) . \end{aligned}$$
(5)

Now, notice that for every \({\tilde{C}}\subseteq (C_1\cup C_2), \vert {\tilde{C}}\vert \le k\), there exist exactly one \({\tilde{C}}_1\subseteq C_1\) and exactly one \({\tilde{C}}_2\subseteq C_2\) such that \({\tilde{C}}={\tilde{C}}_1\cup {\tilde{C}}_2\). Then,

$$\begin{aligned}&v((C_1\cup C_2)\setminus {\tilde{C}})\cdot \prod _{c\in (C_1\cup C_2)\setminus {\tilde{C}}} f(c) \cdot \prod _{c'\in {\tilde{C}}}(1-f(c')) \\&\quad = v((C_1\cup C_2)\setminus ({\tilde{C}}_1\cup {\tilde{C}}_2)\cdot \prod _{c\in (C_1\cup C_2)\setminus ({\tilde{C}}_1\cup {\tilde{C}}_2)} f(c) \cdot \prod _{c'\in {\tilde{C}}_1\cup {\tilde{C}}_2}(1-f(c')) \\&\quad \le \left( v(C_1\setminus {\tilde{C}}_1) + v(C_2\setminus {\tilde{C}}_2) \right) \cdot \prod _{c\in (C_1\cup C_2)\setminus ({\tilde{C}}_1\cup {\tilde{C}}_2)} f(c) \cdot \prod _{c'\in {\tilde{C}}_1\cup {\tilde{C}}_2}(1-f(c')) \\&\quad \le v(C_1\setminus {\tilde{C}}_1)\cdot \prod _{c\in C_1\setminus {\tilde{C}}_1} f(c) \cdot \prod _{c'\in {\tilde{C}}_1}(1-f(c')) \\&\qquad + v(C_2\setminus {\tilde{C}}_2)\cdot \prod _{c\in C_2\setminus {\tilde{C}}_2} f(c) \cdot \prod _{c'\in {\tilde{C}}_2}(1-f(c')). \end{aligned}$$
(6)

But now, since \(\vert {\tilde{C}}_1\vert \le k\) and \(\vert {\tilde{C}}_2\vert \le k\), the two summands in the last expression of Eq. (6) must also appear as summands in \(v_{e,k}(C_1)\) and \(v_{e,k}(C_2)\), respectively. This means that every summand in Eq. (5) is upper bounded by one summand of \(v_{e,k}(C_1)\) and one summand of \(v_{e,k}(C_2)\). Therefore, we can conclude that \(v_{e,k}(C_1) + v_{e,k}(C_2)\ge v_{e,k}(C_1\cup C_2)\). \(\square\)

When v is superadditive, on the other hand, we cannot come to an immediate conclusion that \({\mathrm{{CS}}}^*_{e,k}=\{{\mathcal {A}}\}\). As a preliminary example, let us focus on the case when \(k=0\), which corresponds to Cautious \({\mathsf {PCSG}}\) [26]. Going back to our running example where \(v(\{Ali\})=20\), \(v(\{Chan\})=10\), \(v(\{Ali, Chan\})=60\) and \(f(Ali)=0.9,\; f(Chan)=0.3\), we see that v is superadditive on the domain consisting just of Ali and Chan. However, for \(k=0\), we can easily check that \(v_{e,0}(\{Ali\})=20\cdot 0.9=18\), \(v_{e,0}(\{Chan\})=10\cdot 0.3=3\) and

$$\begin{aligned} v_{e,0}(\{Ali, Chan\})=60\cdot 0.9\cdot 0.3=16.2 < 21 = v_{e,0}(\{Ali\})+v_{e,0}(\{Chan\}), \end{aligned}$$

meaning that superadditivity of v does not transfer to superadditivity of \(v_{e,0}\), and the grand coalition is not an optimal coalition structure. We prove a more general statement in Proposition 2.

Proposition 2

For every\(k\in [0,n-2]\), there exists an instance of the\({\mathsf {PCSG}}\)problem with problem description\({\mathsf {PCSG}}=\langle {\mathcal {A}}, v, f, k \rangle\)such thatvis superadditive and the grand coalition is not an optimal solution.

Proof

Let \(n=\vert {\mathcal {A}}\vert\) and \(f(a_i)=p>0, \forall a_i\in {\mathcal {A}}\). Furthermore, let \(v(C)=\frac{\vert C\vert }{n}, \forall C\subseteq {\mathcal {A}}\). Note that v is superadditive. Then,

$$\begin{aligned} v_{e,k}({\mathcal {A}})&=\sum _{\begin{array}{c} C\subseteq {\mathcal {A}}\\ \vert C\vert \le k \end{array}} v({\mathcal {A}}\setminus C)\cdot \prod _{c\in {\mathcal {A}}\setminus C} f(c) \cdot \prod _{c'\in C} (1-f(c'))\\&\le \sum _{\begin{array}{c} C\subseteq {\mathcal {A}}\\ \vert C\vert \le n-2 \end{array}} v({\mathcal {A}}\setminus C)\cdot \prod _{c\in {\mathcal {A}}\setminus C} f(c) \cdot \prod _{c'\in C} (1-f(c'))\\&=\sum _{i=1}^{n-2} {n\atopwithdelims ()i} p^{n-i}(1-p)^i\\&=p\cdot (1-(1-p)^{n-1})\\&<p\\&= \sum _{a_i\in {\mathcal {A}}} v_{e,k}(\{a_i\}), \end{aligned}$$

where the strict inequality follows from \((1-p)^{n-1}>0, \forall p>0\). Since the coalition structure that consists of all singletons has a higher value than the grand coalition, we know that \({\mathrm{{CS}}}_{e,k}^*\ne \{{\mathcal {A}}\}, \forall k\in [0,n-2]\). \(\square\)

The case \(k=n-1\) needs to be treated separately, as it yields a different result, that is aligned with the one for superadditivity in the standard \({\mathsf {CSG}}\) framework.

Proposition 3

Let \({\mathsf {PCSG}}=\langle A, v, f, n-1\rangle\) be a probabilistic coalition structure generation problem description. If v is superadditive, then the grand coalition is the optimal solution. More precisely,

$$\begin{aligned} {\mathrm{{CS}}}_{e,n-1}^*=\{\{a_i\} | a_i \in {\mathcal {A}}\}. \end{aligned}$$

Proof

Let \({\mathrm{{CS}}}=\{C_1,\dots , C_m\}\) be any coalition structure different from the grand coalition. The result follows by observing that for any realisation \(A\subseteq {\mathcal {A}}\) of the set of the available agents, because of superadditivity of v, we know that

$$\begin{aligned} V(\{{\mathcal {A}}\})=v(A)\ge v(A_1) + \dots + v(A_m) = V({\mathrm{{CS}}}), \end{aligned}$$

where \(A_i=C_i\cap A\). \(\square\)

Finally, let us note that an instance of the \({\mathsf {PCSG}}\) problem can be represented as a zero-one integer program in the same way as for the \({\mathsf {CSG}}\) problem, if we know the \(v_{e,k}\) values of all coalitions. For simplicity, we show the zero-one integer programming formulation of our running example for \({\mathsf {PCSG}}\). The objective function and the constraints are formalized as follows:

$$\begin{aligned}&\begin{array}{l} \max \,(18\cdot a_1 + 25 \cdot a_2 + 3 \cdot a_3 + 43 \cdot a_{12} + 29.1 \cdot a_{13} + 34 \cdot a_{23} + 46.5 \cdot a_{123}), \end{array} \end{aligned}$$
(7)
$$\begin{aligned}&s.t\,\, a_1 + a_{12} + a_{13} + a_{123} = 1, \end{aligned}$$
(8)
$$\begin{aligned}&a_2 + a_{12} + a_{23} + a_{123} = 1, \end{aligned}$$
(9)
$$\begin{aligned}&a_3 + a_{13} + a_{23} + a_{123} = 1, \end{aligned}$$
(10)
$$\begin{aligned}&a_1, a_2, a_3, a_{12}, a_{13}, a_{23}, a_{123} \in \{0,1\}. \end{aligned}$$
(11)

Variables \(a_1, a_2, \ldots , a_{123}\) represent all possible coalitions, e.g., \(a_1\) is the coalition \(\{Ali\}\), and \(a_{123}\) represents the grand coalition (i.e., \(\{\{Ali, Bob, Chan\}\}\)) and can take values 0 or 1 [Eq. (11)]. Equation (8) describes that Ali belongs to one of the coalitions \(\{Ali\}, \{Ali, Bob\}, \{Ali, Chan\}, \{Ali, Bob, Chan\}\), and he cannot belong to more than one coalition simultaneously. Similarly, Eqs. (9) and (10) show the constraints for Bob and Chan. Equation (7) represents the objective function which maximizes the sum of the expected values of all coalitions, and each coefficient shows the expected value obtained by the corresponding coalition, e.g., 18 is the expected value of the coalition \(\{Ali\}\) and 46.5 is the expected value of the grand coalition.

4 Bounded approximation algorithms

In this section, we present two approximation algorithms for solving the \({\mathsf {PCSG}}\) problem called Bounded Approximation Algorithm based on Attendance Types (\({\mathsf {BAAAT}}\)) and Involved Bounded Approximation Algorithm based of Attendance Types (\({\mathsf {IBAAAT}}\)). Even though our algorithms can be seen as simple heuristics, we term them algorithms as they offer a theoretical bound on the quality of the returned solutions.

Let us for a moment assume that we can get the \(v_{e,k}\) function as input, instead of just the characteristic function v and parameter k. Even though this assumption saves a lot of computation that would be necessary to arrive at \(v_{e,k}\), still the input, i.e., the representation size, is exponential in the number of agents. Thus, it is important to consider approximation algorithms, i.e., to consider a trade-off between the quality of the returned solution and tractability. To this end, we prove an a priori upper bound on the error of the solution returned by both \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\), i.e., the error bound is obtained before actually running the algorithm.

4.1 Approximation algorithm \({\mathsf {BAAAT}}\)

\({\mathsf {BAAAT}}\) has the following two phases:

  1. Phase 1:

    For a given parameter \({\tilde{p}} \in \mathbb {R}\) and the attendance rates of all agents, form singleton coalitions for every agent a whose attendance rate p is such that \(p\le {\tilde{p}}\).

  2. Phase 2:

    Find an optimal coalition structure of the relaxed \({\mathsf {PCSG}}\) problem with the remaining agents.

The basic idea of \({\mathsf {BAAAT}}\) is that for a given parameter \({\tilde{p}}\), the singleton coalition is formed for an agent who chooses the attendance type where the given probability/attendance rate is less than or equal to the parameter \({\tilde{p}}\). Then, an optimal coalition structure of the relaxed problem with the remaining agents is computed. We denote the coalition structure on the subset of agents that remain after singletons are formed in Phase 1, which is obtained in Phase 2, by \({\mathrm{{CS}}}^-_{e,k}\) and the coalition structure provided by \({\mathsf {BAAAT}}\), which is a coalition structure on the whole set of agents, by \({\mathrm{{CS}}}^+_{e,k}\).

Let us further explain how \({\mathsf {BAAAT}}\) computes an approximate solution by using our running example. We are given the attendance type (that includes the attendance rate) of each agent, i.e., \(f(Ali)=0.9\) (Type1: available), \(f(Bob)=0.5\) (Type 3: unsure), \(f(Chan)=0.3\) (Type 4: probably not available), the parameter \(k=1\), and the following expected values of coalitions:

$$\begin{aligned}&v_{e,1}(\{Ali\})=18, \,v_{e,1}(\{Bob\})=25, \,v_{e,1}(\{Chan\})=3,\\&v_{e,1}(\{Ali, Bob\})=43, \,v_{e,1}(\{Ali, Chan\})=29.1, \,v_{e,1}(\{Bob, Chan\})=34,\\&v_{e,1}(\{Ali, Bob, Chan\})=46.5. \end{aligned}$$

Let \({\tilde{p}}=0.3\), i.e., the manager does not count on agents who chose attendance Types 4 (i.e., probably not available (\(30\%\))) and 5 (i.e., not available (\(10\%\))). Since Chan reported attendance Type 4, i.e., \(f(Chan) = 0.3 = {\tilde{p}}\), the singleton coalition is formed for Chan in Phase 1 of \({\mathsf {BAAAT}}\). Then, the relaxed \({\mathsf {PCSG}}\) problem with the remaining agents (i.e., Ali and Bob) is solved in Phase 2 of \({\mathsf {BAAAT}}\)Footnote 2 That is, the coalitions which include Chan can be ignored in the simplified problem and only the following is still relevant:

$$\begin{aligned} v_{e,1}(\{Ali\})=18, \,v_{e,1}(\{Bob\})=25, v_{e,1}(\{Ali, Bob\})=43. \end{aligned}$$

Since the expected value of the coalition formed by Ali and Bob is equal to the sum of the expected values of singleton coalitions with Ali and Bob, that is, \(v_{e,1}(\{Ali, Bob\}) = 43 = v_{e,1}(\{Ali\}) + v_{e,1}(\{Bob\})\), the optimal coalition structures of the relaxed problem are \(\{Ali, Bob\}\) and \(\{\{Ali\}, \{Bob\}\}\). The solutions obtained by \({\mathsf {BAAAT}}\) are \({\mathrm{{CS}}}^{+1}_{e,1} = \{\{Chan\}, \{Ali, Bob\}\}\) and \({\mathrm{{CS}}}^{+2}_{e,1} = \{\{Ali\}, \{Bob\}, \{Chan\}\}\). On this instance, there are two possible solutions, but their expected value is the same, namely, \(V_e({\mathrm{{CS}}}^+_{e,1}) = 46\).

4.2 Quality guarantee of \({\mathsf {BAAAT}}\)

We show that we can provide an upper bound on the error of the solution returned by \({\mathsf {BAAAT}}\) a priori, i.e., the error bound is obtained before actually running the algorithm. Let us denote by \({\mathcal {C}}_a\) the set of all coalitions that contain agent a as a member, and let \(\tilde{{\mathcal {A}}}\) be the set of agents whose probability is lower than or equal to a given parameter \({\tilde{p}}\) such that they form their own coalition in Phase 1. Furthermore, let \(r_a^{\max } = \max \{v_{e,k}(C) \,|\, C \in {\mathcal {C}}_a\}\), i.e., \(r_a^{\max }\) is the maximal expected value of all coalitions which include agent a.

Lemma 1

Let\({\mathsf {PCSG}} = \langle {\mathcal {A}}, v, f, k\rangle\)be a probabilistic coalition structure generation problem description. For an optimal coalition structure\({\mathrm{{CS}}}_{e,k}^*\)and a coalition structure\({\mathrm{{CS}}}_{e,k}^{-}\)obtained by\({\mathsf {BAAAT}}\)in Phase 2, the following inequality holds:

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^-) \;\le \; \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} r_{{\tilde{a}}_i}^{\max }. \end{aligned}$$
(12)

Proof

We prove the claim by induction on the size of \(\tilde{{\mathcal {A}}}\). In the base case, \(\tilde{{\mathcal {A}}} = \emptyset\). Then, since no agents are removed in Phase 1 of \({\mathsf {BAAAT}}\), the whole instance is solved to optimality in Phase 2. This means that

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) = V_{e,k}({\mathrm{{CS}}}_{e,k}^-). \end{aligned}$$

On the other hand, \(\sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} r_{{\tilde{a}}_i}^{\max } = 0\), so

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^-) = 0 \le 0 = \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} r_{{\tilde{a}}_i}^{\max } \end{aligned}$$

and inequality (12) holds.

Let us now assume that inequality (12) holds for all \(\tilde{{\mathcal {A}}}\) such that \(|\tilde{{\mathcal {A}}}| = \ell\). More specifically, we assume that

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^-) \le \sum _{i=1}^{\ell } r_{{\tilde{a}}_i}^{\max },\, \forall \tilde{{\mathcal {A}}},\, |\tilde{{\mathcal {A}}}|=\ell . \end{aligned}$$
(13)

Next, we consider the case where \(\tilde{{\mathcal {A}}} = \{{\tilde{a}}_1,\ldots ,{\tilde{a}}_\ell , {\tilde{a}}_{\ell +1}\}\). We first observe the coalitions that form the expected optimal coalition structure \({\mathrm{{CS}}}_{e,k}^*\) and denote these by \({\mathrm{{CS}}}_{e,k}^* = \{C_{e,k}^{*1},\ldots ,C_{e,k}^{*m}\}\). We also know that \(\tilde{a_1}\) must be in one of these coalitions and without loss of generality we can assume that \(C_{e,k}^{*1} = \{{\tilde{a}}_1, b_1,\ldots ,b_q\}\). It might happen that \(b_i = {\tilde{a}}_j\) for some \(i \in [q]\), \(j \in [\ell +1] \setminus \{1\}\) but this does not influence the following inequalities:

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*)&= v_{e,k}(C_{e,k}^{*1}) + v_{e,k}(C_{e,k}^{*2}) + \cdots + v_{e,k}(C_{e,k}^{*m}) \\&\le r_{{\tilde{a}}_1}^{\max } + v_{e,k}(C_{e,k}^{*2}) + \cdots + v_{e,k}(C_{e,k}^{*m}) \\&\le r_{\tilde{a_1}}^{\max } + v_{e,k}(\{b_{1}\}) + \cdots + v_{e,k}(\{b_m\}) + v_{e,k}(C_{e,k}^{*2}) + \cdots + v_{e,k}(C_{e,k}^{*m}) \\&\le r_{{\tilde{a}}_1}^{\max } + V_{e,k}({\mathrm{{CS}}}_{e,k}^{-{\tilde{a}}_1}), \end{aligned}$$

where \(V_{e,k}({\mathrm{{CS}}}_{e,k}^{-{\tilde{a}}_1})\) denotes the optimal coalition structure on the set of agents \({\mathcal {A}} \setminus \{\tilde{a_1}\}\). The last inequality follows from the fact that \(\{\{b_1\},\ldots ,\{b_m\},\)\(C_{e,k}^{*2},\ldots ,C_{e,k}^{*m}\}\) is a coalition structure over the agents in \({\mathcal {A}} \setminus \{\tilde{a_1}\}\) and \({\mathrm{{CS}}}_{e,k}^{-\tilde{a_1}}\) is an optimal such coalition structure. Now, we need to still remove agents \({\tilde{a}}_2,\ldots ,{\tilde{a}}_n\) to reach Phase 2 of \({\mathsf {BAAAT}}\) and to be able to compare \(V_{e,k}({\mathrm{{CS}}}_{e,k}^*)\) with \(V_{e,k}({\mathrm{{CS}}}_{e,k}^{-})\). However, by the induction hypothesis (13) on \({\mathrm{{CS}}}_{e,k}^{-{\tilde{a}}_1}\), we have

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^{-{\tilde{a}}_1}) - V_{e,k}({\mathrm{{CS}}}_{e,k}^-) \le \sum _{i=2}^{\ell +1} r_{{\tilde{a}}_i}^{\max } \end{aligned}$$

Thus, it holds that

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*)&\le r_{{\tilde{a}}_1}^{\max } + V_{e,k}({\mathrm{{CS}}}_{e,k}^{-{\tilde{a}}_1}) \\&\le r_{{\tilde{a}}_1}^{\max } + \sum _{i=2}^{\ell +1} r_{{\tilde{a}}_1}^{\max } + V_{e,k}({\mathrm{{CS}}}_{e,k}^-) \\&\le \sum _{i=1}^{\ell +1} r_{{\tilde{a}}_i}^{\max } + V_{e,k}({\mathrm{{CS}}}_{e,k}^{-}), \end{aligned}$$

which concludes the proof. \(\square\)

Theorem 1

Let\({\mathsf {PCSG}} = \langle {\mathcal {A}}, v, f, k\rangle\)be a probabilistic coalition structure generation problem description. For an optimal coalition structure\({\mathrm{{CS}}}_{e,k}^*\)and any coalition structure\({\mathrm{{CS}}}_{e,k}^{+}\)obtained by\({\mathsf {BAAAT}}\), the following holds:

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^+) \le \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} \left( r_{{\tilde{a}}_i}^{\max } - v_{e,k}({{\tilde{a}}_i})\right) . \end{aligned}$$
(14)

Proof

By Lemma 1, it holds that

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^-) \le \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} r_{{\tilde{a}}_i}^{\max }. \end{aligned}$$

Now, by subtracting \(\sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} v_{e,k}({{\tilde{a}}_i})\) from both sides, we get

$$\begin{aligned} \left( V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^-)\right) - \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} v_{e,k}({{\tilde{a}}_i})&\le \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} r_{{\tilde{a}}_i}^{\max } - \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} v_{e,k}({{\tilde{a}}_i}) \\ V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - \left( V_{e,k}({\mathrm{{CS}}}_{e,k}^-) + \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} v_{e,k}({{\tilde{a}}_i})\right)&\le \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} \left( r_{{\tilde{a}}_i}^{\max } - v_{e,k}({{\tilde{a}}_i})\right) \\ V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^+)&\le \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} \left( r_{{\tilde{a}}_i}^{\max } - v_{e,k}({{\tilde{a}}_i})\right) . \end{aligned}$$

\(\square\)

Let us now focus on a special case of the \({\mathsf {PCSG}}\) problem in which we can give a better bound on the quality of the solution returned by \({\mathsf {BAAAT}}\).

Proposition 4

Let \({\mathsf {PCSG}} = \langle {\mathcal {A}}, v, f, k \rangle\) be a probabilistic coalition structure generation problem description. In case the expected values of all coalitions satisfy subadditivity, i.e., \(v_{e,k}\) is a subadditive function, the coalition structure \({\mathrm{{CS}}}_{e,k}^+\) obtained by \({\mathsf {BAAAT}}\) is optimal, i.e., it holds that

$$\begin{aligned} {\mathrm{{CS}}}_{e,k}^* = {\mathrm{{CS}}}_{e,k}^+ \end{aligned}$$
(15)

Proof

In the case where \(v_{e,k}\) is subadditive, the optimal coalition structure \({\mathrm{{CS}}}_{e,k}^*\) is formed by singletons. So, we just need to show that \({\mathrm{{CS}}}_{e,k}^+\) has the form \({\mathrm{{CS}}}_{e,k}^+ = \{\{a_i\} | a_i \in {\mathcal {A}}\}\). In Phase 1 of \({\mathsf {BAAAT}}\), only singleton coalitions are formed, independently of \(v_{e,k}\). In Phase 2, since \(v_{e,k}\) is subadditive, the optimal coalition structure \({\mathrm{{CS}}}_{e,k}^-\) of the relaxed problem is formed by singleton coalitions. Thus, the solution \({\mathrm{{CS}}}_{e,k}^+\) obtained by \({\mathsf {BAAAT}}\) is formed by singleton coalitions. \(\square\)

4.3 Approximation algorithm \({\mathsf {IBAAAT}}\)

Let us now try to give a better approximation algorithm for \({\mathsf {PCSG}}\) by having a more involved approach to agents that have a low attandance rate. Instead of immediately forming singleton coalitions, we will find an optimal coalition structure on the subset of such agents. We will do the same on the remaining set of agents (as in \({\mathsf {BAAAT}}\)) and the final solution will be the union of these two optima.

\({\mathsf {IBAAAT}}\) has the following two phases:

  1. Phase 1:

    For a given parameter \({\tilde{p}} \in \mathbb {R}\) and the attendance rates of all agents, split the agents into two sets such that every agent a whose attendance rate p is such that \(p\le {\tilde{p}}\) is in \(A_1\) and all other agents are in \(A_2\).

  2. Phase 2:

    Find optimal coalition structures of the two relaxed \({\mathsf {PCSG}}\) problems on \(A_1\) and \(A_2\) and output their union.

Going back to our running example, let us compare the solutions returned by \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\). Recall that the attendance types are given by \(f(Ali)=0.9\) (Type1: available), \(f(Bob)=0.5\) (Type 3: unsure), and \(f(Chan)=0.3\) (Type 4: probably not available). We still assume \(k=1\), and the expected values of coalitions can be seen in Table 3. If we, as before, choose \({\tilde{p}}=0.3\), then only Chan is placed in \(A_1\). Previously we already found that the optimal solutions of the relaxed \({\mathsf {PCSG}}\) problem on \(A_2=\{Ali, Bob\}\) are \(\{Ali, Bob\}\) and \(\{\{Ali\},\{Bob\}\}\). The optimal solution on \(A_1\) has to be \(\{Chan\}\), as this is the only possible solution. Thus, the solutions obtained by \({\mathsf {IBAAAT}}\) are identical to those obtained by \({\mathsf {BAAAT}}\), i.e., \(\{\{Chan\}, \{Ali, Bob\}\}\) and \(\{\{Ali\}, \{Bob\}, \{Chan\}\}\) with an expected value of 46. Now, let us set \({\tilde{p}}=0.5\). In this case \(A_1=\{Bob, Chan\}\) and the solutions returned by \({\mathsf {IBAAAT}}\) and \({\mathsf {BAAAT}}\) do not necessarily coincide. Indeed, \(v_{e,1}(\{Bob, Chan\})=34 > 28 = v_{e,1}(\{Bob\}) + v_{e,1}(\{Chan\})\). Thus, the solution returned by \({\mathsf {IBAAAT}}\) is \(\{\{Ali\}, \{Bob, Chan\}\}\) and has expected value 52.

4.4 Quality guarantee of \({\mathsf {IBAAAT}}\)

Lemma 2

Let\({\mathsf {PCSG}} = \langle {\mathcal {A}}, v, f, k\rangle\)be a probabilistic coalition structure generation problem description. For an optimal coalition structure\({\mathrm{{CS}}}_{e,k}^*\)and a coalition structure\({\mathrm{{CS}}}_{e,k}^{+-}\)on\(A_2\)obtained by\({\mathsf {IBAAAT}}\)in Phase 2, the following inequality holds:

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{+-}) \;\le \; \sum _{a\in A_1} r_{a}^{\max }. \end{aligned}$$
(16)

Proof

Since \(A_1=\tilde{{\mathcal {A}}}\) and \({\mathrm{{CS}}}^{+-}_{e,k}\) returned by \({\mathsf {IBAAAT}}\) is exactly the same as \({\mathrm{{CS}}}_{e,k}^{-}\) returned by \({\mathsf {BAAAT}}\), Lemma 1 proves the claim. \(\square\)

Theorem 2

Let\({\mathsf {PCSG}} = \langle {\mathcal {A}}, v, f, k\rangle\)be a probabilistic coalition structure generation problem description. For an optimal coalition structure\({\mathrm{{CS}}}_{e,k}^*\)and any coalition structure\({\mathrm{{CS}}}_{e,k}^{++}\)obtained by\({\mathsf {IBAAAT}}\), the following holds:

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{++}) \le \sum _{a \in A_1} \left( r_{a}^{\max } - v_{e,k}({a})\right) . \end{aligned}$$
(17)

Proof

By Lemma 2, it holds that \(V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{+-}) \le \sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} r_{{\tilde{a}}_i}^{\max }\). The coalition structure \({\mathrm{{CS}}}_{e,k}^{A_1}\) obtained by \({\mathsf {IBAAAT}}\) when dealing with the relaxed problem on \(A_1\) is in fact the optimal solution on \(A_1\), so \(V_{e,k}({\mathrm{{CS}}}_{e,k}^{A_1})\ge \sum _{a\in A_1}v_{e,k}(a)\). Then,

$$\begin{aligned} V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{++})&= V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - \left( V_{e,k}({\mathrm{{CS}}}_{e,k}^{A_1}) -V_{e,k}({\mathrm{{CS}}}_{e,k}^{+-})\right) \\&= \left( V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{+-})\right) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{A_1})\\&\le \left( V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{+-})\right) - \sum _{a\in A_1}v_{e,k}(a)\\&\le \sum _{a\in A_1} (r_a^{\max } - v_{e,k}(a)) \end{aligned}$$

\(\square\)

4.5 Tightness of the quality guarantee

Next, we show that the bounds given by Theorems 1 and 2 are in fact tight, meaning that there is an instance for which (14) and (17) hold with equality.

Proposition 5

The quality guarantees for\({\mathsf {BAAAT}}\)and\({\mathsf {IBAAAT}}\)given by Theorems 1and 2, respectively, are tight.

Proof

Let us construct an instance where \({\mathcal {A}}=\{a_1,\dots ,a_n, b_1,\dots ,b_n\}\) and the characteristic function is such that all subsets of the agent set have value 0, other than \(C\in \{\{a_1,b_1\},\dots ,\{a_n,b_n\}\}\) for which \(v(C)=1\). Furthermore, let the attendance of any agent \(a_i, i\in [n]\) be higher than \({\tilde{p}}\) and the attendance of any agent \(b_i, i\in [n]\) be lower than \({\tilde{p}}\).

Then, both \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) will return a solution of value 0 for any \(k\in [n-1]\), while the optimum has value n. Additionally, the loss of the algorithms is exactly described by the term \(\sum _{{\tilde{a}}_i\in \tilde{{\mathcal {A}}}} r^{\max }_{{\tilde{a}}_i} = \sum _{a\in A_1}r_a^{\max }\). In summary,

$$\begin{aligned} n&=n-0 = V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{+}) = V_{e,k}({\mathrm{{CS}}}_{e,k}^*) - V_{e,k}({\mathrm{{CS}}}_{e,k}^{++}) \end{aligned}$$
(18)
$$\begin{aligned}&\le \sum _{{\tilde{a}}_i\in \tilde{{\mathcal {A}}}} (r^{\max }_{{\tilde{a}}_i}-v_{e,k}({\tilde{a}}_i)) = \sum _{a\in A_1} (r_a^{\max } - v_{e,k}(a))\end{aligned}$$
(19)
$$\begin{aligned}&= \sum _{{\tilde{a}}_i\in \tilde{{\mathcal {A}}}} r^{\max }_{{\tilde{a}}_i} = \sum _{a\in A_1}r_a^{\max }=n \end{aligned}$$
(20)

\(\square\)

5 Experimental evaluation

In the experiments, \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) are evaluated on a number of benchmarks. We note that since this is the very first work that proposes non-complete algorithms for \({\mathsf {PCSG}}\), a comparison to existing algorithms from the literature is not possible. The only possible comparison is the one to the complete algorithm that solves the problem optimally, and this is, indeed, one of the benchmark used. In our experiments, \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) are implemented in Python and all experiments are carried out on a 6 core running at 3.3GHz with 32GB of RAM.

As preliminary experiments, we first investigate the influence of the value k, i.e., the number of agents who may be absent from a coalition such that the remaining agents still give a contribution to the expected coalition value, on the runtime of \({\mathsf {BAAAT}}\). We also look into the number of singleton coalitions in the optimal coalition structure, and also what are the agent types that tend to form singleton coalitions. Then, the quality of the solutions returned by \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) is evaluated with respect to the optimal solutions.

In order to compute optimal coalition structures, we used the CPLEX solver. The attendance type of each agent was randomly chosen from Type 1: {available (\(90\%\))} to Type 5: {not available (\(10\%\))}. For each setting, 50 problem instances were generated, each based on one of the several probability distributions for the characteristic function v that are commonly used in the literature:

  • (1) Uniform: \(v(C) = \textit{Uniform}(0, |C|)\) [12]

  • (2) Normal: \(v(C) = \textit{Normal}(\mu = 10*|C|, \sigma ^2 = 0.1)\) [21]

  • (3) Modified uniform: \(v(C) = \textit{Uniform}(0, 10*|C|) + a\), where \(a = \textit{Uniform}(0, 50)\) with probability 0.2 and \(a=0\) otherwise [28]

  • (4) Modified normal: \(v(C)= \textit{Normal}(10*|C|, 0.01) + a\) (a is as above) [20]

  • (5) Beta: \(v(C) = |C|*\textit{Beta}(\alpha = \beta = 0.5)\) [14]

  • (6) Gamma: \(v(C) = |C|*\textit{Gamma}(k = \theta = 2)\) [14]

5.1 Preliminary experiments

Fig. 1
figure 1

Runtime of \({\mathsf {BAAAT}}\) for different k. The x-axis shows the number of agents and the y-axis represents the computation time in seconds

Table 4 The average number of singletons in the optimal coalition structure \({\mathrm{{CS}}}_{e,k}^*\)
Fig. 2
figure 2

The ratio of Types of agents of singleton coalitions where the number of agent is 14

The influence of the value k is investigated. More precisely, we set \({\tilde{p}}=0\) and compare the runtime for finding an optimal coalition structure by varying the value k. Figure 1 represents the average runtime of all distributions for different k (i.e., \(k=0,1,2,3,4,5\)). As one can see, we observed similar results for all distributions (1)–(6). When the number of agents is small, the influence of k on the runtime is not significant. However, the difference quickly becomes larger when the number of agents increases, e.g., in the case when the number of agents is 14 and the distribution is uniform, the average runtime is 35.4 s for \(k=1\) and 809 s for \(k=5\). The experimental results thus indeed confirmed what was expected, that is, the influence of k on the runtime becomes larger when the number of agents increases. This is the case, because it is necessary to consider \(\varTheta ( {{\vert C\vert }\atopwithdelims (){k}})\) (which is equal to \(\varTheta ({{n}\atopwithdelims (){k}})= \varTheta (n^k)\) in the worst case) summands in Eq. (3) in order to compute the expected value for each coalition C. For instance, in our running example of a service company dispatching interpreters with three employees, when we set \(k=2\), it is required to consider \({3}\atopwithdelims (){0}\) + \({3}\atopwithdelims (){1}\) + \({3}\atopwithdelims (){2}\) = 7 cases to compute the expected value for the coalition \(\{Ali, Bob, Chan\}\). That is, we consider the case where all employees are available, one of them might be absent, and two of them might be absent. However, in case the number of agents increases from 3 to 4, we need to consider \({4}\atopwithdelims (){0}\) + \({4}\atopwithdelims (){1}\) + \({4}\atopwithdelims (){2}\) = 11 possible cases for computing the expected value for \(k=2\).

Next, we look into the number of singleton coalitions in the optimal coalition structure. Table 4 shows the average number of singletons in the optimal coalition structure \({\mathrm{{CS}}}_{e,k}^*\) where \(k \in [0,5]\) for all considered distributions. We can see that the average number of singleton coalitions differs for different distributions. For the Normal case of Cautious \({\mathsf {PCSG}}\) problems, i.e., for \(k=0\), all of the agents form singleton coalitions in the optimum, while this is not the case with other considered distributions. Additionally, the number of singletons formed drastically decreases when the parameter k changes from 0 to 1. For instance, for the Beta distribution it changes from 7.6 to 1.4 when the number of agents is 14. For all distributions we can see that there are few singleton coalitions in the optimal coalition structure when \(k\ne 0\).

Lastly, we analyze which agent types tend to form singleton coalitions in the optimal solution. More specifically, building upon the result of Table 4, we count the number of agents of each types that form a singleton coalition in the optimal coalition structure \({\mathrm{{CS}}}_{e,0}^*\). Figure 2 shows the ratio of each type among the agents that formed singleton coalitions in the optimum, where the number of agents is 14 and \(k=0\). We observed that in most cases the agents with Type5, that is, the agents whose attendance rate is lowest, tend to form singletons in the optimal coalition structure. A notable exception is the Normal case. Since the optimal coalition structure completely consists of singleton coalitions in the Normal case (see Table 4), one cannot see a difference among types.

5.2 Performance of the approximation algorithms

Table 5 Solution quality of \({\mathsf {BAAAT}}\) ((a) and (c)) and \({\mathsf {IBAAAT}}\) ((b) and (d)) with respect to the optimum (a value close to 1 is desirable)
Fig. 3
figure 3

Runtime of \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) for \({\tilde{p}}=0.3\) and \({\tilde{p}}=0.7\), while \(k=1\). The x-axis shows the number of agents and the y-axis represents the computation time in seconds

We set \(k=1\) and test the two algorithms, while using two different values of the parameter \({\tilde{p}}\), 0.3 and 0.7, to see the needed computation time and the quality of the returned solutions with respect to the optima. In Table 5, (a) and (b) show the actual quality of the solution obtained by \({\mathsf {BAAAT}}\), i.e.,

$$\begin{aligned} \frac{V_{e,1}({\mathrm{{CS}}}_{e,1}^*)}{ V_{e,1}({\mathrm{{CS}}}_{e,1}^+)}, \end{aligned}$$
(21)

and \({\mathsf {IBAAAT}}\), i.e.,

$$\begin{aligned} \frac{V_{e,1}({\mathrm{{CS}}}_{e,1}^*)}{V_{e,1}({\mathrm{{CS}}}_{e,1}^{++})} \end{aligned}$$
(22)

in the experiments, respectively. On the other hand, (c) and (d) represent the theoretical a priori worst-case error bound on the solution quality for \({\mathsf {BAAAT}}\), i.e.,

$$\begin{aligned} 1 + \frac{\sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} \left( r_{{\tilde{a}}_i}^{\max } - v_{e,1}({{\tilde{a}}_i})\right) }{ V_{e,1}({\mathrm{{CS}}}_{e,1}^+)} \end{aligned}$$
(23)

which is an upper bound on (21) by Theorem 1, and for \({\mathsf {IBAAAT}}\), i.e.,

$$\begin{aligned} 1 + \frac{\sum _{{\tilde{a}}_i \in \tilde{{\mathcal {A}}}} \left( r_{\tilde{a_i}}^{\max } - v_{e,1}({{\tilde{a}}_i})\right) }{ V_{e,1}({\mathrm{{CS}}}_{e,1}^{++})} \end{aligned}$$
(24)

which is an upper bound on (22) by Theorem 2, respectively. All of (a), (b), (c), (d) are always at least 1.0 and a value close to 1 is desirable.

From the results for (a) and (b), we can see that when \({\tilde{p}}=0.3\), \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) have an observed approximation ratio of less than 1.05 in most cases. For instance, when the number of agents is 16, the result of (a) with the uniform distribution is 1.034, and the result of (b) is 1.019. However, when we set \({\tilde{p}}=0.7\), in most cases we can see that the values obtained by \({\mathsf {BAAAT}}\) are more than 1.3, while the ones of \({\mathsf {IBAAAT}}\) are less than 1.2. For instance, when the number of agents is 12, the results of (a) and (b) with modified uniform distribution are 1.444 and 1.156, respectively. Regarding (c) and (d), we can see that the value (i.e., the a priori bound) increases gradually in all cases as the number of agents increases for both algorithms. For instance, with the uniform distribution the result of (c) for the parameter \({\tilde{p}}=0.3\) increases from 2.013 to 2.806 when the number of agents increases from 6 to 16, and the result of (d) for the parameter \({\tilde{p}}=0.7\) increases from 3.339 to 5.255.

Figure 3 represents the average runtime of \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) for all distributions (1)–(6). The x-axis shows the number of agents and the y-axis represents the average runtime. As one can see, we observed similar results for all distributions.

When the parameter \({\tilde{p}}\) is 0.3 and the number of agents is small, the complete algorithm, \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) can solve the problems very quickly. However, as the number of agents increases, the difference between the complete algorithm and the approximation algorithms becomes significant. For instance, when the number of agents is 16 and the distribution is uniform, the average runtime of \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) is 1.8 s and 2.1 s, respectively, while it is 593.3 s for the complete algorithm. The average runtime of \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) are very similar.

However, when the parameter \({\tilde{p}}\) is 0.7, we can see that the average runtime of \({\mathsf {IBAAAT}}\) increases and the one of \({\mathsf {BAAAT}}\) decreases. For example, with uniform distribution the average runtime of \({\mathsf {IBAAAT}}\) increases from 2.1 to 74.8 s when the parameter \({\tilde{p}}\) increases from 0.3 to 0.7, while the average runtime of \({\mathsf {BAAAT}}\) decreases from 1.8 to 0.1 s.

In total, the experimental results are not very surprising but instead confirm and make precise the natural intuition of what kind of impact on the performance should changing some of the parameters of our model and the algorithms’ parameter \({\tilde{p}}\) have. In summary, they reveal that (1) when the parameter \({\tilde{p}}\) is set to 0.3, the approximation ratio of \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) is less than 1.05 in most cases, and (2) when the parameter \({\tilde{p}}\) increases, \({\mathsf {BAAAT}}\) can solve the problems faster than \({\mathsf {IBAAAT}}\), while \({\mathsf {IBAAAT}}\) has a better solution quality than \({\mathsf {BAAAT}}\). Therefore, the user can decide how to set the paremeter \({\tilde{p}}\) according to his preference and if most of the agents have participation rates that are higher than \({\tilde{p}}\), both algorithms have a very similar computation time, but the solution returned by \({\mathsf {IBAAAT}}\) is always at least as good as the solution returned by \({\mathsf {BAAAT}}\). Furthermore, if many agents have participation probabilities smaller or equal to \({\tilde{p}}\), the user chooses which algorithm to use according to whether his priority is to get an approximately optimal solution fast or he is more interested in the solution quality being as good as possible.

6 Related work

Our work is most closely related to the probabilistic \({\mathsf {CSG}}\) model introduced in [26]. Our framework includes the two variants of probabilistic \({\mathsf {CSG}}\) from [26], namely Cautious and Flexible \({\mathsf {PCSG}}\), as special cases for values of parameter \(k=0\) and \(k=n-1\), respectively. For the experiments, however, we restrict our attention to a limited number of attendance types, while in [26] there is no limitation placed on the number of types.

In coalition formation, which also includes \({\mathsf {CSG}}\), many works have been devoted to the uncertainty of forming a coalition. Chalkiadakis and Boutilier [3] focused on the uncertainty of the types (capabilities) of the agents, and proposed a Bayesian reinforcement learning framework for repeated coalition formation under type uncertainty. In this framework, the agents maintain and update beliefs about the types of others through the experience gained by repeated interaction, and through this process improve their ability to form useful coalitions. Compared to this work, we rather focus on the attendance type (i.e., the uncertainty of agents’ attendances), which is different from capability uncertainty. Also, the coalition values depend on the capabilities and the actions in the aforementioned work, while we compute them with the payoffs given by the characteristic function and the attendance rate of the attendance type.

Kraus et al. [11] worked on coalition formation under coalitional value uncertainty. In this framework, a set of tasks is given, and each task is performed by a different agent. The agents do not know the value of a task of another agent or the cost of performing it, but they know the overall payoff associated with performing a set of tasks and the capabilities of other agents. Faye et al. [8] worked on dynamic coalition formation in dynamic uncertain environments. This work investigates dynamic, uncertain environments in which tasks may evolve during execution, and agents and resource availability may vary rapidly and unpredictably. None of those works actually considers the attendance rate of each agent. Also, how the coalition values are computed in \({\mathsf {PCSG}}\) makes our model quite different from previous work.

Moreover, related to our work is the Team Formation Problem (\(\mathsf {TF}\)) [15, 32]. \(\mathsf {TF}\) is the problem of forming the best possible team to perform some tasks of interest, given limited resources. Nair and Tambe [15] worked on forming a team with the maximum expected value, under the constraint that it has all the required skills to accomplish the tasks of interest. To give a comparison between the \(\mathsf {TF}\) problem and the \({\mathsf {CSG}}\) problem, it is useful to keep in mind that \({\mathsf {CSG}}\) (and \({\mathsf {PCSG}}\)) is similar to the complete set partition problem [33], while \(\mathsf {TF}\) is equivalent to the set cover problem [10]. Okimoto et al. [16, 17] worked on the robustness issue in team formation problems. In these papers, a set of agents and a set of tasks are given, and the aim is to form a team which is robust, i.e., which can achieve the given tasks even if some agents break down. The parameter k in \({\mathsf {PCSG}}\) (i.e., the number of agents who may be absent from a coalition) has a somewhat similar role as the robustness considered in team formation.

In traditional \({\mathsf {CSG}}\), since the value of each coalition is assumed to be given by a characteristic function, the representation size is exponential in the number of agents [24, 25]. In order to solve this problem, several compact representation schemes for characteristic functions have been proposed [4, 9, 29, 31], e.g., concise representation scheme based on agent types [31], marginal contribution nets (MC-nets) [9] and synergy coalition groups (SCGs) [4]. The attendance type used in this paper is based on the idea introduced in [31], where the authors consider a situation where multiple agents have similar capabilities/skills called recognizable types, and the number of possible types is small. In our work, we use this idea for the attendance rate of each agent.

7 Conclusion

How to form a coalition is a major issue for many applications related to multi-agent cooperation. Coalition Structure Generation (\({\mathsf {CSG}}\)) involves partitioning a set of agents into coalitions such that the social surplus is maximized. Probabilistic Coalition Structure Generation (\({\mathsf {PCSG}}\)) is an extension of \({\mathsf {CSG}}\) where the aim is to find an optimal coalition structure that maximizes the sum of the expected values of all coalitions. The contributions of this paper are as follows:

  • A formal framework for the Probabilistic Coalition Structure Generation (\({\mathsf {PCSG}}\)) is introduced where the attendance type of each agent is considered. This framework is a generalization of the framework introduced in [26], namely it includes the Cautious and Flexible \({\mathsf {PCSG}}\) treated in [26] as special cases.

  • Approximation algorithms for solving the \({\mathsf {PCSG}}\) problem called Bounded Approximation Algorithm based on Attendance Types (\({\mathsf {BAAAT}}\)) and Involved Bounded Approximation Algorithm based on Attendance Types (\({\mathsf {IBAAAT}}\)) are presented. The characteristics of these algorithms are as follows: (1) there is an upper bound on the error of the returned solution and this bound can be obtained a priori, and (2) one can use any complete algorithm for solving to optimality the relaxed problems in Phase 2 of both algorithms.

  • The performances of \({\mathsf {BAAAT}}\) and \({\mathsf {IBAAAT}}\) are evaluated on a number of benchmarks. Our experimental results revealed that (1) both algorithms can solve the \({\mathsf {PCSG}}\) problems very quickly and provide high solution quality when the parameter \({\tilde{p}}\) is low, and (2) we can see a trade-off between these algorithms by varying the parameter \({\tilde{p}}\), i.e., \({\mathsf {BAAAT}}\) can solve the problems faster than \({\mathsf {IBAAAT}}\) for larger \({\tilde{p}}\), while \({\mathsf {IBAAAT}}\) has the better solution quality.

Regarding future work, a potential direction is investigating concise representations of the characteristic function in \({\mathsf {PCSG}}\). Since the number of coalitions is exponential in the number of agents, it is reasonable to try to reduce the representation size of the characteristic function which provides the value of each coalition. One could try to apply the existing concise representations of the characteristic function from [4, 9, 29, 31] in our framework, and then try to solve large-scale problem instances.

Furthermore, what could be of interest is applying our \({\mathsf {PCSG}}\) framework to real-world problems such as the nurse scheduling problem (NSP) [2] and the distributed vehicle routing problem [25]. Forming effective working groups by considering the nurses’s attendances, amounts to solving the \({\mathsf {PCSG}}\) problem. Also, drivers’ attendances in the vehicle routing problem can be represented in the \({\mathsf {PCSG}}\) framework.

Lastly, our framework could potentially be extended to a dynamic setting in which the set of agents \({\mathcal {A}}\) may change over time. The objective here would be to apply such an extended framework to the distributed robot team reconfiguration problem [6].