Performance Analysis of Group Recommendation Systems in TV Domains
- Author: Kim Noo-ri, Lee Jee-Hyong
- Publish: International Journal of Fuzzy Logic and Intelligent Systems Volume 15, Issue1, p45~52, 25 March 2015
Although researchers have proposed various recommendation systems, most recommendation approaches are for single users and there are only a small number of recommendation approaches for groups. However, TV programs or movies are most often viewed by groups rather than by single users. Most recommendation approaches for groups assume that single users’ profiles are known and that group profiles consist of the single users’ profiles. However, because it is difficult to obtain group profiles, researchers have only used synthetic or limited datasets. In this paper, we report on various group recommendation approaches to a real large-scale dataset in a TV domain, and evaluate the various group recommendation approaches. In addition, we provide some guidelines for group recommendation systems, focusing on home group users in a TV domain.
Group recommendation system , Group modeling , Home group , TV domain
In various domains, such as movies, music, books, and TV, the amount of content is increasing rapidly. Although this vast content provides a wide range of choices to users, they often have difficulty with choosing or finding what they want to consume due precisely to this huge amount of content. For this reason, researchers have proposed various recommendation systems for providing convenience in choice [1-7] and these recommendation systems are successful in various services, such as movies, music, and e-commerce . Some services are for single users and some for group users. For example, e-commerce services are usually for single users, and movies or TV programs are for single or group users. However, most recommendation approaches are designed for single users and there are only a small number of approaches for group users.
General group recommendation systems need to model the preference of groups for the recommendation. There are two main approaches. The first is the group profile-based approach. If there is a group of users
A, B, and C, the system collects the list of items consumed by the group and recommends items based on the profile . Because it is difficult to obtain profiles of groups, the other approach is based on the profiles of individual group members. It is a consensus function-based approach [10-12] that creates the pseudo-preferences of a group by merging the preference of each member in the group using consensus functions.
Because group profile-based approaches are very similar to recommendations for single users, most approaches for single users can be directly applied and do not have many variants. On the other hand, many consensus function-based approaches have been proposed. McCarthy and Anagnost  proposed a group preference arbitration system. It merged group member preference into a preference and recommended pieces of music that the group would like to listen to. Boratto and Carta  proposed how to make synthetic virtual group users by using clustering and then applied several consensus functions to a movie domain. Baltrunas et al.  proposed a group recommendation system using rank aggregation. In order to build preferences of group users, it used ranks instead of ratings. This method was applied to a movie domain . Gorla et al.  proposed a probabilistic group recommendation via information matching. It made a group user preference model from each member’s preference, based on probability. In addition, it applied to a movie domain.
Researchers evaluate and verify group recommendation approaches, however, with synthetic or limited datasets [8-16] because real large-scale datasets of group profiles are rare. Regarding design group recommendation systems, we need to consider group and domain characteristics simultaneously. Group user characteristics may change, depending on domain characteristics . For example, we could ignore minor opinions in a group on casual domains, such as movies or TV programs.
In addition, to provide recommendation in many domains, we need to consider making user profiles use general recommendation methods. The general recommendation methods use the users’ profiles, such as rating. In many domains, however, it is difficult to collect the user rating information because of lack of interaction or additional cost. Therefore, we need to consider basing the user profiles on consumed histories.
In this paper, we make users’ profiles from users’ consumed histories and analyze the effectiveness of group recommendation systems on real large-scale datasets regarding TV domain. In addition, we provide some guides for group recommendation systems, focusing on home group users in TV domains. We have organized the remainder of this paper as follows. Section 2 overviews methods for group modeling. Section 3 describes real-world datasets and pre-processing. Section 4 presents experimental results and Section 5 provides conclusions.
To suggest items to a single user, most recommendation systems adopt a simple approach based on user profiles, such as ratings on items. For example, if a target user has high ratings on the items
A, B, and C, the recommendation system suggests similar items to A, B, and C, or items popular to users who like A, Band C.
Unlike the recommendation system for single users, the group recommendation system needs consideration of the group profile and the group members’ profile together.
There are two major approaches for group recommendations: the group profile-based approach and the consensus function-based approach. Each approach uses only one type of profiles, group profiles or group member profiles. The following paragraphs provide details on this approach.
The group user approach makes group profiles from group histories only. In other words, this approach treats a group as a single user. For example, there is a group user
g1 that is composed of u1, u2, and u3. If this group watches the TV contents c1, c2, c3, c4, c5, and c6 and rates 3, 4, 2, 5, 1, and 2 together, these ratings are threaded to a group profile, such as < 3, 4, 2, 5, 1, 2 >. This approach makes the group profile for the group recommendation system. However, if group histories are insufficient to use or cannot be collected, this approach cannot be used.
In the case where group profiles are hard to obtain but group members’ profiles are available, the consensus function approach can be applicable. These approaches make a group profile from group members’ profiles by using consensus functions. The consensus functions merge group members’ profiles into a pseudo-group profile and the profile is used to recommend. There are various consensus function strategies, such as least misery, most pleasure, and average. These consensus functions imitate decision making processes in the group . The pseudo-group profiles that are made by each consensus function have difference characteristics, leading to different performances of the group recommendation system. Let us assume that there are group members’ profiles as follows:
2.2.1 Least misery strategy
The least misery strategy (
Min) makes a pseudo-group profile by selecting a minimum rating on each item from the group members’ profiles. Therefore, group g1 is made as follows:
The recommendation system could avoid suggesting TV programs that are similar to
c1 and c6, because g1’s profile shows the lowest ratings on c1 and c6. This strategy leads the recommendation system to avoid dissatisfaction for group members. However, this strategy could ignore satisfaction from other members.
2.2.2 Most pleasure strategy
The most pleasure strategy (
Max) makes a pseudo-group profile by selecting a maximum rating on each item from the group members’ profiles. The following vector shows g1’s profile:
The recommendation system could suggest TV content, such as TV programs similar to
c2 and c4, because g1’s profile shows the highest ratings on c2 and c4. This strategy supposes that the most pleasure user determines the group users’ opinions. However, this strategy could ignore dissatisfaction from other members.
2.2.3 Average strategy
The average strategy (
Avg) makes a pseudo-group profile by selecting the average rating on each item from group members’ profiles. The following vector shows g1’s profile:
The recommendation system could suggest TV content, such as items similar to
c4, because g1’s preference model shows the highest rating on c4. This strategy supposes that all group members have the same influence to determine the group’s opinion. However, if group members have dissimilar preferences, this strategy is not suitable.
In the case where both group profiles and group members’ profiles are available, the consensus function with group profile based approaches can be applicable. The approach makes pseudo-group profiles from group profiles and group members’ profiles. This approach is an extension of the previously mentioned consensus function-based approach. Let us assume that there are a group profile and group members’ profiles as follows:
g1’s pseudo-group profiles are as follows:
This approach considers group preferences and group members’ preferences simultaneously. However, this approach could be suitable when we know the agreement process among the group members and select an appropriate consensus function.
In order to make group profiles, we should consider when we apply the collaborative filtering to predict unknown ratings . To make group profiles, we can apply the collaborative filtering before or after applying the consensus function-based approach and the consensus function with group profile-based approach. If a collaborative filtering is applied first, recommendation lists for each group members are generated and then the recommendation list for the group is generated by a consensus based approach. Therefore, the recommendation list for the group may reflect more each group member’s preference. If a consensus function-based approach is applied first, the pseudo-group profiles are made from the group members’ profiles by a consensus based approach, and then the recommendation list for the group is generated by applying a collaborative filtering to the pseudo-group profile. Therefore, this approach may reflect more of the group preference, or the consensus between group members.
In order to analyze the performance of the group recommendation systems, we use the TNmS dataset. TNmS is the company that collects users’ watching histories and provides program rating information. To collects the users’ watching histories, TNmS uses the devices (peoplemeter) that are set on the panel homes. The TNmS dataset consists of users’ watching histories from 1,000 homes in Korea between 1/1/2013 and 6/30/2013. The number of total personal users is 3,318, with total TV content at 107,570. This TV content is on four public channels (KBS1, KBS2, MBC, and SBS). If a user watches TV content for less than 15 minutes, it is regarded as an invalid view and ignored.
In order to apply the previously mentioned approaches, ratings on the TV program should be made from watching histories. We use the following equation.
In Eq. (7),
r u, U Pnis a rating on the unique TV program U Pnby a user u. For example, if a user watches TV programs in a unique TV program repeatedly, the rating on the unique TV programs is higher. The number of TV programs and unique TV programs are 21,480 and 1,958, respectively. In other words, each user has ratings vector that consists of 1,958 dimensions.
We experiment with five types of group recommendation systems. The first is the group profile-based approach (
GP). The second is the collaborative filtering-consensus approaches ( CF- Con), which applies collaborative filtering to individual members and makes a consensus of the individual group members. In CF- Con, there are three approaches with different consensus strategies: CF- Min, CF- Max, and CF- Avg. The third is the collaborative filtering-consensus with group profile approach ( CF- Con/ G), which applies collaborative filtering to the individual members and applies a consensus function to the individual members including the group profile. Also, there are three approaches with different consensus strategies: CF- Min/ G, CF- Max/ G, and CF- Avg/ G. The fourth is the consensus-collaborative filtering approach ( Con- CF), which makes a consensus of the individual members and applies collaborative filtering. There are three approaches with different consensus strategies: Min- CF, Max- CF, and Avg- CF. The last is the consensus with group profile-collaborative filtering approach ( Con/ G- CF), which applies a consensus function to the individual members, including the group profile, and applies collaborative filtering. There are three approaches: Min/ G- CF, Max/ G- CF, and Avg/ G- CF.
We use the user-based collaborative filtering to predict ratings. In collaborative filtering, we use the Pearson correlation coefficient to calculate similarity between users. It is as follows:
In Eq. (8),
r u1, iis a rating of TV program iby user u1 and is an average of ratings by user u1. The value of similarity is from -1 to 1. If user u1 and u2 have more similar ratings, the similarity is closer to 1. In order to predict the unknown ratings, we used the adjusted weighted aggregation. It is as follows:
In Eq. (9), is the predicted rating on item
iby user u. KNNuis a set of users similar to the user u. Therefore, is predicted from the average of user u’s ratings and similar users’ ratings. We set the neighborhood size to 50.
The dataset is divided into a training set and a testing set. The training set is first the 80% watching histories and the other 20% of watching histories were used as the testing set. The number of TV programs in the testing set is 133,383.
The recommendation process is as follows. If there is a group user with a test item, the recommendation system finds the existing ratings or predicts the unknown ratings on available TV programs by using the group user’s profile or the pseudo-group profile. A TV program with the highest rating is then offered to the group user.
In order to analyze performances of the group recommendation systems, we use precision as a metric. It is defined as follows:
The number of correctly recommended TV programs is the number of programs viewed by users in the recommended TV programs. If the value of the precision is higher, it means that the group recommendation system has a higher performance.
In this section, we analyze the experimental results in two aspects. The first is the overall comparison of performance between each group recommendation system. The second is the performance depending on history sizes.
4.3.1 Overall comparison of performance
Table 2 shows the performance of various group recommendation approaches. First,
GPshows a better performance than traditional consensus function-based approaches: CF- Min, CF- Max, CF- Avg, Min- CF, Max- CF, and Avg- CF. It implies that group user preferences built from group user profiles are more valuable than those built by merging individual group member profiles. However, most CF- Con/ Gapproaches, CF- Max/ G, CF- Avg/ G, Max/ G- CF, and Avg/ G- CF, show better performances than GP. That is, the best performances were achieved by using both the preference of groups and the preferences of individual group members, implying that the preferences of individual group members are valuable even though the preferences of groups are available. We cannot find significant differences between CF- Conapproaches and Con- CFapproaches. The order of collaborative filtering and consensus function may not make a big difference in recommendation. However, the strategies for consensus show different performances. Max and Avg strategies show a better performance than the Min strategy, and thus are more suitable for home group users in TV domains.
4.3.2 Comparison of performance depending on history size
Figures 1-3 show performances of various group recommendation systems depending on the history sizes of the groups. We do not compare with the experiment results of
Con- CFapproaches, because their performances are about the same as those of CF- Conapproaches. The horizontal axis indicates the group history sizes and the vertical axis indicates the precision.
First, Figure 1 shows the performance of
GPand CF- Conapproaches. The performance of GPincreases along with the group history size. It shows that the more histories a group has, the better performance the recommendation system has. In the CF- Conapproaches, it is difficult to find a strong relation between the performances and history sizes, because they do not directly use group histories.
When a group has more than 300 histories, the performance of
GPis better than that of CF- Conapproaches. When a group has fewer than 300 histories, CF- Conapproaches with Max and Avg strategies show better performances than GP. Especially when a group has fewer than 25 histories, the performance of CF- Maxand CF- Avgare almost twice of that of GP. Therefore, if a group has fewer than 300 histories, CF- Conapproaches may be more suitable to the group recommendation systems. In CF- Conapproaches, Min strategy shows the worst performance. It means that most home group users choose TV programs that give more pleasure to group members.
Second, Figure 2 shows the performance of
GPand CF- Con/ Gapproaches. CF- Max/ Gand CF- Avg/ Gshow better performances than GPfor any history sizes, unlike CF- Conapproaches. In other words, using group preferences is helpful to improve the performance of CF- Conapproaches. Because CF- Con/ Gapproaches use group preferences, their performance increases as the history size increases.
Last, Figure 3 shows the comparison of the
bests, that is, the comparison of GP, CF- Avg, and CF- Avg/G. CF- Avgand CF- Avg/ Gshow the best performance in CF- Conand CF- Con/ Gapproaches, respectively. For all the histories, CF- Avg/ Gshows the best performance. The gap between CF- Avgand CF- Avg/ Gincreases more as the history size increases. In other words, if a group has more histories, CF- Con/ Gapproaches can provide better performances.
These results tell us which approach is better depending on available information. If the system cannot use group profiles,
CF- Avgis better. If the system can use group profiles, however, CF- Avg/ Gis better. In summary, for group recommendation system design, we should consider availability of group profiles.
In this study, we evaluated various group recommendation approaches with a real-world dataset, and provided some guidelines for group recommendation systems, focusing on home group users in TV domains.
We evaluated the five types of approaches:
GP, CF- Con, CF- Con/ G, Con- CF, and Con- CF/ G. The experimental results showed characteristics of each approach.
The performance of the
GPincreases along with the group history size and when a group has more than 300 histories, it is better than the performance of the consensus function-based approaches. In the consensus function with a GP, Max/ Gand Avg/ Gshow the best performance for all the histories. The order of collaborative filtering and consensus function may not make a big difference on home group users in TV domains. If a recommendation system cannot use group profiles, Maxand Avgare more suitable than other approaches. If the recommendation system can use group profiles or a group has more than sufficient history for recommendation, however, Max/ Gand Avg/ Gare more suitable.
For a better recommendation, we may use profiles of groups with preferences that are similar to those of target groups. In future work, we will extend group recommendation approaches with similar group profiles and more various consensus functions.
[Table 1.] Dataset information
[Table 2.] Overall performance comparison
[Figure 1.] Performance comparison between GP and CF-Con approaches. GP, group profile; CF-Con, collaborative filtering-consensus.
[Figure 2.] Performance comparision between GP and CF-Con/G approaches. GP, group profile; CF-Con/G, collaborative filtering-consensus with group profile.
[Figure 3.] Performance comparision between GP, CF-Avg, and CF-Avg/G approaches. GP, group profile; CF, collaborative filtering.