검색 전체 메뉴
PDF
맨 위로
OA 학술지
Performance Analysis of Group Recommendation Systems in TV Domains
  • 비영리 CC BY-NC
  • 비영리 CC BY-NC
ABSTRACT

Although researchers have proposed various recommendation systems, most recommendation approaches are for single users and there are only a small number of recommendation approaches for groups. However, TV programs or movies are most often viewed by groups rather than by single users. Most recommendation approaches for groups assume that single users’ profiles are known and that group profiles consist of the single users’ profiles. However, because it is difficult to obtain group profiles, researchers have only used synthetic or limited datasets. In this paper, we report on various group recommendation approaches to a real large-scale dataset in a TV domain, and evaluate the various group recommendation approaches. In addition, we provide some guidelines for group recommendation systems, focusing on home group users in a TV domain.


KEYWORD
Group recommendation system , Group modeling , Home group , TV domain
  • 1. Introduction

    In various domains, such as movies, music, books, and TV, the amount of content is increasing rapidly. Although this vast content provides a wide range of choices to users, they often have difficulty with choosing or finding what they want to consume due precisely to this huge amount of content. For this reason, researchers have proposed various recommendation systems for providing convenience in choice [1-7] and these recommendation systems are successful in various services, such as movies, music, and e-commerce [8]. Some services are for single users and some for group users. For example, e-commerce services are usually for single users, and movies or TV programs are for single or group users. However, most recommendation approaches are designed for single users and there are only a small number of approaches for group users.

    General group recommendation systems need to model the preference of groups for the recommendation. There are two main approaches. The first is the group profile-based approach. If there is a group of users A, B, and C, the system collects the list of items consumed by the group and recommends items based on the profile [9]. Because it is difficult to obtain profiles of groups, the other approach is based on the profiles of individual group members. It is a consensus function-based approach [10-12] that creates the pseudo-preferences of a group by merging the preference of each member in the group using consensus functions.

    Because group profile-based approaches are very similar to recommendations for single users, most approaches for single users can be directly applied and do not have many variants. On the other hand, many consensus function-based approaches have been proposed. McCarthy and Anagnost [8] proposed a group preference arbitration system. It merged group member preference into a preference and recommended pieces of music that the group would like to listen to. Boratto and Carta [10] proposed how to make synthetic virtual group users by using clustering and then applied several consensus functions to a movie domain. Baltrunas et al. [11] proposed a group recommendation system using rank aggregation. In order to build preferences of group users, it used ranks instead of ratings. This method was applied to a movie domain [11]. Gorla et al. [13] proposed a probabilistic group recommendation via information matching. It made a group user preference model from each member’s preference, based on probability. In addition, it applied to a movie domain.

    Researchers evaluate and verify group recommendation approaches, however, with synthetic or limited datasets [8-16] because real large-scale datasets of group profiles are rare. Regarding design group recommendation systems, we need to consider group and domain characteristics simultaneously. Group user characteristics may change, depending on domain characteristics [12]. For example, we could ignore minor opinions in a group on casual domains, such as movies or TV programs.

    In addition, to provide recommendation in many domains, we need to consider making user profiles use general recommendation methods. The general recommendation methods use the users’ profiles, such as rating. In many domains, however, it is difficult to collect the user rating information because of lack of interaction or additional cost. Therefore, we need to consider basing the user profiles on consumed histories.

    In this paper, we make users’ profiles from users’ consumed histories and analyze the effectiveness of group recommendation systems on real large-scale datasets regarding TV domain. In addition, we provide some guides for group recommendation systems, focusing on home group users in TV domains. We have organized the remainder of this paper as follows. Section 2 overviews methods for group modeling. Section 3 describes real-world datasets and pre-processing. Section 4 presents experimental results and Section 5 provides conclusions.

    2. Group Modeling

    To suggest items to a single user, most recommendation systems adopt a simple approach based on user profiles, such as ratings on items. For example, if a target user has high ratings on the items A, B, and C, the recommendation system suggests similar items to A, B, and C, or items popular to users who like A, B and C.

    Unlike the recommendation system for single users, the group recommendation system needs consideration of the group profile and the group members’ profile together.

    There are two major approaches for group recommendations: the group profile-based approach and the consensus function-based approach. Each approach uses only one type of profiles, group profiles or group member profiles. The following paragraphs provide details on this approach.

       2.1 Group Profile-Based Approach

    The group user approach makes group profiles from group histories only. In other words, this approach treats a group as a single user. For example, there is a group user g1 that is composed of u1, u2, and u3. If this group watches the TV contents c1, c2, c3, c4, c5, and c6 and rates 3, 4, 2, 5, 1, and 2 together, these ratings are threaded to a group profile, such as < 3, 4, 2, 5, 1, 2 >. This approach makes the group profile for the group recommendation system. However, if group histories are insufficient to use or cannot be collected, this approach cannot be used.

       2.2 Consensus Function-Based Approach

    In the case where group profiles are hard to obtain but group members’ profiles are available, the consensus function approach can be applicable. These approaches make a group profile from group members’ profiles by using consensus functions. The consensus functions merge group members’ profiles into a pseudo-group profile and the profile is used to recommend. There are various consensus function strategies, such as least misery, most pleasure, and average. These consensus functions imitate decision making processes in the group [9]. The pseudo-group profiles that are made by each consensus function have difference characteristics, leading to different performances of the group recommendation system. Let us assume that there are group members’ profiles as follows:

    image

    2.2.1 Least misery strategy

    The least misery strategy (Min) makes a pseudo-group profile by selecting a minimum rating on each item from the group members’ profiles. Therefore, group g1 is made as follows:

    image

    The recommendation system could avoid suggesting TV programs that are similar to c1 and c6, because g1’s profile shows the lowest ratings on c1 and c6. This strategy leads the recommendation system to avoid dissatisfaction for group members. However, this strategy could ignore satisfaction from other members.

    2.2.2 Most pleasure strategy

    The most pleasure strategy (Max) makes a pseudo-group profile by selecting a maximum rating on each item from the group members’ profiles. The following vector shows g1’s profile:

    image

    The recommendation system could suggest TV content, such as TV programs similar to c2 and c4, because g1’s profile shows the highest ratings on c2 and c4. This strategy supposes that the most pleasure user determines the group users’ opinions. However, this strategy could ignore dissatisfaction from other members.

    2.2.3 Average strategy

    The average strategy (Avg) makes a pseudo-group profile by selecting the average rating on each item from group members’ profiles. The following vector shows g1’s profile:

    image

    The recommendation system could suggest TV content, such as items similar to c4, because g1’s preference model shows the highest rating on c4. This strategy supposes that all group members have the same influence to determine the group’s opinion. However, if group members have dissimilar preferences, this strategy is not suitable.

       2.3 Consensus Function with Group Profile-Based Approach

    In the case where both group profiles and group members’ profiles are available, the consensus function with group profile based approaches can be applicable. The approach makes pseudo-group profiles from group profiles and group members’ profiles. This approach is an extension of the previously mentioned consensus function-based approach. Let us assume that there are a group profile and group members’ profiles as follows:

    image

    Then, g1’s pseudo-group profiles are as follows:

    image

    This approach considers group preferences and group members’ preferences simultaneously. However, this approach could be suitable when we know the agreement process among the group members and select an appropriate consensus function.

    In order to make group profiles, we should consider when we apply the collaborative filtering to predict unknown ratings [12]. To make group profiles, we can apply the collaborative filtering before or after applying the consensus function-based approach and the consensus function with group profile-based approach. If a collaborative filtering is applied first, recommendation lists for each group members are generated and then the recommendation list for the group is generated by a consensus based approach. Therefore, the recommendation list for the group may reflect more each group member’s preference. If a consensus function-based approach is applied first, the pseudo-group profiles are made from the group members’ profiles by a consensus based approach, and then the recommendation list for the group is generated by applying a collaborative filtering to the pseudo-group profile. Therefore, this approach may reflect more of the group preference, or the consensus between group members.

    3. Dataset

    In order to analyze the performance of the group recommendation systems, we use the TNmS dataset. TNmS is the company that collects users’ watching histories and provides program rating information. To collects the users’ watching histories, TNmS uses the devices (peoplemeter) that are set on the panel homes. The TNmS dataset consists of users’ watching histories from 1,000 homes in Korea between 1/1/2013 and 6/30/2013. The number of total personal users is 3,318, with total TV content at 107,570. This TV content is on four public channels (KBS1, KBS2, MBC, and SBS). If a user watches TV content for less than 15 minutes, it is regarded as an invalid view and ignored.

    Table 1 shows information of the dataset. In Table 1, groups have fewer watching histories than single users. Furthermore, about 50% of groups have fewer than 25 histories.

    [Table 1.] Dataset information

    label

    Dataset information

    In order to apply the previously mentioned approaches, ratings on the TV program should be made from watching histories. We use the following equation.

    image

    In Eq. (7), ru,U Pn is a rating on the unique TV program U Pn by a user u. For example, if a user watches TV programs in a unique TV program repeatedly, the rating on the unique TV programs is higher. The number of TV programs and unique TV programs are 21,480 and 1,958, respectively. In other words, each user has ratings vector that consists of 1,958 dimensions.

    4. Experiment

       4.1 Experiment Design

    We experiment with five types of group recommendation systems. The first is the group profile-based approach (GP). The second is the collaborative filtering-consensus approaches (CF-Con), which applies collaborative filtering to individual members and makes a consensus of the individual group members. In CF-Con, there are three approaches with different consensus strategies: CF-Min, CF-Max, and CF-Avg. The third is the collaborative filtering-consensus with group profile approach (CF-Con/G), which applies collaborative filtering to the individual members and applies a consensus function to the individual members including the group profile. Also, there are three approaches with different consensus strategies: CF-Min/G, CF-Max/G, and CF-Avg/G. The fourth is the consensus-collaborative filtering approach (Con-CF), which makes a consensus of the individual members and applies collaborative filtering. There are three approaches with different consensus strategies: Min-CF, Max-CF, and Avg-CF. The last is the consensus with group profile-collaborative filtering approach (Con/G-CF), which applies a consensus function to the individual members, including the group profile, and applies collaborative filtering. There are three approaches: Min/G-CF, Max/G-CF, and Avg/G-CF.

    We use the user-based collaborative filtering to predict ratings. In collaborative filtering, we use the Pearson correlation coefficient to calculate similarity between users. It is as follows:

    image

    In Eq. (8), ru1,i is a rating of TV program i by user u1 and is an average of ratings by user u1. The value of similarity is from -1 to 1. If user u1 and u2 have more similar ratings, the similarity is closer to 1. In order to predict the unknown ratings, we used the adjusted weighted aggregation. It is as follows:

    image

    In Eq. (9), is the predicted rating on item i by user u. KNNu is a set of users similar to the user u. Therefore, is predicted from the average of user u’s ratings and similar users’ ratings. We set the neighborhood size to 50.

    The dataset is divided into a training set and a testing set. The training set is first the 80% watching histories and the other 20% of watching histories were used as the testing set. The number of TV programs in the testing set is 133,383.

    The recommendation process is as follows. If there is a group user with a test item, the recommendation system finds the existing ratings or predicts the unknown ratings on available TV programs by using the group user’s profile or the pseudo-group profile. A TV program with the highest rating is then offered to the group user.

       4.2 Metric

    In order to analyze performances of the group recommendation systems, we use precision as a metric. It is defined as follows:

    image

    The number of correctly recommended TV programs is the number of programs viewed by users in the recommended TV programs. If the value of the precision is higher, it means that the group recommendation system has a higher performance.

       4.3 Experiment Result

    In this section, we analyze the experimental results in two aspects. The first is the overall comparison of performance between each group recommendation system. The second is the performance depending on history sizes.

    4.3.1 Overall comparison of performance

    Table 2 shows the performance of various group recommendation approaches. First, GP shows a better performance than traditional consensus function-based approaches: CF-Min, CF-Max, CF-Avg, Min-CF, Max-CF, and Avg-CF. It implies that group user preferences built from group user profiles are more valuable than those built by merging individual group member profiles. However, most CF-Con/G approaches, CF-Max/G, CF-Avg/G, Max/G-CF, and Avg/G-CF, show better performances than GP. That is, the best performances were achieved by using both the preference of groups and the preferences of individual group members, implying that the preferences of individual group members are valuable even though the preferences of groups are available. We cannot find significant differences between CF-Con approaches and Con-CF approaches. The order of collaborative filtering and consensus function may not make a big difference in recommendation. However, the strategies for consensus show different performances. Max and Avg strategies show a better performance than the Min strategy, and thus are more suitable for home group users in TV domains.

    [Table 2.] Overall performance comparison

    label

    Overall performance comparison

    4.3.2 Comparison of performance depending on history size

    Figures 1-3 show performances of various group recommendation systems depending on the history sizes of the groups. We do not compare with the experiment results of Con-CF approaches, because their performances are about the same as those of CF-Con approaches. The horizontal axis indicates the group history sizes and the vertical axis indicates the precision.

    First, Figure 1 shows the performance of GP and CF-Con approaches. The performance of GP increases along with the group history size. It shows that the more histories a group has, the better performance the recommendation system has. In the CF-Con approaches, it is difficult to find a strong relation between the performances and history sizes, because they do not directly use group histories.

    When a group has more than 300 histories, the performance of GP is better than that of CF-Con approaches. When a group has fewer than 300 histories, CF-Con approaches with Max and Avg strategies show better performances than GP. Especially when a group has fewer than 25 histories, the performance of CF-Max and CF-Avg are almost twice of that of GP. Therefore, if a group has fewer than 300 histories, CF-Con approaches may be more suitable to the group recommendation systems. In CF-Con approaches, Min strategy shows the worst performance. It means that most home group users choose TV programs that give more pleasure to group members.

    Second, Figure 2 shows the performance of GP and CF-Con/G approaches. CF-Max/G and CF-Avg/G show better performances than GP for any history sizes, unlike CF-Con approaches. In other words, using group preferences is helpful to improve the performance of CF-Con approaches. Because CF-Con/G approaches use group preferences, their performance increases as the history size increases.

    Last, Figure 3 shows the comparison of the bests, that is, the comparison of GP, CF-Avg, and CF-Avg/G. CF-Avg and CF-Avg/G show the best performance in CF-Con and CF-Con/G approaches, respectively. For all the histories, CF-Avg/G shows the best performance. The gap between CF-Avg and CF-Avg/G increases more as the history size increases. In other words, if a group has more histories, CF-Con/G approaches can provide better performances.

    These results tell us which approach is better depending on available information. If the system cannot use group profiles, CF-Avg is better. If the system can use group profiles, however, CF-Avg/G is better. In summary, for group recommendation system design, we should consider availability of group profiles.

    5. Conclusions

    In this study, we evaluated various group recommendation approaches with a real-world dataset, and provided some guidelines for group recommendation systems, focusing on home group users in TV domains.

    We evaluated the five types of approaches: GP, CF-Con, CF-Con/G, Con-CF, and Con-CF/G. The experimental results showed characteristics of each approach.

    The performance of the GP increases along with the group history size and when a group has more than 300 histories, it is better than the performance of the consensus function-based approaches. In the consensus function with a GP, Max/G and Avg/G show the best performance for all the histories. The order of collaborative filtering and consensus function may not make a big difference on home group users in TV domains. If a recommendation system cannot use group profiles, Max and Avg are more suitable than other approaches. If the recommendation system can use group profiles or a group has more than sufficient history for recommendation, however, Max/G and Avg/G are more suitable.

    For a better recommendation, we may use profiles of groups with preferences that are similar to those of target groups. In future work, we will extend group recommendation approaches with similar group profiles and more various consensus functions.

참고문헌
  • 1. Bobadilla J., Serradilla F., Bernal J. 2010 “A new collaborative filtering metric that improves the behavior of recommender systems,” [Knowledge-Based Systems] Vol.23 P.520-528 google cross ref
  • 2. Shin H., Lee M., Kim E. Y. 2009 “Personalized digital TV content recommendation with integration of user behavior profiling and multimodal content rating,” [IEEE Transactions on Consumer Electronics] Vol.55 P.1417-1423 google cross ref
  • 3. Kim J., Kim K., You K. H., Lee J. H. 2012 “An approach for music recommendation using content-based analysis and collaborative filtering,” [Information] Vol.15 P.1985-1996 google
  • 4. Chung Y., Jung H.W., Kim J., Lee J. H., Perner P. 2013 “Personalized expert-based recommender system: training C-SVM for personalized expert identification,” [Machine Learning and Data Mining in Pattern Recognition] P.434-441 google cross ref
  • 5. Kim D. M., Kim K. S., Park K. H., Lee J. H., Lee K. M. 2007 “A music recommendation system with a dynamic k-means clustering algorithm,” [Proceedings of the 6th International Conference on Machine Learning and Applications (ICMLA 2007)] P.399-403 google cross ref
  • 6. Bang S. W., Jung H. W., Kim J., Lee J. H. 2010 “An auto playlist generation system with one seed song,” [International Journal of Fuzzy Logic and Intelligent Systems] Vol.10 P.19-24 google cross ref
  • 7. Lee S. I., Lee S. Y. 2011 “A recommendation system using context-based collaborative filtering,” [Journal of Korean Institute of Intelligent Systems] Vol.21 P.224-229 google cross ref
  • 8. McCarthy J. F., Anagnost T. D. 1998 “MusicFX: an arbiter of group preferences for computer supported collaborative workouts,” [Proceedings of the 1998 ACM Conference on Computer Supported Cooperative Work (CSCW)] P.363-372 google cross ref
  • 9. Yu Z., Zhou X., Hao Y., Gu J. 2006 “TV program recommendation for multiple viewers based on user profile merging,” [User Modeling and User-Adapted Interaction] Vol.16 P.63-82 google cross ref
  • 10. Boratto L., Carta S. 2014 “Modeling the preferences of a group of users detected by clustering: a group recommendation case-study,” [Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics (WIMS 2014)] P.1-7 google cross ref
  • 11. Baltrunas L., Makcinskas T., Ricci F. 2010 “Group recommendations with rank aggregation and collaborative filtering,” [Proceedings of the 4rth ACM Conference on Recommender Systems (RecSys)] P.119-126 google cross ref
  • 12. Pizzutilo S., Carolis B. D., Cozzolongo G., Ambruoso F. 2005 “Group modeling in a public space: methods, techniques, experiences,” [Proceedings of the 5th WSEAS International Conference on Applied Informatics and Communications] P.175-180 google
  • 13. Gorla J., Lathia N., Robertson S., Wang J. 2013 “Probabilistic group recommendation via information matching,” [Proceedings of the 22nd International Conference on World Wide Web] P.495-504 google
  • 14. Sotelo R., Blanco-Fernandez Y., Lopez-Nores M., Gil-Solla A., Pazos-Arias J. J. 2009 “TV program recommendation for groups based on muldimensional TV-anytime classifications,” [IEEE Transactions on Consumer Electronics] Vol.55 P.248-256 google cross ref
  • 15. O’Connor M., Cosley D., Konstan J. A., Riedl J. 2001 “PolyLens: a recommender system for groups of users,” [Proceedings of the 7th European Conference on Computer Supported Cooperative Work (ECSCW 2001)] P.199-218 google cross ref
  • 16. Quijano-Snchez L., Bridge D., Daz-Agudo B., Recio-Garca J., Agudo B. D., Watson I. 2012 “A case-based solution to the cold-start problem in group recommenders,” [Case-Based Reasoning Research and Development] P.342-356 google cross ref
이미지 / 테이블
  • [ ] 
  • [ ] 
  • [ ] 
  • [ ] 
  • [ ] 
  • [ ] 
  • [ Table 1. ]  Dataset information
    Dataset information
  • [ ] 
  • [ ] 
  • [ ] 
  • [ ] 
  • [ Table 2. ]  Overall performance comparison
    Overall performance comparison
  • [ Figure 1. ]  Performance comparison between GP and CF-Con approaches. GP, group profile; CF-Con, collaborative filtering-consensus.
    Performance comparison between GP and CF-Con approaches. GP, group profile; CF-Con, collaborative filtering-consensus.
  • [ Figure 2. ]  Performance comparision between GP and CF-Con/G approaches. GP, group profile; CF-Con/G, collaborative filtering-consensus with group profile.
    Performance comparision between GP and CF-Con/G approaches. GP, group profile; CF-Con/G, collaborative filtering-consensus with group profile.
  • [ Figure 3. ]  Performance comparision between GP, CF-Avg, and CF-Avg/G approaches. GP, group profile; CF, collaborative filtering.
    Performance comparision between GP, CF-Avg, and CF-Avg/G approaches. GP, group profile; CF, collaborative filtering.
(우)06579 서울시 서초구 반포대로 201(반포동)
Tel. 02-537-6389 | Fax. 02-590-0571 | 문의 : oak2014@korea.kr
Copyright(c) National Library of Korea. All rights reserved.