展示广告中点击率预估和动态竞价策略的研究与实现
发布时间:2018-06-23 13:52
本文选题:展示广告 + 实时竞价 ; 参考:《华东师范大学》2017年硕士论文
【摘要】:近年来,网络流量急剧上升,将网页上的部分资源当作广告位出售已经成为越来越多的媒体进行流量变现的一个重要手段。计算广告学研究的焦点问题是为一组用户与网页上下文环境的组合,找到与之最匹配的广告。精准的广告投放对用户、媒体和广告主均有利,作为一种新型的展示广告投放模式,实时竞价的出现推动了广告位由线下定价到线上售卖模式的转变,改变了广告市场的格局,也极大地拓展了计算广告学的研究领域,实时竞价算法研究也因此受到了学术界和工业界的广泛关注。本文将实时竞价算法研究划分为两大关键问题:点击率预测和竞价策略的设计。一方面,点击率预估关系到媒体、广告主和用户三方的利益;另一方面广告主需要参考点击率来制定合理的竞价策略。然而,广告历史日志本身存在严重的数据稀疏性,传统机器学习方法构建的预测模型难以达到较高的准确率。本文抓住广告投放是面向用户的商业活动这一重要特征,提出了一种基于用户相似度和特征分化的点击率预估组合模型。该模型首先分析了用户历史行为特征的相似性并据此将其划分为不同子集,接着训练各子集对应的分类子模型,对于所需预测用户、广告、媒体的组合,首先模型需要评估用户与各用户子集的相似度并将其作为子分类器权重,然后统计在各子分类器下的点击概率,最后通过对权重和各子概率的加权组合确定用户的点击率。根据实时竞价的运行模式,广告主通过拍卖的方式获得广告曝光的机会。受广告主预算的限制,合理的竞价策略直接影响到广告主的投资回报。出价偏高会导致广告主预算消耗过快,出价太低将无法获得广告曝光机会。当前主流策略研究主要集中在静态或持续反馈模型上,考虑到互联网环境的复杂性,本文在点击率预估模型的基础上,提出了一种基于概率反馈的动态竞价策略。该策略引入偏离率评估当前算法的有效性,此外,针对需要修正的状态,我们结合拍卖反馈信息给出了修正函数对其进行调整。最后,本文在真实数据集上对提出的模型分别进行实验,并与目前主流方法进行了详细的对比分析。实验结果表明,本文提出的点击率预估模型在Logloss、PR曲线均有突出的性能表现并且相对AUC值最优情况下提升了约5%。此外,通过分析各子集的特征权重可以证明该模型能够挖掘特征对不同群体的差异性影响。在竞价实验中,对比各模型下广告主的KPI和消耗可得,本文提出的竞价策略在广告主预算受限的情况下可以提高广告主的投资回报率,且平均提升在三倍左右;从预算消耗趋势上看,该策略与市场真实消耗情况误差最小,并与其保持相同的消耗趋势。
[Abstract]:In recent years, network traffic has risen sharply, and it has become an important means for more and more media to sell some resources on web pages as advertising space. The focus of computational advertising is to find the most appropriate advertisement for a group of users and web context. Accurate advertising delivery is beneficial to users, media and advertisers. As a new display advertising mode, the emergence of real-time bidding has promoted the transformation of advertising positions from offline pricing to online selling, and has changed the pattern of the advertising market. The research field of computational advertising has also been greatly expanded, and the research of real-time bidding algorithm has been widely concerned by academia and industry. In this paper, the real-time bidding algorithm is divided into two key issues: click rate prediction and bidding strategy design. On the one hand, the prediction of click rate is related to the interests of the media, advertisers and users; on the other hand, advertisers need to refer to the click rate to formulate a reasonable bidding strategy. However, there is serious data sparsity in advertising history log itself, and the prediction model constructed by traditional machine learning method is difficult to achieve high accuracy. This paper takes advantage of the important feature that advertising is a user-oriented business activity, and proposes a combination model of click rate prediction based on user similarity and feature differentiation. The model firstly analyzes the similarity of users' historical behavior characteristics and divides them into different subsets, and then trains the corresponding submodels of each subset to predict the combination of users, advertisements and media. First of all, the model needs to evaluate the similarity between users and subsets of users and take them as sub-classifier weights, then count the click probability under each sub-classifier, and finally determine the click rate of users by the weighted combination of the weights and the sub-probability. According to the real-time bidding mode, advertisers get exposure by auction. Due to the limitation of advertisers' budget, reasonable bidding strategy directly affects the advertisers' return on investment. High bids lead advertisers to spend too fast on their budgets and too low bids to get exposure. The current mainstream strategy research is mainly focused on the static or continuous feedback model. Considering the complexity of the Internet environment, this paper proposes a dynamic bidding strategy based on probability feedback based on the click rate prediction model. This strategy introduces the deviation rate to evaluate the effectiveness of the current algorithm. In addition, for the states that need to be modified, we propose a modified function to adjust the proposed algorithm in combination with the auction feedback information. Finally, the proposed model is experimented on the real data set and compared with the current mainstream methods in detail. The experimental results show that the model presented in this paper has outstanding performance in the Loglosser PR curve and is improved by about 5% relative to the optimal AUC value. In addition, by analyzing the feature weights of each subset, it can be proved that the model can mine the influence of feature on the difference of different groups. In the bidding experiment, comparing the KPI and consumption of the advertisers under each model, the bidding strategy proposed in this paper can improve the advertisers' return on investment under the condition of limited budget, and the average increase is about three times; From the point of view of the trend of budget consumption, the error between the strategy and the real consumption of the market is minimum, and the same consumption trend is maintained.
【学位授予单位】:华东师范大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:F713.8
【参考文献】
相关期刊论文 前1条
1 纪文迪;王晓玲;周傲英;;广告点击率估算技术综述[J];华东师范大学学报(自然科学版);2013年03期
相关硕士学位论文 前1条
1 王孝舒;广告点击率预估的深层神经网络模型研究[D];北京邮电大学;2015年
,本文编号:2057327
本文链接:https://www.wllwen.com/jingjilunwen/guojimaoyilunwen/2057327.html