- Veronika Balážová: Prediction of users’ personality traits during task solving in information system
- Patrik Berger: User Retention in online enviroment
- Patrik Blanárik: Prediction of Project Success on Crowdfunding Portals
- Martin Černák: Personalized recommendation based on item visual characteristics
- Juraj Flamík: Recognition of Similarities in User Behavior in Data Stream
- Peter Gašpar: Personalized Hybrid Recommendation enhanced by Visual Features
- Mário Hunka:Scalable personalized recommendation
- Jakub Janeček: Interpretabilty of Machine Learning Models created by Clustering Algorithms
- Kamil Janeček: Predicting Customer Satisfaction Based on Help Desk Data
- Tomáš Jendrejčák: Prediction of website user churn rate
- Matúš Kalafut: Purchase prediction in e-shop
- Ondrej Kaššák: User Modelling for Session End Intent Prediction
- Matej Končál: Purchase prediction in e-shop
- Júlia Krajčoviechova: Prediction of user return to website
- Michal Kren: Improving Robustness Against Websites’ Changes During Web Data Extraction
- Rastislav Krchňavý: Aspect-Based Sentiment Analysis
- Michaela Kolesíková: Predicting customer satisfaction based on data from customer support centre
- Ľubomír Koprla: Interpretability and explainability of machne learning models
- Dung Lam Tuan:User’s behaviour prediction in eshop
- Tomáš Matlovič: Providing feedback in the domain of programming
- Peter Ocelík: Reconstruction of text for Slovak language
- Marek Roštár: Personalized recommendation considering visual influences
- Matúš Salát:User Segmentation for Personalization of Newsletters in CQA Systems
- Elena Štefancová: Recommendation Taking into Account the Time Aspects of Users and Items
- Matúš Tunder: Analysis of source code reading
Prediction of users’ personality traits based on task solving on the Web
Veronika Balážová
master study, supervised by Róbert Móro
Abstract: Nowadays, people use web applications and systems to a greater extent than in the past. They want to gain information, realize different services or socialize. But how each user uses an application (a system), does not depend only on the user interface and user experience, but also on user himself and his personality characteristics.
Personality characteristics can help explain how users look at the webpage and how they work with it. Automatic prediction of personality characteristics could be useful for example in personalization of the applications. Specifically, in e-shop domain it could be used to recommend suitable products for each user based on the users’ personality characteristics.
In our work, we build a user model based on user’s interactions with a web page. We use this model as an input for automatic classification of user’s personality characteristics. We work with dataset from a Slovak e-shop. In this dataset there are users’ interactions with the e-shop, their purchases, ratings and views of products and many other actions. Also included in the dataset, there are completed questionnaires of these users, which reveal their personalities according to the Big Five model (5 dimensions of personality).
Besides using interaction features, we plan to extend this model and thus improve the predictions by using eye-tracking. Basic metrics of eye-tracking (number of fixations, time to first fixation, …) will be new input to the classification.
Predicting User Retention in online enviroment
Patrik Berger
master study, supervised by Michal Kompan
Abstract: The recent growth of market and technology advancement led to the increse of amount of competitors providing online services to its users. In those circumstances acquiring a new user is multiple times more expensive than keeping the existing ones. That makes user retention one of the key metrics of success for such an online service (e-shops, bank services, insurrance companies etc.). Successful prediction of churn of a specific user provides an opportunity to change his decision by for instance giving him a special offer. This kind of prevention and identification of churn reasons create huge motivation to explore this area. In our work we focus on identification of the set of features to create a user model for further use for the churn prediction. In first stage of our work we plan to build a user model in selected domain and explore the possibilities of automatic feature extraction from the data. As a next step we want to select classifiers and build a structure of a learning ensemble. Finally, we are planning to test our model with a nontrivial dataset from the selected domain.
Prediction of Project Success on Crowdfunding Portals
Patrik Blanárik
bachelor study, supervised by Ivan Srba
Abstract: Crowdfunding is a way of funding with the support of large number of people who help by contributing small amounts of money. For some people crowdfunding can be considered a simple way to get a capital needed to bring their creative projects to life. Crowdfunding portals such as Kickstarter and Indiegogo provide an opportunity for these people to achieve their goals. Naturally, there are projects that were successfully funded but also projects that were not.
Our task is to predict whether a project created on Kickstarter will get funded as it might be very useful for project creator as a way of feedback. To do so, we will use data about projects from Kickstarter recorded in different time periods together with relevant data gathered from Facebook, assuming social networks might have great impact on project success. Besides many other methods, we will use NLP to extract features from structured texts (e.g. project description).
Personalized Recommendation Taking into Account Visual Impacts
Martin Černák
master study, supervised by Michal Kompan
Abstract: Recommender system is important part of the web nowadays. Amount of information on the web is just unsearchable and to find useful informations and keep track about interesting topic is too time consuming. Well placed and selected recommendations can improve user experience which often transforms into the revenue.
Lists of results generated by recommenders are often accompanied with graphical elements such as images. These elements and their attributes in some domains significantly influence user preferences. Despite state-of-the-art approaches are focused on processing textual content of items, or observing and modeling behavioral relationships and interactions between users, they ignore graphical elements.
In our work we aim at proposing novel method for recommendation which takes into account the influence of graphical elements and analyze its viability.
We decided to use this method to recommend hotels. This domain have some specific problems like not enough data for individual users but also there are plenty of images for individual hotels. We believe that using our method we can ease problem with data.
Recognition of similarities in user behavior in data stream
Juraj Flamík
master study, supervised by Ondrej Kaššák
Abstract: It would seem, that web site user behavior is highly unique and different from other users behavior. It is based on user current intention and previous experiences with the web site. But the web site itself offers only finite number of possibilities, in which users can behave. Thanks to this fact, we can find users, who behave similarly. Then, we can use this information in tasks like personalization, user modeling, recommendation or prediction.
In our work, we analyze possibilities of user behavior clustering. Because we work with a lot of data in web sites with dynamically changing content, we focus on clustering in data stream. We are solving subtasks like feature engineering, distance and cluster quality measurements. Then we want to use these obtained clusters / behavior similarities to improve task of recommendation. At the end, we want to test our method on nontrivial real dataset and show, that clustering can help to get better results for task of recommendation.
Personalized Hybrid Recommendation enhanced by Visual Features
Peter Gašpar
doctoral study
Abstract: Recommender systems have become an essential part of the Web in various domains. They provide suggestions for users about which things to read, which products to buy, or which movies to watch. They try to perfectly tailor to user preferences in order to improve overall user experience. However, in some scenarios, we need to deal with the problem of insufficient amount of information about users, items, or they interactions.
Another essential part of many Web portals are images. They play an important role in extending or even replacing an information about items (such as movie posters, product photos). Users’ decision process may be led by the visual stimuli and thus, an important information about the item is hidden in the image. Moreover, these images may also contain features that can be useful during the process of user-modelling and recommendation.
In our work, we study the problem of incorporation of features extracted from the images in a recommender systems domain. We examine hybrid recommendation approaches that combine basic recommender systems techniques and incorporate images during the ranking process of output recommendations.
Scalable Personalized Recommendation
Mário Hunka
master study, supervised by Michal Kompan
Abstract: Do you know that feeling when you dont know which movie to watch? The more movies are in offer the harder it is to find suitable movie. Personalized recommendation systems are trying to fit users needs and give him relevant options to choose from, but there are many things that can influence whats relevant to you in that particular moment, e.g. other users opinion, context, popularity of actors, category…
In our works, we want to determine if quality is relevant for users most of the time. Therefore, we want to see how users are influenced by opinion of experts – in this case – movie critics. We use collaborative filtering approach that boosts or decreases the final estimation of rating based on correlation between user and critic ratings.
Final method should be used only for certain best performing user segments based on algorithm evaluation through multiple segments. Segments can be formed by many features like demography, number of reviews and so on. Combining this with other classic methods can result in robust and scalable movie recommender.
Interpretabilty of Machine Learning Models created by Clustering Algorithms
Jakub Janeček
master study, supervised by Jakub Ševcech
Abstract: Interpretability is a key characteristics of machine learning model, if our goal is to persuade experts from chosen domain, for which we propose our model, to accept it and use it. The better we are able to explain the behaviour of our model, the greater is the chance it will be accpeted.
That’s why it is importatnt to strive not just for better results of our model, but also its interpretability. One without other lacks the meaning. Sometimes maybe even model with worse result has better chances of being accepted if we are able to explain it better in a way that is comprehensible for people.
We are focusing on clustering and models created by clustering algorithms. Our goal is to use feature importance of these models to raise their interpretability.
Predicting Customer Satisfaction Based on Help Desk Data
Kamil Janeček
bachelor study, supervised by Eduard Kuric
Abstract: Customer satisfaction is critical for any business. Good customer care is very important and can lead to increased sales and profit. Customers with any problems contact a customer support in hopes of getting a positive resolution of their problems.Unfortunately, after finishing their conversations with the support, customers tend not to leave any feedback. This fact creates a hard problem of evaluating the customer support effectiveness. In this work, we propose method, which should predict whether a customers problem was successfully solved during an online chat session or not
Prediction of website user churn rate
Tomáš Jendrejčák
bachelor study, supervised by Ondrej Kaššák
Abstrakt: The problem of customer return is problem that is often being solved in various domains. Getting user we lost to return is often unsuccessful, which is why it’s important to design a method that would be able to reveal customers with higher risk of churning early. If we manage to identify users like this early, we can take steps to avoid their churning.
This work deals with the problem of predicting user’s return to a web site with the use of machine learning.
Purchase prediction in e-shop
Matúš Kalafut
bachelor study, supervised by Ondrej Kaššák
Abstract: The topic of our bachelor thesis is to predict user’s behaviour during his visit in eshop, specifically, whether he is going to buy something at the end of his visit or not. Our task is to create a method for this prediction.
We can predict user’s behaviour based on his behaviour in the past or behaviour during his present session. This type of information can be very useful for merchants and start era of personalized advertisements in their eshop. For example they will be able to create personalized newsletters for groups of customers, which showed interest in particular category of products in the past.
To solve this type of problem we need to use machine learning. Machine learning gives computers ability to learn without being explicitly programmed. For training our machine learning model we need to extract features from data, on which it will learn. Features can describe customers,products or sessions.
We use multiple algorithms for training and then compare reached results. After we modify features and try to boost models to get better results.
For training and validation our model we use data provided by discount portal zlavadna.sk
User Modelling for Session End Intent Prediction
Ondrej Kaššák
Abstract: User behaviour in the web site can be modelled from two basic points of view. The first one is the short term behaviour, which reflect user’s actual intent, preferences, goal etc. It captures user’s most actual behaviour and actions but it is typically very noisy, because of influence of user’s actual context, mood and more unpredictable conditions.
The second point of view – long-term behaviour is characterized by more stable preferences identification and capturing user typical customs. On the other side, this kind of behaviour is not so adaptable to changes, it learn trends and hot topics of user behaviour only after longer time period.
To be able to model user preferences and predict future behaviour, it is suitable to combine both data sources and consider them when estimating next user actions. In our research, we focus on task of user session exit intent prediction. This task require to be able to recognize subtle changes in user behaviour in comparison to previous behaviour in different time periods as well as characteristics of actual user session.
Purchase prediction in e-shop
Matej Končál
bachelor study, supervised by Ondrej Kaššák
Abstract: Shopping in an eshop is in virtual space, not in the physical stores of companies with ability to influence customer behavior. Therefore, there is a requirement to monitor customer behavior while visiting the store’s website to positively affect the outcome of visit. Supervised machine learning is one of the most used methods to predict customer behaviour nowadays. The goal of this thesis is to analyze input data and use it for training machine learning model. Two independent algorithms will be used for model training: gradient increasing decision tree and logistic regression. After training, results will be compared and better model will be picked. The picked model will be experimentally tested on data from the discount portal.
Prediction of user return to website
Júlia Krajčoviechová
bachelor study, supervised by Ondrej Kaššák
Abstract: In this work, we‘re discussing the topic prediction of user return to website. At present, the casual and one-time users are a group that represents an idle potential for increasing the number of web site’s visits. The main goals is to determine the behaviour of the individual and then to use this findings to predict whether or not the user will return to the site or if the customer has lost interest in using the services. An important part of this work is the use of relatively large amount of data to analyze and track user behaviour while using the web site or look for similarities between the behaviour of users. Then, it is necessary to design a custom method, using existing methods for prediction of user return, the output of which will be the answer to whether the user will visit the site in the future again or not. Our method could be helpful also in the practise, as it can detect the deficiencies of web site because of what the web site loses its users, or it can find out the so-called occasional visitors who need to take another way to visit the site again.
Improving Robustness Against Websites’ Changes During Web Data Extraction
Michal Kren
master study, supervised by Ivan Srba
Abstract: General approach to dealing with changes to the websites’ structure during web extraction is to optimize the XPath expressions before executing the wrapper. We propose a novel approach to wrapper robustness based on machine learning, applied during, or more precisely, after the extraction. When an XPath expression fails as a result of a new change to the web page’s structure, we apply binary classification to identify the desired HTML element. Based on this element a new XPath expression is generated. We will evaluate our method on a series of snapshots of selected webpages, measuring not only the accuracy of our classificator, but also the duration until our self-repairing wrapper definitivelly fails.