ABSTRACT. A multi-label Naive Bayes (NB) classi er was implemented to predict di erent genres of a movie given its synopsis (plot summary). Di erent methods of implementing the NB such as Multinomial NB and Bernoulli NB were explored along with two types of feature extractor- count and tf-idf (text frequency-inverse document frequency). The use of a pro posed feature selection system using the trained weights of a Naive-Bayes classi er was also explored. It was shown that the Multinomial NB classi er with NB-trained tf-idf features performed better than the Bernoulli NB and the Multino mial NB with count features. Furthermore, the proposed feature selection scheme gave an increased performance in the the Multinomial NB classi er by up to approximately 4% (in the precision, recall, and F1-score).
Keywords naive bayes, machine learning, feature extraction, document classification.
Modeling the relationship between a human’s stress level and their physiological responses can pave the way for numerous advances in neuropsychology. This work follows suit in tackling such prob lem by adapting a multi-modal framework in determining stress response. In this paper, we modeled the valences and arousals of human subjects as they were put into an impromptu job interview session; visual, acoustic, text, and physiological features were ob tained fromsubjects.Weproposeatemporal-sensitivedeeplearning network framework that consists of a CNN-LSTM and a CNN-CNN module with late fusion. Various combinations of modalities were tested and it was determined that using a combination of acoustic and visual features yielded the best concordance correlation coeffi cient (CCC) for valence–reaching 0.708; on the other hand, fusing all the acoustic, visual, and text features had the highest CCC for arousal, equaling to 0.3692.
Human activity recognition aims to recognize the actions of the person from a series of observations (usually obtained from sensors attached or being used by the person). Incorporating action recognition on these wearables and designing softwares that utilizes this information for monitoring and managing physical activity. In this paper, we present using convolutional neural networks (CNN) on signals from mobile phones sensors for activity recognition. Generalized model (input data without disambiguating per person) using the accelerometer input data gave a 84-85% accuracy while the Per-person model (input data is limited to per-person information) yielded a 98.91% accuracy for 20 subjects.
An anonymized recommendation system of anime shows that uses neural network and federated learning was implemented. This is performed without storing user information via federated learning to address anonimity in using services that require personal information. Federated learning uses model averaging to preserve data privacy by not requiring the user to send personal information directly to the recommendation system but instead updates the model on the device level. The paper implemented three (3) collaborative filtering recommendation systems: vanilla using matrix slower rate taking into account the small batch implemented with the limitation on the computing system available for simulation.
Using 8 years worth of Top 100 Billboard music videos -- text and image analytics was applied using the Rekognition, Comprehend, and SageMaker. With the Recognition and Comprehend, top themes as well as top objects/people present in the MVs were identified and used to create a model that helps predict the probability of a new music video to hit the billboards. Aside from probabilities, it can be expanded to show similarities in previously released MVs that have already reached high popularity for further content analysis -- in terms of video, music, and text.