Market-Required Competence Topic Dynamics

Tigran Topchyan

Outline

Market-Required Competence Topic Dynamics

Tigran Topchyan

2014

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

The Job market is an ever moving and evolving entity. So are the competencies and qualifications it demands of prospective employees. In our previous work we modelled these competencies using topic models, but in order to have a more effective understanding of the market, we have to take into account the dynamics of the system as well. Here we propose using dynamic topic modelling as a means of analysing the job market for competencies and qualifications stemming from our previous research.

spyretta leivaditi

2017

The Labour Market domain is a relatively narrow domain in terms of concept types that appear in it (as it typically consists of professions, skills and qualifications) but a very broad one in terms of actual concepts (as these professions and skills can be in all kinds of domains such as Technology, Education, Finance, etc). More importantly, it is a quite volatile domain in the sense that the meaning of many concepts changes (at different rates) over time. This phenomenon, known as semantic or concept drift, poses a challenge for the maintenance and evolution of knowledge graphs that represent such domains, and requires dedicated approaches for tackling it so as to prevent such graphs from becoming irrelevant. With that in mind, in this paper we describe our experiences from dealing with concept drift in an in-house developed labour market knowledge graph, and provide insights on: i) how concept drift can be effectively defined and modeled for labour market concepts, and ii) how it...

downloadDownload free PDF View PDFchevron_right

Analysis of Research Data in Information Science Using the Topic Modeling Method

Sompejch Junlabuddee

Journal of Mekong Societies, 2021

Most research data in the modern world are in digital format, and there is therefore a need to develop high-efficiency tools that can provide access to and an understanding of these data. Computerization technology based on natural language processing with a capacity for topic extraction and categorization would enable us to identify new topics and future directions for research in several fields of study. The aim of this research was to analyze and categorize information science research data obtained from journals listed in an international database between 2013 and 2019. The research methodology applied here was data analysis based on the topic modeling method, a technique used to locate word groups or topics from a corpus containing complicated and difficult works. This method yields reliable and high-accuracy outcomes. The data analyzed here were drawn from research articles published in information science journals, the names of which were listed in the Scimago Journal and Country Rank between 2013 and 2019. Only journals in the Web of Science and articles written in English were included. A total of 30,571 research articles obtained from 677 volumes of 99 journals were analyzed using the topic modeling method, and topics were assigned by experts in the field. The findings revealed that over the past seven years, research was carried out on 30 topics in information science. The five most frequently researched topics were competency development, data management, social media analytics, public and community services, and bioinformatics. A comparison with other research data analyzed in the field of information science over the past five years using other techniques showed clear differences and a tendency of the research topics to change. The results of this research can greatly benefit the identification of research directions for the future.

downloadDownload free PDF View PDFchevron_right

Topic Modeling in Management Research: Rendering New Theory from Textual Data

Hovig Tchalian

Academy of Management Annals

downloadDownload free PDF View PDFchevron_right

Hierarchical Topic Modeling for Analysis of Time-Evolving Personal Choices

David Dunson

The nested Chinese restaurant process is extended to design a nonparametric topic-model tree for representation of human choices. Each tree path corresponds to a type of person, and each node (topic) has a corresponding probability vector over items that may be selected. The observed data are assumed to have associated temporal covariates (corresponding to the time at which choices are made), and we wish to impose that with increasing time it is more probable that topics deeper in the tree are utilized. This structure is imposed by developing a new "change point" stick-breaking model that is coupled with a Poisson and productof-gammas construction. To share topics across the tree nodes, topic distributions are drawn from a Dirichlet process. As a demonstration of this concept, we analyze real data on course selections of undergraduate students at Duke University, with the goal of uncovering and concisely representing structure in the curriculum and in the characteristics of the student body.

downloadDownload free PDF View PDFchevron_right

The researchers profile with topic modeling

Smail Boussaadi, Hassina Aliane, Abdeldjalil Ouahabi

2020

Modeling the interests of researchers in academic social networks is a crucial step in a process of recommending scientific articles, linked to their areas of competence and expertise. In this context, a researcher profile constructed from non-observable variables on the basis of articles which interests him by the LDA (Latent Dirichlet Allocation) topic modeling technique allows the system to capture knowledge about his area of competence and skills, in order to predict these needs in terms of relevant research articles. In this article we are interested in the results produced by two different implementations of LDA Gensim and Mallet on the basis of information provided by the researchers (explicit information), in order to compare their interpretability and checked if they are reliable sources for model the areas of competence and expertise of scientists. Keywords-1 st Topic Modeling, 2 nd LDA , 3 rd Mallet, 4 th Researcher Profil.

downloadDownload free PDF View PDFchevron_right

Continuous-Time Infinite Dynamic Topic Models

Wesam Elshamy

Risk Management and Decision-Making, 2014

Topic models are probabilistic models for discovering topical themes in collections of documents. In real world applications, these models provide us with the means of organizing what would otherwise be unstructured collections. They can help us cluster a huge collection into different topics or find a subset of the collection that resembles the topical theme found in an article at hand.

downloadDownload free PDF View PDFchevron_right

Evaluation of the trends in jobs and skill-sets using data analytics: a case study

Atif Omar

Journal of Big Data, 2022

Introduction Fast-emerging technologies are making the job market dynamic, causing desirable skills to evolve continuously. It is therefore important to understand the transitions in the job market to proactively identify skill sets required. Case description A novel data-driven approach is developed to identify trending jobs through a case study in the oil and gas industry. The proposed approach leverages a range of data analytics tools, including Latent Semantic Indexing (LSI), Latent Dirichlet Allocation (LDA), Factor Analysis and Non-Negative Matrix Factorization (NMF), to study changes in the market. Further, our approach is capable of identifying disparities between skills that are covered by the educational system, and the skills that are required in the job market. Discussion and evaluation The results of the case study show that, while the jobs most likely to be replaced are generally low-skilled, some high-skilled jobs may also be at risk. In addition, mismatches are ident...

downloadDownload free PDF View PDFchevron_right

What Topic Modeling Could Reveal about the Evolution of Economics

Marco Guerzoni

Journal of Economics Methdology, 2018

The paper presents the topic modeling technique known as Latent Dirichlet Allocation (LDA), a form of text-mining aiming at discovering the hidden (latent) thematic structure in large archives of documents. By applying LDA to the full text of the economics articles stored in the JSTOR database, we show how to construct a map of the discipline over time, and illustrate the potentialities of the technique for the study of the shifting structure of economics in a time of (possible) fragmentation

downloadDownload free PDF View PDFchevron_right

Topic Modeling: Perspectives From a Literature Review

Sebastian Robledo

IEEE Access

Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews. INDEX TERMS Literature review, machine learning, natural language processing, scientometrics, topic modeling.

downloadDownload free PDF View PDFchevron_right

Skills and Vacancy Analysis with Data Mining Techniques (text mining)

Izabela A Wowczko

Through recognizing the importance of a qualified workforce, skills research has become one of the focal points in economics, sociology, and education. Great effort is dedicated to analyzing labor demand and supply, and actions are taken at many levels to match one with the other. In this work we concentrate on skills needs, a dynamic variable dependent on many aspects such as geography, time, or the type of industry. Historically, skills in demand were easy to evaluate since transitions in that area were fairly slow, gradual, and easy to adjust to. In contrast, current changes are occurring rapidly and might take an unexpected turn. Therefore, we introduce a relatively simple yet effective method of monitoring skills needs straight from the source—as expressed by potential employers in their job advertisements. We employ open source tools such as RapidMiner and R as well as easily accessible online vacancy data. We demonstrate selected techniques, namely classification with k-NN and information extraction from a textual dataset, to determine effective ways of discovering knowledge from a given collection of vacancies.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (7)

Blei, 2003] David M. Blei, Andrew Y. Ng, and Michael I. Jordan. In: Latent dirichlet allocation. J. Mach. Learn. Res, 3:993-1022, mar 2003.
Blei, 2006] Blei, David M and Lafferty, John D. In: Dynamic topic models. Proceedings of the 23rd international conference on Machine learning, 113-12, 2006.
Wang, 2006] Wang, Chong and Blei, David and Heckerman, David. In: Continuous time dynamic topic models. arXiv preprint arXiv:1206.3298, 2012.
Zhang, 2006] Zhang, Jianwen and Song, Yangqiu and Zhang, Changshui and Liu, Shixia. In: Evolutionary hierarchical dirichlet processes for multiple correlated time-varying corpora. Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, 1079-1088, 2010.
Topchyan, 2014] Tigran Topchyan. In: Engineering competence frameworks and topic modelling. Mathematical Problems of Computer Science, 2014.
Topchyan, 2014] Tigran Topchyan. In: Job Market requirements and qualities extraction for qualification analysis. Mathematical Problems of Computer Science, 2014.
Crawley, 20] Edward F Crawley, Johan Malmqvist, William A Lucas, and Doris R Brodeur. In: The cdio syllabus v2. 0 an updated statement of goals for engineering education. Proceedings of 7th International CDIO Conference, Copenhagen, Denmark, 2011.

Chris Ballenger

2020

Attracting technology talent in today’s hiring climate is more complicated than ever. Recruiting for technology talent in non-technology industries is even more challenging. This intense hiring landscape is motivating companies to attract the right talent and create a culture that can retain and grow that talent. In this paper, we developed algorithms and present insights that use data provided in reviews to glean information employers can use to address or even change their priorities to meet the demands of an ever-changing job market. The core of our research is to investigate and attribute the role of company reviews in explaining the critical dimensions through which employees perceive their job. To provide more in-depth and targeted insights, we limit our focus to Technology related job reviews. Our contributions include building an IT Professional profile that can help create an edge for recruiters. We achieved our research by conducting a comprehensive topic modeling on emplo...

downloadDownload free PDF View PDFchevron_right

A Comprehensive Review of the Three Main Topic Modeling Algorithms and Challenges in Albanian Employability Skills

Milena Shehu, European Scientific Journal ESJ

European Scientific Journal, ESJ

Today’s jobseekers face many obstacles while trying to find a career that aligns with their interests, employability soft skills, and professional experience. In Albania, jobseekers frequently initiate their job search by actively exploring job vacancies listed on various online job portals. The analysis of job vacancies posted online provides an added advantage to the labour market actors compared to traditional survey-based analyses. This is because it enables a faster analytical process, promotes decision-making based on accurate data, and should be carefully considered by every country when formulating their Labor Market Policies. Since the data posted online are unlabelled, it has been proven that the potential of unsupervised learning techniques, more precisely the Topic Modelling algorithms, is outstanding when applied to analysing job vacancies, mainly with regard to assessing employability soft skills. Algorithms in topic modelling are essential for uncovering hidden patterns in texts, facilitating the extraction of important data, generating document summaries, and enhancing content comprehension. This paper analyses and compares the three primary methodologies and algorithms used in topic modelling, which can be applied to analyse employability soft-skills: Latent Semantic Analysis (LSA), Latent Dirichlet Allocation (LDA), and BERTopic. At the end of the paper, conclusions are drawn regarding superior performance and optimal algorithm applicability, challenges, and limitations through a review of studies conducted in the Albanian job market.

downloadDownload free PDF View PDFchevron_right

A Latent Dirichlet Allocation Framework to Analyse and Forecast Employability Skills

Milena Shehu

International Journal of Innovative Technology and Interdisciplinary Sciences, 2025

Globalization, rapid technological advancement, Albania's EU integration process are reshaping labour market dynamics, creating urgent needs for timely skill intelligence. Traditional survey-based statistics often lag behind these changes, while online job postings provide a real-time source of employer demand. A Latent Dirichlet Allocation (LDA)-based framework is introduced in this paper, applied to 1,500 vacancies collected from five major Albanian job portals (July-September 2024), to extract, categorize, and forecast employability skills. The model is implemented in a rolling/windowed LDA Model, enabling the tracking of skill dynamics over time and alignment with the European Skills, Competences, and Occupations (ESCO) taxonomy. Findings show that Albanian employers predominantly demand transversal soft skills, especially Responsibility, Communication, Collaboration, Networking, and Presentation, while green and digital skills appear only gradually. An interactive "Skills Forecast" Shiny application operationalizes results, forecasting the top ten indemand skills for specific vacancies that the user wants to test, and offering validation metrics for policymakers, educators, and employers.

downloadDownload free PDF View PDFchevron_right

The Ideal Candidate. Analysis of Professional Competences through Text Mining of Job Offers

Maria Gabriella Grassia

2006

Summary. The aim of this paper is to propose analytical tools for identifying peculiar aspects of the job market for graduates. The main objective is to reduce the complexity of the phenomenon, both on the variable side, by transforming the collected information into latent factors, and on the unit side, by classifying observations. We propose a strategy for dealing with data that have different source and nature. The dependence structure is investigated to identify potential evolutionary paths.

downloadDownload free PDF View PDFchevron_right

TOPIC MODELING IN MANAGEMENT RESEARCH: RENDERING NEW THEORY FROM TEXTUAL DATA Journal: Academy of Management Annals

Hovig Tchalian

Academy of Management Annals, 2019

Increasingly, management researchers are using topic modeling, a new method borrowed from computer science, to reveal phenomenon-based constructs and grounded conceptual relationships in textual data. By conceptualizing topic modeling as the process of rendering constructs and conceptual relationships from textual data, we demonstrate how this new method can advance management scholarship without turning topic modeling into a black box of complex computer-driven algorithms. We begin by comparing features of topic modeling to related techniques (content analysis, grounded theorizing, and natural language processing). We then walk through the steps of rendering with topic modeling and apply rendering to management articles that draw on topic modeling. Doing so enables us to identify and discuss how topic modeling has advanced management theory in five areas: detecting novelty and emergence, developing inductive classification systems, understanding online audiences and products, analyzing frames and social movements, and understanding cultural dynamics. We conclude with a review of new topic modeling trends and revisit the role of researcher interpretation in a world of computer-driven textual analysis.

downloadDownload free PDF View PDFchevron_right

Topic Modeling: A Comprehensive Review

pooja kherwa

ICST Transactions on Scalable Information Systems

Topic modelling is the new revolution in text mining. It is a statistical technique for revealing the underlying semantic structure in large collection of documents. After analysing approximately 300 research articles on topic modeling, a comprehensive survey on topic modelling has been presented in this paper. It includes classification hierarchy, Topic modelling methods, Posterior Inference techniques, different evolution models of latent Dirichlet allocation (LDA) and its applications in different areas of technology including Scientific Literature, Bioinformatics, Software Engineering and analysing social network is presented. Quantitative evaluation of topic modeling techniques is also presented in detail for better understanding the concept of topic modeling. At the end paper is concluded with detailed discussion on challenges of topic modelling, which will definitely give researchers an insight for good research.

downloadDownload free PDF View PDFchevron_right

Identifying Labour Market Skills Demand with RapidMiner (web mining)

Izabela A Wowczko

This paper introduces a three-step methodology of identifying skills demand on labour markets. By accessing publicly available vacancy data, with web and text mining tools, we are able to extract valuable facts about competences and abilities sought by employers. This easily applicable technique provides a new dimension in labour market research. It supplements occupational analysis with detailed information related to specialised knowledge expected from prospective employees. By example of IT jobs publicised through IrishJobs.ie domain, we present how web based data can be successfully acquired and pre-processed to suit our research needs, and how meaningful information can be extracted in efficient and reasonably quick manner. Evidence obtained through this process is a straightforward reflection of current needs, and can be acted on by educational and labour market bodies to bridge the gap between skills demand and supply.

downloadDownload free PDF View PDFchevron_right

2010-KBS Journal-Temporal expert finding through generalized time topic modeling.pdf

Ali Daud Associate Professor

This paper addresses the problem of semantics-based temporal expert finding, which means identifying a person with given expertise for different time periods. For example, many real world applications like reviewer matching for papers and finding hot topics in newswire articles need to consider time dynamics. Intuitively there will be different reviewers and reporters for different topics during different time periods. Traditional approaches used graph-based link structure by using keywords based matching and ignored semantic information, while topic modeling considered semantics-based information without conferences influence (richer text semantics and relationships between authors) and time information simultaneously. Consequently they result in not finding appropriate experts for different time periods. We propose a novel Temporal-Expert-Topic (TET) approach based on Semantics and Temporal Information based Expert Search (STMS) for temporal expert finding, which simultaneously models conferences influence and time information. Consequently, topics (semantically related probabilistic clusters of words) occurrence and correlations change over time, while the meaning of a particular topic almost remains unchanged. By using Bayes Theorem we can obtain topically related experts for different time periods and show how experts' interests and relationships change over time. Experimental results on scientific literature dataset show that the proposed generalized time topic modeling approach significantly outperformed the non-generalized time topic modeling approaches, due to simultaneously capturing conferences influence with time information.

downloadDownload free PDF View PDFchevron_right

A Survey of Topic Modeling in Text Mining

Khalid Alfalqi

International Journal of Advanced Computer Science and Applications, 2015

Topic models provide a convenient way to analyze large of unclassified text. A topic contains a cluster of words that frequently occur together. A topic modeling can connect words with similar meanings and distinguish between uses of words with multiple meanings. This paper provides two categories that can be under the field of topic modeling. First one discusses the area of methods of topic modeling, which has four methods that can be considerable under this category. These methods are Latent semantic analysis (LSA), Probabilistic latent semantic analysis (PLSA), Latent Dirichlet allocation (LDA), and Correlated topic model (CTM). The second category is called topic evolution models, which model topics by considering an important factor time. In the second category, different models are discussed, such as topic over time (TOT), dynamic topic models (DTM), multiscale topic tomography, dynamic topic correlation detection, detecting topic evolution in scientific literature, etc.

downloadDownload free PDF View PDFchevron_right

Big Data Software Engineering: Analysis of Knowledge Domains and Skill Sets Using LDA-Based Topic Modeling

Nergiz Cagiltay

IEEE Access, 2019

Software engineering is a data-driven discipline and an integral part of data science. The introduction of big data systems has led to a great transformation in the architecture, methodologies, knowledge domains, and skills related to software engineering. Accordingly, education programs are now required to adapt themselves to up-to-date developments by first identifying the competencies concerning big data software engineering to meet the industrial needs and follow the latest trends. This paper aims to reveal the knowledge domains and skill sets required for big data software engineering and develop a taxonomy by mapping these competencies. A semi-automatic methodology is proposed for the semantic analysis of the textual contents of online job advertisements related to big data software engineering. This methodology uses the latent Dirichlet allocation (LDA), a probabilistic topic-modeling technique to discover the hidden semantic structures from a given textual corpus. The output of this paper is a systematic competency map comprising the essential knowledge domains, skills, and tools for big data software engineering. The findings of this paper are expected to help evaluate and improve IT professionals' vocational knowledge and skills, identify professional roles and competencies in personnel recruitment processes of companies, and meet the skill requirements of the industry through software engineering education programs. Additionally, the proposed model can be extended to blogs, social networks, forums, and other online communities to allow automatic identification of emerging trends and generate contextual tags. INDEX TERMS Big data software engineering, competency map, knowledge domains and skill sets, topic modeling, latent Dirichlet allocation. with the Department of Informatics, Karadeniz Technical University, from 2001 to 2014, where he has been an Instructor with the Center for Research and Application in Distance Education, since 2015. His research interests include trend analysis, sentiment analysis, statistical topic modeling, engineering education, data mining, machine learning, big data analytics, and text mining. NERGIZ ERCIL CAGILTAY received the degree in computer engineering and the Ph.D. degree in instructional technologies from Middle East Technical University, Turkey. She worked for commercial and government organizations as a Project Manager for more than eight years in Turkey. She was also with the Indiana University Digital Library Program as a System Analysis and a Programmer for four years. She has been with the Software Engineering Department, Atilim University, Turkey, since 2003, as an Associate Professor. Her main research interests include information systems, medical information systems, engineering education, instructional systems technologies, distance education, e-learning, and medical education.

downloadDownload free PDF View PDFchevron_right

Market-Required Competence Topic Dynamics

Sign up for access to the world's latest research

Abstract

Related papers

References (7)

Related papers

Related topics