Papers by Francesco Poggi

International Journal of Software Engineering and Knowledge Engineering, 2020
In recent years, new classes of highly dynamic, complex systems are gaining momentum. These class... more In recent years, new classes of highly dynamic, complex systems are gaining momentum. These classes include, but are not limited to IoT, smart cities, cyber-physical systems and sensor networks. These systems are characterized by the need to express behaviors driven by external and/or internal changes, i.e. they are reactive and context-aware. A desirable design feature of these systems is the ability of adapting their behavior to environment changes. In this paper, we propose an approach to support adaptive, reactive systems based on semantic runtime representations of their context, enabling the selection of equivalent behaviors, i.e. behaviors that have the same effect on the environment. The context representation and the related knowledge are managed by an engine designed according to a reference architecture and programmable through a declarative definition of sensors and actuators. The knowledge base of sensors and actuators (hosted by an RDF triplestore) is bound to the real...

An Initial Investigation of Concurrency Bugs in Open Source Systems
In the last 10 years CPUs have evolved focusing on performance improvements based on the introduc... more In the last 10 years CPUs have evolved focusing on performance improvements based on the introduction of multi-core architectures forcing developers to build software in a completely different way. Concurrent programming is now the main approach to improve performances in any software product. Unfortunately, this paradigm is prone to bugs which are particularly hard to fix, since their occurrence depends on specific thread interleaving. The paper investigates bugs related to concurrency analyzing their characteristics with machine learning methods to automatically distinguish them from other kinds of bugs based on the data available in the issue tracking systems and in the code repositories. The best model we developed for Apache HTTP Server has a precision of 0.97 and a recall of 0.843 when considering linked bugs (bug reports information in bug repository and the corresponding fix in the version control system).

Resilient Reactive Systems Based on Runtime Semantic Models
2019 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW), 2019
IoT, smart cities, cyber-physical systems and sensor networks represent new classes of highly dyn... more IoT, smart cities, cyber-physical systems and sensor networks represent new classes of highly dynamic, complex systems. The behavior of these systems should be designed in order to react to external changes, i.e. they are reactive and context-aware, and also to internal ones to be able to reconfigure themselves for handling possible anomalies. These requirements ask for a runtime representation of application logic and its context, enriched with variation points that associate different behaviors to possible changes. In this paper, we extend our previous work on the design of reactive, context-aware systems with the support for resilience. According to our model, sensors and actuators can be physical, virtual or logical ones; the last two can be semantically described and dynamically configured to react with a proper behavior to context changes (e.g. faults). The proposal is validated with a use case aimed at designing an edge node for smart buildings dedicated to cultural heritage preservation.

Scientometrics
The aim of this paper is to study the role of citation network measures in the assessment of scie... more The aim of this paper is to study the role of citation network measures in the assessment of scientific maturity. Referring to the case of the Italian national scientific qualification (ASN), we investigate if there is a relationship between citation network indices and the results of the researchers’ evaluation procedures. In particular, we want to understand if network measures can enhance the prediction accuracy of the results of the evaluation procedures beyond basic performance indices. Moreover, we want to highlight which citation network indices prove to be more relevant in explaining the ASN results, and if quantitative indices used in the citation-based disciplines assessment can replace the citation network measures in non-citation-based disciplines. Data concerning Statistics and Computer Science disciplines are collected from different sources (ASN, Italian Ministry of University and Research, and Scopus) and processed in order to calculate the citation-based measures us...

Quantitative Science Studies
The importance of open bibliographic repositories is widely accepted by the scientific community.... more The importance of open bibliographic repositories is widely accepted by the scientific community. For evaluation processes, however, there is still some skepticism: Even if large repositories of open access articles and free publication indexes exist and are continuously growing, assessment procedures still rely on proprietary databases, mainly due to the richness of the data available in these proprietary databases and the services provided by the companies they are offered by. This paper investigates the status of open bibliographic data of three of the most used open resources, namely Microsoft Academic Graph, Crossref, and OpenAIRE, evaluating their potentialities as substitutes of proprietary databases for academic evaluation processes. We focused on the Italian National Scientific Qualification (NSQ), the Italian process for University Professor qualification, which uses data from commercial indexes, and investigated similarities and differences between research areas, discipl...
Communications of the ACM, 2022
Personalized learning models can cut student dropout rates, boost student success, improve the in... more Personalized learning models can cut student dropout rates, boost student success, improve the integration of online and on-site students, better support teachers in mixed-teaching modalities, enhance accessibility, and more.

Datasets and results of the paper titled "Are citation networks relevant to explain academic promotions? An empirical analysis of the Italian national scientific qualification
These are the <strong>input datasets</strong> and the <strong>results of the an... more These are the <strong>input datasets</strong> and the <strong>results of the analyses</strong> reported on the paper titled <strong>"Are citation networks relevant to explain academic promotions? An empirical analysis of the Italian national scientific qualification"</strong>. <strong>Abstract:</strong> The aim of this paper is to study the role of citation network measures in the assessment of scientific maturity. Referring to the case of the Italian national scientific qualification (ASN), we investigate if there is a relationship between citation network indices and the results of the researchers' evaluation procedures. In particular, we want to understand if network measures can enhance the prediction accuracy of the results of the evaluation procedures beyond basic performance indices. Moreover, we want to highlight which citation network indices prove to be more relevant in explaining the ASN results, and if quantit...

Data and code of the article "Do open citations inform the qualitative peer-review evaluation in research assessments? An analysis of the Italian National Scientific Qualification
Dataset containing code and data for our <strong>"Do open citations inform the qualita... more Dataset containing code and data for our <strong>"Do open citations inform the qualitative peer-review evaluation in research assessments? An analysis of the Italian National Scientific Qualification"</strong> paper. Introduction Resulting data Reproducing the data collection Analyzing coverage Reproducing the citation network visualization Machine learning experiment 1. Introduction Our study is rooted in the context of the Italian National Scientific Qualification (NSQ). The NSQ is a national assessment exercise that qualifies scholars to the positions of Associate Professor and Full Professor. It consists of a quantitative and qualitative evaluation process, making use of both bibliometrics and a peer-review process. In the NSQ, academic disciplines are divided into two categories, i.e. citation-based disciplines (CDs) and non-citation-based disciplines (NDs). This division affects the bibliometrics used in the first part of the process. This study aims at e...

Predicting the Results of Evaluation Procedures of Academics: Additional Materials
Additional materials containing the results of the analyses described in the paper entitled "... more Additional materials containing the results of the analyses described in the paper entitled "Predicting the Results of Evaluation Procedures of Academics".In the tables, Precision (P), Recall (R) and F-Measure (FM) values are reported for each Recruitment Field (RF) and Area. The results are ordered in descending order with respect to the F-measure values. Non-bibliometric disciplines have a gray background.The data are organized as follows:<b>-Table 1:</b> contains the performance of the SVM algorithm for academic level I (Full Professor) using 291 predictors. Analysis of the 184 recruitment fields;<b>-Table 2:</b> contains the performance of the SVM algorithm for academic level II (Associate Professor) using 291 predictors. Analysis of the 184 recruitment fields;<b>-Table 3:</b> contains the performance of the SVM algorithm for academic level I (full Professor) and II (Associate Professor). Analysis of the scientific areas;<b>-Ta...
Prediction of the Results of the ASN 2012 using the 15 top predictors - Performance of the SVM algorithm for academic level I (Full Professor)
Performance of the SVM algorithm for academic level I (Full Professor) using the 15 top predictor... more Performance of the SVM algorithm for academic level I (Full Professor) using the 15 top predictors. Precision (P), Recall (R) and F-Measure (FM) values are reported for each Recruitment Field (RF). The results are ordered in descending order with respect to the F-measure values. Non-bibliometric disciplines have a gray background.
Prediction of the Results of the ASN 2012 - Performance of the SVM algorithm for academic level I (Full Professor)
Performance of the SVM algorithm for academic level I (Full Professor)}. Precision (P), Recall (R... more Performance of the SVM algorithm for academic level I (Full Professor)}. Precision (P), Recall (R) and F-Measure (FM) values are reported for each Recruitment Field (RF). The results are ordered in descending order with respect to the F-measure values. Non-bibliometric disciplines have a gray background.

International Conferences on Software Engineering and Knowledge Engineering, 2019
IoT, smart cities, cyber-physical systems and sensor networks are context-aware, highly dynamic a... more IoT, smart cities, cyber-physical systems and sensor networks are context-aware, highly dynamic and reactive systems. Their implementation should take into account the heterogeneity of their components and make easy the management of events unplanned at design time. According to these requirements, in this paper we propose an ontology-based approach to provide runtime models of the physical entities characterizing context-aware reactive systems. We extend SSN, a W3C standard ontology, to support complex reactive behaviors through the modeling of Logical Sensors and Actuators (LSA ontology); we also present a software architecture in which a knowledge base, structured coherently with this semantic model, is bound to real world entities by grounding (via web services) semantic elements to physical sensors and actuators. To validate the approach we discuss a case study related to smart buildings for cultural heritage preservation.

Analysing and Discovering Semantic Relations in Scholarly Data
Communications in Computer and Information Science, 2017
Scholarly publishing has seen an ever increasing interest in Linked Open Data (LOD). However, mos... more Scholarly publishing has seen an ever increasing interest in Linked Open Data (LOD). However, most of the existing datasets are designed as flat translation of legacy data sources into RDF. Although that is a crucial step to address, a lot of useful information is not expressed in RDF, and humans are still required to infer relevant knowledge by reading and making sense of texts. Examples are the reasons why authors cite other papers, the rhetorical structure of scientific discourse, bibliometric measures, provenance information, and so on. In this paper we introduce the Semantic Lancet Project, whose goal is to make available a LOD which includes the formalisation of some useful knowledge hidden within the textual content of papers. We have developed a toolchain for reengineering and enhancing data extracted from some publisher’s legacy repositories. Finally, we show how these data are immediately useful to help humans to address relevant tasks, such as data browsing, expert finding, related works finding, and identification of data inconsistencies.

ArXiv, 2021
In the past, several works have investigated ways for combining quantitative and qualitative meth... more In the past, several works have investigated ways for combining quantitative and qualitative methods in research assessment exercises. Indeed, the Italian National Scientific Qualification (NSQ), i.e. the national assessment exercise which aims at deciding whether a scholar can apply to professorial academic positions as Associate Professor and Full Professor, adopts a quantitative and qualitative evaluation process: it makes use of bibliometrics followed by a peer-review process of candidates’ CVs. The NSQ divides academic disciplines into two categories, i.e. citation-based disciplines (CDs) and non-citation-based disciplines (NDs), a division that affects the metrics used for assessing the candidates of that discipline in the first part of the process, which is based on bibliometrics. In this work, we aim at exploring whether citation-based metrics, calculated only considering open bibliographic and citation data, can support the human peer-review of NDs and yield insights on how...
ArXiv, 2021
In the past, several works have investigated ways for combining quantitative and qualitative meth... more In the past, several works have investigated ways for combining quantitative and qualitative methods in research assessment exercises. In this work, we aim at introducing a methodology to explore whether citation-based metrics, calculated only considering open bibliographic and citation data, can yield insights on how human peer-review of research assessment exercises is conducted. To understand if and what metrics provide relevant information, we propose to use a series of machine learning models to replicate the decisions of the committees of the research assessment exercises.

The need for scholarly open data is ever increasing. While there are large repositories of open a... more The need for scholarly open data is ever increasing. While there are large repositories of open access articles and free publication indexes, there are still a few examples of free citation networks and their coverage is partial. One of the results is that most of the evaluation processes based on citation counts rely on commercial citation databases. Things are changing under the pressure of the Initiative for Open Citations (I4OC), whose goal is to campaign for scholarly publishers to make their citations as totally open. This paper investigates the growth of open citations with an experiment on the Italian Scientific Habilitation, the National process for University Professor qualification which instead uses data from commercial indexes. We simulated the procedure by only using open data and explored similarities and differences with the official results. The outcomes of the experiment show that the amount of open citation data currently available is not yet enough for obtaining ...

ArXiv, 2021
The need for open scientific knowledge graphs is ever increasing. While there are large repositor... more The need for open scientific knowledge graphs is ever increasing. While there are large repositories of open access articles and free publication indexes, there are still few free knowledge graphs exposing citation networks, and often their coverage is partial. Consequently, most evaluation processes based on citation counts rely on commercial citation databases. Things are changing thanks to the Initiative for Open Citations (I4OC, https://i4oc.org) and the Initiative for Open Abstracts (I4OA, https://i4oa.org), whose goal is to campaign for scholarly publishers to open the reference lists and the other metadata of their articles. This paper investigates the growth of the open bibliographic metadata and open citations in two scientific knowledge graphs, OpenCitations’ COCI and Crossref, with an experiment on the Italian National Scientific Qualification (NSQ), the National process for University Professor qualification which uses data from commercial indexes. We simulated the p...
IFIP Advances in Information and Communication Technology, 2020
Tools are of paramount importance in automating software engineering tasks; although the Agile Ma... more Tools are of paramount importance in automating software engineering tasks; although the Agile Manifesto prefers "individuals and their interactions over processes and tools", some agile development activities make no exception and can be automated effectively and successfully. In process frameworks like Scrum or similar ones some activities are in fact quite structured and need specific tool support. Hence, it is interesting to study the combination of specific agile practices with OSS tools. In this paper we introduce the Compositional Agile System (CAS), an environment created to support iAgile and automate some of its tasks using OSS tools. iAgile is a Scrum-like model designed to develop critical systems in the military domain.
2017 International Joint Conference on Neural Networks (IJCNN), May 1, 2017
Proceedings of the 2013 ACM symposium on Document engineering, 2013
Recognising textual structures (paragraphs, sections, etc.) provides abstract and more general me... more Recognising textual structures (paragraphs, sections, etc.) provides abstract and more general mechanisms for describing documents independent of the particular semantics of specific markup schemas, tools and presentation stylesheets. In this paper we propose an algorithm that allows us to identify the structural role of each element in a set of homogeneous scientific articles stored as XML files.
Uploads
Papers by Francesco Poggi