Academia.eduAcademia.edu

Automated Evaluation

description16 papers
group21 followers
lightbulbAbout this topic
Automated evaluation refers to the use of algorithms and software tools to assess and score responses, performances, or outputs in various domains, such as education, natural language processing, and software development, without human intervention. It aims to enhance efficiency, consistency, and objectivity in the evaluation process.
lightbulbAbout this topic
Automated evaluation refers to the use of algorithms and software tools to assess and score responses, performances, or outputs in various domains, such as education, natural language processing, and software development, without human intervention. It aims to enhance efficiency, consistency, and objectivity in the evaluation process.

Key research themes

1. How can automated evaluation enhance programming education through precise and fair assessment with feedback?

This research area investigates the development and implementation of automated tools to accurately assess programming assignments, aiming to reduce manual grading errors, improve efficiency, and standardize evaluation. It focuses on both correctness and qualitative assessment dimensions, such as code structure, style, and performance, and addresses the challenge of providing detailed, consistent feedback to learners.

Key finding: This paper presents JavAssess, a Java library enabling both black-box and white-box assessment methods that automatically inspect, test, mark, and correct student Java code. The controlled university study with 535 students... Read more
Key finding: The described system fully automates testing, grading, and feedback generation for student programming assignments (currently C language), incorporating multiple test dimensions including random and user-defined inputs,... Read more
Key finding: This research introduces a web-based system for automated grading of programming assignments with instructor-created questions and automatic generation of test cases. A survey with 30 teachers showed significant positive... Read more
Key finding: Through a bibliometric study (2010–2022), this work identified that continuous, individualized, and timely feedback is critical in automated programming assessment to enhance learner progress. The analysis underscores recent... Read more
Key finding: This project develops a digital platform leveraging Optical Character Recognition (OCR) and keyword-based methods to automate handwritten answer evaluation, aiming to significantly reduce teacher workload and eliminate bias.... Read more

2. How can implicit user behavior and interaction data be utilized for automated evaluation of intelligent assistants' effectiveness?

This research theme examines methods for automatically evaluating voice-activated intelligent assistants by leveraging implicit user feedback such as interaction patterns, satisfaction metrics, and acoustic signals. The goal is to create consistent, scalable, and task-agnostic evaluation frameworks that overcome the challenges posed by diverse, evolving tasks and reduce reliance on costly, manual ground-truth annotations.

Key finding: The paper introduces a novel model that categorizes user-system interactions into task-independent dialog actions and uses Markov models plus features from requests, responses, clicks, and acoustic signals to predict user... Read more

3. What are the challenges and solutions in automating subjective evaluation and feedback for written responses and complex open-ended answers?

This theme focuses on automating the evaluation of subjective, open-ended responses—such as essays, summaries, and diagrams—using natural language processing, semantic similarity measures, and diagrammatic analyses. It addresses the tension between capturing writing quality, providing instructional feedback, and ensuring evaluation fairness, especially when using indirect measures and avoiding superficial text features.

Key finding: The paper critically assesses early automated essay scoring methods that predominantly relied on surface textual features (e.g., essay length, word frequency) shown to correlate moderately with human scores (multiple R =... Read more
Key finding: This study proposes a fully automated evaluation framework that compares student-generated diagrammatic summaries against concept graphs extracted directly from the source scientific text, requiring no expert input. It uses... Read more
Key finding: The work develops an intelligent evaluation system (IES) that measures semantic and syntactic similarity between student answers and model answers using techniques like TF-IDF weighting and cosine similarity. It enables... Read more
Key finding: This paper surveys existing AI-based technologies for automating evaluation of subjective answers in online exams, comparing approaches using keyword matching, cosine similarity, Jaccard similarity, and transformer-based... Read more

All papers in Automated Evaluation

Nowadays, manually assessing students’ programming exercises has been identified as among of the toughest tasks to lecturers of programming courses on top of their high routine workloads.Thus, Automatic Programming Assessment (or APA) has... more
Introduction. Writing foreign-language creative writing assignments is one of the goals of foreign language teaching in higher education. Modern AI tools (ChatGPT 4.0) are able to provide learners with evaluative feedback and... more
Introduction. The development of students' foreign language creative writing skills is a component of the goal of foreign language teaching in higher education. The effectiveness of the development of students' writing skills is largely... more
Technology is defined as the use of scientific knowledge to solve practical problems. However, educators’ initiatives to integrate technology have been mostly prohibitively expensive. In this context, researchers proposed the automation... more
Technology is defined as the use of scientific knowledge to solve practical problems. However, educators’ initiatives to integrate technology have been mostly prohibitively expensive. In this context, researchers proposed the automation... more
Technology is defined as the use of scientific knowledge to solve practical problems. However, educators’ initiatives to integrate technology have been mostly prohibitively expensive. In this context, researchers proposed... more
Computer programming captivates the attention of both professionals and young learners due to its multidisciplinary applications and high paid jobs. However, mastering computer programming requires critical thinking and consistent... more
This paper proposes a solution to evaluate summary of a scientific article through diagram analysis. The model diagram used for evaluation is constructed solely base on the reading text, and does not require extra input from human... more
Within this study, the authors want to address the problem of overworking of teachers in Philippine schools due to their excessive clerical responsibility, which could lead to teacher attrition. The authors propose to automate the... more
Being the most used method for dissemination of information, especially for public services, it is of paramount importance that the Web is made accessible as to allow all its users to access the content of its pages. In this paper, we... more
Within this study, the authors want to address the problem of overworking of teachers in Philippine schools due to their excessive clerical responsibility, which could lead to teacher attrition. The authors propose to automate the... more
This paper proposes a new idea for grading multiple-choice test which is to develop a method to use a personal computer plus a scanner and a program based application, that will grade a specially designed MCQ exam test and feedback... more
Within this study, the authors want to address the problem of overworking of teachers in Philippine schools due to their excessive clerical responsibility, which could lead to teacher attrition. The authors propose to automate the... more
Within this study, the authors want to address the problem of overworking of teachers in Philippine schools due to their excessive clerical responsibility, which could lead to teacher attrition. The authors propose to automate the... more
A preliminary version of a platform for automated, remote, insitu user experience measurement called TUMCAT was evaluated. The use of peer-to-peer software was monitored with it during five weeks and subjective data were gathered with the... more
Within this study, the authors want to address the problem of overworking of teachers in Philippine schools due to their excessive clerical responsibility, which could lead to teacher attrition. The authors propose to automate the... more
A preliminary version of a platform for automated, remote, insitu user experience measurement called TUMCAT was evaluated. The use of peer-to-peer software was monitored with it during five weeks and subjective data were gathered with the... more
El programa EssA fue creado en 2004 como un instrumento para ayudar a valorar narraciones en el área de lengua extranjera. El análisis de regresión múltiple, según variables léxico-gramaticales, determinó una ecuación de regresión que... more
Download research papers for free!