Test Data Generation

description613 papers

group32 followers

lightbulbAbout this topic

Test Data Generation is the process of creating data sets that are used to validate the functionality, performance, and security of software applications. This involves producing synthetic or real data that mimics expected user inputs and system behaviors to ensure comprehensive testing and quality assurance.

lightbulbAbout this topic

Key research themes

1. How can constraint-based and symbolic execution techniques be used to automate fault-revealing test data generation?

This research area focuses on leveraging constraint logic, algebraic constraints, and symbolic execution to derive test data that satisfies specific fault detection criteria, improving testing effectiveness and automation. It matters because manual test data generation remains labor-intensive, and these formal methods provide systematic ways to approximate test set adequacy, including mutation adequacy, to detect faults more reliably.

Constraint-Based Automatic Test Data Generation

by Richard A DeMillo

2016

Key finding: Presented a mutation analysis-driven technique that formulates algebraic constraints representing test cases designed to detect specific fault types. The approach approximates relative adequacy (mutation adequacy) by... Read more

articleView Paper downloadDownload

ATGen: Automatic Test Data Generation using Constraint Logic Programming and Symbolic Execution

by Christophe Meudec

2016

Key finding: Proposed a novel approach combining symbolic execution with constraint logic programming to overcome traditional symbolic execution challenges in test data generation. The ATGen tool produces test inputs automatically by... Read more

articleView Paper downloadDownload

Enhancing path-oriented test data generation using adaptive random testing techniques

by saeed parsa

2023, 2015 2nd International Conference on Knowledge-Based Engineering and Innovation (KBEI)

Key finding: Introduced a divide-and-conquer approach based on adaptive random testing (ART) that computes tight over-approximations of input sub-domains for feasible paths via dynamic domain partitioning. The approach reduces invalid... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. How can evolutionary and search-based algorithms improve test data generation for object-oriented and path coverage testing?

This research theme explores applying evolutionary computing paradigms—such as genetic algorithms (GA), simulated annealing, and other metaheuristics—to automatically generate or select test data that maximize code coverage under structural criteria like statement, branch, and path coverage. These techniques are particularly relevant for the complexity of object-oriented programming features and hard-to-cover paths, enabling efficient search in large input spaces with optimization guidance.

Evolutionary Approaches to Test Data Generation for Object-Oriented Software

by Ana Filipa Nogueira

2022, Incorporating Nature-Inspired Paradigms in Computational Applications

Key finding: Surveyed metaheuristic search-based testing methods specialized for OO software, highlighting how evolutionary algorithms (EAs) such as genetic algorithms (GA) optimize test data to increase coverage criteria (e.g., statement... Read more

articleView Paper downloadDownload

Automatic Test Data Generation for Java Card Applications Using Genetic Algorithm

by Dr-Mohmmad Alshraideh

2016

Key finding: Applied genetic algorithms specifically to Java Card (JSC) applets to automatically generate minimal test data sets satisfying branch coverage criteria. The GA approach substantially reduced the number of test data needed and... Read more

articleView Paper downloadDownload

Search-Based Software Test Data Generation for Path Coverage Based on a Feedback-Directed Mechanism

by stuart semujju

2024, Complex system modeling and simulation

Key finding: Developed a feedback-directed search technique that dynamically groups paths and temporarily removes groups with stagnant fitness improvements, thus avoiding wasted search efforts on infeasible or difficult paths during... Read more

articleView Paper downloadDownload

Automatic Test Data Generation Using the Activity Diagram and Search-Based Technique

by aman jaffari

2021, Applied Sciences

Key finding: Proposed AutoTDGen, which automates test data generation by extracting data flow (definition-use pairs) information from UML activity diagrams as test bases, combined with a genetic algorithm to maximize coverage of these... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. What roles do combinatorial and pairwise testing strategies play in efficient test data set generation under resource constraints?

This theme addresses strategies for mitigating combinatorial explosion in test input spaces by focusing on covering combinations of input parameters to a specified interaction strength, primarily pairwise (2-wise) coverage. Efficient algorithms and benchmarking frameworks facilitate selecting minimal test suites that maximize testing effectiveness under practical constraints such as limited time and computational power.

IRPS – An Efficient Test Data Generation Strategy for Pairwise Testing

by Kamal Zuhairi Zamli

2023, Lecture Notes in Computer Science

Key finding: Proposed the IRPS algorithm as an efficient and deterministic strategy for pairwise test data generation that outperforms established methods such as AETG, IPO, simulated annealing, and ant colony optimization in producing... Read more

articleView Paper downloadDownload

An environment for benchmarking combinatorial test suite generators

by Andrea BOMBARDA

2025, 2021 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW)

Key finding: Developed a comprehensive benchmarking framework that integrates multiple combinatorial test suite generators (e.g., ACTS, CAgen, CASA, Medici, PICT), enabling fair and systematic evaluation across key metrics: test suite... Read more

articleView Paper downloadDownload

All papers in Test Data Generation

The INTUITION design process: structuring military multimodal interactive cockpits design according to the MVC design pattern

by didier bazalgette

2025

This article is concerned with the design and implementation of multimodal user interfaces. The use of multiple modalities such as vision, speech and gesture opens a vast world of possibilities in user interface design. Although the... more

descriptionView Paper arrow_downwardDownload

Evolutionary algorithm for prioritized pairwise test data generation

by Enrique Alba

2025

Combinatorial Interaction Testing (CIT) is a technique used to discover faults caused by parameter interactions in highly configurable systems. These systems tend to be large and exhaustive testing is generally impractical. Indeed, when... more

descriptionView Paper arrow_downwardDownload

Highly efficient evaluation design (HEED) for comparing algorithms used to detect nuclear materials

by Fred Roberts

2025

ID: 1750 Paul Kantor, Christie Nelson, Fred Roberts, William M. Pottenger CCICADA, Rutgers University Piscataway, New Jersey

descriptionView Paper arrow_downwardDownload

Genetic algorithm based test data generator

by Moataz Ahmed

2025, The 2003 Congress on Evolutionary Computation, 2003. CEC '03.

descriptionView Paper arrow_downwardDownload

On the Inference of Stochastic Regular Grammars

by Antony Van der Mude

2025, Information and Control

The relevance of grammatical inference techniques to the semiautomatic construction from empirical data, of a model of human decision making, is outlined. A grammatical inference problem is presented in which the least complex stochastic... more

descriptionView Paper arrow_downwardDownload

Autonomous Test Case Generation Using GenAI for Life Insurance Applications

by Chandra Shekhar Pareek

2025, International Journal of Multidisciplinary Research and Growth Evaluation

As the Life Insurance industry undergoes rapid digital transformation, the need for more intelligent and adaptive testing methods has become crucial. Traditional test case generation methods often struggle to keep pace with the industry's... more

descriptionView Paper arrow_downwardDownload

The limitations of genetic algorithms in software testing

by Mohammed ElSaid Ibrahim El-Telbany

2025, ACS/IEEE International Conference on Computer Systems and Applications - AICCSA 2010

Software test-data generation is the process of identifying a set of data, which satisfies a given testing criterion. For solving this difficult problem there were a lot of research works, which have been done in the past. The most... more

descriptionView Paper arrow_downwardDownload

WSDL-Based Automatic Test Case Generation for Web Services Testing

by Yinong Chen

2025

Web Services promote the specification-based cooperation and collaboration among distributed applications in an open environment. To ensure the quality of the services that are published, bound, invoked and integrated at runtime, test... more

descriptionView Paper arrow_downwardDownload

Continuous Testing in CI/CD Pipelines

by Vivek Jain

2025, INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH AND CREATIVE TECHNOLOGY

The rapid evolution of software development methodologies has placed increasing emphasis on the need for efficiency, reliability, and speed in delivering high-quality applications. Continuous Integration and Continuous Deployment (CI/CD)... more

descriptionView Paper arrow_downwardDownload

A Survey of the use of Genetic Algorithms in Structural Testing

by Megan Cifuentes

2025

This is a survey paper of the user of Genetic Algorithms in Structural Testing. Structural testing like statement coverage or branch coverage can be automated through a variety of algorithms. Genetic Algorithms can be very useful in... more

descriptionView Paper arrow_downwardDownload

Automated test data generation using an iterative relaxation method

by Mary Lou Soffa

2025, ACM Sigsoft Software Engineering Notes

An important problem that arises in path oriented testing is the generation of test data that causes a program to follow a given pat.h. In this paper, we present a novel program execution based approach using an iterative relaxation... more

descriptionView Paper arrow_downwardDownload

Constraint Reasoning in FocalTest

by Catherine Dubois

2025, HAL (Le Centre pour la Communication Scientifique Directe)

Property-based testing implies selecting test data satisfying coverage criteria on user-specified properties. However, current automatic test data generation techniques adopt direct generate-and-test approaches for this task. In... more

descriptionView Paper arrow_downwardDownload

GENERATIVE ADVERSARIAL NETWORKS IN BUSINESS ANALYTICS SIMULATING MARKET DYNAMICS FOR STRATEGIC CONSULTING

by Daria Kalishina

2024, International Journal of Business Quantitative Economics and Applied Management Research

This paper explores the integration of Generative Adversarial Networks (GANs) into business analytics, particularly in simulating market dynamics for strategic consulting. By leveraging GANs' ability to generate high-fidelity simulations... more

descriptionView Paper arrow_downwardDownload

Automatic Test Data Generation for Data Flow Testing Using a Genetic Algorithm

by Moheb Girgis

2024, Zenodo (CERN European Organization for Nuclear Research)

One of the major difficulties in software testing is the automatic generation of test data that satisfy a given adequacy criterion. This paper presents an automatic test data generation technique that uses a genetic algorithm (GA), which... more

descriptionView Paper arrow_downwardDownload

Synthetic Transactions in Financial Systems - A Pathway to Real-Time Transaction Simulation

by Chandra Shekhar Pareek

2024, International Journal of Computer Techniques

Synthetic transactions have become a pivotal enabler for financial institutions striving to achieve unparalleled system integrity, unwavering operational performance, and exceptional customer satisfaction in a landscape dominated by... more

descriptionView Paper arrow_downwardDownload

Test Data Management - Trends Charting the Future of Software Quality Assurance

by Chandra Shekhar Pareek

2024, International Journal of Computer Techniques

Test data plays a pivotal role in the software testing lifecycle, enabling quality assurance teams to emulate real-world scenarios while safeguarding sensitive information. It is essential for ensuring comprehensive test coverage,... more

descriptionView Paper arrow_downwardDownload

Case Studies with Lurette V2

by Philippe Baufreton

2024, Leveraging Applications of Formal Methods

Lurette is an automated testing tool dedicated to reactive programs. The test process is automated at two levels: given a formal description of the System Under Test (SUT) environment, Lurette generates realistic input sequences; and,... more

descriptionView Paper arrow_downwardDownload

Compositional CLP-Based Test Data Generation for Imperative Languages

by José Siles

2024, Lecture Notes in Computer Science

Glass-box test data generation (TDG) is the process of automatically generating test input data for a program by considering its internal structure. This is generally accomplished by performing symbolic execution of the program where the... more

descriptionView Paper arrow_downwardDownload

Using Continuous Code Change Analysis to Understand the Practice of Refactoring

by Ralph Johnson

2024

Despite the enormous success that manual and automated refactoring has enjoyed during the last decade, we know little about the practice of refactoring. Understanding the refactoring practice is important for developers, refactoring tool... more

descriptionView Paper arrow_downwardDownload

An Intelligent Apitesting: Unleashing the Power of AI

by Rohit khankhoje

2024, International journal of software engineering and applications

In the continually evolving domain of software development, guaranteeing the dependability and functionality of Application Programming Interfaces (APIs) is of utmost importance. Traditional approaches to API testing frequently encounter... more

descriptionView Paper arrow_downwardDownload

Case Studies with Lurette V2

by Philippe Baufreton

2024, HAL (Le Centre pour la Communication Scientifique Directe)

descriptionView Paper arrow_downwardDownload

Comparison of Back Stepping Optimized via PSO Algorithm and LQR Controllers for a Quadrotor

by Niloofar Parhizkar

2024

هلاقم تاعلاطا هدیکچ لماک یشهوژپ هلاقم :تفایرد 17 یدرورف ن 1396 :شریذپ 05 ادرخ د 1396 :تیاس رد هئارا 13 دادرم 1396 هنیهب بقع هب ماگ رلرتنک ود درکلمع هسیاقم هب هلاقم نیا رلرتنک و تارذ ماحدزا متیروگلا اب هدش LQR رواه تلاح رد روتورداوک کی یور... more

descriptionView Paper arrow_downwardDownload

1 Automatic Generation of Test Sequences form EFSM Models Using Evolutionary Algorithms

by AbdulSalam Kalaji

2024

Automated test data generation through evolutionary testing (ET) is a topic of interest to the software engineering community. While there are many ET-based techniques for automatically generating test data from code, the problem of... more

descriptionView Paper arrow_downwardDownload

Review Article APPLICATIONS OF GENETIC ALGORITHM IN SOFTWARE TESTING

by shivam pandey

2024

The applicability of evolutionary algorithms in software testing has been an area of importance for many researchers. In this paper, we have studied the implementation of one such evolutionary algorithm namely genetic algorithm. Genetic... more

descriptionView Paper arrow_downwardDownload

Lutess

by Lydie du Bousquet

2024

Several studies have shown that automated testing is a promising approach to save significant amounts of time and money in the industry of reactive software. But automated testing requires a formal framework and adequate means to generate... more

descriptionView Paper arrow_downwardDownload

Observations in using parallel and sequential evolutionary algorithms for automatic software testing

by Enrique Alba

2024, Computers & Operations Research

In this paper we analyze the application of parallel and sequential evolutionary algorithms to the automatic test data generation problem. The problem consists of automatically creating a set of input data to test a program. This is a... more

descriptionView Paper arrow_downwardDownload

An applicable test data generation algorithm for domain errors

by Istvan Forgacs

2024, Proceedings of the 1998 ACM SIGSOFT international symposium on Software testing and analysis

An integrated tes ing criterion is proposed that extends traditional criteria to ~e effective to reveal domain errors. The method requires many fevJer test cases and is, applicable for any kind of predicates. An au :omated test data... more

descriptionView Paper arrow_downwardDownload

Search-based Testing for Embedded Telecommunication Software with Complex Input Structures: An Industrial Case Study

by Sigrid Eldh

2024

In this paper, we discuss the application of search-based software testing techniques for unit level testing of a real-world telecommunication middleware at Ericsson. Input data for the system under test consists of nested data... more

descriptionView Paper arrow_downwardDownload

Search-Based Testing for Embedded Telecom Software with Complex Input Structures

by Sigrid Eldh

2024, Springer eBooks

In this paper, we discuss the application of search-based software testing techniques for unit level testing of a real-world telecommunication middleware at Ericsson. Our current implementation analyzes the existing test cases to handle... more

descriptionView Paper arrow_downwardDownload

MC-MIPOG: A Parallel t-Way Test Generation Strategy for Multicore Systems

by Kamal Zuhairi Zamli

2024, ETRI Journal

Fig. 3. Generation of test set using IPOG.

Fig. 4. Generation of test set using MIPOG. the most uncovered way combinations whenever possible. This is efficiently done when there are don’t care values. This step, while improving the test size, also increases the overall computation of MIPOG

Table 1. Size ratio results for 5 to 15 parameters with 5 values in 4-way testing. Table 2. Size ratio results for 10 parameters with 2 to 10 values in 4-way testing.

Table 3. Size ratio results for 10 parameters with 5 values for ¢= 2 to 7.

Table 4. Speedup results for 5 to 15 parameters with 5 values in 4-way testing.

Table 5. Speedup results for 10 parameters with 2 to 10 values in 4-way testing.

Table 6. Speedup results for 10 parameters with 5 values for ¢ = 2 to 7.

Table 7. Comparative test size results using the TCAS module for t= 2 to 12. Table 8. Comparative test generation time using the TCAS module for t= 2 to 12.

terms of test size and the number of generated test sets, we adopt a common configuration system, the TCAS module. The TCAS module is an aircraft collision avoidance system developed by the Federal Aviation Administration which has been used as case study in other related works [2], [11]. The TCAS module has twelve parameters; seven parameters have 2 values, two parameters have three values, one parameter has four values, and two parameters have 10 values. As seen in Table 4, the speedup increases linearly as the number of parameters increases. Here, extra overhead is added for the fifth parameters due to the need to start and shut down the corresponding threads. As seen in Table 5, the speedup gain also increases quadratically as the number of values increases. Extrapolating and performing curve fitting of the results from Table 6, we observe that the speedup increases logarithmically as the strength of coverage increases. In this case, there is also no speedup gain for this strategy when ¢ = 2, possibly due to the overhead required for creation, synchronization, and deletion of threads for a small degree of interaction.

descriptionView Paper arrow_downwardDownload

A Deterministic T-Way Strategy for Test Data Minimization

by Kamal Zuhairi Zamli

2024, Proc. International Conference on IT to Celebrate S. Charmonman's 72nd Birthday

Abstract-In order to meet market demands for quality software products, software engineers are increasingly under pressure to test more lines of codes. To maintain acceptable test coverage, software engineers need to consider a... more

descriptionView Paper arrow_downwardDownload

Leveraging user-session data to support Web application testing

by Marc Fisher II

2024, IEEE Transactions on Software Engineering

Web applications are vital components of the global information infrastructure, and it is important to ensure their dependability. Many techniques and tools for validating web applications have been created, but few of these have... more

descriptionView Paper arrow_downwardDownload

Atmospheric correction of ocean color imagery: use of the Junge power-law aerosol size distribution with variable refractive index to handle aerosol absorption

by Roman Chomko

2024, Applied Optics

When strongly absorbing aerosols are present in the atmosphere, the usual two-step procedure of processing ocean color data-͑1͒ atmospheric correction to provide the water-leaving reflectance ͑ w ͒, followed by ͑2͒ relating w to the water... more

descriptionView Paper arrow_downwardDownload

Review Article APPLICATIONS OF GENETIC ALGORITHM IN SOFTWARE TESTING

by Shivam Pandey

2024

descriptionView Paper arrow_downwardDownload

A REVIEW OF DEEP GENERATIVE MODELS FOR SYNTHETIC FINANCIAL DATA GENERATION

by IAEME Publication

2024, IAEME PUBLICATION

In today's financial landscape, the availability of high-quality data is essential for decision-making, risk management, and innovation. However, accessing real-world financial data can be challenging due to privacy concerns, data access... more

descriptionView Paper arrow_downwardDownload

Contract-based testing for PHP with Praspel

by Fabrice Bouquet

2024, Journal of Systems and Software

We summarize several contributions related to the PHP Realistic Annotation and SPEcification Language (Praspel). This language extends PHP programs with annotations for the formal specification of the behavior of their functions and for... more

descriptionView Paper arrow_downwardDownload

Experiment and comparison of automated static code analyzers and automated dynamic tests

by Karen J Smiley

2024

Code review and inspection techniques are considered vital for defect detection during analysis and design. Automated static code analyzers are essentially an approach to performing code reviews and inspections in an efficient and timely... more

descriptionView Paper arrow_downwardDownload

Experiment and comparison of automated static code analyzers and automated dynamic tests

by Karen J Smiley

2024

descriptionView Paper arrow_downwardDownload

Search Based Software Engineering Techniques

by aman jatain

2024

Based Software Engineering (SBSE) is the field of Software Engineering that helps in solving the problems using metaheuristic approach rather than solving the problems manually i.e. it helps in providing the automated solution for the... more

descriptionView Paper arrow_downwardDownload

Mutation Analysis for Reactive System Environment Properties

by Vũ Khao Đỗ

2024, Second Workshop on Mutation Analysis (Mutation 2006 - ISSRE Workshops 2006)

Reactive systems used in safety-critical domains demand high level of confidence. The development of these systems, which are submitted to several normative recommendations, is complex and expensive. Reactive systems can be developed by... more

descriptionView Paper arrow_downwardDownload

Feasible test path selection by principal slicing

by Istvan Forgacs

2024, Lecture Notes in Computer Science

We propose to improve current path-wise methods for automatic test data generation by using a new method named principal slicing. This method statically derives program slices with a near minimum number of influencing predicates, using... more

descriptionView Paper arrow_downwardDownload

Qex: Symbolic SQL Query Explorer

by Jonathan Halleux

2024, Lecture Notes in Computer Science

We describe a technique and a tool called Qex for generating input tables and parameter values for a given parameterized SQL query. The evaluation semantics of an SQL query is translated into a specific background theory for a... more

descriptionView Paper arrow_downwardDownload

A Novel Fitness function of metaheuristic algorithms for test data generation for simulink models based on mutation analysis

by Lê Mỹ Hạnh

2024, Journal of Systems and Software

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of... more

Formular to compute the value of Fitness Function

value of the signal is true then after being passed through the VNO mutant

Fitness Function for the Compare to Constant Block of the CRO Operator fr, = 0 VbiasConst 4 0, with biasConst being the constant value of!

Fitness Function for the CCO Operator this mutant is killed when at least one input is equal to 0 but not all

The stubborn Mutants Description In additon, Zhan used the SA whereas we utilized the multi-parent crossover

The results of Experiment 1 fitness function is better than the SAin guidance for the process of generating

based on the mutation score criterion in difficult test-data generation cases

descriptionView Paper arrow_downwardDownload

Software Testing Using Genetic Algorithms

by Mitrabinda Ray

2024, Advances in Computer Science and Engineering: Texts

This paper presents a set of methods that uses a genetic algorithm for automatic test-data generation in software testing. For several years researchers have proposed several methods for generating test data which had different drawbacks.... more

descriptionView Paper arrow_downwardDownload

Software Testing Using Genetic Algorithms

by Akshat Sharma

2024, Advances in Computer Science and Engineering: Texts

descriptionView Paper arrow_downwardDownload

Interactivity in the Generation of Test Cases with Evolutionary Computation

by Kevin J. Valle-Gomez

2024, 2021 IEEE Congress on Evolutionary Computation (CEC)

Test generation is a costly but necessary testing activity to increase the quality of software projects. Automated testing tools based on evolutionary computation principles constitute an appealing modern approach to support testing... more

descriptionView Paper arrow_downwardDownload

Programming Without Refinement

by Wided Ghardallou

2024

To derive a program for a given specification R means to find an artifact P that satisfies two conditions: P is executable in some programming language; and P is correct with respect to R. Refinementbased program derivation achieves this... more

descriptionView Paper arrow_downwardDownload

Validation of information system models: Petri nets and test case generation

by Jörg Desel

2024

High-level Petri nets are a graphical language for the modeling of distributed information systems. Petri nets can be validated by simulation. In this paper, a technique is proposed which generates test cases for the simulation of... more

descriptionView Paper arrow_downwardDownload

Comparative Analysis of Various Testing Techniques used for Aspect-Oriented Software System

by Susheela Hooda

2024, Indonesian Journal of Electrical Engineering and Computer Science

Nowadays, Aspect-Oriented Programming (AOP) paradigm is getting more popularity in the field of software development. But testing an Aspect-oriented software system (AOSS) is not well matured. Therefore, many researchers have been... more

descriptionView Paper arrow_downwardDownload

PROTEUM/IM: Uma Ferramenta de Apoio ao Teste de Integração

by Jose Maldonado

2024, Anais do XI Simpósio Brasileiro de Engenharia de Software (SBES 1997)

descriptionView Paper arrow_downwardDownload