Academia.eduAcademia.edu

Decision Tree Algorithm

description45 papers
group1 follower
lightbulbAbout this topic
The Decision Tree Algorithm is a supervised machine learning technique used for classification and regression tasks. It models decisions and their possible consequences as a tree structure, where each internal node represents a feature, each branch represents a decision rule, and each leaf node represents an outcome, facilitating interpretability and decision-making.
lightbulbAbout this topic
The Decision Tree Algorithm is a supervised machine learning technique used for classification and regression tasks. It models decisions and their possible consequences as a tree structure, where each internal node represents a feature, each branch represents a decision rule, and each leaf node represents an outcome, facilitating interpretability and decision-making.

Key research themes

1. How can decision tree algorithms be optimized and extended to improve classification accuracy and computational efficiency?

This research theme explores methods to enhance traditional decision tree algorithms like ID3, C4.5, and CART by integrating optimization techniques such as genetic algorithms and analyzing computational complexity. The goal is to achieve higher classification accuracy, reduce tree size and overfitting, and improve algorithm runtime performance especially on large and complex datasets.

Key finding: This study showed that integrating a genetic algorithm (GA) with the C4.5 decision tree algorithm can optimize and simplify the search rules, leading to improved accuracy (with an average of 75%) and reduced complexity in the... Read more
Key finding: The paper demonstrates that a genetic algorithm can be effectively applied to prune the C4.5 decision trees across four UCI datasets, resulting in reduced tree size and mitigating overfitting while maintaining or improving... Read more
Key finding: This survey presented enhanced versions of the basic ID3 algorithm (e.g., FID3 and VPRSFID3) incorporating rough set theory to handle uncertainty and imprecision, which improved attribute selection and classification... Read more
Key finding: Through theoretical and experimental analysis, this work identified that classical top-down decision tree algorithms (such as C4.5 and CART) have computational bottlenecks primarily due to entropy gain calculations and data... Read more
Key finding: Utilizing evolutionary multi-objective optimization with genetic programming, this paper established decision trees that minimize empirical error rates for each tree size, enabling structural risk minimization. The approach... Read more

2. What novel algorithmic and representational approaches enhance decision tree learning especially in online and multi-class classification contexts?

This theme focuses on the development of new frameworks and hybrid models that extend classical decision tree approaches to better handle online learning scenarios, multi-class problems, and feature selection. It includes reinforcement learning formulations for adaptive tree induction, combination of support vector machines with decision trees, and techniques for efficient split point determination to improve the scalability and flexibility of decision trees.

Key finding: This study introduced RLDT, an online decision tree learning algorithm that models tree growth as a Markov Decision Process solved via reinforcement learning. RLDT actively selects a minimal set of features for... Read more
Key finding: This work proposed the SVM-based Binary Decision Tree Architecture (SVM-DTA) that integrates support vector machines (SVMs) within a binary decision tree framework to solve multiclass classification problems. By clustering... Read more
Key finding: The paper proposed alternative dynamic discretization methods that generate a small, computationally efficient set of candidate split points for continuous predictor variables during tree construction. These methods eliminate... Read more

3. How are decision tree algorithms applied effectively in domain-specific contexts such as medical diagnosis, environmental prediction, and industrial classification?

This theme synthesizes research applying decision tree algorithms in practical real-world scenarios. It highlights domain-adapted decision tree implementations that combine appropriate variants or enhancements of decision trees to model complex phenomena such as disease diagnosis, rainfall-induced landslides, industrial material classification, and fall detection, demonstrating interpretability and competitive performance in respective applications.

Key finding: Using the QUEST decision tree algorithm on 750 patients with serum PSA levels between 0 and 10 ng/mL, the study identified five distinctive risk node groups that stratify prostate cancer probabilities. The derived decision... Read more
Key finding: Decision Tree modeling was applied to rainfall and landslide data from Western Serbia (2001–2014) to identify threshold conditions leading to massive landslides. The analysis indicated that mid-term rainfall (2–3 days... Read more
Key finding: This research developed a classification strategy integrating Laser-Induced Breakdown Spectroscopy (LIBS) spectral data with a decision tree-based machine learning algorithm to categorize waste refractories into 10 classes... Read more
Key finding: Leveraging depth camera input, a pose-invariant randomized decision tree algorithm was developed to extract human body joints, coupled with a support vector machine classifier analyzing 3D head joint trajectories for fall... Read more
Key finding: Applying decision tree classification on breast cancer datasets from Kaggle, the proposed method achieved classification accuracy of up to 100% in initial trials and 97.9% upon validation with larger samples. The decision... Read more

All papers in Decision Tree Algorithm

Hydroponics is a new agricultural production system in which the production takes place in a soilless medium using water. The hydroponic system requires controlled environment for the proper growth of plants, less chance of diseases and... more
In this paper, we present a model based on the Neural Network (NN) for classifying Arabic texts. We propose the use of Singular Value Decomposition (SVD) as a preprocessor of NN to reduce the data in terms of both size as well as... more
Data mining refers to extracting knowledge from large amount of data. Real life data mining approaches are interesting because they often present a different set of problems for data miners. The process of designing a model helps to... more
As per the national employability report 2016 and looking at the current trend it is very clear that unemployability among young IT graduates is increasing and the major reason can be related to the lack of skill set among IT students... more
The elderly population is increasing rapidly all over the world. One major risk for elderly people is the fall accidents, especially for those living alone. In this paper, we propose a robust fall detection approach by analyzing the... more
Information advances have appeared to progress rural efficiency in several ways. A procedure that's rising as a valuable device in picture handling. This report pQresents a brief overview of the utilize of picture handling strategies to... more
Players are the most important and valuable assets of sports clubs; their contracts cover most of the clubs' budgets. The present study aimed to investigate the role of those factors related to players’ valuation and predict the amount of... more
potential states of objects is defined first before operator definition. To acquire completely all states of an object is During the course of decision-making in case-based usually very difficult. Moreover, new situations always planning,... more
This paper focuses on modeling rainfall-induced massive landsliding in the Western Serbia in the 2001-2014 period. The motivation for conducting the study was the rainfall-induced flooding and landsliding that took place across most of... more
This purposed work is about Energy crisis as a major problem in India. Presently, most of our street lighting systems are manually operated and it leads to energy loss. This research deals with the design and implementation of simple yet... more
The movie industry is one of the most important branches of the entertainment industry, which generates a lot of revenue. The person playing a big role in this aspect is the producer as they are in charge of funding needed to produce the... more
Pulsari su brzo rotirajuće neutronske zvijezde. Ti vrlo zanimljivi objekti fasciniraju astronome još od trenutka kad su prvi put pronađeni. S vremenom su teleskopi postajali sve bolji te se tako i broj otkrivenih pulsara povećavao. Ručno... more
Refractories are materials that can withstand high temperatures and maintain their mechanical functions and properties during long time, even in contact with corrosive liquids or gases. These materials are indispensable for all... more
Parkinson's disease is one of the most common nervous system disorders after Alzheimer. It is a neurodegenerative disorder. Early detection of Parkinson's disease can be helpful in preventing it but it cannot be completely cured. In... more
This paper presents a Methodology which classifies urine samples into normal hydration or dehydration labels using image processing techniques and XGboost ensemble boosting model. Images of urine samples were captured by the digital... more
A number of evolutionary algorithms such as genetic algorithms, simulated annealing, particle swarm optimization, etc., have been used by researchers in order to optimize different manufacturing processes. In many cases these algorithms... more
This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school-related features) and it was collected by using school reports and... more
Construction projects are a unit taken from the pass literature review. The literature reviews are a unit summarized and therefore the delay framework is made to support the literature review outline.
This paper presents a Methodology which classifies urine samples into normal hydration or dehydration labels using image processing techniques and XGboost ensemble boosting model. Images of urine samples were captured by the digital... more
Improving community mobility is a common goal for persons with stroke. Measuring daily physical activity is helpful to determine the effectiveness of rehabilitation interventions. In our previous studies, a novel wearable shoe-based... more
Paani Foundation is a non-benefit, non-legislative association which is dynamic in the territory of drought anticipation and watershed management in the state of Maharashtra, India. The association was established by Indian actor Amir... more
Maintaining the environment for growth of crops in greenhouse is difficult. In world of global warming it's hard to maintain environment factors. A system needs to be developed for monitoring and controlling of greenhouse tasks. Use of... more
Information advances have appeared to progress rural efficiency in several ways. A procedure that's rising as a valuable device in picture handling. This report pQresents a brief overview of the utilize of picture handling strategies to... more
As malware increases nowadays, it is necessary to safeguard your system from the malware. Malware is being protected by traditional methods but it only protects system from the malware whose signature is known. So we aim to prepare a... more
Microbial secondary metabolites are one of the sources of therapeutic molecules in the pharmaceutical industry. Product quality and high yields of secondary metabolites are the main goals for the commercial success of a fermentation... more
Wheat is an important cash crop in the Rwandan market. The wheat market in Rwanda is made of small and medium holder farmers who are not participating in the market adequately though the attention given by Government to increase the... more
This research describes the water dispenser volume monitoring system inside room H118. The volume of water is obtained from weight measurements uses a load cell. To determine the feasibility of the room used for weight measurement, DHT-11... more
This paper describes the implementation of an automatic control system for aeroponics. This system contains a lattepanda as the main processing unit and an adjustable interface to suit the needs of the cultivated plant. It gets input from... more
Food production is an important factor in our day to day life. Farming is the method of food production. There are many problems that affect food production .One of the main solutions for the problems of the farmers is good crop... more
In the present study, we tried to assess the feasibilities of possible effective and safe utilization of fly ash as soil amendment in north Rajasthan wheat field and its impact on wheat plants, especially at Biochemical (Protein, Starch... more
This paper outlines a remote monitoring system of temperature, humidity, gas and light control for cold storage warehouses. In these warehouses, it is significant that homestead produces should stay fresh. Lamentably when an administrator... more
As in India agriculture provides employment opportunities for rural people on a large scale. It is an important source of livelihood and still our agriculture sector is facing challenges. And one of those challenges is productivity of... more
This paper presents the prototyped design of a wireless sensor network for shrimp pool in aquaculture. The system design in this paper includes a Raspberry Pi 3connected to Arduino Uno, Zbee S2C module, temperature sensor and other... more
Agriculture plays a key role in the development of human civilization. A lot of research has been done to protect crops. The important aspect of agriculture is pest management. When a farmer has best strategies to encounter pests and... more
Now-a-days, marketing demands are increasing continuously. If you are unable to give the required results then it'll be hard to succeed. Our project satisfies this need by using a data analysis by making use of high-end technology. The... more
The medical science is significantly usages the services of data mining and machine learning. In different domains of medical science the data mining techniques are helpful to research and planning. A number of applications are possible... more
Iteration is the process to solve a problem or defining a set of processes to called repeated with different values. The method mentioned in this survey article, we will find the roots of equations which is described. This method is... more
The World Food Conference1974 resolved that the food security ensures adequate provides and convenience. Food security has become a major issue across the globe preponderantly in developing countries like Republic of India. during a... more
Soil moisture is a critical process in the water cycle. In agricultural production, the spatial variability of soil moisture can be responsible for low or spatially variable crop yields, as soil moisture is required to make nutrients... more
Since time immemorial mankind has utilized plants for their medicinal values. Several branches of science like ayurvedic medicine, unani medicine etc put paramount amount of importance on plant extracts for the cure of multitude of... more
The objective of this review paper is to review the concept of errors and their computation including different types of errors such as absolute error, relative error, random error, percentage error, etc. Errors play an important role in... more
The objective of this review paper is to review the concept of errors and their computation including different types of errors such as absolute error, relative error, random error, percentage error, etc. Errors play an important role in... more
This paper describes an approach to the development of an autonomous vehicle, which can navigate in indoor environments like warehouses and assembly lines. Specifically, the paper focuses on phases from prototyping the vehicle to... more
Download research papers for free!