Syracuse University
Computer Science
Abstract—Semantic values in kernel data structures are critical to many security applications, such as virtual machine introspection, malware analysis, and memory forensics. However, malware, or more specifically a kernel rootkit, can... more
We propose a new approach for outlier detection, based on a ranking measure that focuses on the question of whether a point is ‘central’ for its nearest neighbours. Using our notations, a low cumulative rank implies that the point is... more
Density-based methodology that exploits k-neighborhood of a data point has many good features. For instance, it is independent of the distribution of the data and is capable of detecting isolated objects. However it has some shortcomings:
Anomaly or outlier detection problems are of considerable importance, arising frequently in diverse real-world applications such as finance and cyber-security. Several algorithms have been formulated for such problems, usually based on... more
Rank-based algorithms provide a promising approach for outlier detection, but currently used rank-based measures of outlier detection suffer from two deficiencies: first they take a large value from an object near a cluster whose density... more
The density-based methodology discussed in the preceding chapter, which examines the k-neighborhood of a data point, has many good features. For instance, it is independent of the distribution of the data and is capable of detecting... more
Many practical problems involve data that arrive over time, and are hence in a strict temporal sequence. As discussed in Chap. 5, treating the data as a set, while ignoring the time-stamp, loses information essential to the problem.... more
Anomaly Detection Anomaly detection problems arise in multiple applications, as discussed in the preceding chapter. This chapter discusses the basic ideas of anomaly detection, and sets up a framework within which various algorithms can... more
MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce... more
The E-Learning System is a web based system that focuses on the use of graphical simulation to help students learn the basic concepts of programming easily. The intended project would be used by the students as a web based tutorial for... more
Time series data abound in many realistic domains. The proper study and analysis of time series data help to make important decisions. Study of such data is very useful in many applications where there are trendy changes with time or... more