
John F Betak
Rutgers, The State University of New Jersey, Center for Advanced Infrastructure & Transportation, Research Fellow - Freight & Maritime Program
Senior researcher addressing transportation systems risk management, organizational decision making and marketing strategies.
Address: Albuquerque, New Mexico, United States
Address: Albuquerque, New Mexico, United States
less
Uploads
Papers by John F Betak
States and the Transportation Safety Board of Canada publish
reports about major railroad accidents. The text from these
accident reports were analyzed using the text mining techniques
of probabilistic topic modeling and k-means clustering to
identify the recurring themes in major railroad accidents. The
output from these analyses indicates that the railroad accidents
can be successfully grouped into different topics. The output
also suggests that recurring accident types are track defects,
wheel defects, grade crossing accidents, and switching
accidents. A major difference between the Canadian and U.S.
reports is the finding that accidents related to bridges are found
to be more prominent in the Canadian reports.
database contains numerous interrelated variables.
Understanding of how the variables are interrelated can be
enhanced using modern visualization techniques. These
techniques can allow managers from railroads and government
agencies to find complex variables relationships not usually
provided by routine statistical analyses. For this research we
have developed several dashboards of linked visualizations
using the Weave data visualization software [5]. Our
visualizations explore various accident types of concern to the
railroad industry including trespassing and pedestrian accidents,
passenger train accidents, actions of highway users involved in
accidents, and the effect of different types of warning devices
on grade crossing accidents. In addition, we are currently
developing an advanced visualization system that views the
accident data as time varying events occurring over a fixed
grade crossings topology. This view allows the application of a
recent network data abstraction termed “Graph Cards.” We present initial examples of the advanced system that provides a
variety of filtering mechanisms to view statistical distributions
and their time varying behavior over the grade crossings
topology.
contains text comment fields that may provide additional
information about grade crossing accidents. New text mining
algorithms provide the potential to automatically extract
information from text that can enhance traditional numeric
analyses. Topic modeling algorithms are statistical methods
that analyze the words of original texts to automatically
discover the themes that run through them. A frequently used
topic-modeling algorithm is Latent Dirichlet Analysis (LDA).
In this paper we will show several examples of how labeled
LDA can be applied to the FRA grade crossing data to better
understand categories of words and phrases that are associated
with various types of grade crossing accidents