Academia.eduAcademia.edu

Data Compilation

description16 papers
group1 follower
lightbulbAbout this topic
Data compilation is the systematic process of gathering, organizing, and integrating data from various sources to create a comprehensive dataset. This process involves ensuring data accuracy, consistency, and relevance, facilitating subsequent analysis and interpretation for research or decision-making purposes.
lightbulbAbout this topic
Data compilation is the systematic process of gathering, organizing, and integrating data from various sources to create a comprehensive dataset. This process involves ensuring data accuracy, consistency, and relevance, facilitating subsequent analysis and interpretation for research or decision-making purposes.

Key research themes

1. How can program synthesis be leveraged to automate data extraction and transformation in data compilation pipelines?

This theme investigates the development of program synthesis techniques, particularly programming-by-example (PBE) and predictive synthesis, to automate data extraction and transformation tasks within data compilation workflows. It addresses the challenge of generating accurate, reusable programs from incomplete or input-only specifications, aiming to reduce manual effort in data wrangling and preprocessing, which are often time-consuming and require programming expertise.

Key finding: This paper formalizes an interactive model of program synthesis tailored to programming-by-example scenarios, addressing both efficiency and correctness challenges. It introduces incremental, step-based, and feedback-oriented... Read more
Key finding: The study presents a predictive program synthesis algorithm that automatically generates extraction programs solely from input data (input-only examples), without requiring explicit output examples. It demonstrates... Read more

2. What methodologies and architectures enable automatic generation of dependable and scalable programs for data acquisition and control systems in data compilation?

This theme focuses on the design and implementation of program generators and compiler-compilers that automate the generation of software artifacts, specifically for data acquisition, control systems, and general program compilation. It explores architectural frameworks, extended formal automata models, and attribute grammar-based compilers that facilitate scalable, customizable, and error-free software production essential for integrating diverse data sources and processing logic in data compilation.

Key finding: The paper proposes an extended hybrid automata-based architecture for program generators tailored to data acquisition and control systems, highlighting benefits such as support for dependable operation, real-time constraints,... Read more
Key finding: This work introduces VisualLISA, a visual programming front-end for the attribute grammar-based compiler generator LISA, enabling intuitive graphical construction of attribute grammar productions and generating valid compiler... Read more

3. How can comprehensive data preparation workflows and tools enhance the efficiency and quality of data compilation?

This theme explores approaches, tools, and workflows designed to support comprehensive data preparation, including data cleaning, integration, profiling, matching, and transformation to facilitate effective data compilation. It looks at workflow-based, programmatic, dataset-centric, and automation-driven tools that help minimize manual effort, accommodate heterogeneous data sources, and ensure reusable, repeatable pipelines that underpin reliable compiled datasets for subsequent analysis.

Key finding: This review systematically categorizes data preparation approaches into program-based, workflow-based, dataset-based, and automation-driven paradigms, illustrating how each handles key steps such as profiling, matching, and... Read more
Key finding: The paper presents the Data Analysis Workbench (DAWB), a platform combining data visualization, scripting, and workflow engines to facilitate both online and offline data analysis. It enables construction and execution of... Read more

All papers in Data Compilation

for the valuable time they took to share and educate me, and extra help from my dear friend Akay Izat, June Rinker, Matt Gangi, and Martin Rodriguez for the valued roof information. I would like to thank the Architecture Program Office... more
CERN has been archiving data on tapes in its Computer Center for decades and its archive system is now holding more than 135 PB of HEP data in its premises on high density tapes. For the last 20 years, tape areal bit density has been... more
Electron-capture and-loss cross sections have been measured for highly charged (q =13+) sulfur ions with energies 2.5-200 MeV colliding with helium. Electron capture varies by nearly six orders of magnitude over the energy range... more
In the decade since the first pan-European testate amoeba-based transfer function for peatland palaeohydrological reconstruction was published, a vast amount of additional data collection has been undertaken by the research community.... more
From our work done under contracts with JAERI, I talk about the following topics: (1) functional forms useful for fits to atomic collision (including charge transfer) cross sections, and (2) a method to optimize adjustable parameters. As... more
In the decade since the first pan-European testate amoeba-based transfer function for peatland palaeohydrological reconstruction was published, a vast amount of additional data collection has been undertaken by the research community.... more
Improved empirical formulas are given for the number-reflection coefficient of light ions (H, D. He) normally incident on the solid surface and the mean fractional energy of reflected particles.
Analytic expressions fitted to Barnett's recommended data are given for the reaction cross sections of H, H2 and He atoms and ions colliding with atoms and molecules. The reactions treated are dissociative collisions and particle... more
Correlations and clustering are of great importance in the study of the Nuclear Equation of State. Information on these items/aspects can be obtained using heavy-ion reactions which are described by dynamical theories. We propose a... more
Thin repurIwevpcwmred eamvncarunlwfwvxh qanmurwdbymr~rwy Mlhc[JnlWd.S!mm Owerrrmcnt. Nciikrthc (Jnitd Slalm(h~rnmnl mcnnyngwrxy thcmvf, nvrcmryd Ihwlr cmployea, m-kc mIy wnrranly, csprcas ur lmpikvl, or uaurna qny la~l Iidklily tw... more
An experimental study has been carried out on the reactions of state selected O + (4 S, 2 D, 2 P) ions with methane with the aims of characterizing the effects of both the parent ion internal energy and collision energy on the reaction... more
The ratio between the cross sections for single-electron capture by highly charged ions colliding with atomic and molecular hydrogen has been investigated within the framework of the Bohr-Lindhard theory. It is shown that cr(H2)/a (H) can... more
Absolute cross sections for electron-impact single ionization, dissociative excitation and dissociative ionization of the ethynyl radical ion (C2D +) have been measured for electron energies ranging from the corresponding reaction... more
Absolute cross sections for electron-impact single ionization, dissociative excitation and dissociative ionization of the ethynyl radical ion (C2D +) have been measured for electron energies ranging from the corresponding reaction... more
Purpose The purpose of this paper is to analyse if citizens’ searches on the internet coincide with the services that municipal websites offer. In addition, the authors examine municipal webpage rankings in search engines and the factors... more
Absolute cross sections are reported for electron-impact ionization and dissociation of CN + ions. Simple ionization to CN 2+ ions and formation of singly charged C + and N + and doubly charged C 2+ and N 2+ fragments have been... more
Atomic rates relevant to fusion plasma modelling are calculated from a simplified collisional radiative model, using recommended atomic data. The rates calculated are the radiative loss and the electron cooling rates for individual... more
A high-charge-state plasma neutralizer for a beam of energetic H-ions offers the potential of high optimum neutralization efficiency (-85%) relative to a gas target (50-60%), and considerably reduced target thickness. We have calculated... more
This is a Library Circulating Copy which may be borrowed for two weeks. For a personal retention copy, call Tech. Info. Division, Ext. 6782.
This is a Library Circulating Copy which may be borrowed for two weeks. For a personal retention copy~ call Tech. Info. Division~ Ext. 6782.
This is a Library Circulating Copy which may be borrowed for two weeks. For a personal retention copy~ call Tech. Info. Division~ Ext. 6782.
Progress on the study of H formation by charge transfer in alkaline-earth vapors is reported. The H equilibrium yield in strontium vapor reaches a maximum of 50% at an energy of 250 eV/amu, which is the highest H yield reported to date.
Total electron-capture cross-section measurements are reported for C~+ (3 & q & 6) and Q~+ (2&q &6) ions colliding with hydrogen atoms and molecules in the energy range (0.01 &E & 10) keV/amu. The cross sections range from (0.5-7) &10 "... more
The present volume of Atomic and Plasma-Material Interaction Data for Fusion includes critical reviews and results of original experimental and theoretical studies on inelastic collision processes among the basic and dominant impurity... more
The present volume of Atomic and Plasma-Material Interaction Data for Fusion includes critical reviews and results of original experimental and theoretical studies on inelastic collision processes among the basic and dominant impurity... more
The present volume contains two papers, published by Tatsuo Tabata and his coworkers in 2002 and 2006, in the form of the post-print re-edited by the use of LATEX. The studies described were made at Osaka Prefecture University and... more
The present volume contains two papers, published by Tatsuo Tabata and his coworkers in 2000 and 2001, in the form of the post-print re-edited by the use of LATEX. The studies described were made at Osaka Prefecture University and... more
The present volume contains four papers, published by Tatsuo Tabata and his coworkers from 1987 to 1992, in the form of the post-print re-edited by the use of LATEX. The studies described were made at the Radiation Center of Osaka... more
​Recent results are briefly described of the joint work made at the Institute of Plasma Physics, Nagoya University, to compile the data on the backscattering coefficients of ions and to develop empirical formulas for these coefficients.... more
Cross sections for 74 processes in collisions of electrons with nitrogen molecules (N2) and singly ionized nitrogen molecules (N2+) have been collected. The literature has been surveyed through the middle of 2004. The data sets collected... more
Refers to: Analytic cross sections for electron impact collisions with nitrogen molecules, Atomic Data and Nuclear Data Tables, Volume 92, Issue 3, May 2006, Pages 375-406, Tatsuo Tabata, Toshizo Shirai, Masao Sataka, Hirotaka Kubo Eq.... more
Due to the present interest in modeling and diagnosing the edge and divertor plasma regions in magnetically confined fusion devices, we have sought to provide new calculations regarding the elastic, excitation, ionization, and charge... more
Due to the present interest in modeling and diagnosing the edge and divertor plasma regions in magnetically confined fusion devices, we have sought to provide new calculations regarding the elastic, excitation, ionization, and charge... more
Download research papers for free!