This is an outdated version published on 2022-12-20. Read the most recent version.
Preprint / Version 3

Random Forest Identification of Pulsars

##article.authors##

  • Ankhita Sathanur Eastlake High School

DOI:

https://doi.org/10.58445/rars.17

Keywords:

Machine Learning, Random Forest, Astrophysics, Astronomy, Pulsars

Abstract

Pulsars are a unique type of rotating neutron star that emit pulses of radio emission in beams that sweep across Earth, allowing for the detection of their repetitive pulses. Traditionally, pulsar candidates have been identified through manual signal processing. As data volumes increase, automated methods, such as artificial neural networks, have been proposed. In this study, the random forest classifier – an algorithm that takes the majority output of multiple decision trees – was used to accurately separate real pulsar signals from radio frequency interference (RFI) and other noise. These candidates can then be further studied and allotted telescope time to confirm them as pulsars. 1,639 real pulsar examples and 16,259 samples of RFI/noise from the HTRU2 survey were used to create the model. Features of the data used include the mean, standard deviation, excess kurtosis, and skewness of the integrated pulse profile and DM-SNR curve. The model demonstrated a 98% accuracy in identifying pulsars. The excess kurtosis, skewness, and mean of integrated profile were identified to be the most important factors in differentiating between pulsars and interference. This tool could be used to filter data from future surveys to reduce the number of candidates that need to be processed by humans.

References

Calla Cofield. What Are Pulsars? https://www.space.com/32661-pulsars.html.

Lyon, R. J.; Stappers, B. W.; Cooper, S.; Brooke, J. M.; Knowles, J. D. Fifty Years of Pulsar Candidate Selection: From Simple Filters to a New Principled Real-Time Classification Approach. Monthly Notices of the Royal Astronomical Society 2016, 459 (1), 1104–1123. https://doi.org/10.1093/mnras/stw656.

Lyon, R. J. Why Are Pulsars Hard to Find?, University of Manchester, 2016.

IBM Cloud Education. What is Machine Learning? https://www.ibm.com/cloud/learn/machine-learning.

Train and Test datasets in Machine Learning - Javatpoint https://www.javatpoint.com/train-and-test-datasets-in-machine-learning.

Bento, C. Decision Tree Classifier explained in real-life: picking a vacation destination https://towardsdatascience.com/decision-tree-classifier-explained-in-real-life-picking-a-vacation-destination-6226b2b60575.

Yiu, T. Understanding Random Forest https://towardsdatascience.com/understanding-random-forest-58381e0602d2.

IBM Cloud Education. What is Random Forest? https://www.ibm.com/cloud/learn/random-forest.

IBM Cloud Education. What are Neural Networks? https://www.ibm.com/cloud/learn/neural-networks.

Artificial Neural Network - Basic Concepts - Tutorialspoint https://www.tutorialspoint.com/artificial_neural_network/artificial_neural_network_basic_concepts.htm.

Bates, S. D.; Bailes, M.; Barsdell, B. R.; Bhat, N. D. R.; Burgay, M.; Burke-Spolaor, S.;

Champion, D. J.; Coster, P.; D’Amico, N.; Jameson, A.; Johnston, S.; Keith, M. J.;

Kramer, M.; Levin, L.; Lyne, A.; Milia, S.; Ng, C.; Nietner, C.; Possenti, A.; Stappers, B. The High Time Resolution Universe Pulsar Survey — VI. An Artificial Neural Network and Timing of 75 Pulsars. Monthly Notices of the Royal Astronomical Society 2012, 427, 1052–1065. https://doi.org/10.1111/j.1365-2966.2012.22042.x.

Eatough, R. P.; Molkenthin, N.; Kramer, M.; Noutsos, A.; Keith, M. J.; Stappers, B. W.; Lyne, A. G. Selection of Radio Pulsar Candidates Using Artificial Neural Networks. Monthly Notices of the Royal Astronomical Society 2010, 407 (4), 2443–2450. https://doi.org/10.1111/j.1365-2966.2010.17082.x.

Morello, V.; Barr, E. D.; Bailes, M.; Flynn, C. M.; Keane, E. F.; van Straten, W. SPINN: A Straightforward Machine Learning Solution to the Pulsar Candidate Selection Problem. Monthly Notices of the Royal Astronomical Society 2014, 443 (2), 1651–1662. https://doi.org/10.1093/mnras/stu1188.

UCI Machine Learning Repository: HTRU2 Data Set https://archive.ics.uci.edu/ml/datasets/HTRU2.

Radhakrishnan, V.; Vivekanand, M. The Structure of Integrated Pulse Profiles. Journal of Astrophysics and Astronomy 1980, 1, 119–128.

Pulsar Dispersion Measure | COSMOS https://astronomy.swin.edu.au/cosmos/P/Pulsar+Dispersion+Measure#:~:text=The%20dispersion%20measure%20can%20be%20determined%20by%20the.

CFI. Kurtosis - Definition, Excess Kurtosis, and Types of Kurtosis https://corporatefinanceinstitute.com/resources/knowledge/other/kurtosis/.

How is the kurtosis of a distribution related to the geometry of the density function? https://stats.stackexchange.com/questions/84158/how-is-the-kurtosis-of-a-distribution-related-to-the-geometry-of-the-density-fun.

6.1: Qualitative Data and Quantitative Data https://math.libretexts.org/Courses/Mount_Royal_University/MATH_1150:_Mathematical_Reasoning/6:_Introduction_to_Statistics/6.1:_Qualitative_Data_and_Quantitative_Data.

Aniththa. Hyperparameter tuning a model - Azure Machine Learning https://docs.microsoft.com/en-us/azure/machine-learning/how-to-tune-hyperparameters.

Baron, D. Machine Learning in Astronomy: A Practical Overview. arXiv:1904.07248 [astro-ph] 2019.

Google. Classification: ROC Curve and AUC | Machine Learning Crash Course https://developers.google.com/machine-learning/crash-course/classification/roc-and-auc.

Scikit-learn. 3.2.4.3.1. sklearn.ensemble.RandomForestClassifier — scikit-learn 0.20.3 documentation

https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html.

Gini Index for Decision Trees: Mechanism, Perfect & Imperfect Split With Examples https://www.upgrad.com/blog/gini-index-for-decision-trees/.

What Is Log Loss in Machine Learning?

https://pandio.com/what-is-log-loss-in-machine-learning/#:~:text=Log%20loss%20applies%20to%20the%20prediction%20process%20in (accessed 2022 -08 -27).

https://kodzilla.pl; dev@kodzilla.pl. Addepto

https://addepto.com/what-is-entropy-in-machine-learning/#:~:text=Entropy%20is%20frequently%20used%20in%20one%20of%20the (accessed 2022 -08 -27).

Fraj, M. B. In Depth: Parameter tuning for Random Forest

https://medium.com/all-things-ai/in-depth-parameter-tuning-for-random-forest-d67bb7e920d.

Downloads

Posted

2022-10-03 — Updated on 2022-12-20

Versions