Price discovery algorithm for free-form PHILGEPS dataset
PHILGEPS is the official online platform used in government procurement of products and services. In this work, we aim to perform price discovery on the products listed in the free-form PHILGEPS 2016 procurement report. The text listings were first lemmatized to determine product keywords. Afterwards, product price distributions were generated by kernel density estimation in conjunction with two-sample Kolmogorov-Smirnov test. Lastly, keywords were clustered into three groups based on their threshold bandwidth hth. It was found that general product terms cluster together, with the specific product identifiers having bandwidths between 0.25 and 1.50.
By submitting their manuscript to the Samahang Pisika ng Pilipinas (SPP) for consideration, the Authors warrant that their work is original, does not infringe on existing copyrights, and is not under active consideration for publication elsewhere.
Upon acceptance of their manuscript, the Authors further agree to grant SPP the non-exclusive, worldwide, and royalty-free rights to record, edit, copy, reproduce, publish, distribute, and use all or part of the manuscript for any purpose, in any media now existing or developed in the future, either individually or as part of a collection.
All other associated economic and moral rights as granted by the Intellectual Property Code of the Philippines are maintained by the Authors.