Managing Data Acquisition, Cleansing and Transformation in an Agriculture Data Warehouse

Ahsan Abdullah, Fuad Bajaber


Pakistan is the world’s fourth largest cotton producer. The country relies heavily on cotton yield to sustain and enhance its export and economic growth. Several state run organizations have been monitoring the cotton crop for decades through pest-scouting, agriculture and meteorological data-gathering processes. This non-digitized and non-standardized dirty data is of little use for strategic analysis and decision support. This paper is based on the data collection and cleansing issues of that cotton pest-scouting data consisting of approximately 15,000 sheets from 20 cotton-growing districts of Punjab province. Various real-life agriculture data management and data quality problems are discussed and explained in this paper using several real examples. 

Full Text:




  • There are currently no refbacks.

Copyright (c) 2015 Ahsan Abdullah, Fuad Bajaber

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Journal of Agricultural Studies   ISSN 2166-0379


Copyright © Macrothink Institute

To make sure that you can receive messages from us, please add the '' domain to your e-mail 'safe list'. If you do not receive e-mail in your 'inbox', check your 'bulk mail' or 'junk mail' folders.

If you have any questions, please contact