Search Results
Showing 1 - 1 of 1 results.
- Search terms can be anywhere in the study: title, description, variables, etc.
- Because our holdings are large, we recommend using at least two query terms:
rural economy
home ownership
higher education
- Keywords help delimit the breadth of results. Therefore, use as many as required to achieve your desired results:
elementary education federal funding
- Our search will find studies with derivative expressions of your query terms: A search for
"nation"
will find results containing "national" - Use quotes to search for an exact expression:
"social mobility"
- You can combine exact expressions with loose terms:
"united states" inmates
- Exclude results by using a MINUS sign:
elections -sweden -germany
will exclude swedish and german election studies - On the results page, you will be able to sort and filter to further refine results.
Hidden
Study Title/Investigator
Released/Updated
1.
Synthetic Data Generation of Health and Demographic Surveillance Systems Dataset, Kenya, 2019-2020 (ICPSR 39209)
Waljee, Akbar K.
Waljee, Akbar K.
Surveillance data play a vital role in estimating the burden of diseases, pathogens, exposures, behaviors, and susceptibility in populations, providing insights that can inform the design of policies and targeted public health interventions. The use of Health and Demographic Surveillance System (HDSS) collected from the Kilifi region of Kenya, has led to the collection of massive amounts of data on the demographics and health events of different populations. This has necessitated the adoption of tools and techniques to enhance data analysis to derive insights that will improve the accuracy and efficiency of decision-making. Machine Learning (ML) and artificial intelligence (AI) based techniques are promising for extracting insights from HDSS data, given their ability to capture complex relationships and interactions in data. However, broad utilization of HDSS datasets using AI/ML is currently challenging as most of these datasets are not AI-ready due to factors that include, but are not limited to, regulatory concerns around privacy and confidentiality, heterogeneity in data laws across countries limiting the accessibility of data, and a lack of sufficient datasets for training AI/ML models. Synthetic data generation offers a potential strategy to enhance accessibility of datasets by creating synthetic datasets that uphold privacy and confidentiality, suitable for training AI/ML models and can also augment existing AI datasets used to train the AI/ML models. These synthetic datasets, generated from two rounds of separate data collection periods, represent a version of the real data while retaining the relationships inherent in the data. For more information please visit The Aga Khan University Website.
2024-10-01