An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data

Guillaume Derval; Frédéric Docquier; Pierre Schaus

doi:https://doi.org/10.1007/978-3-030-46133-1_40

An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data

Guillaume Derval, Frédéric Docquier, Pierre Schaus

Crossing Borders

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Census data provide detailed information about population characteristics at a coarse resolution. Nevertheless, fine-grained, high-resolution mappings of population counts are increasingly needed to characterize population dynamics and to assess the consequences of climate shocks, natural disasters, investments in infrastructure, development policies, etc. Dissagregating these census is a complex machine learning, and multiple solutions have been proposed in past research. We propose in this paper to view the problem in the context of the aggregate learning paradigm, where the output value for all training points is not known, but where it is only known for aggregates of the points (i.e. in this context, for regions of pixels where a census is available). We demonstrate with a very simple and interpretable model that this method is on par, and even outperforms on some metrics, the state-of-the-art, despite its simplicity.

Original language	English
Title of host publication	Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019
Subtitle of host publication	Würzburg, Germany, September 16–20, 2019, Proceedings, Part III
Editors	Ulf Brefeld, Elisa Fromont, Andreas Hotho, Arno Knobbe, Marloes Maathuis, Céline Robardet
Publisher	Springer
Pages	672-687
ISBN (Electronic)	978-3-030-46133-1
ISBN (Print)	978-3-030-46132-4
DOIs	https://doi.org/10.1007/978-3-030-46133-1_40
Publication status	Published - 30 Apr 2020

Publication series

Name	Lecture Notes in Computer Science book series (LNCS)
Publisher	Springer
Volume	11908
ISSN (Print)	0885-6125

Bibliographical note

Accepted at ECML-PKDD 2019 Data on Zenodo: https://zenodo.org/record/3260713

Keywords

Disaggregation
Aggregate learning
GIS

Access to Document

https://doi.org/10.1007/978-3-030-46133-1_40

https://arxiv.org/abs/1907.00270

Cite this

Derval, G., Docquier, F., & Schaus, P. (2020). An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data. In U. Brefeld, E. Fromont, A. Hotho, A. Knobbe, M. Maathuis, & C. Robardet (Eds.), Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019: Würzburg, Germany, September 16–20, 2019, Proceedings, Part III (pp. 672-687). (Lecture Notes in Computer Science book series (LNCS); Vol. 11908). Springer. https://doi.org/10.1007/978-3-030-46133-1_40

Derval, Guillaume ; Docquier, Frédéric ; Schaus, Pierre. / An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data. Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019: Würzburg, Germany, September 16–20, 2019, Proceedings, Part III. editor / Ulf Brefeld ; Elisa Fromont ; Andreas Hotho ; Arno Knobbe ; Marloes Maathuis ; Céline Robardet. Springer, 2020. pp. 672-687 (Lecture Notes in Computer Science book series (LNCS)).

@inproceedings{48833aafefeb4f86afd6e3c5bc02aec1,

title = "An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data",

abstract = "Census data provide detailed information about population characteristics at a coarse resolution. Nevertheless, fine-grained, high-resolution mappings of population counts are increasingly needed to characterize population dynamics and to assess the consequences of climate shocks, natural disasters, investments in infrastructure, development policies, etc. Dissagregating these census is a complex machine learning, and multiple solutions have been proposed in past research. We propose in this paper to view the problem in the context of the aggregate learning paradigm, where the output value for all training points is not known, but where it is only known for aggregates of the points (i.e. in this context, for regions of pixels where a census is available). We demonstrate with a very simple and interpretable model that this method is on par, and even outperforms on some metrics, the state-of-the-art, despite its simplicity. ",

keywords = "Disaggregation, Aggregate learning, GIS",

author = "Guillaume Derval and Fr{\'e}d{\'e}ric Docquier and Pierre Schaus",

note = "Accepted at ECML-PKDD 2019 Data on Zenodo: https://zenodo.org/record/3260713",

year = "2020",

month = apr,

day = "30",

doi = "https://doi.org/10.1007/978-3-030-46133-1_40",

language = "English",

isbn = "978-3-030-46132-4",

series = "Lecture Notes in Computer Science book series (LNCS)",

publisher = "Springer",

pages = "672--687",

editor = "Ulf Brefeld and Fromont, {Elisa } and Hotho, {Andreas } and Knobbe, {Arno } and Maathuis, {Marloes } and Robardet, {C{\'e}line }",

booktitle = "Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019",

address = "Germany",

}

Derval, G, Docquier, F & Schaus, P 2020, An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data. in U Brefeld, E Fromont, A Hotho, A Knobbe, M Maathuis & C Robardet (eds), Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019: Würzburg, Germany, September 16–20, 2019, Proceedings, Part III. Lecture Notes in Computer Science book series (LNCS), vol. 11908, Springer, pp. 672-687. https://doi.org/10.1007/978-3-030-46133-1_40

An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data. / Derval, Guillaume; Docquier, Frédéric; Schaus, Pierre.
Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019: Würzburg, Germany, September 16–20, 2019, Proceedings, Part III. ed. / Ulf Brefeld; Elisa Fromont; Andreas Hotho; Arno Knobbe; Marloes Maathuis; Céline Robardet. Springer, 2020. p. 672-687 (Lecture Notes in Computer Science book series (LNCS); Vol. 11908).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data

AU - Derval, Guillaume

AU - Docquier, Frédéric

AU - Schaus, Pierre

N1 - Accepted at ECML-PKDD 2019 Data on Zenodo: https://zenodo.org/record/3260713

PY - 2020/4/30

Y1 - 2020/4/30

N2 - Census data provide detailed information about population characteristics at a coarse resolution. Nevertheless, fine-grained, high-resolution mappings of population counts are increasingly needed to characterize population dynamics and to assess the consequences of climate shocks, natural disasters, investments in infrastructure, development policies, etc. Dissagregating these census is a complex machine learning, and multiple solutions have been proposed in past research. We propose in this paper to view the problem in the context of the aggregate learning paradigm, where the output value for all training points is not known, but where it is only known for aggregates of the points (i.e. in this context, for regions of pixels where a census is available). We demonstrate with a very simple and interpretable model that this method is on par, and even outperforms on some metrics, the state-of-the-art, despite its simplicity.

AB - Census data provide detailed information about population characteristics at a coarse resolution. Nevertheless, fine-grained, high-resolution mappings of population counts are increasingly needed to characterize population dynamics and to assess the consequences of climate shocks, natural disasters, investments in infrastructure, development policies, etc. Dissagregating these census is a complex machine learning, and multiple solutions have been proposed in past research. We propose in this paper to view the problem in the context of the aggregate learning paradigm, where the output value for all training points is not known, but where it is only known for aggregates of the points (i.e. in this context, for regions of pixels where a census is available). We demonstrate with a very simple and interpretable model that this method is on par, and even outperforms on some metrics, the state-of-the-art, despite its simplicity.

KW - Disaggregation

KW - Aggregate learning

KW - GIS

UR - https://www.mendeley.com/catalogue/a0c9b486-35f2-3ebf-9f33-9f3c75596854/

U2 - https://doi.org/10.1007/978-3-030-46133-1_40

DO - https://doi.org/10.1007/978-3-030-46133-1_40

M3 - Conference contribution

SN - 978-3-030-46132-4

T3 - Lecture Notes in Computer Science book series (LNCS)

SP - 672

EP - 687

BT - Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019

A2 - Brefeld, Ulf

A2 - Fromont, Elisa

A2 - Hotho, Andreas

A2 - Knobbe, Arno

A2 - Maathuis, Marloes

A2 - Robardet, Céline

PB - Springer

ER -

Derval G, Docquier F, Schaus P. An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data. In Brefeld U, Fromont E, Hotho A, Knobbe A, Maathuis M, Robardet C, editors, Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2019: Würzburg, Germany, September 16–20, 2019, Proceedings, Part III. Springer. 2020. p. 672-687. (Lecture Notes in Computer Science book series (LNCS)). doi: https://doi.org/10.1007/978-3-030-46133-1_40

An Aggregate Learning Approach for Interpretable Semi-supervised Population Prediction and Disaggregation Using Ancillary Data

Abstract

Publication series

Bibliographical note

Keywords

Access to Document

Other files and links

Cite this