Abstract
![CDATA[Plant traits are a key to understanding and predicting the adaptation of ecosystems to environmental changes, which motivates the TRY project aiming at constructing a global database for plant traits and becoming a standard resource for the ecological community. Despite its unprecedented coverage, a large percentage of missing data substantially constrains joint trait analysis. Meanwhile, the trait data is characterized by the hierarchical phylogenetic structure of the plant kingdom. While factorization based matrix completion techniques have been widely used to address the missing data problem, traditional matrix factorization methods are unable to leverage the phylogenetic structure. We propose hierarchical probabilistic matrix factorization (HPMF), which effectively uses hierarchical phylogenetic information for trait prediction. We demonstrate HPMF’s high accuracy, effectiveness of incorporating hierarchical structure and ability to capture trait correlation through experiments.]]
Original language | English |
---|---|
Title of host publication | Proceedings of the 29th International Conference on Machine Learning (ICML 2012), June 26-July 1, 2012, University of Edinburgh, Scotland, United Kingdom |
Publisher | International Machine Learning Society |
Pages | 1303-1310 |
Number of pages | 8 |
ISBN (Print) | 9781450312851 |
Publication status | Published - 2012 |
Event | International Conference on Machine Learning - Duration: 27 Jun 2012 → … |
Conference
Conference | International Conference on Machine Learning |
---|---|
Period | 27/06/12 → … |
Keywords
- databases
- functional diversity
- plants
- sampling