Published on 02 June 2025
LeafMachine2 Data: Urticaceae Leaf Outlines and ECT
View DatasetDescription
Urticaceae - 224,532 leaves - 31.3 GB: This repository contains HDF5 (.h5) files with leaf morphological data for species in the Urticaceae family. Each file represents a single leaf specimen and contains leaf shape outlines, the 128x128 ECT matrix, and associated metadata.File ContentsThis dataset contains 224,532 leaves from taxa in the family Urticaceae. Some leaves may be partial, predated, broken, or incomplete. Each .h5 file contains the following datasets:ECT_matrices/ - Euler Characteristic Transform (ECT) matrices capturing topological features of leaf shapesshapes/shape_0 - Coordinate array (x,y) defining the leaf outline boundary. component_names - Original filename identifier (without extension)group_labels - Taxonomic classification dictionary containing:family: Taxonomic family namegenus: Genus classificationgenus_species: Binomial species namefullname: Complete taxonomic identifierData FormatFiles are organized with standardized naming: [Herbarium][ID][Family][Genus][Species]__[LeafID].h5Shape coordinates are normalized to a unit circle centered at the origin (-0.5 to 0.5 range), vertically oriented.Uncompressed Size:31.3 GBUsageCode for reading, processing, and analyzing these files is available at: https://github.com/Gene-Weaver/LM2-Data-ToolsThe repository includes functions for extracting data and generating visualizations.Citation: Please cite the LeafMachine2 paper and this dataset.Weaver, W. N., & Smith, S. A. (2023). From leaves to labels: Building modular machine learning networks for rapid herbarium specimen analysis with LeafMachine2. Applications in Plant Sciences, 11(5), e11548. https://doi.org/10.1002/aps3.11548
Citations (0)
No citations found
Mentions (0)
No mentions found
Metrics Over Time
Publication Details
Subfield
Ecology, Evolution, Behavior and Systematics
Field
Agricultural and Biological Sciences
Domain
Life Sciences
Confidence Score
51%
Source
Scholar Data Model