FAQ - Camtrap DP

How to describe bounding boxes of detected objects?
How to describe multiple events related to a single resource?
How to handle multi-camera deployments?
Can I describe plant or fungus observations using Camtrap DP?
How to include measurements in a data package?
- Using tags
- Using a custom table
How to merge data packages describing different projects?
Do I need to use CSV files?
- gzipped CSV files
- Apache parquet
Have a question?

How to describe bounding boxes of detected objects?

In the observations table there are four terms used to describe bounding boxes: bboxX, bboxY, bboxWidth, and bboxHeight. The values for all these fields are numbers between 0 and 1, relative to the image size.

The bboxX and bboxY fields represent the coordinates of the top-left corner of the bounding box. bboxX is measured from the left edge of the image, while bboxY is measured from the top edge. bboxWidth represents the width of the bounding box, measured from its left edge to its right edge. Similarly, bboxHeight represents the height of the bounding box, measured from its top edge to its bottom edge.

How to describe multiple events related to a single resource?

Multiple records in the observations table can reference the same media. See this GitHub issue.

How to handle multi-camera deployments?

See this GitHub issue.

Can I describe plant or fungus observations using Camtrap DP?

Currently, possible values for the observationType field in the observations table are: animal, human, vehicle, blank, unknown and unclassified. This definition does not allow for observations of plants or fungi.

If you have a use case for describing non-animal observations using Camtrap DP, please let us know in this GitHub issue.

How to include measurements in a data package?

There are two ways to include additional information (values not covered by the standard fields) in a Camtrap DP:

Using tags

Deployment and observation tables include deploymentTags and observationTags fields. You can use these fields to store additional information as key:value pairs, separated by a pipe character (|). For example, this is how temperature and snow cover information could be represented in the deployment table:

deploymentID	deploymentTags
dep1	temperature:20 \| snow_cover:false
dep2	temperature:-5 \| snow_cover:true

There are some drawbacks to using this method. Storing additional information in the media table is not possible, since it does not contain a tags field. Additionally, data represented this way is difficult to parse.

Using a custom table

You can add a custom table to the data package to store additional information. This requires providing a schema for the additional table. The schema must include a foreign key to the referenced table (deploymentID, observationID, or mediaID) and the additional fields. Here is an example schema for the deployment measurement table:

{
  "name": "deployment-measurements",
  "title": "Deployment measurements",
  "description": "Table with weather measurements for deployments. Associated with deployments (`deploymentID`).",
  "fields": [
    {
      "name": "deploymentID",
      "description": "Identifier of the deployment. Foreign key to `deployments.deploymentID`.",
      "skos:broadMatch": "http://rs.tdwg.org/dwc/terms/parentEventID",
      "type": "string",
      "constraints": {
        "required": true
      },
      "example": "dep1"
    },
    {
      "name": "temperature",
      "description": "Temperature (in Celsius) at the time of the observation.)",
      "type": "number",
      "constraints": {
        "required": false,
        "minimum": -50,
        "maximum": 100
      },
      "example": 19.5
    },
    {
      "name": "snowCover",
      "description": "Snow cover present at the time of the observation.",
      "type": "boolean",
      "constraints": {
        "required": false
      },
      "example": true
    }
  ],
  "foreignKeys": [
    {
      "fields": "deploymentID",
      "reference": {
        "resource": "deployments",
        "fields": "deploymentID"
      }
    }
  ]
}

You need to add this table to the datapackage.json file in the resources field.

This is an example table following the schema above:

deploymentID	temperature	snowCover
dep1	20	false
dep2	-5	true

We recommend this approach for storing additional information. It allows for easier parsing and merging of tables and is more flexible than using tags.

For more details, see this GitHub issue.

How to merge data packages describing different projects?

By design, a single Camtrap DP data package describes a single project. However, there are some use cases (for example, a meta-analysis) where merging multiple data packages could be beneficial.

We provide an R package to read and manipulate Camtrap DP. The R package includes the merge function that lets you combine two data packages into a single valid Camtrap DP.

Consult the merge function documentation to understand exactly how specific fields are merged to avoid information loss. Please note that when merging data packages x and y, the project$samplingDesign field in the resulting package will be set to the value of project$samplingDesign from data package x. Therefore, we recommend merging data packages only for projects that use the same sampling design.

Do I need to use CSV files?

No. Some studies have media and observations tables with over a million records, which may be hard to produce or consume as CSV files. Here are two approaches for formatting large files:

gzipped CSV files

By compressing a CSV file, you can often reduce its size by a factor. We recommend gzip over zip, as it allows direct file reading. Compressed CSV files are supported in all versions of Camtrap DP, by frictionless-py and the camtrapdp R package.

Compress the file:
```
 gzip media.csv
```

Refer to the compressed CSV file in the datapackage.json as follows:

 {
   "name": "media",
   "path": "media.csv.gz",
   "profile": "tabular-data-resource",
   "format": "csv",
   "mediatype": "text/csv",
   "encoding": "UTF-8",
   "schema": "https://raw.githubusercontent.com/tdwg/camtrap-dp/1.0.2/media-table-schema.json"
 }

Apache parquet

Apache Parquet is an open source data file format, designed for efficient data storage and retrieval. Parquet files are supported in Camtrap DP 1.0.2, by the frictionless-py after installing an extension, but not by the camtrapdp R package (as it is not yet supported by its dependency).

Create the parquet file (e.g. with the arrow R package).

Refer to the parquet file in the datapackage.json as follows:

 {
   "name": "media",
   "path": "media.parquet",
   "profile": "tabular-data-resource",
   "format": "parquet",
   "mediatype": "application/vnd.apache.parquet",
   "encoding": "UTF-8",
   "schema": "https://raw.githubusercontent.com/tdwg/camtrap-dp/1.0.2/media-table-schema.json"
 }

Have a question?

Don’t see your question answered here?

Ask it in our discussion forum

On this page