Dog Breeds Dataset

Our Dog Breeds Dataset provides structured, expert-verified data to support breed research, comparison, and professional canine insights

The Pawsome Authority Dog Breeds Dataset is a structured, scientifically-backed dataset of dog breeds designed to support a wide range of canine-related projects. Whether you are developing an app, conducting academic research, or comparing breeds for suitability, the dataset provides a consistent, detailed foundation.

The dataset is built on our comprehensive Dog Breed Rating Methodology, which evaluates each breed across 30 dimensions, and currently features 60 internationally recognized breeds from major kennel clubs. The dataset is available in JSON-LD and CSV formats.

Dataset

Each breed profile includes 60 structured data points. Of these, 30 are Rating Dimensions evaluated using our standardized 1 to 5 scale. The remaining attributes include measurements, classifications, and descriptive fields:

CATEGORY

DESCRIPTION

Taxonomy

Breed name, pronunciation, alternate names, country of origin, etc.

Appearance & Characteristics

Height, weight, coat length, coat type, double coat presence, and hypoallergenic status

Temperament & Behavior

Rating Dimensions such as affection, playfulness, protectiveness, and interactions with children, pets, and strangers

Training & Exercise

Rating Dimensions such as obedience, intelligence, trainability, and attention span

Grooming & Maintenance

Rating Dimensions covering shedding level, grooming frequency, and drooling tendency

Health & Lifespan

Lifespan and Rating Dimensions covering predispositions to dental, ear, or eye issues

Breed Suitability

First-time owner compatibility and Rating Dimensions covering experience level required and apartment living

The 30 Rating Dimensions use a standardized 1 to 5 scale to reflect general tendencies within each breed.

A rating of 1 indicates a lower presence or tendency, such as minimal protectiveness, low energy, or fewer health concerns. A rating of 5 indicates a stronger presence or tendency, such as high playfulness, strong prey drive, or greater grooming needs. Ratings of 2 to 4 represent moderate levels, offering a nuanced view for comparison.

These ratings represent general breed tendencies. Individual dogs may differ based on genetics, environment, upbringing, and other factors.

Technical Details

The dataset is structured for consistency, interoperability, and long-term utility in research, development, and canine profiling. Key technical specifications include:

SPECIFICATION

DESCRIPTION

Subject

Pawsome Authority Dog Breeds Dataset

Description

A structured dataset of dog breeds including breed origins, physical measurements, coat characteristics, temperament ratings, health indicators, and lifestyle suitability, based on breed standards from major international kennel clubs

Total Breeds

60 internationally recognized breeds

Attributes per Breed

60 structured attributes (30 are Rating Dimensions)

Total Data Points

3,600 (60 breeds × 60 attributes)

Version

v20250723

Publication Date

2025-07-23

Last Updated

2025-07-23

Temporal Coverage

2023–2026

Spatial Coverage

Global

Taxonomy

Classification based on AKC, UKC, and FCI classifications

Data Sources

AKC, UKC, FCI, AVMA, veterinary literature, expert surveys, and proprietary analysis by Pawsome Authority

Validation Status

Verified by breed specialists using multi-source validation process

Formats

Available as JSON-LD and CSV

Units

Imperial (inches, lbs) and Metric (cm, kg)

Encoding

UTF-8 for universal character support

Identifiers

Stable web URLs and @id identifiers for each breed (used in JSON-LD)

Methodology

The dataset is built on a structured, multi-source rating methodology designed for consistency, transparency, and cross-breed comparability. Each breed is evaluated using 60 structured attributes, including 30 Rating Dimensions scored on a standardized 1 to 5 scale. Our rating process incorporates four primary sources:

CATEGORY

DESCRIPTION

Official Standards

Breed standards and definitions from major kennel clubs (AKC, UKC, FCI)

Veterinary Literature

Published research on breed-specific health, behavior, and care needs

Expert Input

Insights from breed specialists, veterinarians, dog trainers, and dog owners

Internal Research

Proprietary analysis, synthesis, and normalization by Pawsome Authority

All attribute data is normalized to a common scale to support side-by-side breed comparison and structured data applications.

Validation

Each dataset release is versioned and undergoes our five-step fact-checking and review process to ensure accuracy, consistency, and trustworthiness:

CATEGORY

DESCRIPTION

Step 1 (Collecting)

Breed data is gathered through automated systems and manual methods, then reviewed to identify traits needing verification

Step 2 (Verifying)

Each attribute is categorized by type and assessed using our internal credibility ranking system to focus on the most reliable information

Step 3 (Cross-Referencing)

All claims are checked against global sources and official standards to ensure consistency

Step 4 (Documenting)

Every verification step is recorded, and ratings are updated in real-time as new data becomes available

Step 5 (Reporting)

Profiles are compiled into detailed reports and peer-reviewed by experts and fellow fact-checkers before public release

Only data labeled as “Verified” is included in public dataset releases. This designation indicates the attribute has passed cross-checking, discrepancy resolution, and expert review across all five stages. Visit the Fact-Checking Process page for more information.

Dataset Formats

The dataset is available in JSON-LD and CSV formats. The JSON-LD format is a machine-readable, semantically rich format ideal for developers and structured data applications. The CSV format is a flat file for spreadsheets, data analysis, or importing into databases or statistical tools.

The JSON-LD distribution adheres to the schema.org Dataset specification and includes metadata for versioning, licensing, and attribution.

BREED NAMEORIGINGROUPSIZEMALE HEIGHT MIN (IN)MALE HEIGHT MAX (IN)MALE WEIGHT MIN (LBS)MALE WEIGHT MAX (LBS)COAT LENGTHAFFECTIONPLAYFULNESSGOOD WITH CHILDRENSHEDDINGHEALTHEXPERIENCE LEVEL
AkitaJapanWorkingExtra Large2628100130Medium3/53/53/53/54/55/5
American Cocker SpanielUnited StatesSportingSmall14.515.52530Medium5/54/55/53/54/53/5
Australian Cattle DogAustraliaHerdingMedium18203550Short3/54/53/53/53/54/5
Australian ShepherdUnited StatesHerdingMedium20235065Medium4/54/55/54/53/54/5
Basset HoundFranceHoundMedium12154065Short3/53/54/53/54/53/5
BeagleEnglandHoundSmall14162230Short4/54/54/54/53/52/5
Belgian MalinoisBelgiumHerdingLarge24266080Short1/53/54/53/53/55/5
Bernese Mountain DogSwitzerlandWorkingExtra Large2527.580115Medium4/53/54/55/54/53/5
Bichon FriséSpainNon-SportingSmall9.511.51218Medium5/54/55/51/52/51/5
BloodhoundBelgiumHoundLarge252790110Short3/53/54/53/54/53/5

Note: The full dataset in both JSON-LD and CSV formats includes all 60 breeds and respective traits and characteristics.

The JSON-LD format is for web applications, semantic web projects, and structured data integration. Features include:

  • Schema.org markup for maximum compatibility
  • Stable URL identifiers for each breed
  • Hierarchical data structure
  • Machine-readable format for APIs and databases

The CSV format is for data analysis, spreadsheet applications, and statistical research. Features include:

  • Universal compatibility with analysis tools
  • Flat structure optimized for filtering and sorting
  • Easy import into statistical software
  • Human-readable format for quick reference

## [v20250707] - 2025-07-23

### Added
- Initial release.

Archived versions are also available at:

Developer Notes

Each of the 60 structured attributes in the dataset fits into one of three main data types. This classification supports schema mapping, API integration, and proper data handling in applications. Examples for each data type include:

FIELD TYPE

DESCRIPTION

EXAMPLES

Ordinal

Values on a 1 to 5 rating scale indicating ranked tendencies (not equidistant)

temperament.affection = "3"
training.obedience = "4"
grooming.shedding = "5"

Quantitative

Measurable numeric ranges, typically stored using schema.org `QuantitativeValue` objects

heightMaleInches.minValue = 26
lifespan.maxYears = 15
weightFemaleKilograms.maxValue = 45.5

Categorical

Non-numeric, discrete classifications or binary values

coat.doubleCoat = "Yes"
ownerSuitability.firstTimeOwnerSuitable = "Not Suitable"
coat.hypoallergenic = "No"

Licensing & Attribution

The Dog Breeds Dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).

You are free to use, share, adapt, and redistribute the dataset for any purpose, even commercially, as long as proper attribution is provided

Contribute & Collaborate

We welcome input from developers, researchers, and canine experts. To submit corrections, request new breed additions, suggest a new data field, or use the dataset in a research paper or product, reach out through the Contact Us page. Your feedback helps strengthen the dataset and support the canine research and development community.

Editorial Standards:

Our team of experts independently writes all dog breed facts and information to ensure they are trustworthy, accurate, and up-to-date. Learn about our fact-checking process and editorial standards.