The Pawsome Authority Dog Breeds Dataset is a structured, scientifically-backed dataset of dog breeds designed to support a wide range of canine-related projects. Whether you are developing an app, conducting academic research, or comparing breeds for suitability, the dataset provides a consistent, detailed foundation.
The dataset is built on our comprehensive Dog Breed Rating Methodology, which evaluates each breed across 30 dimensions, and currently features 60 internationally recognized breeds from major kennel clubs. The dataset is available in JSON-LD and CSV formats.
Dataset
Each breed profile includes 60 structured data points. Of these, 30 are Rating Dimensions evaluated using our standardized 1 to 5 scale. The remaining attributes include measurements, classifications, and descriptive fields:
CATEGORY
DESCRIPTION
Taxonomy
Breed name, pronunciation, alternate names, country of origin, etc.
Appearance & Characteristics
Height, weight, coat length, coat type, double coat presence, and hypoallergenic status
Temperament & Behavior
Rating Dimensions such as affection, playfulness, protectiveness, and interactions with children, pets, and strangers
Training & Exercise
Rating Dimensions such as obedience, intelligence, trainability, and attention span
Grooming & Maintenance
Rating Dimensions covering shedding level, grooming frequency, and drooling tendency
Health & Lifespan
Lifespan and Rating Dimensions covering predispositions to dental, ear, or eye issues
Breed Suitability
First-time owner compatibility and Rating Dimensions covering experience level required and apartment living
The 30 Rating Dimensions use a standardized 1 to 5 scale to reflect general tendencies within each breed.
A rating of 1 indicates a lower presence or tendency, such as minimal protectiveness, low energy, or fewer health concerns. A rating of 5 indicates a stronger presence or tendency, such as high playfulness, strong prey drive, or greater grooming needs. Ratings of 2 to 4 represent moderate levels, offering a nuanced view for comparison.
These ratings represent general breed tendencies. Individual dogs may differ based on genetics, environment, upbringing, and other factors.
Technical Details
The dataset is structured for consistency, interoperability, and long-term utility in research, development, and canine profiling. Key technical specifications include:
SPECIFICATION
DESCRIPTION
Subject
Pawsome Authority Dog Breeds Dataset
Description
A structured dataset of dog breeds including breed origins, physical measurements, coat characteristics, temperament ratings, health indicators, and lifestyle suitability, based on breed standards from major international kennel clubs
Total Breeds
60 internationally recognized breeds
Attributes per Breed
60 structured attributes (30 are Rating Dimensions)
Total Data Points
3,600 (60 breeds × 60 attributes)
Version
v20250723
Publication Date
2025-07-23
Last Updated
2025-07-23
Temporal Coverage
2023–2026
Spatial Coverage
Global
Taxonomy
Classification based on AKC, UKC, and FCI classifications
Data Sources
AKC, UKC, FCI, AVMA, veterinary literature, expert surveys, and proprietary analysis by Pawsome Authority
Validation Status
Verified by breed specialists using multi-source validation process
Formats
Available as JSON-LD and CSV
Units
Imperial (inches, lbs) and Metric (cm, kg)
Encoding
UTF-8 for universal character support
Identifiers
Stable web URLs and @id identifiers for each breed (used in JSON-LD)
Methodology
The dataset is built on a structured, multi-source rating methodology designed for consistency, transparency, and cross-breed comparability. Each breed is evaluated using 60 structured attributes, including 30 Rating Dimensions scored on a standardized 1 to 5 scale. Our rating process incorporates four primary sources:
CATEGORY
DESCRIPTION
Official Standards
Breed standards and definitions from major kennel clubs (AKC, UKC, FCI)
Veterinary Literature
Published research on breed-specific health, behavior, and care needs
Expert Input
Insights from breed specialists, veterinarians, dog trainers, and dog owners
Internal Research
Proprietary analysis, synthesis, and normalization by Pawsome Authority
All attribute data is normalized to a common scale to support side-by-side breed comparison and structured data applications.
Validation
Each dataset release is versioned and undergoes our five-step fact-checking and review process to ensure accuracy, consistency, and trustworthiness:
CATEGORY
DESCRIPTION
Step 1 (Collecting)
Breed data is gathered through automated systems and manual methods, then reviewed to identify traits needing verification
Step 2 (Verifying)
Each attribute is categorized by type and assessed using our internal credibility ranking system to focus on the most reliable information
Step 3 (Cross-Referencing)
All claims are checked against global sources and official standards to ensure consistency
Step 4 (Documenting)
Every verification step is recorded, and ratings are updated in real-time as new data becomes available
Step 5 (Reporting)
Profiles are compiled into detailed reports and peer-reviewed by experts and fellow fact-checkers before public release
Only data labeled as “Verified” is included in public dataset releases. This designation indicates the attribute has passed cross-checking, discrepancy resolution, and expert review across all five stages. Visit the Fact-Checking Process page for more information.
Dataset Formats
The dataset is available in JSON-LD and CSV formats. The JSON-LD format is a machine-readable, semantically rich format ideal for developers and structured data applications. The CSV format is a flat file for spreadsheets, data analysis, or importing into databases or statistical tools.
The JSON-LD distribution adheres to the schema.org Dataset specification and includes metadata for versioning, licensing, and attribution.
| BREED NAME | ORIGIN | GROUP | SIZE | MALE HEIGHT MIN (IN) | MALE HEIGHT MAX (IN) | MALE WEIGHT MIN (LBS) | MALE WEIGHT MAX (LBS) | COAT LENGTH | AFFECTION | PLAYFULNESS | GOOD WITH CHILDREN | SHEDDING | HEALTH | EXPERIENCE LEVEL |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Akita | Japan | Working | Extra Large | 26 | 28 | 100 | 130 | Medium | 3/5 | 3/5 | 3/5 | 3/5 | 4/5 | 5/5 |
| American Cocker Spaniel | United States | Sporting | Small | 14.5 | 15.5 | 25 | 30 | Medium | 5/5 | 4/5 | 5/5 | 3/5 | 4/5 | 3/5 |
| Australian Cattle Dog | Australia | Herding | Medium | 18 | 20 | 35 | 50 | Short | 3/5 | 4/5 | 3/5 | 3/5 | 3/5 | 4/5 |
| Australian Shepherd | United States | Herding | Medium | 20 | 23 | 50 | 65 | Medium | 4/5 | 4/5 | 5/5 | 4/5 | 3/5 | 4/5 |
| Basset Hound | France | Hound | Medium | 12 | 15 | 40 | 65 | Short | 3/5 | 3/5 | 4/5 | 3/5 | 4/5 | 3/5 |
| Beagle | England | Hound | Small | 14 | 16 | 22 | 30 | Short | 4/5 | 4/5 | 4/5 | 4/5 | 3/5 | 2/5 |
| Belgian Malinois | Belgium | Herding | Large | 24 | 26 | 60 | 80 | Short | 1/5 | 3/5 | 4/5 | 3/5 | 3/5 | 5/5 |
| Bernese Mountain Dog | Switzerland | Working | Extra Large | 25 | 27.5 | 80 | 115 | Medium | 4/5 | 3/5 | 4/5 | 5/5 | 4/5 | 3/5 |
| Bichon Frisé | Spain | Non-Sporting | Small | 9.5 | 11.5 | 12 | 18 | Medium | 5/5 | 4/5 | 5/5 | 1/5 | 2/5 | 1/5 |
| Bloodhound | Belgium | Hound | Large | 25 | 27 | 90 | 110 | Short | 3/5 | 3/5 | 4/5 | 3/5 | 4/5 | 3/5 |
Note: The full dataset in both JSON-LD and CSV formats includes all 60 breeds and respective traits and characteristics.
The JSON-LD format is for web applications, semantic web projects, and structured data integration. Features include:
- Schema.org markup for maximum compatibility
- Stable URL identifiers for each breed
- Hierarchical data structure
- Machine-readable format for APIs and databases
The CSV format is for data analysis, spreadsheet applications, and statistical research. Features include:
- Universal compatibility with analysis tools
- Flat structure optimized for filtering and sorting
- Easy import into statistical software
- Human-readable format for quick reference
## [v20250707] - 2025-07-23
### Added
- Initial release.
Archived versions are also available at:
- Zenodo (DOI: 10.5281/zenodo.16398461)
- Figshare (DOI: 10.6084/m9.figshare.29634542)
Developer Notes
Each of the 60 structured attributes in the dataset fits into one of three main data types. This classification supports schema mapping, API integration, and proper data handling in applications. Examples for each data type include:
FIELD TYPE
DESCRIPTION
EXAMPLES
Ordinal
Values on a 1 to 5 rating scale indicating ranked tendencies (not equidistant)
temperament.affection = "3"training.obedience = "4"grooming.shedding = "5"
Quantitative
Measurable numeric ranges, typically stored using schema.org `QuantitativeValue` objects
heightMaleInches.minValue = 26lifespan.maxYears = 15weightFemaleKilograms.maxValue = 45.5
Categorical
Non-numeric, discrete classifications or binary values
coat.doubleCoat = "Yes"ownerSuitability.firstTimeOwnerSuitable = "Not Suitable"coat.hypoallergenic = "No"
Licensing & Attribution
The Dog Breeds Dataset is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
You are free to use, share, adapt, and redistribute the dataset for any purpose, even commercially, as long as proper attribution is provided
Contribute & Collaborate
We welcome input from developers, researchers, and canine experts. To submit corrections, request new breed additions, suggest a new data field, or use the dataset in a research paper or product, reach out through the Contact Us page. Your feedback helps strengthen the dataset and support the canine research and development community.





