Skip to content
View axonlab-data's full-sized avatar

Block or report axonlab-data

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
axonlab-data/README.md

Axon Labs — Biometric AI Datasets

Research-driven datasets for face anti-spoofing, liveness detection, face recognition, and voice biometrics

axonlab.ai · sales@axonlabs.pro


About

Axon Labs builds training and evaluation datasets for biometric AI systems. Our data powers face recognition, liveness detection , iBeta certification preparation, KYC / eKYC verification, and voice biometrics.

We are used by 50+ clients — fintech companies, eKYC platforms, biometric SDK vendors, and security teams — and our data has contributed to 21% of iBeta 2025 certified solutions.

  • 180,000+ videos across all datasets
  • 30+ ready-to-license datasets
  • 1,000+ unique participants, diverse demographics and devices
  • All data collected with signed consent and GDPR-compliant

What clients usually ask — and where to find the answer

Every dataset page (repository) in this organization answers the six questions our clients ask most:

  1. Number of unique IDs — how many distinct subjects
  2. Number of videos / images — total volume
  3. Variability — lighting, environments, backgrounds, accessories, attributes
  4. Real-to-spoof pairing — whether genuine face videos are paired with matching attack videos
  5. Demographics — gender, age, ethnicity distribution
  6. Devices — how many and which specific device models were used for capture

Dataset categories

Face Anti-Spoofing & Liveness Detection (PAD)

Training and certification-grade datasets for ISO/IEC 30107-3 compliant systems and iBeta Level 1 / Level 2 / Level 3 preparation.

Additional PAD datasets we can share on request: silicone masks, latex masks, cloth masks, 3D resin masks, photo print attacks, replay display attacks, high-fidelity mask variants.

Face Recognition & Identity Verification

  • Selfie_and_Official_ID_Photo_Dataset — 6,000+ people, 70,000+ images: 10–15 photos per ID (selfies + 2 official ID photos). For face recognition, KYC verification, identity matching, biometric model training. Ages 18–65, balanced demographics.
  • human-faces-dataset-multiple-images — 1,000+ people, 10,000+ files: 8 photos per person + 2 videos each
  • age-estimation-minors-face-dataset — 10,000+ consented selfies of minors and young adults (10–30 years) with verified per-year age labels. Multi-ethnic, phone-captured. For under-18 age gating, age verification, age estimation models.

Browse all repositories in the Repositories tab.


Use cases

  • iBeta PAD certification (Level 1, 2, 3) preparation and auditing
  • eKYC and onboarding — selfie vs. ID matching, liveness gating, age verification
  • Face recognition training for fintech, banking, and telecom
  • Deepfake and presentation attack detection (PAD) research
  • Under-18 age gating for regulated platforms
  • Voice biometric authentication in contact centers

Compliance and ethics

  • All participants sign informed consent before capture
  • GDPR-compliant processing and storage
  • Commercial licensing — no web-scraped data
  • Attack datasets collected in controlled conditions with participant release

Contact

Popular repositories Loading

  1. Selfie_and_Official_ID_Photo_Dataset Selfie_and_Official_ID_Photo_Dataset Public

    6,000+ people, 70,000+ images: 10-15 photos per ID (selfies + 2 official ID photos). Perfect for face recognition, KYC verification, identity matching, and biometric training. Ages 18-65, balanced …

    3

  2. human-faces-dataset-multiple-images human-faces-dataset-multiple-images Public

    1,000+ people, 10,000+ files: 8 photos per person + 2 videos

    2

  3. age-estimation-minors-face-dataset age-estimation-minors-face-dataset Public

    Age estimation face dataset: 10,000+ consented selfies of minors & young adults (10-30 years) with verified per-year age labels. Multi-ethnic, phone-captured. Built for under-18 age gating, age ver…

    2

  4. partial-paper-mask-face-anti-spoofing-dataset partial-paper-mask-face-anti-spoofing-dataset Public

    Partial paper mask attack dataset for face anti-spoofing, liveness detection, and presentation attack detection (PAD). 3,000 videos, 50 participants, dual-device capture.

    2

  5. display-replay-attack-face-anti-spoofing-dataset display-replay-attack-face-anti-spoofing-dataset Public

    Display replay attack dataset for face anti-spoofing and liveness detection. 9,000+ videos from 6,500+ participants across PC monitors and mobile devices

    2

  6. silicone-mask-face-anti-spoofing-dataset silicone-mask-face-anti-spoofing-dataset Public

    Silicone mask attack dataset for face anti-spoofing and liveness detection. 12,500+ videos, 18 silicone masks, 40+ accessory combinations. iBeta Level 2 compliant

    2