Skip to content

TheNude: fix performer data quality issues#2726

Open
FlashSpazzbo wants to merge 1 commit intostashapp:masterfrom
FlashSpazzbo:fix/thenude-improvements
Open

TheNude: fix performer data quality issues#2726
FlashSpazzbo wants to merge 1 commit intostashapp:masterfrom
FlashSpazzbo:fix/thenude-improvements

Conversation

@FlashSpazzbo
Copy link
Copy Markdown
Contributor

Scraper type(s)

  • performerByName
  • performerByURL

Changes

  • URLs: Normalize to canonical /_NNNNN.htm form, eliminating spaces in URLs (e.g. /Alice Mido_53862.htm/_53862.htm)
  • Birthdate: Blank incomplete dates (year-only, month+year, ??-??-YYYY placeholders) before parseDate, instead of silently defaulting day to 01
  • FakeTits: Remove incorrect Fake → Augmented mapping. "Fake" is the correct Stash internal value
  • CareerLength: Add spaces around dash separator ("2022 - 2026")
  • Tattoos/Piercings: Removed — TheNude data is unreliable (reports "None" for performers known to have body art)
  • Image URL: Encode spaces for valid CDN URLs

Test performers

  • Alice Mido: https://www.thenude.com/_53862.htm (partial birthdate ??-??-1996, multiple social links)
  • Carly Lauren: https://www.thenude.com/_28377.htm (month-only birthdate July 1990, fake_tits)

- Normalize URLs to canonical /_NNNNN.htm form (no spaces)

- Blank incomplete birthdates instead of defaulting day to 01

- Fix fake_tits: keep 'Fake' as-is (correct Stash internal value)

- Career length: add spaces around dash separator

- Remove tattoos/piercings (TheNude data unreliable)

- Encode spaces in performer image URLs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant