A sample of production scraping systems built for court records, ecommerce catalogs, real estate listings, image datasets, reviews, monitoring, and lead generation.
Courts · PDF extraction
900K+ PDFs collected
Nevada Supreme Court document pipeline
Built a large-scale scraper to harvest court case documents from the Nevada Supreme Court system, structure the records, and prepare them for searchable legal data workflows.
Source: caseinfo.nvsupremecourt.us
Courts · Historical data
160K+ cases · 220K+ PDFs
Oregon statewide court archive scraper
Extracted statewide Oregon court case records and related PDF documents across a long historical window, turning fragmented public records into structured case data.
Sources: trportal.courts.oregon.gov · cdm17027.contentdm.oclc.org
Courts · Docket data
15K+ historical cases
Montana Supreme Court docket scraper
Collected Supreme Court docket cases with multiple PDF documents per case, covering records from 1979 to 2026 for legal research and archival use.
Source: supremecourtdocket.mt.gov
Courts · Structured records
37K+ PDFs processed
Missouri judicial portal extraction
Scraped and structured court documents from Missouri judicial portals, converting public case records into clean, organized datasets.
Source: courts.mo.gov
Courts · Case documents
10K+ PDFs extracted
Arizona court document scraper
Built a scraper for Arizona court resources to collect case documents, normalize extracted fields, and deliver clean data for downstream review.
Source: azcourts.gov
AI · Ecommerce data
30+ storefronts scraped
European product catalog pipeline
Built a multi-source ecommerce scraper covering European wholesale and retail storefronts, extracting product names, pricing, EAN/UPC, currency, availability, and category data.
Market: European ecommerce · B2B & retail
Real estate · Daily scraping
Daily property listings
Greece real estate listing scraper
Created a recurring scraper for new sale and rental listings from one of Greece’s largest property portals, including full listing fields, images, and structured database delivery.
Source: xe.gr
Real estate · Image scraping
Listings + photos collected daily
Spitogatos property data pipeline
Built a daily scraper for residential and commercial listings from Spitogatos, collecting property details, listing metadata, and photos into a MySQL database.
Source: spitogatos.gr
Dataset · Image collection
480K+ images collected
Large-scale Baidu image dataset
Collected a large image dataset for machine learning and research workflows, with scalable image downloading, source tracking, and organized dataset delivery.
Source: baidu.com
Google Maps · Lead gen
18K+ business records
Banking lead generation dataset
Built a maps-based business intelligence dataset for targeted outreach, collecting relevant banking-related business records by location and category.
Source: Google Maps
Monitoring · Daily updates
200+ records/day
Automated lottery results monitor
Created an ongoing monitoring system to collect newly published lottery outcomes, update records daily, and keep the dataset fresh without manual tracking.
Source: thelott.com
Lead gen · Retail data
3K+ store records
European Pokémon TCG store dataset
Collected regional store and contact data for the trading card market across Europe, creating a targeted retail dataset for outreach and market mapping.
Market: Multi-source Europe
Product data · Daily pipeline
100 products/day
Product and video trend scraper
Built a daily extraction pipeline for product details and video data to support trend analysis, product research, and ecommerce decision-making.
Source: Kalodata
Reviews · Sentiment data
1K+ reviews/day
Amazon review monitoring pipeline
Created a high-frequency review scraper for brand and sentiment tracking, collecting fresh customer feedback to support product research and reputation monitoring.
Source: Amazon · Befa Natur