Premium Social Media Datasets
Ditch the scrapers. Get instant access to clean, structured, ready-to-use social media data — pre-processed, deduplicated, and refreshed on your schedule. Purpose-built for AI/ML training, academic research, and enterprise intelligence.
No API rate limits · No infrastructure overhead · No maintenance burden
Refresh Rate
Volume of Records
473KWhy Choose Us
Enterprise-grade data quality
Built for the demands of AI training, quantitative research, and large-scale business intelligence — not weekend side projects.
Up to 102 Fields per Record
Every record ships with rich metadata — engagement metrics, timestamps, content signals, author attributes, and platform-specific fields. No sparse tables.
Multi-Platform Coverage
Data sourced from TikTok, Instagram, Twitter/X, YouTube, Reddit, Amazon, LinkedIn, and more — unified schema, consistent quality across every source.
Flexible Refresh Cadence
Choose one-time delivery or recurring subscriptions (monthly, quarterly, semi-annual). Supports full snapshots, incremental additions, or delta updates only.
Your Infrastructure, Your Way
Deliver directly to your S3 bucket, Azure Blob, GCS, or pull via API or Webhook. We integrate with your existing data stack — no new tools required.
Any Format You Need
Export as JSON, CSV, Parquet, or compressed archives. Column naming conventions and schema documentation included with every delivery.
Data Integrity Insights
Detailed fill-rate reports and per-column statistics shipped alongside every dataset. Validate coverage before you commit to a full integration.
Dataset Catalog
Browse Available Datasets
30 datasets across 8 platforms. New datasets added monthly.
TikTok Creator Index — Global Top 10M
TikTokComprehensive profiles of 10M+ TikTok creators worldwide — follower counts, engagement rates, niche categories, and 12-month growth history.
TikTok Video Metadata Archive — 2025
TikTokFull metadata for 50M+ trending videos: views, shares, comments, audio track, hashtags, publish timestamp, and linked creator info.
TikTok Comment Sentiment Corpus
TikTok200M+ TikTok comments enriched with sentiment scores, language detection, and topic classification. High-quality NLP training corpus.
TikTok Hashtag Performance Index
TikTokWeekly performance snapshots for 2M+ hashtags: reach, engagement velocity, growth rate, and correlated content category distribution.
TikTok Shop Product Review Dataset
TikTokProduct-linked TikTok content with purchase intent signals, review sentiment, product metadata, and creator monetization attributes.
TikTok Sound & Music Trend Data
TikTokTrending audio tracks with usage counts, creator adoption curves, geographic spread, and content category breakdowns over 24 months.
Instagram Influencer Database — 8M+ Profiles
InstagramDetailed data on 8M+ Instagram accounts: follower counts, engagement rates, post frequency, audience demographics, and niche classification.
Instagram Post Engagement Analytics
InstagramEngagement metrics for 300M+ Instagram posts: likes, comments, saves, estimated reach, impression counts, and hashtag performance scores.
FAQ
Common questions
How is the data delivered?
We support direct delivery to your cloud storage (AWS S3, Azure Blob, Google Cloud Storage), download via secure link, or pull via API/Webhook. You choose the method that fits your stack.
What formats are available?
All datasets are available in JSON, CSV, and Parquet. Compressed archives (.gz, .zip) are included at no extra cost. Schema documentation and column definitions ship with every delivery.
How fresh is the data?
Freshness depends on the refresh cadence you select. Monthly datasets are updated within the first week of each month; weekly datasets are updated every Monday. One-time purchases reflect the most recent available snapshot.
Can I get a sample before buying?
Yes. Click the "Sample" button on any dataset card to request a free 1,000-record sample in your preferred format. Samples are delivered within one business day.
Is the data compliant with platform Terms of Service?
Our datasets are sourced from publicly available content and processed in accordance with applicable platform terms, GDPR, and CCPA requirements. We provide a data provenance report with every purchase.
What if I need a dataset that isn't in the catalog?
We build custom datasets to order. Submit a request with your platform, volume, required fields, and delivery format, and we'll respond within one business day with a feasibility assessment and pricing.
Can't find what you need?
Every data project is different. Tell us your exact requirements — platform, volume, fields, refresh cadence, and delivery format — and we'll scope a custom dataset tailored to your use case.
We respond within one business day with a feasibility assessment and quote.
Request a Custom Dataset