
Data Extraction Projects
Looking for freelance Data Extraction jobs and project work? PeoplePerHour has you covered.
LinkedIn Profile Data Extraction
I need comprehensive data extracted from approximately 375 LinkedIn profiles and delivered in a structured JSON format. You will use your own tools and infrastructure, ensuring each profile's roles are individually detailed. Scope of work - Extract info from 375 LinkedIn profiles, including full name, headline, location, LinkedIn URL, email, phone, about text, connection count, follower count, profile photo URL, premium status, and verified status. - For each profile, list every individual role separately with detailed info, including company name, job title, start date, end date, duration, location, employment type, description text, company industry, company size, company website, and LinkedIn URL. - Capture education details like school, degree, field of study, dates, and activities or societies. - Extract certifications with name, issuing organization, issue date, and credential ID. - Compile skills with endorsement counts, languages and proficiency, volunteer work, honors/awards, recommendations, featured section content, groups, and interests/follows. - Include recent posts/activity with full text, post date, likes, and comments with names. Read more Additional information You'll start with a test batch of 5 profiles and proceed with the full list upon approval. Please go through the pdf attached
21 days ago13 proposalsRemoteBNI contact details extraction
I am looking for a freelancer to build a spreadsheet database of BNI members worldwide. The task is to visit the official BNI member directory/website and collect available contact details for members whose profession/speciality matches any of the following categories: Ticketing Tours/Tour Guide Travel Agent Visa Consultant Travel (Other) This is for all countries where BNI member listings are publicly available. The final deliverable should be a clean spreadsheet with the information organized logically, including fields such as: Country City/Region Chapter name Member name Company name Profession/Speciality Phone number, if available Email, if available Website, if available Profile URL/source link Any notes Requirements: Accuracy is very important. Only collect publicly available information. Provide source links for verification. The spreadsheet should be clean, deduplicated, and easy to filter. Please mention your estimated timeline, cost, and how many records you expect to collect.
6 days ago22 proposalsRemoteI need an expert on scrapping with appollo.io
Hi i need an expert to help us to scrap some datas with appollo.io My need is to scrap and extract some contacts in france based on Human ressources function
14 days ago25 proposalsRemoteopportunity
Document Extraction & AI Query Platform (second stage)
Overview We are building a system that collects and analyses documents from UK council websites. Stage 1 has already been completed and is working. It successfully: Scrapes a council website Identifies and downloads document files (primarily PDFs) Stores those files in a structured format Extracts basic text for inspection Stage 2 is to build on this foundation and develop a scalable backend system that can operate across multiple councils, organise documents, extract useful content, and enable AI-based querying of that data. Scope 2A(i) – Scraping & Document System Develop the existing scraper into a system that can: Explore council websites and locate documents across multiple sections Download and store documents in an organised and structured way Track documents over time (new, existing, changed, duplicate) Categorise documents (e.g. minutes, agendas, policies) Extract basic information (titles, dates, sections where possible) Provide clear visibility of what has been found, stored, and processed 2A(ii) – Multi-Council Validation Extend the system from a single working example to at least 3 different council websites Demonstrate that it adapts to different website structures 2B – Document Processing & Structuring Extract readable text from documents Clean and structure the content Break documents into smaller usable sections Link all extracted content back to its source Prepare the data for both keyword and semantic search 2C – AI Query Capability Accept natural language questions about council documents Use AI to identify and retrieve relevant content Return clear answers grounded in the documents Include references to source material Indicate when no reliable answer is available Core Requirements System must build directly on the existing Stage 1 functionality Must be usable across multiple councils Must be accessible via a backend interface (API) Must run reliably and allow monitoring of processes Must allow inspection of stored documents and extracted data Must be structured so a multi-user frontend can be built on top Deliverable A working backend system that: Extends the existing Stage 1 scraper into a multi-council system Collects, tracks, and organises council documents Extracts and structures document content Supports AI-based querying with referenced answers Has been demonstrated across multiple council websites Please only provide FIXED bids. Placeholder bids will be immediately rejected. Any bid will be deemed your full and final price for the job. Please add the text 'This is my full and final bid based up your job description' to your message to confirm understanding of this. The budget is only an auto suggestion by PPH and is not reflective of my assessment of the job value. Please take the time to calculate what you believe to be the cost and tailor your bid accordingly. AI responses will be rejected.
4 days ago30 proposalsRemoteI need information off of websites and sheets for CRM set up
We require meticulous consolidation of legacy event and membership data for CRM ingestion. Extract detailed event and attendee records from Eventbrite into a structured spreadsheet, ensuring each event and participant is logged. Cleanse and harmonize historical event CSVs and membership exports, map fields consistently to a master template, resolve duplicates and formatting issues, and deliver a validated, CRM-ready master spreadsheet with clear field mappings and data integrity checks.
21 days ago41 proposalsRemoteData Scrape a Facebook Group
Hi I need someone to datascrspe a Facebook group.
6 days ago24 proposalsRemoteUK Company Data Enrichment Specialist
Summary We are looking for a skilled data enrichment specialist to help us enhance an existing dataset of UK-based companies. We already have a list of company names (sourced from Companies House). Your role will be to enrich this dataset with accurate and verified business and founder-level information. Scope of Work: For each company, extract and validate: * Company website * Founder(s) / Director(s) name (if needed, validate existing data) * Founder email address (preferred: direct, not generic) * Founder and Company phone number * Founder LinkedIn profile URL * Company LinkedIn page URL * Estimated employee count Data Quality Expectations: * High accuracy (no guesswork or random scraping) * Verified emails * Avoid generic emails unless no alternative exists * Provide confidence level or source for each data point Tools & Approach: Please clearly explain your approach in your proposal. Deliverable Format: Spreadsheet (Google Sheets or Excel) with structured columns: Company Name | Company Email | Company Phone | Website | Founders Name | Founders Email | Founders Phone | Founders LinkedIn | Company LinkedIn | Employee Count | Source/Notes What We’re Looking For: * Proven experience in B2B data enrichment / lead generation * Strong research skills (especially UK companies) * Ability to maintain accuracy at scale * Clear communication and structured delivery To Apply: Please include: 1. Your approach to finding verified founder emails 2. Sample of similar work (if available) 3. Turnaround time Bonus: Experience working with Companies House data or UK-based businesses is a plus.
9 days ago18 proposalsRemoteSenior Data Engineer
We are looking for a Senior Data Engineer to serve as a technical leader within our Analytics Engineering team. In this role, you will design and build scalable data platforms and high-impact data products that power critical business decisions, analytics, and machine learning use cases. You will work cross-functionally with engineering, product, data science, and business teams to deliver reliable, high-quality data solutions while setting standards and best practices across the organization. Design, build, and maintain scalable data pipelines and data products Architect robust data models and transformation frameworks Lead end-to-end data platform initiatives (design → development → deployment) Define and implement best practices for data quality, testing, and observability Collaborate with cross-functional teams to gather requirements and deliver solutions Optimize data systems for performance, scalability, and cost-efficiency Mentor engineers and contribute to team-wide technical standards Drive adoption of modern data tools and frameworks Build reusable components and improve overall platform efficiency 5+ years of experience in Data Engineering or Analytics Engineering Strong expertise in SQL and Python Experience building and maintaining large-scale data pipelines Hands-on experience with: Cloud platforms (AWS, GCP, or Azure) Data warehouses (Snowflake, BigQuery, Redshift) Data transformation tools (dbt or similar) Workflow orchestration tools (Airflow, Dagster, etc.) Strong understanding of data modeling, ETL/ELT, and data architecture Experience with CI/CD and DevOps practices for data systems Ability to lead complex projects and work across teams Strong communication skills (technical + non-technical) Experience supporting machine learning workflows Knowledge of data governance and data quality frameworks Experience with cost optimization (FinOps) Background working in startup or high-growth environments Experience building internal data platforms or shared infrastructure Strong problem-solving and system design skills Passion for building scalable and maintainable systems Ability to work with ambiguity and drive clarity Leadership mindset with a focus on mentoring and collaboration Continuous improvement mindset with attention to quality and performance Experience with real-time data processing Exposure to data observability tools Experience designing semantic layers or metrics layers Job Skills
13 days ago21 proposalsRemoteopportunity
PhD Qualitative Data Analysis with Nvivo
Seeking an experienced qualitative researcher to conduct thematic analysis for a PhD study using NVivo on 20 interview transcripts. The ideal candidate will possess robust expertise in qualitative methods, demonstrated NVivo proficiency, and a track record of academic-standard analysis. Responsibilities include coding, theme development, analytic memos, and clear presentation of findings aligned with research objectives. Meticulous attention to detail, insightful interpretation of complex data, and adherence to rigorous methodological practices are essential.
5 days ago12 proposalsRemoteLooking for a data administrator
We are looking for a data administrator. There are requirements on this role. - Need to base on East Europe, United States or Canada - It is not full time, part time position Need to work few hours per week. - Need to speak in English If you are okay, share your CV Best regards
21 days ago14 proposalsRemoteOGL Software & Sales Vision Stock Data Analysis
Abacus Creative Resources use a Software System called OGL which has ODBC & uses Pivot Tables to analyse data such as stock patterns & trends. We are looking for someone that has experience with OGL, or similar, as we are wanting to set up a routine analysis system. Currently I am looking through daily despatch notes which takes me an hour each day, and tells me what our daily fulfillment rate is along with new customers, helps identify what products are selling well and helps me plan for offers. We would like to automate this, so that we can analyse the data quickly.
15 hours ago18 proposalsRemoteDATA ANALYTIC STRUCTURE
Seeking a skilled data analytics structure specialist to collaborate with a UX designer. Project involves organizing, modeling, and documenting data flows, creating clean, scalable schemas, and defining metrics to inform user experience decisions. Deliverables: annotated data architecture diagrams, data dictionary, ETL blueprint, and recommendations for instrumentation and dashboards. Require clear, concise documentation and pragmatic solutions to support iterative UX research and product design.
20 days ago12 proposalsRemoteAudio Data Collection with Wireless Earbuds
I need two participants for an audio data collection project using wireless earbuds. The task involves recording natural conversation between two people in a quiet indoor environment using the Riverside platform. Project Details: * Total Sessions: 2 recordings (10 minutes each) * Participants: Exactly 2 people * Device Requirement: Wireless earbuds with microphone (e.g., AirPods or similar) * Audio Format: WAV * Recording Method: Audio must be captured only through the earbuds microphone (not phone/laptop mic) Recording Process: 1. Session 1 (10 minutes): * Person A wears earbuds (primary speaker) * Person B sits 1–3 meters away (secondary speaker) 2. Session 2 (10 minutes): * Roles are switched * Person B wears earbuds (primary speaker) * Person A sits 1–3 meters away Requirements: * Continuous recording (no pauses, cuts, or edits) * Natural conversation (no scripted reading) * Distance must remain between 1 to 3 meters * Both participants must be physically present in the same room * Earbuds must remain in use throughout the session Additional Requirements: * Provide metadata including: * Earbud brand/model * Distance between participants * Ages of participants * Recording duration * Environment details (room setup, objects) * Background noise type and level * Room size category * Links to uploaded WAV files Important Notes: * Perform a short test recording before starting * Ensure devices are fully charged * Follow all instructions strictly to avoid rejection Deliverables: * Two 10-minute WAV audio files (one per primary speaker session) * Completed metadata sheet with all required details This is a simple task but requires strict adherence to guidelines and high-quality, natural audio recording.
13 days ago0 proposalsRemoteData Research Assistant – Trucking Companies
I’m looking for a detail-oriented assistant to research trucking companies that hire CDL drivers with 0–12 months of experience. This is strictly a research and data entry task. No outreach, account creation, or posting on external platforms is required. Responsibilities include: * Researching trucking company websites * Identifying hiring requirements for entry-level CDL drivers * Collecting accurate information from official company career pages * Organizing data into a structured spreadsheet All information must be sourced directly from public company websites. Short sample task may be requested to confirm fit and accuracy.
13 days ago27 proposalsRemoteExcel & Automation Specialist Needed / Fix Data Tool
We are seeking an experienced Excel and automation specialist to take over, troubleshoot, and enhance an existing data management tool, as well as implement automation for invoicing and customer communications. This project involves improving an Excel-based system currently used to manage student course data, alongside building a streamlined workflow for generating invoices and sending confirmation emails. **Part 1: Existing Excel Tool (Fix & Optimisation)** An Excel-based tool has already been developed to extract and organise course sales data. It captures key student information, including: * Full name * Date of birth * Address * Enrolled course * Course start and end dates **Current functionality includes:** * Generating student name lists for class registers * Exporting student data for certification purposes * Structuring data for upload to a governing body platform **Current issue:** The data refresh function is not working correctly. When attempting to update the dataset with the latest orders, an error alert appears and the refresh fails. **Requirements:** * Diagnose and fix the refresh/data connection issue * Review and optimise the existing tool * Ensure reliable and efficient data updates * Improve usability where necessary **Part 2: Invoice & Email Automation** We also require automation of our invoicing and confirmation email process. **Current workflow:** * Order/customer data is exported from our WordPress website * Invoices and confirmation emails are created and sent manually **Requirements:** * Automatically generate invoices using order data * Create professional invoice templates (PDF format preferred) * Automatically send confirmation emails to customers * Emails must include accurate course and student details (course name, dates, etc.) * Attach invoices to emails where applicable **Integration & Workflow:** * The solution must work with our existing WordPress data exports (CSV format) * We are open to the best technical approach (Excel, Power Automate, VBA, Zapier, Make, or other solutions) * The system should be reliable, easy to use, and suitable for ongoing operational use **Deliverables:** * Fully functional and stable Excel tool with working data refresh * Automated invoice generation system * Automated email confirmation system * Clean templates for invoices and emails * Documentation or handover instructions Please Include in Your Proposal: * Relevant experience (Excel automation, Power Query, VBA, APIs, or workflow tools) * Examples of similar projects * Your proposed approach/tech stack * Estimated timeframe * Cost estimate We are looking for someone who can take ownership of this project, resolve existing issues, and deliver a reliable, long-term solution.
19 days ago76 proposalsRemoteopportunity
AI-Driven Survey Data Analysis App
I need help creating an AI application to analyze survey data. The surveys focus on public sentiment about social issues and government policy. The goal is to glean meaningful insights from survey responses. Scope of work - Analyze survey response data in xlsx format - Identify key findings and trends - Generate a PowerPoint report with data visualizations - Develop user-friendly application with detailed documentation Additional information Looking for someone based in Singapore and is open to face-to-face interactions. Application Type Desktop Application
11 days ago25 proposalsRemoteopportunity
Data Platform Launching
We are launching our B2B database over the coming weeks. We need the following to be drafted. Would you be able to specify if you can provide and if you're covered from an insurance perspective too if we use your documents & wording? - SaaS Terms of Service (B2B subscription) - Customer contract - Website Terms & Conditions - Privacy Policy (UK GDPR compliant) - Data Processing Addendum (if required) Thank you
24 days ago24 proposalsRemoteSold UK Property Data
I am looking for an experienced UK property data researcher / scraper to compile a high-quality dataset of sold residential properties in England & Wales (C3 and C4 use classes), over the past 4 years, where the buyer is a limited company. The output must be delivered in Excel or CSV, cleanly formatted and deduplicated. For each transaction, the data we require: Company name of purchaser Registered address of company Address of sold property Sale price Completion / sold date Please explain: How you would identify the data Timeline Examples of similar UK property data projects This may lead to ongoing repeat work for the right freelancer. Please start your proposal with: “UK property data understood”
20 days ago16 proposalsRemoteSketchUp drawings → Cut list + detailed furniture drawings
I need a skilled freelancer to extract all dimensions from my SketchUp models and produce: Full cut list (board sizes, quantities, materials) Detailed technical drawings for manufacturing Clear labeling of all parts Project details: Furniture: wardrobes / cabinets Software: SketchUp files provided Accuracy is critical (this will be used for production) Requirements: Experience with furniture manufacturing drawings Experience with cut lists Ability to spot missing parts (doors, panels etc.) Deliverables: Cut list (Excel or PDF) Detailed drawings (PDF)
11 days ago13 proposalsRemoteBookkeeper (Care MIS + Accounting) — On-site
Care Sector Bookkeeper (Care MIS + Cloud Accounting) — On-site — 55 hrs/month retainer Summary We’re a domiciliary care business looking for a reliable, experienced bookkeeper with strong experience extracting and reconciling payroll and billing data from care management / Care MIS systems, and using cloud accounting software. This is an ongoing role with fixed monthly capacity and clear controls. Work pattern / location On-site attendance required (schedule agreed in advance). Monthly workload is driven by a fortnightly billing cycle plus a monthly payroll cut-off. Core duties (high level) Prepare fortnightly billing submissions using Care MIS outputs and required portal/admin steps. Prepare monthly payroll input pack (hours, leave, sickness, overtime etc.) for accountant verification; payroll pack must be ready 3–4 working days before month end. Maintain basic sales ledger/admin in the accounting system and keep records audit-ready. Bank reconciliation is exception-based only (investigate over/under-payments where they arise), not routine weekly reconciliation. Volume context (for estimating effort) Approx. 45 hourly staff + 8 salaried staff Approx. 63 invoices/month
8 days ago4 proposalsOn-site in Guildford, GB