Agenda 2020

Pierluigi Vinciguerra
Running a Business On Web Scraped Data
Every day we hear sentences like "Data is new oil" or "web data is a gold mine" and that's definitely true. In this talk, see how establishing a business based on web scraped data has much more in common with old traditional mining companies. Pierluigi will cover the processes, tasks, operators and tools needed to run a reliable and modern company and avoid as much as possible the hassles that web scraping can hide.
Presented by Pierluigi Vinciguerra - CTO and Co-Founder of Re Analytics

Bio

Pierluigi Vinciguerra, CTO and Co-Founder of Re Analytics, a data boutique for consumer and luxury goods.  10+ years of expertise in data management, from web data integration and scraping to business intelligence, actually in Re Analytics where we crawl 1+ Billion price points every month and extract insights from them for investors and c-level in Consumer and Luxury goods.
Tues 10th Nov 2020
Time: TBC
carles
How Venture Capital Firms use Web Scraping to find the next Billion-Dollar Company
The goal of any Venture Capital firm is to invest in successful companies to get a return on their investments, so they are on a constant search for promising startups.  But subscribing to a data feed or a data platform that everyone else has access to, doesn’t give firms an edge. That’s where web scraping comes in, bringing much more productivity in the research process. In this talk, Carles explains how Venture Capital firms use web scraping to make data-driven investment decisions, and how they rely on advanced data analytics in their processes.
Presented by Carles Illa, Software Engineer and Data Scientist at Nauta Capital

Bio

Carles Illa is a Software Engineer and Data Scientist at Nauta Capital, a Pan-European Venture Capital Firm investing in early-stage software companies.
He works on the development of the Dealflow Engine, a proprietary tool to automatically extract, structure, and enrich data of potential investment opportunities.

Tues 10th Nov 2020
Time: TBC
rameez
Everyday Low Pricing Strategy with Scraping Hub, Google App Scripts & Heroku
Is your business suffering because your competitors are undercutting your prices? Are you struggling to keep up? Say hello to your very own Price-spy! Made with free tools like Scraping Hub, Google App Scripts, Heroku, you can see how your competitors' prices are changing.
Presented by Rameez Kakodker, Digital Transformation Expert

Bio

With over 30 eCommerce launches in the last 10 years, Rameez has been at the helm of digital transformation within the GCC & Asia region. He features regularly on Medium and supports startups with their roadmap planning and product.
Tues 10th Nov 2020
Time: TBC
ondra
Web Scraping in 2020
As the web evolved from static sites to complex JavaScript applications, even the techniques and tools needed to scrape it have changed. From plain HTTP requests to robotized browsers - This talk will show you all the tricks you need to extract data from the modern web reliably and scalably.
Presented by Ondra Urban, Technical Web Scraping Expert

Bio

Ondra is a hacker of the browser age. He extracts terabytes of publicly available data and translates them to a language that machines can understand. At Apify, Ondra leads a team of fellow hackers who grow their open source projects, break anti-scraping walls, and dabble in AI.
Tues 10th Nov 2020
Time: TBC
amanda
TellFinder Alliance: Tacking Online Explotiaiton with Data
Last year at Extract Summit, we talked about out five-year project to build a bleeding-edge data collection and extraction pipeline to fight human trafficking. This year, we would like to expand on this topic to discuss how we pivot our pipeline to tackle a broader array of online exploitation, how having a solid foundation makes this a tractable, efficient process, and the impacts we can have on the world.
Talk by Amanda Towler, Co-Founder & Principal Investigator Hyperion Gray, LLC

Bio

Amanda is the Co-Founder and Principal Investigator at Hyperion Gray, LLC, a technology R&D small business working primarily with the Defense Advanced Research Projects Agency (DARPA). She has a decade of experience spanning OSINT, offensive security, data science, and software development. She has consulted with law enforcement on several high profile dark web child exploitation cases.
Tues 10th Nov 2020
Time: TBC
ivan
Introducing AutoCrawl - The AI-Powered Crawler
AI is disrupting the ecosystem, altering every single process with new machine-learning powered approaches. In this talk, Iván will show how this impacts the world of data crawling by introducing AutoCrawl, an AI-powered crawler capable of gathering data from websites automatically.
Presented by Iván de Prado Alonso, Data Scientist, ScraphingHub

Bio

Iván is a Data Scientist at Scrapinhub who loves Deep Learning and Computer Vision. He has 10+ years of experience working for and with startups, dealing with the greatest technical challenges at each.
Tues 10th Nov 2020
Time: TBC
victor
Separating Extraction From Crawling Logic With Web-Poet
What are Web-poet and Scrapy-poet projects? How do they work and how could they be helpful? Victor will take you through the state of development, the foreseeable future, and their relation with AutoExtract and AutoCrawl projects.
Presented by Victor Torres, Web Scraping Python and Scrapy Guru

Bio

Victor Torres is Full-stack developer with 5+ years of experience leading agile teams and building web applications. Currently works with Python and web scraping at Scrapinghub.
Tues 10th Nov 2020
Time: TBC

Want to be Involved?

Register for 2020

Web Data Extraction Summit is organised by Scrapinghub.
Scrapinghub delivers world class web data extraction products and services.
© Web Data Extraction Summit 2020

info@www.extractsummit.io
linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram