The Web Scraping Club(@webscrapingclub) 's Twitter Profile Photo

Do you want to talk about in a fireside chat with me? Or do you want to see someone in particular in these videos? Write here in the comments!

Do you want to talk about #webscraping in a fireside chat with me? Or do you want to see someone in particular in these videos? Write here in the comments!
account_circle
Rik 🦭(@rikvk01) 's Twitter Profile Photo

NEW DEMO 💥

Insert website URL -> Get Python scraper 👀

For those who are into webscraping, check it.

App generates a Python script that scrapes your target website, in seconds. Scrape away 🫡

account_circle
ozymandias(@ozymandias_py) 's Twitter Profile Photo

Estruturei as ausências e presenças de deputados federais em sessões oficiais do Congresso com webscraping e NoSQL e criei uma API para distribuir esses dados com Python

Link para os dados atualizados no próximo tweet (você pode utilizar para fazer suas próprias análises)

Estruturei as ausências e presenças de deputados federais em sessões oficiais do Congresso com webscraping e NoSQL e criei uma API para distribuir esses dados com Python

Link para os dados atualizados no próximo tweet (você pode utilizar para fazer suas próprias análises)
account_circle
Ruben Galicia 🦊(@RubenGaliciaB) 's Twitter Profile Photo

Minería de datos de la página del INE ⛏️⛏️🗳️, geolocalización de las casillas electorales.
Código en Python con librerías de para hacer el webscraping y para generar un csv con toda la información.
Todo el proyecto lo subo a GitHub

account_circle
epctex(@epctex) 's Twitter Profile Photo

Adopt an 'automation-first' mindset. Always ask, 'Can this process be automated?' You'll be surprised how often the answer is yes.

Adopt an 'automation-first' mindset. Always ask, 'Can this process be automated?' You'll be surprised how often the answer is yes. #automation #webscraping #epctex
account_circle
🌿 lithos(@lithos_graphein) 's Twitter Profile Photo

So if you do webscraping you may have noticed that when Google updates their browser it breaks the selenium and undetected chrome python calls. I noticed my site was running thin on updates jn and sure enough, all the sites using selenium (not a simple headless call) are running

So if you do webscraping you may have noticed that when Google updates their browser it breaks the selenium and undetected chrome python calls. I noticed my site was running thin on updates jn and sure enough, all the sites using selenium (not a simple headless call) are running
account_circle
Julian von der Goltz(@jhvdgoltz) 's Twitter Profile Photo

So Cloudflare has Turnstile and all sorts of tools to prevent webscraping but also has a headless browser API to do... webscraping???

So Cloudflare has Turnstile and all sorts of tools to prevent webscraping but also has a headless browser API to do... webscraping???
account_circle
Fabricio Lennart Flores Ledezma(@FabricioLennart) 's Twitter Profile Photo

Día 68 del reto 📊 ¡Ya logré extraer todo el HTML de la web que quiero escrapear! 💻 Ahora me falta seleccionar la tabla de esa web y escoger sus columnas y filas para poder convertirla en un dataframe de Python. 👇🏼

Día 68 del reto #365DayOfData 📊 ¡Ya logré extraer todo el HTML de la web que quiero escrapear! 💻 Ahora me falta seleccionar la tabla de esa web y escoger sus columnas y filas para poder convertirla en un dataframe de Python. 👇🏼 #WebScraping
account_circle
Jérémy De Campos(@jeremy_D_C) 's Twitter Profile Photo

🚨🚨VideoDuMercredi🚨🚨
La vidéo sur et vient de sortir :)
Pousser le webscraping sur au plus haut niveau avec ;)

Et j'ai changer le setup du son, je trouve ça mieux ! Dites moi ce que vous en pensez ;)

Vidéo → youtube.com/watch?v=Cr8y2v…

🚨🚨VideoDuMercredi🚨🚨
La vidéo sur #n8n et #puppeteer vient de sortir :)
Pousser le webscraping sur #n8n au plus haut niveau avec #puppeteer ;)

Et j'ai changer le setup du son, je trouve ça mieux ! Dites moi ce que vous en pensez ;)

Vidéo → youtube.com/watch?v=Cr8y2v…
account_circle
bankole Collins(@OCHHQ) 's Twitter Profile Photo

Day 48 continued: Delved into Selenium WebDriver to navigate and interact with elements on a website. Leveraging its capabilities to find and select elements, paving the way for enhanced web scraping projects.

account_circle
Kalyan KS(@kalyan_kpl) 's Twitter Profile Photo

𝐒𝐜𝐫𝐚𝐩𝐞𝐆𝐫𝐚𝐩𝐡𝐀𝐈 - 𝐖𝐞𝐛𝐬𝐜𝐫𝐚𝐩𝐢𝐧𝐠 𝐮𝐬𝐢𝐧𝐠 𝐋𝐋𝐌𝐬

ScrapeGraphAI is a LLM-based python library for webscraping.

This library uses and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, etc.).

This library

𝐒𝐜𝐫𝐚𝐩𝐞𝐆𝐫𝐚𝐩𝐡𝐀𝐈 - 𝐖𝐞𝐛𝐬𝐜𝐫𝐚𝐩𝐢𝐧𝐠 𝐮𝐬𝐢𝐧𝐠 𝐋𝐋𝐌𝐬

ScrapeGraphAI is a LLM-based python library for webscraping.

This library uses and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, etc.).

This library
account_circle
bankole Collins(@OCHHQ) 's Twitter Profile Photo

Day 46: Started working on scraping data from the Billboard Top 100 using Beautiful Soup. Excited to dive into this project! 🎵💻

account_circle
Max Mynter(@MaxMynter) 's Twitter Profile Photo

There must be some people using LLMs/seq2seq models for webscraping, right?

Should be doable even with BERT-sized models.

There must be some people using LLMs/seq2seq models for webscraping, right?

Should be doable even with BERT-sized models.
account_circle
Emmanuel Ejeagha(@Emma_Ejeagha) 's Twitter Profile Photo

Day 88 of and Day 62 of :

- Completed web scraping project using BeautifulSoup module
- Scraped a Nigerian job website
- Extracted the job title, description, and date posted
- Stored the extracted data in text files

account_circle
Micha(el) Bladowski 🇩🇪 🇺🇦(@michabbb) 's Twitter Profile Photo

this might be interesting 🤔

18:00 - 18:30 MESZ - Beyond IP Bans & CAPTCHAs: Advanced Techniques for Unblocking Difficult Websites Delve into the latest challenges posed by advanced anti-bot technologies, and.....



buff.ly/3U0Ja6o

this might be interesting 🤔 

18:00 - 18:30 MESZ - Beyond IP Bans & CAPTCHAs: Advanced Techniques for Unblocking Difficult Websites Delve into the latest challenges posed by advanced anti-bot technologies, and.....

#webscraping #scraping

buff.ly/3U0Ja6o
account_circle