Back to portfolio

BODACC Scraper to Google Sheets

2
0

About this video

Automated BODACC Data Collection

This video showcases an application designed for the automated collection of judicial recovery announcements published on the Official Bulletin of Civil and Commercial Announcements (BODACC). The tool extracts key information and exports it directly to a Google Sheet.

Precise Announcement Filtering

The application provides detailed filters to target relevant data. Users can specify the period of search (e.g., 7 days), workforce size (e.g., 20 to 49 employees), NACE code (economic activities like transport, construction), and company type (all).

Extraction Process and Logging

Once the filters are defined, the user initiates the scraping process. The "Execution logs" section displays the real-time progress, including the number of identified companies, details of each announcement (SIREN number, company name, etc.), and pauses due to rate limiting.

Data Export and Visualization

The collected data is then organized and automatically exported into a Google Sheets spreadsheet. This structured sheet contains columns such as company name, SIREN number, publication date, procedure type, NACE code, workforce, legal form, city, and postal code, allowing for easy analysis and efficient information management.

Tags

  • Scraping de données
  • BODACC
  • Redressements judiciaires
  • Google Sheets
  • Automation
  • API INSEE