BODACC Scraper to Google Sheets
About this video
Automated BODACC Data Collection
This video showcases an application designed for the automated collection of judicial recovery announcements published on the Official Bulletin of Civil and Commercial Announcements (BODACC). The tool extracts key information and exports it directly to a Google Sheet.
Precise Announcement Filtering
The application provides detailed filters to target relevant data. Users can specify the period of search (e.g., 7 days), workforce size (e.g., 20 to 49 employees), NACE code (economic activities like transport, construction), and company type (all).
Extraction Process and Logging
Once the filters are defined, the user initiates the scraping process. The "Execution logs" section displays the real-time progress, including the number of identified companies, details of each announcement (SIREN number, company name, etc.), and pauses due to rate limiting.
Data Export and Visualization
The collected data is then organized and automatically exported into a Google Sheets spreadsheet. This structured sheet contains columns such as company name, SIREN number, publication date, procedure type, NACE code, workforce, legal form, city, and postal code, allowing for easy analysis and efficient information management.
Tags
- Scraping de données
- BODACC
- Redressements judiciaires
- Google Sheets
- Automation
- API INSEE