2025-04-16 00:10:52 - INFO - Reading input data files...
2025-04-16 00:11:35 - INFO - Input data files read successfully.
2025-04-16 00:11:35 - INFO - Filtering for final presentation transcripts with valid company IDs...
2025-04-16 00:11:37 - INFO - Extracted 'year' from 'mostimportantdateutc'
2025-04-16 00:11:40 - INFO - Sorted transcripts to keep newest version
2025-04-16 00:11:41 - INFO - Dropped duplicate transcripts per event
2025-04-16 00:11:42 - INFO - Filtered events with valid gvkeys. Remaining rows: 479475
2025-04-16 00:11:42 - INFO - Number of unique transcript IDs: 466985
2025-04-16 00:11:42 - INFO - Querying analyst data from database...
2025-04-16 00:12:53 - INFO - Retrieved 7985448 analyst records from database
2025-04-16 00:12:56 - INFO - Merged analyst data with event data and dropped duplicate analyst-transcript observations. Remaining rows: 2238573 out of 8205955
2025-04-16 00:13:02 - INFO - Merged analyst/company names and cleaned up temp variables
2025-04-16 00:13:03 - INFO - Created 'companyofperson' and dropped redundant columns
2025-04-16 00:13:04 - INFO - Reduced to unique firm-analyst-year observations. Remaining rows: 1144977 out of 2908172
2025-04-16 00:13:04 - INFO - Filtered out unidentified analysts. Remaining rows: 1095149
2025-04-16 00:13:07 - INFO - Saved CIQAnalystFirmYear.pkl with 1095149 rows and 9 columns
2025-04-16 00:13:08 - INFO - Saved CIQAnalysts.csv with 136436 rows and 3 columns
2025-04-16 00:13:08 - INFO - Saved CIQBrokers.csv with 16718 rows and 1 columns
