Commit Graph

36 Commits

Author SHA1 Message Date
Leonard Excoffier e17a54ba3c feat: advancing on new script 2024-09-10 22:23:14 -04:00
Leonard Excoffier e586c2b202 feat: downloads and extract the zip. 2024-09-10 21:38:43 -04:00
Leonard Excoffier 5e38f46e8f Temp expirimentation, problem with duplicate primary key lowercase uppercase and insert on duplicate, need to find way to bulk insert but still not insert rows that have duplicate. 2024-08-31 22:34:04 -04:00
Leonard Excoffier 162106c8e0 e 2024-08-31 19:50:15 -04:00
Leonard Excoffier fa64f81cb8 clean slate 2024-08-31 19:14:02 -04:00
Leonard Excoffier 91272506e3 feat: drops columns that are not supposed to be there 2024-08-31 18:54:28 -04:00
Leonard Excoffier 3f5f6d5569 fix: remove not null in some places 2024-08-31 18:26:54 -04:00
Leonard Excoffier 84b20e1496 fix: removed NOT NULL specification for non primary keys 2024-08-31 18:26:14 -04:00
Leonard Excoffier 4c9b2e46c1 fix: put the doc type back to TEXT 2024-08-31 18:23:48 -04:00
Leonard Excoffier c9612b35ae fix: added some special fixe that are not taken into account by dataset documentation 2024-08-31 17:18:33 -04:00
Leonard Excoffier 025ba9b913 fix: fixed coreg being a null value issue 2024-08-31 17:17:17 -04:00
Leonard Excoffier 57b13dbec5 feat: based on the sec doc match the sql schema 2024-08-31 16:02:39 -04:00
Leonard Excoffier 00a59b27ce feat: now writes all datasets to db 2024-08-31 15:52:37 -04:00
Leonard Excoffier 015879ef4d feat: added reset block 2024-08-31 15:45:53 -04:00
Leonard Excoffier 3a60674b7c feaet: now cycles through wanted files 2024-08-31 15:41:33 -04:00
Leonard Excoffier f31d901201 refactor: total change, no longer ripping gigabytes but going directly to datasets, must find way to automate download. 2024-08-31 15:32:57 -04:00
Leonard Excoffier b5e0af3ca5 feat: added some helpfull commands 2024-08-31 13:00:04 -04:00
Leonard Excoffier ab8f1fcf8f feat: now use batches but seem to write less rows, not sure 2024-08-31 00:50:17 -04:00
Leonard Excoffier 9c5afa7670 feat: script cycle works 2024-08-31 00:38:50 -04:00
Leonard Excoffier 765272f896 feat: seem to be able to write to a db. 2024-08-31 00:17:37 -04:00
Leonard Excoffier 0fbbb37ef7 feat: design new schema 2024-08-30 19:59:42 -04:00
Leonard Excoffier 71a054e2b6 feat: new script seem to be super efficient 2024-08-30 19:54:37 -04:00
Leonard Excoffier bfa36303af feat: ignored defautl sql file 2024-08-30 18:35:14 -04:00
Leonard Excoffier 6fcbc73a4f fix: correctly adapted script for new primary key 2024-08-30 08:16:05 -04:00
Leonard Excoffier 184f82a688 fix: correctly set primary key for data table 2024-08-30 07:56:02 -04:00
Leonard Excoffier 0f9a42eb7e feat: now checks if cik is already present before writing to db, can be deactivated if want to update data. 2024-08-29 22:04:47 -04:00
Leonard Excoffier a17d73f336 feat: now loops over all the facts jsons 2024-08-29 22:03:05 -04:00
Leonard Excoffier 5946ff73bd feat: script to write info to db seems to work, have no idea how tho, need to generalize with a for loop for all files 2024-08-29 21:51:05 -04:00
Leonard Excoffier 750d33b976 feat: added stuff and corrected the schema creation. 2024-08-29 21:39:57 -04:00
Leonard Excoffier d5d823b290 feat: changed primary key of facts table 2024-08-29 21:35:30 -04:00
Leonard Excoffier 97bd73c4a3 fix: stopped download and extraction of submissions, only keep company facts 2024-08-27 22:11:27 -04:00
Leonard Excoffier 4266e09691 feat: script to output all the labels , initiation to parsing jsons, may use jquery on mac. 2024-08-27 21:58:16 -04:00
Leonard Excoffier ce0152bbba feat: completed db schema 2024-08-27 19:13:07 -04:00
Leonard Excoffier 19a5579fb0 feat: db schema has now a base 2024-08-27 19:09:12 -04:00
Leonard Excoffier c02fea1417 feat: initiated description of db schema based on facts json. 2024-08-26 22:08:28 -04:00
Leonard Excoffier 20a2866ecd batman 2024-08-25 19:52:34 -04:00