├── .gitattributes ├── LICENSE ├── README.md └── assets ├── BML@2x-704x454.png ├── empty.ipynb └── image /.gitattributes: -------------------------------------------------------------------------------- 1 | # Auto detect text files and perform LF normalization 2 | * text=auto 3 | -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | Creative Commons Legal Code 2 | 3 | CC0 1.0 Universal 4 | 5 | CREATIVE COMMONS CORPORATION IS NOT A LAW FIRM AND DOES NOT PROVIDE 6 | LEGAL SERVICES. DISTRIBUTION OF THIS DOCUMENT DOES NOT CREATE AN 7 | ATTORNEY-CLIENT RELATIONSHIP. CREATIVE COMMONS PROVIDES THIS 8 | INFORMATION ON AN "AS-IS" BASIS. CREATIVE COMMONS MAKES NO WARRANTIES 9 | REGARDING THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS 10 | PROVIDED HEREUNDER, AND DISCLAIMS LIABILITY FOR DAMAGES RESULTING FROM 11 | THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS PROVIDED 12 | HEREUNDER. 13 | 14 | Statement of Purpose 15 | 16 | The laws of most jurisdictions throughout the world automatically confer 17 | exclusive Copyright and Related Rights (defined below) upon the creator 18 | and subsequent owner(s) (each and all, an "owner") of an original work of 19 | authorship and/or a database (each, a "Work"). 20 | 21 | Certain owners wish to permanently relinquish those rights to a Work for 22 | the purpose of contributing to a commons of creative, cultural and 23 | scientific works ("Commons") that the public can reliably and without fear 24 | of later claims of infringement build upon, modify, incorporate in other 25 | works, reuse and redistribute as freely as possible in any form whatsoever 26 | and for any purposes, including without limitation commercial purposes. 27 | These owners may contribute to the Commons to promote the ideal of a free 28 | culture and the further production of creative, cultural and scientific 29 | works, or to gain reputation or greater distribution for their Work in 30 | part through the use and efforts of others. 31 | 32 | For these and/or other purposes and motivations, and without any 33 | expectation of additional consideration or compensation, the person 34 | associating CC0 with a Work (the "Affirmer"), to the extent that he or she 35 | is an owner of Copyright and Related Rights in the Work, voluntarily 36 | elects to apply CC0 to the Work and publicly distribute the Work under its 37 | terms, with knowledge of his or her Copyright and Related Rights in the 38 | Work and the meaning and intended legal effect of CC0 on those rights. 39 | 40 | 1. Copyright and Related Rights. A Work made available under CC0 may be 41 | protected by copyright and related or neighboring rights ("Copyright and 42 | Related Rights"). Copyright and Related Rights include, but are not 43 | limited to, the following: 44 | 45 | i. the right to reproduce, adapt, distribute, perform, display, 46 | communicate, and translate a Work; 47 | ii. moral rights retained by the original author(s) and/or performer(s); 48 | iii. publicity and privacy rights pertaining to a person's image or 49 | likeness depicted in a Work; 50 | iv. rights protecting against unfair competition in regards to a Work, 51 | subject to the limitations in paragraph 4(a), below; 52 | v. rights protecting the extraction, dissemination, use and reuse of data 53 | in a Work; 54 | vi. database rights (such as those arising under Directive 96/9/EC of the 55 | European Parliament and of the Council of 11 March 1996 on the legal 56 | protection of databases, and under any national implementation 57 | thereof, including any amended or successor version of such 58 | directive); and 59 | vii. other similar, equivalent or corresponding rights throughout the 60 | world based on applicable law or treaty, and any national 61 | implementations thereof. 62 | 63 | 2. Waiver. To the greatest extent permitted by, but not in contravention 64 | of, applicable law, Affirmer hereby overtly, fully, permanently, 65 | irrevocably and unconditionally waives, abandons, and surrenders all of 66 | Affirmer's Copyright and Related Rights and associated claims and causes 67 | of action, whether now known or unknown (including existing as well as 68 | future claims and causes of action), in the Work (i) in all territories 69 | worldwide, (ii) for the maximum duration provided by applicable law or 70 | treaty (including future time extensions), (iii) in any current or future 71 | medium and for any number of copies, and (iv) for any purpose whatsoever, 72 | including without limitation commercial, advertising or promotional 73 | purposes (the "Waiver"). Affirmer makes the Waiver for the benefit of each 74 | member of the public at large and to the detriment of Affirmer's heirs and 75 | successors, fully intending that such Waiver shall not be subject to 76 | revocation, rescission, cancellation, termination, or any other legal or 77 | equitable action to disrupt the quiet enjoyment of the Work by the public 78 | as contemplated by Affirmer's express Statement of Purpose. 79 | 80 | 3. Public License Fallback. Should any part of the Waiver for any reason 81 | be judged legally invalid or ineffective under applicable law, then the 82 | Waiver shall be preserved to the maximum extent permitted taking into 83 | account Affirmer's express Statement of Purpose. In addition, to the 84 | extent the Waiver is so judged Affirmer hereby grants to each affected 85 | person a royalty-free, non transferable, non sublicensable, non exclusive, 86 | irrevocable and unconditional license to exercise Affirmer's Copyright and 87 | Related Rights in the Work (i) in all territories worldwide, (ii) for the 88 | maximum duration provided by applicable law or treaty (including future 89 | time extensions), (iii) in any current or future medium and for any number 90 | of copies, and (iv) for any purpose whatsoever, including without 91 | limitation commercial, advertising or promotional purposes (the 92 | "License"). The License shall be deemed effective as of the date CC0 was 93 | applied by Affirmer to the Work. Should any part of the License for any 94 | reason be judged legally invalid or ineffective under applicable law, such 95 | partial invalidity or ineffectiveness shall not invalidate the remainder 96 | of the License, and in such case Affirmer hereby affirms that he or she 97 | will not (i) exercise any of his or her remaining Copyright and Related 98 | Rights in the Work or (ii) assert any associated claims and causes of 99 | action with respect to the Work, in either case contrary to Affirmer's 100 | express Statement of Purpose. 101 | 102 | 4. Limitations and Disclaimers. 103 | 104 | a. No trademark or patent rights held by Affirmer are waived, abandoned, 105 | surrendered, licensed or otherwise affected by this document. 106 | b. Affirmer offers the Work as-is and makes no representations or 107 | warranties of any kind concerning the Work, express, implied, 108 | statutory or otherwise, including without limitation warranties of 109 | title, merchantability, fitness for a particular purpose, non 110 | infringement, or the absence of latent or other defects, accuracy, or 111 | the present or absence of errors, whether or not discoverable, all to 112 | the greatest extent permissible under applicable law. 113 | c. Affirmer disclaims responsibility for clearing rights of other persons 114 | that may apply to the Work or any use thereof, including without 115 | limitation any person's Copyright and Related Rights in the Work. 116 | Further, Affirmer disclaims responsibility for obtaining any necessary 117 | consents, permissions or other rights required for any use of the 118 | Work. 119 | d. Affirmer understands and acknowledges that Creative Commons is not a 120 | party to this document and has no duty or obligation with respect to 121 | this CC0 or use of the Work. 122 | 123 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # Business Machine Learning and Data Science Applications 2 | 3 | --------- 4 | 5 | 6 | ## 🌟 We Are Growing! 7 | 8 | We're seeking to collaborate with motivated, independent PhD graduates or doctoral students on approximately seven new projects in 2024. If you’re interested in contributing to cutting-edge investment insights and data analysis, please get in touch! This could be in colaboration with a university or as independent study. 9 | 10 | ![image](https://github.com/user-attachments/assets/da97663a-b63f-4286-94cc-fcd168905109) 11 | 12 | 13 | ### 🚀 About Sov.ai 14 | 15 | Sov.ai is at the forefront of integrating advanced machine learning techniques with financial data analysis to revolutionize investment strategies. We are working with **three of the top 10** quantitative hedge funds, and with many mid-sized and boutique firms. 16 | 17 | Our platform leverages diverse data sources and innovative algorithms to deliver actionable insights that drive smarter investment decisions. 18 | 19 | By joining Sov.ai, you'll be part of a dynamic research team dedicated to pushing the boundaries of what's possible in finance through technology. Before expressing your interest, please be aware that the research will be predominantly challenging and experimental in nature. 20 | 21 | 22 | ### 🔍 Research and Project Opportunities 23 | 24 | We offer a wide range of projects that cater to various interests and expertise within machine learning and finance. Some of the exciting recent projects include: 25 | 26 | - **Predictive Modeling with GitHub Logs:** Develop models to predict market trends and investment opportunities using GitHub activity and developer data. 27 | - **Satallite Data Analysis:** Explore non-traditional data sources such as social media sentiment, satellite imagery, or web traffic to enhance financial forecasting. 28 | - **Data Imputation Techniques:** Investigate new methods for handling missing or incomplete data to improve the robustness and accuracy of our models. 29 | 30 | Please visit [docs.sov.ai](https://docs.sov.ai) for more information on public projects that have made it into the subscription product. If you already have a corporate sponsor, we are also happy to work with them. 31 | 32 | ### 🌐 Why Join Sov.ai? 33 | 34 | - **Innovative Environment:** Engage with the latest technologies and methodologies in machine learning and finance. 35 | - **Collaborative Team:** Work alongside a team of experts passionate about driving innovation in investment insights. 36 | - **Flexible Projects:** Tailor your research to align with your interests and expertise, with the freedom to explore new ideas. 37 | - **Experienced Researchers:** Experts previously from NYU, Columbia, Oxford-Man Institute, Alan Turing Institute, and Cambridge. 38 | - **Post Research:** Connect with alumni that has moved on to DRW, Citadel Securities, Virtu Financial, Akuna Capital, HRT. 39 | 40 | 41 | ### 🤝 How to Apply 42 | 43 | If you’re excited about leveraging your expertise in machine learning and finance to drive impactful research and projects, we’d love to hear from you! Please reach out to us at [research@sov.ai](mailto:research@sov.ai) with your resume and a brief description of your research interests. 44 | 45 | Join us in shaping the future of investment insights and making a meaningful impact in the world of finance! 46 | 47 | 48 | 49 | 50 | ## Table of Contents 51 | 52 | ### Department Applications 53 | 54 | 55 | - [Accounting](#accounting) 56 | - [Machine Learning](#accounting-ml) 57 | - [Analytics](#accounting-analytics) 58 | - [Textual Analysis](#accounting-text) 59 | - [Data](#accounting-data) 60 | - [Research and Articles](#accounting-ra) 61 | - [Websites](#accounting-web) 62 | - [Courses](#accounting-course) 63 | - [Customer](#customer) 64 | - [Lifetime Value](#customer-clv) 65 | - [Segmentation](#customer-seg) 66 | - [Behaviour](#customer-behave) 67 | - [Recommender](#customer-rec) 68 | - [Churn Prediction](#customer-cp) 69 | - [Sentiment](#customer-sent) 70 | - [Employee](#employee) 71 | - [Management](#employee-man) 72 | - [Performance](#employee-perf) 73 | - [Turnover](#employee-general-turn) 74 | - [Conversations](#employee-con) 75 | - [Physical](#employee-ph) 76 | - [Legal](#legal) 77 | - [Tools](#legal-tools) 78 | - [Policy and Regulatory](#legal-pr) 79 | - [Judicial](#legal-judicial) 80 | - [Management](#management) 81 | - [Strategy](#management-strat) 82 | - [Decision Optimisation](#management-do) 83 | - [Causal Inference](#management-causal) 84 | - [Statistics](#management-stat) 85 | - [Quantitative](#management-quant) 86 | - [Data](#management-data) 87 | - [Operations](#operations) 88 | - [Failures and Anomalies](#operations-fail) 89 | - [Load and Capacity Management](#operations-load) 90 | - [Prediction Management](#operations-predict) 91 | 92 | 93 | 94 | #### Also see [Python Business Analytics](https://github.com/firmai/python-business-analytics) 95 | 96 | 97 | ## Accounting 98 | 99 | 100 | #### Machine Learning 101 | * [Chart of Account Prediction](https://github.com/agdgovsg/ml-coa-charging ) - Using labeled data to suggest the account name for every transaction. 102 | * [Accounting Anomalies](https://github.com/GitiHubi/deepAI/blob/master/GTC_2018_Lab-solutions.ipynb) - Using deep-learning frameworks to identify accounting anomalies. 103 | * [Financial Statement Anomalies](https://github.com/rameshcalamur/fin-stmt-anom) - Detecting anomalies before filing, using R. 104 | * [Useful Life Prediction (FirmAI)](http://www.firmai.org/documents/Aged%20Debtors/) - Predict the useful life of assets using sensor observations and feature engineering. 105 | * [AI Applied to XBRL](https://github.com/Niels-Peter/XBRL-AI) - Standardized representation of XBRL into AI and Machine learning. 106 | 107 | 108 | #### Analytics 109 | 110 | * [Forensic Accounting](https://github.com/mschermann/forensic_accounting) - Collection of case studies on forensic accounting using data analysis. On the lookout for more data to practise forensic accounting, *please get in [touch](https://github.com/mschermann/)* 111 | * [General Ledger (FirmAI)](http://www.firmai.org/documents/General%20Ledger/) - Data processing over a general ledger as exported through an accounting system. 112 | * [Bullet Graph (FirmAI)](http://www.firmai.org/documents/Bullet-Graph-Article/) - Bullet graph visualisation helpful for tracking sales, commission and other performance. 113 | * [Aged Debtors (FirmAI)](http://www.firmai.org/documents/Aged%20Debtors/) - Example analysis to invetigate aged debtors. 114 | * [Automated FS XBRL](https://github.com/CharlesHoffmanCPA/charleshoffmanCPA.github.io) - XML Language, however, possibly port analysis into Python. 115 | 116 | 117 | #### Textual Analysis 118 | 119 | * [Financial Sentiment Analysis](https://github.com/EricHe98/Financial-Statements-Text-Analysis) - Sentiment, distance and proportion analysis for trading signals. 120 | * [Extensive NLP](https://github.com/TiesdeKok/Python_NLP_Tutorial/blob/master/NLP_Notebook.ipynb) - Comprehensive NLP techniques for accounting research. 121 | 122 | 123 | #### Data, Parsing and APIs 124 | 125 | * [EDGAR](https://github.com/TiesdeKok/UW_Python_Camp/blob/master/Materials/Session_5/EDGAR_walkthrough.ipynb) - A walk-through in how to obtain EDGAR data. 126 | * [IRS](http://social-metrics.org/sox/) - Acessing and parsing IRS filings. 127 | * [Financial Corporate](http://raw.rutgers.edu/Corporate%20Financial%20Data.html) - Rutgers corporate financial datasets. 128 | * [Non-financial Corporate](http://raw.rutgers.edu/Non-Financial%20Corporate%20Data.html) - Rutgers non-financial corporate dataset. 129 | * [PDF Parsing](https://github.com/danshorstein/python4cpas/blob/master/03_parsing_pdf_files/AR%20Aging%20-%20working.ipynb) - Extracting useful data from PDF documents. 130 | * [PDF Tabel to Excel](https://github.com/danshorstein/ficpa_article) - How to output an excel file from a PDF. 131 | 132 | 133 | #### Research And Articles 134 | 135 | * [Understanding Accounting Analytics](http://social-metrics.org/accountinganalytics/) - An article that tackles the importance of accounting analytics. 136 | * [VLFeat](http://www.vlfeat.org/) - VLFeat is an open and portable library of computer vision algorithms, which has Matlab toolbox. 137 | 138 | 139 | #### Websites 140 | 141 | * [Rutgers Raw](http://raw.rutgers.edu/) - Good digital accounting research from Rutgers. 142 | 143 | 144 | #### Courses 145 | 146 | * [Computer Augmented Accounting](https://www.youtube.com/playlist?list=PLauepKFT6DK8TaNaq_SqZW4LIDJhCkZe2) - A video series from Rutgers University looking at the use of computation to improve accounting. 147 | * [Accounting in a Digital Era](https://www.youtube.com/playlist?list=PLauepKFT6DK8_Xun584UQPPsg1grYkWw0) - Another series by Rutgers investigating the effects the digital age will have on accounting. 148 | 149 | 150 | ## Customer 151 | 152 | 153 | #### Lifetime Value 154 | * [Pareto/NBD Model](https://github.com/zabahana/Customer-LifeTime-Value-Analysis/blob/master/CLV%20Analysis.ipynb) - Calculate the CLV using a Pareto/NBD model. 155 | * [Cohort Analysis](https://github.com/iris9112/Customer-Segmentation/blob/master/Chapter1-Cohort_Analysis.ipynb) - Cohort analysis to group customers into mutually exclusive cohorts measured over time. 156 | 157 | 158 | 159 | #### Segmentation 160 | 161 | * [E-commerce](https://github.com/jalajthanaki/Customer_segmentation/blob/master/Cust_segmentation_online_retail.ipynb ) - E-commerce customer segmentation. 162 | * [Groceries](https://github.com/harry329/CustomerFinding/blob/master/customer_segments.ipynb ) - Segmentation for grocery customers. 163 | * [Online Retailer](https://github.com/Vinayak02/CustomerCentricRetail/blob/master/CustomerSegmentation/Customer_Segmentation_Online_Retail.ipynb) - Online retailer segmentation. 164 | * [Bank](https://github.com/Mogbo/Customer-Clustering-Segmentation-Challenge) - Bank customer segmentation. 165 | * [Wholesale](https://github.com/SyedAdilAli93/Identifying-Customers/blob/master/customer_segments.ipynb) - Clustering of wholesale customers. 166 | * [Various](https://github.com/abalaji-blr/CustomerSegments/tree/master/deliver ) - Multiple types of segmentation and clustering techniques. 167 | 168 | 169 | 170 | #### Behaviour 171 | 172 | * [RNN](https://github.com/DaniSanchezSantolaya/RNN-customer-behavior/tree/master/src) - Investigating customer behaviour over time with sequential analysis using an RNN model. 173 | * [Neural Net](https://github.com/Vinayak02/CustomerCentricRetail/blob/master/DemandForecasting/NeuralNetworks.ipynb) - Demand forecasting using artificial neural networks. 174 | * [Temporal Analytics](https://github.com/riccotti/CustomerTemporalRegularities) - Investigating customer temporal regularities. 175 | * [POS Analytics](https://github.com/IBM/customer_pos_analytics/blob/master/code/Customer%20Ranking%20POS%20wip.ipynb) - Analytics driven customer behaviour ranking for retail promotions using POS data. 176 | * [Wholesale Customer](https://github.com/kralmachine/WholesaleCustomerAnalysis/blob/master/WhosaleCustomerAnalysis.ipynb) - Wholesale customer exploratory data analysis. 177 | * [RFM](https://github.com/espin086/customer_growth/blob/master/rfm/rfm.ipynb) - Doing a RFM (recency, frequency, monetary) analysis. 178 | * [Returns Behaviour](https://github.com/adarsh2111/Customer-Returns-Analysis-Customer-Fraud-Detection-/blob/master/Returns%20Analysis.ipynb) - Predicting total returns and fraudulent returns. 179 | * [Visits](https://github.com/Ryanfras/Customer-Visits/blob/master/Customer%20Visits.ipynb) - Predicting which day of week a customer will visit. 180 | * [Bank: Next Purchase](https://github.com/albertcdc/Project_CAJAMAR) - A project to predict bank customers' most probable next purchase. 181 | * [Bank: Customer Prediction](https://github.com/rohangawade/Predicting-Target-customers-for-Bank-Policy-subscribtion-using-Logistic_Regression_Transparency) - Predicting Target customers who will subscribe the new policy of the bank. 182 | * [Next Purchase](https://github.com/Featuretools/predict-next-purchase) - Predict a customers’ next purchase also using feature engineering. 183 | * [Customer Purchase Repeats](https://github.com/kpei/Customer-Analytics/blob/master/customer_zakka.ipynb) - Using the lifetimes python library and real jewellery retailer data analyse customer repeat purchases. 184 | * [AB Testing](https://github.com/sushant2811/customerAnalyticsWithA-BTesting/blob/master/customerAnalyticsWithA-BTesting.ipynb) - Find the best KPI and do A/B testing. 185 | * [Customer Survey (FirmAI)](http://www.firmai.org/documents/Customer%20Survey/) - Example of parsing and analysing a customer survey. 186 | * [Happiness](https://github.com/rohit6205/predictHappiness/blob/master/predictingHapiness.ipynb) - Analysing customer happiness from hotel stays using reviews. 187 | * [Miscellaneous Customer Analytics](https://github.com/mapr-demos/customer360) - Various tools and techniques for customer analysis. 188 | 189 | 190 | 191 | #### Recommender 192 | 193 | * [Recommendation](https://github.com/annalucia1/Customer-Behavior-Analysis-Recommendation/blob/master/recomendation_by_RatingScore.ipynb) - Recommend the songs that a customer on a music app would prefer listening to. 194 | * [General Recommender](https://github.com/Vinayak02/CustomerCentricRetail/blob/master/RecommenderSystem/Recommender.ipynb) - Identifying which products to recommend to which customers. 195 | * [Collaborative Filtering](https://github.com/codeBehindMe/CustomerIntelligence/blob/master/CollaborativeFiltering.ipynb ) - Customer recommendation using collaborative filtering. 196 | * [Up-selling (FirmAI)](http://www.firmai.org/documents/Expected%20Value%20Business%20Model%20Performance/ ) - Analysis to identify up-selling opportunities. 197 | 198 | 199 | 200 | #### Churn Prediction 201 | 202 | * [Ride Sharing](https://github.com/MSopranoInTech/Churn-prediction/blob/master/Churn%20Prediction.ipynb) - Identify customer churn rates in order to target customers for retention campaigns. 203 | * [KKDBox I](https://github.com/naomifridman/Deep-VAE-prediction-of-churn-customer) - Variational deep autoencoder to predict churn customer 204 | * [KKDBox II](https://github.com/Featuretools/predict-customer-churn) - A three step customer churn prediction framework using feature engineering. 205 | * [Personal Finance](https://github.com/smit5490/CustomerChurn) - Predict customer subscription churn for a personal finance business. 206 | * [ANN](https://github.com/AgarwalGeeks/customer-churn-Analysis/blob/master/ANN.ipynb) - Churn analysis using artificial neural networks. 207 | * [Bike](http://www.firmai.org/documents/Customer%20Segmentation/) - Customer bike churn analysis. 208 | * [Cost Sensitive](https://nbviewer.jupyter.org/github/albahnsen/ML_RiskManagement/blob/master/exercises/10_CS_Churn.ipynb) - Cost sensitive churn analysis drivenby economic performance. 209 | 210 | 211 | #### Sentiment 212 | 213 | * [Topic Modelling](https://github.com/Chrisjw42/ZLSurveyAnalysis) - Topic modelling on a corpus of customer surveys from the VR industry. 214 | * [Customer Satisfaction](https://github.com/BoulderDataScience/kaggle-santander) - Predict customer satisfaction using Kaggle data. 215 | 216 | 217 | ## Employee 218 | 219 | 220 | #### Management 221 | * [Personality Prediction ](https://github.com/jcl132/personality-prediction-from-text) - Predict Big 5 Personality from text. 222 | * [Salary Prediction Resume](https://github.com/Artifelse/Prediction-salary-on-the-base-of-the-resume/blob/master/NLP.ipynb) - Textual analyses over resume to predict appropriate salary [Project Disappeared, still a cool idea] 223 | * [Employee Review Analysis](https://github.com/jackyip1/Indeed-Reviews/blob/master/Python%20scripts/Indeed%20-%20Main.ipynb) - Review analytics for top 50 retail companies on Indeed. 224 | * [Diversity Analysis](https://github.com/mtfaye/Employee-Diversity-in-Tech/blob/master/Data%20Viz%20Special%20Edition.ipynb) - A simple analysis of gender and race disparity in the tech industry. 225 | * [Occupation Prediction](https://github.com/RashmiSingh24/OccuptionPrediction/blob/master/BurningGlass.ipynb) - Predict the likelihood that an occupation is analytical. 226 | 227 | 228 | 229 | #### Performance 230 | * [Training Hours Performance](https://github.com/niqueerdo/MLpredictemployeedevelopment/blob/master/Working%20Ntbk_MODELS_Clustering.ipynb) - The impact of training ours on employee performance. 231 | * [Promotion Prediction](https://github.com/AbinSingh/Employee-Promotion-Prediction/blob/master/Employee_Promotion_Prediction.ipynb) - Analysing promotion patterns. 232 | * [Employee Attendance prediction](https://github.com/lokesh1233/Employee_Attendance/tree/master/notebooks) - Various tools to predict employee attendance. 233 | 234 | 235 | 236 | #### Turnover 237 | * [Early Leaving Employees](https://github.com/anushuk/ML-Human-Resources-Analytics/blob/master/Human%20Resources%20Analytics.ipynb ) - Identifying why the best and most experienced employees leaving prematurely. 238 | * [Employee Turnover](https://github.com/randylaosat/Predicting-Employee-Turnover-Complete-Guide-Analysis/blob/master/HR%20Analytics%20Employee%20Turnover/HR_Analytics_Report.ipynb) - Identifying factors associated with employee turnover. 239 | 240 | 241 | 242 | #### Conversations 243 | * [Slack Communication Analysis](https://github.com/stiebels/slack_nlp/blob/master/Slack%20Analytics.ipynb) - Producing meaningful visualisations from slack conversations. 244 | * [Employee Relationships from Conversations ](https://github.com/yuwie10/cultivate) - Identifying employee relationships from emails for improved HR analytics. 245 | * [Categorise Employee Requests](https://github.com/denizn/Request-classification-via-TFIDF) - Classifying employee requests via TFDIF Vectorizer and RandomForestClassifier. 246 | 247 | 248 | 249 | #### Physical 250 | * [Employee Face Recognition](https://github.com/ckarthic/Face-Recognition) - A face recognition implementation. 251 | * [Attendance Management System](https://github.com/mrsaicharan1/face-rec-a) - An attendance management system using face recognition. 252 | 253 | 254 | ## Legal 255 | 256 | 257 | 258 | #### Tools 259 | * [LexPredict](https://github.com/LexPredict/lexpredict-contraxsuite ) - Software package and library. 260 | * [AI Para-legal](https://github.com/davidawad/lobe) - Lobe is the world's first AI paralegal. 261 | * [Legal Entity Detection](https://github.com/hockeyjudson/Legal-Entity-Detection/blob/master/Dataset_conv.ipynb) - NER For Legal Documents. 262 | * [Legal Case Summarisation](https://github.com/Law-AI/summarization) - Implementation of different summarisation algorithms applied to legal case judgements. 263 | * [Legal Documents Google Scholar](https://github.com/GirrajMaheshwari/Web-scrapping-/blob/master/Google_scholar%2BExtract%2Bcase%2Bdocument.ipynb ) - Using Google scholar to extract cases programatically. 264 | * [Chat Bot](https://github.com/akarazeev/LegalTech) - Chat-bot and email notifications. 265 | 266 | 267 | 268 | #### Policy and Regulatory 269 | * [GDPR scores](https://github.com/erickjtorres/AI_LegalDoc_Hackathon) - Predicting GDPR Scores for Legal Documents. 270 | * [Driving Factors FINRA](https://github.com/siddhantmaharana/text-analysis-on-FINRA-docs) - Identify the driving factors that influence the FINRA arbitration decisions. 271 | * [Securities Bias Correction](https://github.com/davidsontheath/bias_corrected_estimators/blob/master/bias_corrected_estimators.ipynb ) - Bias-Corrected Estimation of Price Impact in Securities Litigation. 272 | * [Public Firm to Legal Decision](https://github.com/anshu3769/FirmEmbeddings) - Embed public firms based on their reaction to legal decisions. 273 | 274 | 275 | 276 | #### Judicial Applied 277 | * [Supreme Court Prediction](https://github.com/davidmasse/US-supreme-court-prediction) - Predicting the ideological direction of Supreme Court decisions: ensemble vs. unified case-based model. 278 | * [Supreme Court Topic Modeling](https://github.com/AccelAI/AI-Law-Minicourse/tree/master/Supreme_Court_Topic_Modeling) - Multiple steps necessary to implement topic modeling on supreme court decisions. 279 | * [Judge Opinion](https://github.com/GirrajMaheshwari/Legal-Analytics-project---Court-misclassification) - Using text mining and machine learning to analyze judges’ opinions for a particular concern. 280 | * [ML Law Matching](https://github.com/whs2k/GPO-AI) - A machine learning law match maker. 281 | * [Bert Multi-label Classification](https://github.com/brightmart/sentiment_analysis_fine_grain) - Fine Grained Sentiment Analysis from AI. 282 | * [Some Computational AI Course](https://www.youtube.com/channel/UC5UHm2J9pbEZmWl97z_0hZw) - Video series Law MIT. 283 | 284 | 285 | ## Management 286 | 287 | 288 | #### Strategy 289 | * [Topic Model Reviews](https://github.com/chrisjcc/DataInsight/blob/master/Topic_Analysis/Topic_modeling_Amazon_Reviews.ipynb) - Amazon reviews for product development. 290 | * [Patents](https://github.com/agdal1125/patent_analysis) - Forecasting strategy using patents. 291 | * [Networks](https://github.com/JohnAnthonyBowllan/BusinessAI/blob/master/DataAnalysis_FeatureEngineering/businessCommunitiesMethod.ipynb) - Business categories from Yelp reviews using networks can help to identify pockets of demand. 292 | * [Company Clustering](https://github.com/DistrictDataLabs/company-clustering) - Hierarchical clusters and topics from companies by extracting information from their descriptions on their websites 293 | * [Marketing Management](https://github.com/Jiseong-Michael-Yang/Marketing-Management) - Programmatic marketing management. 294 | 295 | 296 | 297 | 298 | #### Decision Optimisation 299 | * [Constraint Learning](https://github.com/abrahami/Constraint-Learning) - Machine learning that takes into account constraints. 300 | * [Fairlearn](https://github.com/Microsoft/fairlearn) - I think it is called cost-sensitive machine learning. 301 | * [Multi-label Classification](https://github.com/ej0cl6/csmlc) - Cost-Sensitive Multi-Label Classification 302 | * [Multi-class Classification](https://github.com/david-cortes/costsensitive) - Cost-sensitive multi-class classification (Weighted-All-Pairs, Filter-Tree & others) 303 | * [CostCla](http://albahnsen.github.io/CostSensitiveClassification/) - Costcla is a Python module for cost-sensitive machine learning (classification) built on top of Scikit-Learn 304 | * [DEA Software](https://araith.github.io/pyDEA/) - pyDEA is a software package developed in Python for conducting data envelopment analysis (DEA). 305 | * [Covering Set (FirmAI)](http://www.firmai.org/documents/Covering%20Set/) - Constraint programming analysis. 306 | * [Insurance (FirmAI)](http://www.firmai.org/documents/Insurance/) - CP Insurance analysis. 307 | * [Machine Learning + CP (FirmAI)](http://www.firmai.org/documents/MachineLearningand%20Optimisation/) - Machine Learning + Optimisation. 308 | * [Post Office (FirmAI)](http://www.firmai.org/documents/Post%20Office/) - Post Office optimisation. 309 | * [Soda - CP (FirmAI)](http://www.firmai.org/documents/soda_promotion-adapted-cp/) - Constraint Programming + ML. 310 | * [Soda - Knapsack (FirmAI)](http://www.firmai.org/documents/soda_promotion-adapted-knapsack/) - Knapsack algorithm + ML. 311 | * [Soda - MLP (FirmAI)](http://www.firmai.org/documents/soda_promotion-adapted-mip/) - MLP analysis + ML. 312 | 313 | 314 | #### Casual Inference 315 | * [Marketing AB Testing](https://github.com/chrisjcc/DataInsight/tree/master/ABtesting) - A/B Testing Experiment. 316 | * [Legal Studies](https://github.com/Akesari12/Intro_Causal_Inference) - Instrumental and discontinuity causal approach. 317 | * [A-B Test Result (FirmAI)](http://www.firmai.org/documents/Analyze_ab_test_results/) - Initial A-B Results. 318 | * [Causal Regression (FirmAI) ](http://www.firmai.org/documents/causal_regression/) - Regression technique for causal estimate. 319 | * [Frequentist vs Bayesian A-B Test (FirmAI)](http://www.firmai.org/documents/frequentist-bayesian-ab-testing/) - Comparison between frequentist and bayesian A-B testing. 320 | * [A-B Test Power Analysis (FirmAI)](http://www.firmai.org/documents/Power%20analysis%20for%20AB%20tests/) - Sample size estimation to match testing power. 321 | * [Variance Reduction A-B test (FirmAI)](http://www.firmai.org/documents/variance-reduction/) - Techniques to reduce variance in A-B tests. 322 | 323 | 324 | 325 | #### Statistics 326 | * [Various](https://github.com/khanhnamle1994/applied-machine-learning/tree/master/Statistics) - Various applies statistical solutions 327 | 328 | 329 | 330 | #### Quantitative 331 | * [Applied RL](https://github.com/mimoralea/applied-reinforcement-learning) - Reinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks 332 | * [Process Mining](https://github.com/yesanton/Process-Sequence-Prediction-with-A-priori-knowledge) - Leveraging A-priori Knowledge in Predictive Business Process Monitoring 333 | * [TS Forecasting](https://github.com/khanhnamle1994/applied-machine-learning/tree/master/Time-Series-Forecasting) - Time series forecasting for important business applications. 334 | 335 | #### 336 | 337 | 338 | #### Data 339 | * [Web Scraping (FirmAI)](www.firmai.org/data/) - Web scraping solutions for Facebook, Glassdoor, Instagram, Morningstar, Similarweb, Yelp, Spyfu, Linkedin, Angellist. 340 | 341 | 342 | 343 | ## Operations 344 | 345 | 346 | #### Failure and Anomalies 347 | * [Anomalies](https://github.com/yzhao062/anomaly-detection-resources) - Anomaly detection resources. 348 | * [Intrusion Detection](https://nbviewer.jupyter.org/github/albahnsen/ML_SecurityInformatics/blob/master/exercises/05-IntrusionDetection.ipynb) - Detecting network intrusions. 349 | * [APS Failure](https://github.com/Nisarg9795/Anomaly-Detection-APS-failures-in-Scania-trucks/blob/master/1_LR_Final_Code.py ), [Data](https://archive.ics.uci.edu/ml/datasets/APS+Failure+at+Scania+Trucks) - Investigating APS failures in Scania trucks. 350 | * [Hardware Failure](https://github.com/AbertayMachineLearningGroup/machine-learning-SIEM-water-infrastructure) - Using different machine learning techniques in detecting anomalies. 351 | * [Anomaly KIs](https://github.com/haowen-xu/donut),[Paper](https://arxiv.org/abs/1802.03903) - Anomaly detection algorithm for seasonal KPIs. 352 | 353 | 354 | #### Load and Capacity Management 355 | * [House Load Energy](https://github.com/giorgosfatouros/Appliances-Energy-Load-Prediction) - Linear, SVR and Random Forest models to predict house's appliances energy Load. 356 | * [Uber Load Management](https://github.com/brianallen131/Uber-Predictive-Load-Management) - Uber predictive load management. 357 | * [Capacity Management](https://github.com/nerdiejack/capacity_management/blob/master/notebooks/MyWebshopAssignmentWithSolution.ipynb) - Investigating IT stability issues are caused by capacity constraints. 358 | * [Bike Sharing](https://github.com/chrisjcc/DataInsight/blob/master/DataChallenge/BikeShare_Challenge.ipynb) - XGBRegressor, RandomForestRegressor, GradientBoostingRegressor combined with feature selection. 359 | * [Airline Fleet Segmentation](http://htmlpreview.github.io/?https://github.com/atul-shukla-INSEAD/GroupProjectBDA/blob/master/GroupProject.html) - Analysis of Delta airlines. 360 | * [Airbnb](http://inseaddataanalytics.github.io/INSEADAnalytics/groupprojects/AirbnbReport2016Jan.html) - Airbnb Booking Analysis. 361 | 362 | 363 | 364 | #### Prediction Management 365 | * [Dispute Prediction](https://github.com/zhanghaizhen/Financial-Service-Complaint-Management/tree/master/ipynb) - Financial service complaint management. 366 | * [Fight Delay Prediction](https://github.com/cavaunpeu/flight-delays/blob/master/notebooks/flight-prediction.ipynb) - Transfer learning for flight-delay prediction via variational autoencoders in Keras. 367 | * [Electric Fault Prediction](https://github.com/susano0/Electric-Fault-Prediction/blob/master/Fault_pred.ipynb) - Predict tripping at grid stations by applying simple machine learning algorithms. 368 | * [Popularity Prediction in R](https://github.com/s-mishra/featuredriven-hawkes/blob/master/code/marked_hawkes_point_process.ipynb) - Marked Hawkes Point Process . 369 | -------------------------------------------------------------------------------- /assets/BML@2x-704x454.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/firmai/business-machine-learning/daebdd691a25ba648dfbc2a39a464b42543466a1/assets/BML@2x-704x454.png -------------------------------------------------------------------------------- /assets/empty.ipynb: -------------------------------------------------------------------------------- 1 | { 2 | "nbformat": 4, 3 | "nbformat_minor": 0, 4 | "metadata": { 5 | "colab": { 6 | "name": "empty.ipynb", 7 | "provenance": [] 8 | }, 9 | "kernelspec": { 10 | "name": "python3", 11 | "display_name": "Python 3" 12 | } 13 | }, 14 | "cells": [ 15 | { 16 | "cell_type": "code", 17 | "metadata": { 18 | "id": "TFSfpbfOfrbg", 19 | "colab_type": "code", 20 | "colab": {} 21 | }, 22 | "source": [ 23 | "" 24 | ], 25 | "execution_count": 0, 26 | "outputs": [] 27 | } 28 | ] 29 | } -------------------------------------------------------------------------------- /assets/image: -------------------------------------------------------------------------------- 1 | 2 | --------------------------------------------------------------------------------