PinnedPublished inTowards Data SciencePolars: Pandas DataFrame but Much FasterPerform multithreaded, optimized pandas operationsJan 3, 20235Jan 3, 20235
Published inTowards Data ScienceClass Imbalance Strategies — A Visual Guide with CodeUnderstand Random Undersampling, Oversampling, SMOTE, ADASYN, and Tomek LinksApr 24, 2023Apr 24, 2023
Published inDataDrivenInvestor60 ChatGPT Prompts for Data Science (Tried, Tested, and Rated)Automate data science tasks with ChatGPTApr 11, 202329Apr 11, 202329
Published inTowards AICleanlab: Correct your data labels automatically and quicklyData-centric AI without manually relabeling your dataJan 11, 20235Jan 11, 20235
Published inTowards AILazypredict: Run All Sklearn Algorithms With a Line Of CodeHow to (and why you shouldn’t) use itDec 26, 20221Dec 26, 20221
Published inTowards Data ScienceConvert Jupyter Notebooks into FunctionsParameterize notebooks so you can programmatically run themDec 19, 20223Dec 19, 20223
Published inTowards Data Science4x Faster Pandas Operations with Minimal Code ChangeStop waiting on pandas operations. Parallelize them.Dec 13, 20221Dec 13, 20221
Published inTowards Data ScienceUsing an Out-of-Core Approach to Process Large DatasetsFaster big-data analysis workflows with an open-source libraryDec 9, 20221Dec 9, 20221
Published inTowards Data ScienceUnit Testing for Data Science with PythonCatch mistakes early with nose2 and parameterized testsOct 25, 20221Oct 25, 20221
Published inAnalytics VidhyaAutomate Your Machine Learning Training Process with TPOTStop rewriting the same code for model selection and hyper-parameter searchNov 10, 20212Nov 10, 20212