PinnedPublished inTDS ArchivePolars: Pandas DataFrame but Much FasterPerform multithreaded, optimized pandas operationsJan 3, 2023A response icon5Jan 3, 2023A response icon5
Published inTDS ArchiveClass Imbalance Strategies — A Visual Guide with CodeUnderstand Random Undersampling, Oversampling, SMOTE, ADASYN, and Tomek LinksApr 24, 2023Apr 24, 2023
Published inDataDrivenInvestor60 ChatGPT Prompts for Data Science (Tried, Tested, and Rated)Automate data science tasks with ChatGPTApr 11, 2023A response icon28Apr 11, 2023A response icon28
Published inTowards AICleanlab: Correct your data labels automatically and quicklyData-centric AI without manually relabeling your dataJan 11, 2023A response icon5Jan 11, 2023A response icon5
Published inTowards AILazypredict: Run All Sklearn Algorithms With a Line Of CodeHow to (and why you shouldn’t) use itDec 26, 2022A response icon1Dec 26, 2022A response icon1
Published inTDS ArchiveConvert Jupyter Notebooks into FunctionsParameterize notebooks so you can programmatically run themDec 19, 2022A response icon3Dec 19, 2022A response icon3
Published inTDS Archive4x Faster Pandas Operations with Minimal Code ChangeStop waiting on pandas operations. Parallelize them.Dec 13, 2022A response icon1Dec 13, 2022A response icon1
Published inTDS ArchiveUsing an Out-of-Core Approach to Process Large DatasetsFaster big-data analysis workflows with an open-source libraryDec 9, 2022A response icon1Dec 9, 2022A response icon1
Published inTDS ArchiveUnit Testing for Data Science with PythonCatch mistakes early with nose2 and parameterized testsOct 25, 2022A response icon1Oct 25, 2022A response icon1
Published inAnalytics VidhyaAutomate Your Machine Learning Training Process with TPOTStop rewriting the same code for model selection and hyper-parameter searchNov 10, 2021A response icon2Nov 10, 2021A response icon2