PinnedTravis TanginTowards Data SciencePolars: Pandas DataFrame but Much FasterPerform multithreaded, optimized pandas operations11 min read·Jan 3, 2023--5--5
Travis TanginTowards Data ScienceClass Imbalance Strategies — A Visual Guide with CodeUnderstand Random Undersampling, Oversampling, SMOTE, ADASYN, and Tomek Links13 min read·Apr 24, 2023----
Travis TanginDataDrivenInvestor60 ChatGPT Prompts for Data Science (Tried, Tested, and Rated)Automate data science tasks with ChatGPT27 min read·Apr 11, 2023--30--30
Travis TanginTowards AICleanlab: Correct your data labels automatically and quicklyData-centric AI without manually relabeling your data8 min read·Jan 11, 2023--5--5
Travis TanginTowards AILazypredict: Run All Sklearn Algorithms With a Line Of CodeHow to (and why you shouldn’t) use it10 min read·Dec 26, 2022--1--1
Travis TanginTowards Data ScienceConvert Jupyter Notebooks into FunctionsParameterize notebooks so you can programmatically run them7 min read·Dec 19, 2022--3--3
Travis TanginTowards Data Science4x Faster Pandas Operations with Minimal Code ChangeStop waiting on pandas operations. Parallelize them.6 min read·Dec 13, 2022--1--1
Travis TanginTowards Data ScienceUsing an Out-of-Core Approach to Process Large DatasetsFaster big-data analysis workflows with an open-source library5 min read·Dec 9, 2022--1--1
Travis TanginTowards Data ScienceUnit Testing for Data Science with PythonCatch mistakes early with nose2 and parameterized tests6 min read·Oct 25, 2022--1--1
Travis TanginAnalytics VidhyaAutomate Your Machine Learning Training Process with TPOTStop rewriting the same code for model selection and hyper-parameter search6 min read·Nov 10, 2021--2--2