Lmst

🧵 Thread on sorting algorithms.

Starting with #StupidSort 🤪, an inefficient algorithm that sorts by randomly shuffling until the list is ordered.

(1/3) ⬇️

#computerscience #algorithm #sortingalgorithm #datastructure #coding #programming #software #softwaredevelopment #bigo #learntocode #codenewbie

Python code snippet demonstrating the Stupid Sort algorithm. The code includes a function that shuffles a list until it is sorted, with an example usage and expected result

Shoutout to my friend Daniel, who is not on mastodon. He created an uber efficient slice implementation for joining/modifying/copying lists at scale. The blogpost and github README describe the algorithm. It's an interesting technical read if you have 10-20 minutes of spare time.

https://daniel.avery.io/writing/fork-join-data-structures

#code #algorithm #datastructure #random

Mean imputation is a straightforward method for handling missing values in numerical data, but it can significantly distort the relationships between variables.

For a detailed explanation of mean imputation, its drawbacks, and better alternatives, check out my full tutorial here: https://statisticsglobe.com/mean-imputation-for-missing-data/

More details are available at this link: http://eepurl.com/gH6myT

#research #datastructure #businessanalyst #data

Designing an Efficient Tree Index on Disaggregated Memory

https://cacm.acm.org/research-highlights/designing-an-efficient-tree-index-on-disaggregated-memory/

#computing #datastructure #algorithm #software

gganimate is a powerful extension for ggplot2 that transforms static visualizations into dynamic animations. By adding a time dimension, it allows you to illustrate trends, changes, and patterns in your data more effectively.

The attached animated visualization, which I created with gganimate, showcases a ranked bar chart of the top 3 countries for each year based on inflation since 1980.

More information: https://statisticsglobe.com/online-course-data-visualization-ggplot2-r

#datastructure #datavisualization #tidyverse #ggplot2

Visualizing gene structures in R? gggenes, an extension of ggplot2, simplifies the process of creating clear and informative gene diagrams, making genomic data easier to interpret and share.

Visualization: https://cran.r-project.org/web/packages/gggenes/vignettes/introduction-to-gggenes.html

Click this link for detailed information: https://statisticsglobe.com/online-course-data-visualization-ggplot2-r

#datastructure #datavisualization #dataanalytics #data #tidyverse #datascientists #ggplot2

Red Green Syntax Trees - an Overview | by Will Speak (aka Plingdollar):

https://willspeak.me/2021/11/24/red-green-syntax-trees-an-overview.html

#Parser #Compiler #DataStructure

@FizzyOrange

Wow, this crate looks like the most feature-rich tree crate I've ever seen!

It seems very underrated (only ~1000 downloads and one star on GitHub (by me)).

Thank you for the suggestion!😊

#Rust #RustLang #DataStructure #Tree #Algorithms

#ReleaseMonday — One of the recent (already very useful!) new package additions to #ThingUmbrella is:

https://thi.ng/leaky-bucket

Leaky buckets are commonly used in communication networks for rate limiting, traffic shaping and bandwidth control, but are equally useful in other domains requiring similar constraints.

A Leaky Bucket is a managed counter with an enforced maximum value (i.e. bucket capacity). The counter is incremented for each a new event to check if it can/should be processed. If the bucket capacity has already been reached, the bucket will report an overflow, which we can then handle accordingly (e.g. by dropping or queuing events). The bucket also has a configurable time interval at which the counter is decreasing (aka the "leaking" behavior) until it reaches zero again (i.e. until the bucket is empty). Altogether, this setup can be utilized to ensure both an average rate, whilst also supporting temporary bursting in a controlled fashion...

Related, I've also updated/simplified the rate limiter interceptor in https://thi.ng/server to utilize this new package...

#ThingUmbrella #DataStructure #RateLimiting #OpenSource #TypeScript #JavaScript

#Development #Guides
Bloom filter · What they are and why they are so powerful https://ilo.im/162mpl

_____
#Programming #Coding #BloomFilter #HashTable #DataStructure #JavaScript #Database #WebDev #Frontend #Backend

I used to think that writing sophisticated R code meant using all the advanced features and chaining long functions together...

Fancy code can be fun, but clean code makes collaboration and debugging so much easier.

Stay informed on data science by joining my free newsletter. Check out this link for more details: http://eepurl.com/gH6myT

#datastructure #datasciencecourse #datasciencetraining

Ordered map на Go

Omap — это пакет Golang для работы с потокобезопасными упорядоченными map. Упорядоченная map содержит map golang, list и mutex для выполнения функций упорядоченной map. Упорядоченная map— это map, которая запоминает порядок элементов. Map можно итерировать для извлечения элементов в том порядке, в котором они были добавлены.

https://habr.com/ru/articles/882828/

#go #map #caching #datastructure #index #dataprocessing #orderedmap #omap

In missing data imputation, it is crucial to compare the distributions of imputed values against the observed data to better understand the structure of the imputed values.

The visualization below can be generated using the following R code:

library(mice)
my_imp <- mice(boys)
densityplot(my_imp)

Take a look here for more details: https://statisticsglobe.com/online-workshop-missing-data-imputation-r

#datastructure #statisticalanalysis #dataanalytics #visualanalytics #pythoncoding #package #datavisualization #datascience

#ITByte: Algorithms and data structures are central to #ComputerScience.

Here is a quick refresher on #DataStructure and #Algorithms. #CS101

https://knowledgezone.co.in/trends/explorer?topic=Data-Structure-Algorithms

Avoiding text overlap in plots is essential for clarity, and R offers a great solution with the ggplot2 and ggrepel packages. By automatically repositioning labels, ggrepel keeps your plot clean and easy to interpret.

Video: https://www.youtube.com/watch?v=5lu4h_CPhi0
Website: https://statisticsglobe.com/avoid-overlap-text-labels-ggplot2-plot-r

Take a look here for more details: https://statisticsglobe.com/online-course-data-visualization-ggplot2-r

#pythonprogramminglanguage #statisticalanalysis #datascience #datastructure #package #rstudio

Is there a data structure that can sensibly handle multiple hierarchical classification systems?

e.g. an Orange, in terms of phylogeny is
Plantae->Eudicot->...->Citrus->sinensis

and in terms of usefulness, is
Thing->Food->fruit->orange
(and it could have multiple parents in this taxonomy, e.g. cleaning product)

Bonus points for cool visualisations of this kind information.

#data #dataScience #dataStructure #information #hierarchy #taxonomy #classification #visualisation #dataViz

In statistics, Frequentist and Bayesian approaches are two major methods of inference. While they aim to solve similar problems, they differ in their interpretation of probability and handling of uncertainty.

Frequentists interpret probability as the long-run frequency of events. Parameters (like the mean) are fixed but unknown, and inference relies on analyzing repeated samples.

Learn more: http://eepurl.com/gH6myT

#datascience #datavisualization #datastructure #bigdata #rstats #analysisskill

Bring your visualizations to life with see, a dynamic R package from the easystats ecosystem that extends ggplot2 to create modern and intuitive graphics. Whether you're visualizing statistical models or exploring data, see simplifies the process and enhances the presentation of your insights.

Visualizations: https://github.com/easystats/see

Take a look here for more details: https://statisticsglobe.com/online-course-data-visualization-ggplot2-r

#datastructure #rprogramming #tidyverse #coding

Dimensionality reduction simplifies high-dimensional data while retaining its essential features. It’s a powerful tool for improving data analysis, visualization, and machine learning performance.

Image credit to Wikipedia: https://en.wikipedia.org/wiki/Dimensionality_reduction#/media/File:PCA_Projection_Illustration.gif

I've developed an in-depth course on PCA theory and its application in R programming. Check out this link for more details: https://statisticsglobe.com/online-course-pca-theory-application-r

#rstudio #datastructure #programming #package #statistical #bigdata

Understanding the difference between Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL) can be challenging!

Visualization source: https://en.wikipedia.org/wiki/Deep_learning#/media/File:AI-ML-DL.svg

#database #datastructure #datascience #dataanalytic

#datastructure

Client Info