#datastructure

brozu ▪️brozu@mastodon.uno
2025-05-14

🧵 Thread on sorting algorithms.

Starting with #StupidSort 🤪, an inefficient algorithm that sorts by randomly shuffling until the list is ordered.

(1/3) ⬇️

#computerscience #algorithm #sortingalgorithm #datastructure #coding #programming #software #softwaredevelopment #bigo #learntocode #codenewbie

Python code snippet demonstrating the Stupid Sort algorithm. The code includes a function that shuffles a list until it is sorted, with an example usage and expected result
2025-05-02

Shoutout to my friend Daniel, who is not on mastodon. He created an uber efficient slice implementation for joining/modifying/copying lists at scale. The blogpost and github README describe the algorithm. It's an interesting technical read if you have 10-20 minutes of spare time.

daniel.avery.io/writing/fork-j

#code #algorithm #datastructure #random

Statistics GlobeStatisticsGlobe
2025-04-29

Mean imputation is a straightforward method for handling missing values in numerical data, but it can significantly distort the relationships between variables.

For a detailed explanation of mean imputation, its drawbacks, and better alternatives, check out my full tutorial here: statisticsglobe.com/mean-imput

More details are available at this link: eepurl.com/gH6myT

Statistics GlobeStatisticsGlobe
2025-04-01

gganimate is a powerful extension for ggplot2 that transforms static visualizations into dynamic animations. By adding a time dimension, it allows you to illustrate trends, changes, and patterns in your data more effectively.

The attached animated visualization, which I created with gganimate, showcases a ranked bar chart of the top 3 countries for each year based on inflation since 1980.

More information: statisticsglobe.com/online-cou

Statistics GlobeStatisticsGlobe
2025-03-25

Visualizing gene structures in R? gggenes, an extension of ggplot2, simplifies the process of creating clear and informative gene diagrams, making genomic data easier to interpret and share.

Visualization: cran.r-project.org/web/package

Click this link for detailed information: statisticsglobe.com/online-cou

Jan :rust: :ferris:janriemer@floss.social
2025-03-22

Red Green Syntax Trees - an Overview | by Will Speak (aka Plingdollar):

willspeak.me/2021/11/24/red-gr

#Parser #Compiler #DataStructure

Jan :rust: :ferris:janriemer@floss.social
2025-03-16

@FizzyOrange

Wow, this crate looks like the most feature-rich tree crate I've ever seen!

It seems very underrated (only ~1000 downloads and one star on GitHub (by me)).

Thank you for the suggestion!😊

#Rust #RustLang #DataStructure #Tree #Algorithms

Karsten Schmidttoxi@mastodon.thi.ng
2025-03-10

#ReleaseMonday — One of the recent (already very useful!) new package additions to #ThingUmbrella is:

thi.ng/leaky-bucket

Leaky buckets are commonly used in communication networks for rate limiting, traffic shaping and bandwidth control, but are equally useful in other domains requiring similar constraints.

A Leaky Bucket is a managed counter with an enforced maximum value (i.e. bucket capacity). The counter is incremented for each a new event to check if it can/should be processed. If the bucket capacity has already been reached, the bucket will report an overflow, which we can then handle accordingly (e.g. by dropping or queuing events). The bucket also has a configurable time interval at which the counter is decreasing (aka the "leaking" behavior) until it reaches zero again (i.e. until the bucket is empty). Altogether, this setup can be utilized to ensure both an average rate, whilst also supporting temporary bursting in a controlled fashion...

Related, I've also updated/simplified the rate limiter interceptor in thi.ng/server to utilize this new package...

#ThingUmbrella #DataStructure #RateLimiting #OpenSource #TypeScript #JavaScript

Statistics GlobeStatisticsGlobe
2025-02-28

I used to think that writing sophisticated R code meant using all the advanced features and chaining long functions together...

Fancy code can be fun, but clean code makes collaboration and debugging so much easier.

Stay informed on data science by joining my free newsletter. Check out this link for more details: eepurl.com/gH6myT

2025-02-15

Ordered map на Go

Omap — это пакет Golang для работы с потокобезопасными упорядоченными map. Упорядоченная map содержит map golang, list и mutex для выполнения функций упорядоченной map. Упорядоченная map— это map, которая запоминает порядок элементов. Map можно итерировать для извлечения элементов в том порядке, в котором они были добавлены.

habr.com/ru/articles/882828/

#go #map #caching #datastructure #index #dataprocessing #orderedmap #omap

Statistics GlobeStatisticsGlobe
2025-02-11

In missing data imputation, it is crucial to compare the distributions of imputed values against the observed data to better understand the structure of the imputed values.

The visualization below can be generated using the following R code:

library(mice)
my_imp <- mice(boys)
densityplot(my_imp)

Take a look here for more details: statisticsglobe.com/online-wor

Knowledge Zonekzoneind@mstdn.social
2025-02-07
Statistics GlobeStatisticsGlobe
2025-02-04

Avoiding text overlap in plots is essential for clarity, and R offers a great solution with the ggplot2 and ggrepel packages. By automatically repositioning labels, ggrepel keeps your plot clean and easy to interpret.

Video: youtube.com/watch?v=5lu4h_CPhi0
Website: statisticsglobe.com/avoid-over

Take a look here for more details: statisticsglobe.com/online-cou

naught101naught101
2025-01-30

Is there a data structure that can sensibly handle multiple hierarchical classification systems?

e.g. an Orange, in terms of phylogeny is
Plantae->Eudicot->...->Citrus->sinensis

and in terms of usefulness, is
Thing->Food->fruit->orange
(and it could have multiple parents in this taxonomy, e.g. cleaning product)

Bonus points for cool visualisations of this kind information.

Statistics GlobeStatisticsGlobe
2025-01-03

In statistics, Frequentist and Bayesian approaches are two major methods of inference. While they aim to solve similar problems, they differ in their interpretation of probability and handling of uncertainty.

Frequentists interpret probability as the long-run frequency of events. Parameters (like the mean) are fixed but unknown, and inference relies on analyzing repeated samples.

Learn more: eepurl.com/gH6myT

Statistics GlobeStatisticsGlobe
2024-12-31

Bring your visualizations to life with see, a dynamic R package from the easystats ecosystem that extends ggplot2 to create modern and intuitive graphics. Whether you're visualizing statistical models or exploring data, see simplifies the process and enhances the presentation of your insights.

Visualizations: github.com/easystats/see

Take a look here for more details: statisticsglobe.com/online-cou

Statistics GlobeStatisticsGlobe
2024-12-24

Dimensionality reduction simplifies high-dimensional data while retaining its essential features. It’s a powerful tool for improving data analysis, visualization, and machine learning performance.

Image credit to Wikipedia: en.wikipedia.org/wiki/Dimensio

I've developed an in-depth course on PCA theory and its application in R programming. Check out this link for more details: statisticsglobe.com/online-cou

Statistics GlobeStatisticsGlobe
2024-12-13

Understanding the difference between Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL) can be challenging!

Visualization source: en.wikipedia.org/wiki/Deep_lea

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst