Lmst

🚀 Cập nhật sklearn‑diagnose: thư viện Python “máy MRI” cho mô hình ML giờ đã có chatbot tương tác! Bạn có thể trò chuyện với LLM để hỏi “Tại sao mô hình overfit?” hoặc nhận code mẫu, nhớ ngữ cảnh và khám phá sâu hơn. Giao diện React chạy locally trong trình duyệt. Đừng quên star repo! #MachineLearning #ML #AI #Python #sklearn #CôngNghệ #TríTuệNhânTạo #MLdiagnose

https://www.reddit.com/r/LocalLLaMA/comments/1qr5804/update_sklearndiagnose_now_has_an_interactive/

Компрессор для данных или как я написал свой первый custom transformer

Эта статья будет полезна DS специалистам, и тем, кто хоть когда-нибудь сталкивался с такой проблемой, как выбросы в данных или OOD (out of distribution), и ищет пути решения проблем, возникающих из-за них.

https://habr.com/ru/articles/988736/

#выбросы #анализ_данных #data_science #preprocessing #compression #outliner #custom_transformer #transformer #sklearn

От «обезьяньей» работы к Smart-анализу: как выполнить предобработку данных для моделей

От «обезьяньей» работы к Smart-анализу: как правильно готовить данные для моделей. Что такое Exploratory Data Analysis и как избежать основных ошибок при его выполнении.

https://habr.com/ru/articles/975082/

#pandas #sklearn #data_science #exploratory_data_analysis #machine_learning #numpy #statistics #feature_engineering

Clasificación SVM de 2 clases

Máquinas de Vectores de Soporte con kernel lineal

Se muestra:
- Puntos de entrenamiento
- Puntos de prueba
- vector de soporte
- Hiperplano
- Margen

#python #ML #sklearn

Clasificación de solo dos características de iris:
- longitud y ancho del sépalo
- Algoritmo de k vecinos

#python #sklearn curso ML Aprendiaje Automatico #Anzoategui #Lecheria

Agrupación de estados meteorológicos:
- Agrupamiento Kmeans 3 grupos
- Se puede separar Heavy Rain de los otros 2 grupos

#python Aprendizaje Automatico #ML Guanacaste Software Abierto Libre #sklearn Compressed Sparse Row Matrix Lecheria #anzoategui

Crear una matriz de confusión a partir de los resultados del experimento
Evaluar los resultados del modelo

#Python #sklearn confusion software soberania Guanacaste #Flisol

Scikit-learn теперь умеет в пайплайны: что изменилось и как работать с библиотекой в 2025 году

Scikit-learn — это одна из основных Python-библиотек для машинного обучения. Её подключают в прикладных проектах, AutoML-системах и учебных курсах — как базовый инструмент для работы с моделями. Даже если вы давно пишете на PyTorch или CatBoost, в задачах с табличными данными, скорее всего, всё ещё вызываете fit , predict , score — через sklearn. В 2025 году в библиотеку добавили несколько важных обновлений: доработали работу с пайплайнами, подключили полную поддержку pandas API, упростили контроль за экспериментами. Мы подготовили гайд, как работать со scikit-learn в 2025 году. Новичкам он поможет собрать первую ML-задачу — с данными, моделью и метриками. А тем, кто уже использует библиотеку, — освежить знания и понять, что изменилось в новых версиях. Почитать гайд →

https://habr.com/ru/companies/netologyru/articles/911216/

#scikitlearn #sklearn #пайплайн #python #pandas #машинное_обучение #machine_learning #ml #классификация #регрессия

@data @datadon 🧵

Redressing #Bias: "Correlation Constraints for Regression Models":
Treder et al (2021) https://doi.org/10.3389/fpsyt.2021.615754

#dataDev #linearRegression #modeling #probability #probabilities #statistics #stats #modelling #regression #correctionRatio #skLearn #scikitLearn #python #AIDev

Scikit Flow #skflow has been moved to @TensorFlo https://goo.gl/WvpO79 and will be maintained there! #deeperlearning #datascience #sklearn

"Feature importance helps in understanding which features contribute most to the prediction"

A few lines with #sklearn: https://mljourney.com/sklearn-linear-regression-feature-importance/

#interpretability #explainability #AIethics #compliance #taxonomy #ethicalAI #AIevaluation #linearRegression #featureEngineering

@datadon

#Lasso #LinearRegression "is useful in some contexts due to its tendency to prefer solutions with fewer non-zero coefficients, effectively reducing the number of features upon which the given solution is dependent"

https://scikit-learn.org/stable/modules/linear_model.html#lasso 🧵

#dataDev #AIDev #ML #sklearn #python #interpretability

I'm playing with the California Housing dataset built into sklearn.

One census block group has an average number of bedrooms per household of 0.83 and an average number of household members of 1243.

Huh?

#DataScience #python #sklearn

I just did my first project using the #mlflow library to track metrics on iterations of manual tuning of an #sklearn pipeline, it works great and gives me some idea of the search space before moving into automated hyperparameter tuning.

I am using it in a super basic way, as an alternative to creating a gazillion cells with comments tracking metrics, does anyone have any favorite features to check out for taking mlflow to the next level?
#machinelearning #python #MLOps #scikitlearn

[Перевод] Линейная регрессия и её регуляризация в Scikit-learn

Создание модели линейной регрессии относится к задачам обучения с учителем, цель которых — предсказать значение непрерывной зависимой переменной (y) на основе набора признаков (X). Одним из ключевых допущений любой модели линейной регрессии является предположение, что зависимая переменная (y) в некоторой степени линейно зависит от независимых переменных (Xi). Это означает, что мы можем оценить значение y, используя математическое выражение:

https://habr.com/ru/articles/850168/

#python #машинное_обучение #линейная_регрессия #для_начинающих #руководство #туториал #machine_learning #data_science #регуляризация #sklearn

Our molpipeline paper is out: https://pubs.acs.org/doi/10.1021/acs.jcim.4c00863

The presented code (https://github.com/basf/MolPipeline) integrates #RDKit functionality in #sklearn like objects, allowing to chain multiple steps in a single pipeline. Pipelines can even include ML models, allowing to obtain predictions directly from SMILES strings.

I genuinely miss PyMC2. The #PyMC and #Arviz APIs changes so frequently, that it's impossible to know what the standard approach to anything is.

#Bayesian #Statistics in #Python should be easy.

To be honest, I'd really like a well maintained #SkLearn module for it.

Uhm... if I get a decision tree like the one shown in the picture, does it mean that I only need the columns shown in the tree for training and validation, right? I would only need the columns 2 and 3 (x[2], x[3]), isn't it? Or am I missing something else?

#Sklearn #MachineLearning #ML #DecisionTree

While tackling a Kaggle competition for mushroom classification (to eat or not to eat? 🍄 ), I implemented Classifier Stacking. My blog post explores how combining various models and a meta-learner led to better results, with some trade-offs in computation time.

Combining diverse models can enhance overall performance, at the cost of calculation time.

https://www.briaslab.fr/blog/?action=view&url=combining-models-for-better-predictions-a-guide-to-stacking-in-machine-learning

#MachineLearning #Stacking #Kaggle #sklearn

#LinearRegression #Python #Sklearn
Dive into predictive modeling with our comprehensive guide on linear regression using Python and sklearn. Learn step-by-step implementation, result interpretation, and data visualization techniques. Perfect for beginners

https://teguhteja.id/mastering-linear-regression-with-python-and-sklearn-a-step-by-step-guide/

#SKLearn

Client Info