#Datascience

Classification based on Topological Data Analysis

Classification based on Topological Data Analysis

Topological Data Analysis (TDA) is an emergent field that aims to discover topological information hidden in a dataset. TDA tools have been commonly used to create filters and topological descriptors to improve Machine Learning (ML) methods. This paper proposes an algorithm that applies TDA directly to multi-class classification problems, even imbalanced datasets, without any further ML stage

Hugging Face datasets

Hugging Face datasets

One-line dataloaders for many public datasets & Efficient data pre-processing

R0: dde004c79ac901067ab1189ea01b8ac7-Data Science: A First Introduction

Data Science: A First Introduction

The book is structured so that learners spend the first four chapters learning how to use the R programming language and Jupyter notebooks to load, wrangle/clean, and visualize data, while answering descriptive and exploratory data analysis questions. The remaining chapters illustrate how to solve four common problems in data science, which are useful for answering predictive and inferential data analysis questions[…]

R0_fe33488e78e8d3bac711f1ffb6ea5a48-Bayesian-Data-Analysis-course

Bayesian Data Analysis: book & course

This book is intended to have three roles and to serve three associated audiences: an introductory text on Bayesian inference starting from first principles, a graduate text on effective current approaches to Bayesian modeling and computation in statistics and related fields, and a handbook of Bayesian methods in applied statistics for general users of and researchers in applied statistics. Although introductory in its early sections, the book is definitely not elementary in the sense of a first text in statistics

https://editorialia.com/wp-content/uploads/2020/09/tidy-modeling-with-r.jpg

Tidy Modeling with R

This book provides an introduction to how to use our software to create models. We focus on a dialect of R called the tidyverse that is designed to be a better interface for common tasks using R. If you’ve never heard of or used the tidyverse, Chapter 2 provides an introduction. In this book, we demonstrate how the tidyverse can be used to produce high quality models. The tools used to do this are referred to as the tidymodels packages

https://editorialia.com/wp-content/uploads/2020/09/by-danny-friedman-machine-learning-from-scratch-1.jpg

Machine Learning from scratch (by Danny Friedman)

This book covers the building blocks of the most common methods in machine learning. This set of methods is like a toolbox for machine learning engineers. Those entering the field of machine learning should feel comfortable with this toolbox so they have the right tool for a variety of tasks.

https://editorialia.com/wp-content/uploads/2020/08/ieee-use-casee28093criteria-for-addressing-ethical-challenges-in-transparency-accountability-and-privacy-of-cta_ctt.jpg

IEEE Use Case–Criteria for Addressing Ethical Challenges in Transparency, Accountability, and Privacy of CTA/CTT

There are substantial public health benefits gained through successfully alerting individuals and relevant public health institutions of a person’s exposure to a communicable disease. Contact tracing techniques have been applied to epidemiology for centuries, traditionally involving a manual process of interview and follow-up. This is time-consuming, difficult, and dangerous work. Manual processes are also open to incomplete information because they rely on individuals being willing and able to remember and report all contact possibilities.

https://editorialia.com/wp-content/uploads/2020/07/mastering-shiny.jpg

Mastering Shiny

This book complements Shiny’s online documentation and is intended to help app authors develop a deeper understanding of Shiny. After reading this book, you’ll be able to write apps that have more customized UI, more maintainable code, and better performance and scalability.

https://editorialia.com/wp-content/uploads/2020/06/the-art-of-machine-learning-algorithms-data-r.jpg

The Art of Machine Learning (Algorithms + Data + R)

I wrote this book because: • ML is not a recipe. It is not a matter of knowing the syntax and mechanics of various software packages.• ML is an art, not a science. (Hence the title of this book). • One does not have to be a math whiz or know advanced math in orer to use ML effectively, but one does need to understand the concepts well — the Why? and How? of ML methods

https://editorialia.com/wp-content/uploads/2020/06/best-practives-in-dataviz_-an-r-perspective.jpg

Best Practices in Dataviz: An R Perspective

By the end of this you will have had a whirlwind tour of the very tip of the data visualization best-practices iceberg. We will go over a broad range of topics generally applicable to data science usecases but not dive too deep into any single one. One thing to keep in mind the whole time is none of this is absolutely set in stone, most often in the real world you have to bend or break some of these rules to do what you want.