Federated Learning: Issues in Medical Application

29 de May de 2022 9 de January de 2022

In this presentation, the current issues to make federated learning flawlessly useful in the real world will be briefly overviewed. They are related to data/system heterogeneity, client management, traceability, and security. Also, we introduce the modularized federated learning framework, we currently develop, to experiment various techniques and protocols to find solutions for aforementioned issues. The framework will be open to public after development completes.

Pages: 1 2 3

Scientific Visualization: Python + Matplotlib

27 de February de 2022 8 de January de 2022

The Python scientific visualisation landscape is huge. It is composed of a myriad of tools, ranging from the most versatile and widely used down to the more specialised and confidential. Some of these tools are community based while others are developed by companies. Some are made specifically for the web, others are for the desktop only, some deal with 3D and large data, while others target flawless 2D rendering.

Pages: 1 2 3

Introduction to Datascience: Learn Julia Programming, Math & Datascience from Scratch

30 de January de 2022 8 de January de 2022

I was emboldened to write this book after my video series called Data Science With Julia got some traction. That too after a tweet about Decision Tree was liked by Julia Language itself. So I thought why not give it more?

Pages: 1 2 3

The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming

7 de November de 2021 27 de August de 2021

We analyze the math fundamentals behind DP and demonstrate the power of it by applying it on two real-world text classification tasks. Furthermore, we compare DP with pointillistic active and semi-supervised learning techniques traditionally applied in data-sparse settings.

Pages: 1 2 3

R0:e3d9ea294a21c145042e5f31369de739-CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

26 de September de 2021 22 de August de 2021

CARLA (Counterfactual And Recourse LibrAry), a python library for benchmarking counterfactual explanation methods across both different data sets and different machine learning models. In summary, our work provides the following contributions: (i) an extensive benchmark of 11 popular counterfactual explanation methods, (ii) a benchmarking framework for research on future counterfactual explanation methods, and (iii) a standardized set of integrated evaluation measures and data sets for transparent and extensive comparisons of these methods. We have open-sourced CARLA and our experimental results on Github, making them available as competitive baselines. We welcome contributions from other research groups and practitioners.

Pages: 1 2 3

R0:5e6fade87218b43e4b8d96158080cc85-A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

18 de September de 2021 18 de September de 2021

This paper provides a succinct overview of this emerging theory of overparameterized ML (henceforth abbreviated as TOPML) that explains these recent findings through a statistical signal processing perspective. We emphasize the unique aspects that define the TOPML research area as a subfield of modern ML theory and outline interesting open questions that remain.

Pages: 1 2 3

R0:2860421906bee85cbfa5eadd287f7a8c-Data as the main focus of “State of the art of data science in Spanish language and its application in the field of Artificial Intelligence”

Data as the main focus of “State of the art of data science in Spanish language and its application in the field of Artificial Intelligence”

15 de August de 2021 2 de August de 2021

According to the results, there is an evidence of cultural bias for data science in Spanish language. The outcome of the consultation, which carried out on 12 April 2021, confirms that only 10 out of 23.771 datasets “speaks” Spanish.”

Pages: 1 2 3

State of the art of data science in Spanish language and its application in the field of AI

2 de May de 2021 8 de August de 2021

The study of art provides results that indicate the absence of involvement of Spanish language with AI and all the subareas, which consequently adversely affect to the education of future professionals.

Pages: 1 2 3

El estado del arte de la ciencia de datos en el idioma español y su aplicación en el campo de la Inteligencia Artificial

30 de March de 2021 31 de March de 2021

El estudio arroja resultados que indican la falta de involucración del Español con la IA así como de todas las subáreas, afectando negativamente a la formación de futuros profesionales.

Pages: 1 2 3

Classification based on Topological Data Analysis

28 de February de 2021 1 de April de 2021

Topological Data Analysis (TDA) is an emergent field that aims to discover topological information hidden in a dataset. TDA tools have been commonly used to create filters and topological descriptors to improve Machine Learning (ML) methods. This paper proposes an algorithm that applies TDA directly to multi-class classification problems, even imbalanced datasets, without any further ML stage

Pages: 1 2 3

Ciencia de Datos | 🇬🇧 Data Science