Ciencia de Datos | 🇬🇧 Data Science
Advanced R
Advanced R helps you understand how R works at a fundamental level. It is designed for R programmers who want to deepen their understanding of the language, and programmers experienced in other languages who want to understand what makes R different and special. This book will teach you the foundations of R; three fundamental programming paradigms (functional, object-oriented, and metaprogramming); and powerful techniques for debugging and optimising your code.
IPython Interactive Computing and Visualization Cookbook
IPython Interactive Computing and Visualization Cookbook, Second Edition contains many ready-to-use, focused recipes for high-performance scientific computing and data analysis, from the latest IPython/Jupyter features to the most advanced tricks, to help you write better and faster code.
Text Mining with R (A Tidy Approach)
If you work in analytics or data science, like we do, you are familiar with the fact that data is being generated all the time at ever faster rates. (You may even be a little weary of people pontificating about this fact.) Analysts are often trained to handle tabular or rectangular data that is mostly numeric, but much of the data proliferating today is unstructured and text-heavy. Many of us who work in analytical fields are not trained in even simple interpretation of natural language.
We developed the tidytext (Silge and Robinson 2016) R package because we were familiar with many methods for data wrangling and visualization, but couldn’t easily apply these same methods to text.
Data Science at the Command Line
Today, data scientists can choose from an overwhelming collection of exciting technologies and programming languages. Python, R, Hadoop, Julia, Pig, Hive, and Spark are but a few examples. You may already have experience in one or more of these. If so, then why should you still care about the command line for doing data science? What does the command line have to offer that these other technologies and programming languages do not?
Open Research Dataset (CORD-19): Semantic Scholar has partnered with leading research groups to release the COVID-19
The Allen Institute just published the #covid19 open research #dataset. In addition, they are sponsoring a related Kaggle competition. The dataset contains almost 30k scholarly articles related to the virus. The goal is to use #NLP to advance our understanding.
Efficient R programming
There are many excellent R resources for visualization, data science, and package development. Hundreds of scattered vignettes, web pages, and forums explain how to use R in particular domains. But little has been written on how to simply make R work effectively-until now.
Artificial Intelligence use for help. The coronavirus R dataset package
These data can be used in the area of artificial intelligence (machine learning and deeplearning) and among all obtain more efficient and faster results regarding coronavirus.









