|
|
February 16 · Issue #2 · View online
A weekly digest of all the best Data Science related news and blog posts.
|
|
Welcome to the 2nd issue of the Best of Data Science Weekly! I’m excited to share links that were of interest to me this week . If you have any questions or there’s something you’d like to see in the newsletter please let me know. You’re encouraged to reply to this mail.😊 Regards,
Luis de Sousa
|
|
|
What does Microsoft do with R?
Microsoft’s developments with R are designed to work with the entire R ecosystem rather than be distinct from it. This article contains an overview of what Microsoft has developed around the R ecosystem.
|
Python in Visual Studio Code - Jan 2018 Release
The Microsoft Python extension for Visual Studio code has been updated.
|
Using Azure and AI to Explore the JFK Files
A demo containing sample code using Azure Search and Cognitive Services to provide insights and analysis around the JFK Files.
|
Applying NLP in Sentiment Classification & Entity Recognition Using Azure ML and the Team Data Science Process
This blog post provides a summary of two real-world scenarios demonstrating how to use Azure Machine Learning alongside the Team Data Science Process (TDSP) to execute AI projects involving Natural Language Processing (NLP) use-cases, namely, for sentiment classification and entity extraction. Links to the GitHub code are provided.
|
Custom Speech Recognition, Voice Output, and Video Indexing
A quick recap of three recent posts about Microsoft AI platform developments.
|
|
Azure/DecisionTreeExplorer
DecisionTreeExplorer - Simple Shiny App for visualizing simplicity/performance tradeoffs in decision trees.
|
Microsoft Cognitive Toolkit v2.4
2018-01-31 Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit version 2.4 released.
|
Machine Learning for Apache Spark v0.11
2018-02-08 Machine Learning for Apache Spark (MMLSpark) for Multi-GPU Distributed Training of Deep Networks version 0.11 released.
|
AirSim v1.1.8
2018-02-16 Version 1.18 released. AirSim - Open source simulator based on Unreal Engine for autonomous vehicles from Microsoft AI & Research
|
|
SLAC Dataset From MIT and Facebook
This project presents a novel video dataset, named SLAC (Sparsely Labeled ACtions), for action recognition and localization. Links to the paper are in the YouTube video description.
|
Google's Text Reader AI: Almost Perfect
A brief summary of the paper “Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions”. Links to the paper and an unofficial implementation are in the YouTube video description.
|
Deep Learning for Music Generation
Part 1: Erika explains how to create deep learning models with music as the input. She begins by describing the problem of generating music by specifically describing how she generated the appropriate features from a midi file. She then describes the deep learning model she used in order to generate music.
|
Deep Learning for Music Generation - The Code
Part 2: Erika follows up her previous episode by showing the actual code behind training and using the music generation model. This includes the code for both creating the features from the midi file and returning a midi file from the features. She also shows how Keras is used to generate the actual model.
|
|
Data Skeptic - Deploying Machine Learning to Production with MS SQL Server
A detailed discussion about the questions a practioneer would have when considering how they might use Microsoft SQL Server as the right tool for their production machine learning model deployments.
|
Not So Standard Deviations
Podcast mentioned in first article. The Data Science Podcast
Roger Peng and Hilary Parker talk about the latest in data science and data analysis in academia and industry.
Co-hosts: Roger Peng of the Johns Hopkins Bloomberg School of Public Health and Hilary Parker of Stitch Fix.
|
|
Microsoft Developer Immersion "Developing with Data"
An example of application modernization practice centered around data-driven intelligent apps (using SQL Server 2016 and Azure Database Services), and re-architect on-premises ISV/SaaS apps to Azure for scale.
|
|
The Future Computed: Artificial Intelligence and its role in society
A free book on Artificial Intelligence and its role in society. 🤓
|
|
Gene name errors are widespread in the scientific literature | Genome Biology | Full Text
“The spreadsheet software Microsoft Excel, when used with default settings, is known to convert gene names to dates and floating-point numbers. A programmatic scan of leading genomics journals reveals that approximately one-fifth of papers with supplementary Excel gene lists contain erroneous gene name conversions.”
|
|
That’s all for the second issue. Until next week. EOF
|
Did you enjoy this issue?
|
|
|
|
In order to unsubscribe, click here.
If you were forwarded this newsletter and you like it, you can subscribe here.
|
|
Johannesburg, South Africa, 2020
|