Guardant Health is the world leader in comprehensive liquid biopsy. Oncologists order our blood test to help determine if their advanced cancer patients are eligible for certain drugs that target specific genomic alterations in tumour DNA. Each test produces huge amounts of genomic data that we process into easily interpretable test results.
Proceedings of the National Academy of Sciences, Qian Zhang et al.
from
Mathematical and computational modeling approaches can be essential in providing quantitative scenarios of disease spreading, as well as projecting the impact in the population. Here we analyze the spatial and temporal dynamics of the Zika virus epidemic in the Americas with a microsimulation approach informed by high-definition demographic, mobility, and epidemic data. The model provides probability distributions for the time and place of introduction of Zika in Brazil, the estimate of the attack rate, timing of the epidemic in the affected countries, and the projected number of newborns from women infected by Zika. These results are potentially relevant in the preparation and analysis of contingency plans aimed at Zika virus control.
Humans, modern and otherwise, have lived in Denisova Cave in Siberia for tens of thousands of years, where they left behind a treasury of archaeological artifacts. The cave is famous for giving its name to Denisovans, a species of human closely related to Neanderthals. But Neanderthals have lived there, too.
In the cave’s Main Gallery, stone tools had been left behind by people who lived thousands of years ago. Those people were probably Neanderthals, according to a paper in Science this week: The soil says so. Even though no Neanderthal bones have been found with the tools, the paper’s authors are the first to be able to detect the presence of humans based on DNA found in the soil. This allows them to paint a much more detailed picture of the past, in Denisova Cave and elsewhere.
Cloudera has announced the general availability of its Data Science Workbench, a new self-service tool that could help speed the time to value for advanced analytics and deep learning.
Many of the papers now judged most original and significant rely on massive compute resources, usually beyond the financial reach of academia. So where does that leave academic research?
The Allen Institute for Artificial Intelligence is working to teach computers to answer science questions at a grade-school level — a task that might sound simple, but requires the computer to decipher images, diagrams and understand the contextual meaning of what is written.
Facebook showed advertisers how it has the capacity to identify when teenagers feel “insecure”, “worthless” and “need a confidence boost”, according to a leaked documents based on research quietly conducted by the social network.
The internal report produced by Facebook executives, and obtained by the Australian, states that the company can monitor posts and photos in real time to determine when young people feel “stressed”, “defeated”, “overwhelmed”, “anxious”, “nervous”, “stupid”, “silly”, “useless” and a “failure”.
The Australian reported that the document was prepared by two top Australian executives, David Fernandez and Andy Sinn.
University of Texas, Texas Advanced Computing Center
from
Finding new drugs that can more effectively kill cancer cells or disrupt the growth of tumors is one way to improve survival rates for ailing patients. Researchers are using supercomputers to find new chemotherapy drugs and to test known compounds to determine if they can fight different types of cancer. Recent efforts have yielded promising drug candidates, potential plant-derived compounds and insights into how to design more effective drugs.
In my previous post, I talked about scraping Indeed.com for Data Scientist jobs across the United States. While I was able to scrape a little over 10,500 listings, few of them contained salary data and many of the salaries were hourly, monthly or weekly. After running a massive clean up on the data, I was left with 493 salaries to use for the modeling. The median salary was $100K with 236 of the listings being above the median and 257 below. I was excited to get to the modeling. However, before jumping to the grand finale, I wanted see what other insights I could gain from the data. This task called for me to pull one of my favorite data exploration tools from my data scientist toolbox — Tableau!
The overall theme of the ICLR conference setting this year could be summarized as “finger food and ships”. More importantly, there were a lot of interesting papers, especially on machine learning security, which will be the focus on this post.
Columbia University President Lee C. Bollinger today announced that Jeannette Wing, currently corporate vice president of Microsoft Research, will become the Avanessians Director of Columbia’s Data Science Institute and Professor of Computer Science.
“Jeannette Wing is a pioneering figure in the world of computer science research and education. Her addition to the University’s academic leadership team reflects the continuing expansion of our work in this field,” said Bollinger. “Our Data Science Institute is indispensable to virtually every scholarly initiative at the University dedicated to addressing a societal problem. The benefits to be derived from Jeannette’s leadership and her presence here will be immense.”
Ann Arbor, MI The University of Michigan Exercise & Sport Science Initiative, in collaboration with the Michigan Institute for Data Science, will be hosting a data science summer camp for high-school students who are interested in sport analytics. Deadline to apply is May 20.
It’s the perfect tool for most testing situations. Unfortunately, if you’re doing tests for a product that relies heavily on interaction between users — such as a dating app — doing random assignment on a per-user basis can lead to unreliable experiments and misleading conclusions.
Many organisations develop successful proof of concepts but then don’t manage to materialize the models beyond their laptops. Taking models into production requires a professional workflow, high quality standards, and scalable code and infrastructure. Data Science in Production is dedicated to reaping benefit from data by taking data driven applications into production. [pdf download]
Forecasting data collected during the Intelligence Advanced Research Projects Activity’s (IARPA’s) Aggregative Contingent Estimation (ACE) program by team Good Judgment is now available for use by the public and the research community.