NYU Data Science newsletter – May 17, 2016

NYU Data Science Newsletter features journalism, research papers, events, tools/software, and jobs for May 17, 2016

GROUP CURATION: N/A

 
Data Science News



Deep Learning and Neuromorphic Chips

KDnuggets, Peter Morgan


from May 16, 2016

There are three main ingredients to creating artificial intelligence: hardware (compute and memory), software (or algorithms), and data. We’ve heard a lot of late about deep learning algorithms that are achieving superhuman level performance in various tasks, but what if we changed the hardware?

 

Understanding how news cycles unfold from the original source

Research at Facebook; Chenhao Tan, Adrien Friggeri, Lada Adamic


from May 06, 2016


We get news from many media sources, and also through our friends, online and offline. By the time the news reaches us, it may have been retold in interesting ways, which so far have typically not been quantified. Normally it would be difficult to tell how the information that reaches us differs from its original source, because the sharing of the information is dispersed, or the situation itself is evolving. However, in a few cases, the source is better-defined, for example, when a public entity issues a press release.

In a recent study, we collected a sample of press releases by the U.S. Federal Open Market Committee, published speeches by President Barack Obama, as well as press releases from several tech companies and universities. We then gathered de-identified Facebook data, analyzed in aggregate, on shares of the articles covering the source and the corresponding comments, as shown in the diagram above.

 

Kitt.ai can add voice controls to devices

Seattle Times


from May 15, 2016

A Seattle startup gets on the artificial-intelligence train with work in natural language understanding and by helping developers create voice controls used on devices.

 

PC culture conquers Barclays! Stadium to host massive video game tournament

Brooklyn Daily


from May 12, 2016

Forget Nets games — ’net games are coming to Barclays Center!

The organizers of an international video-game tournament circuit expect thousands of fans to cram into Brooklyn’s biggest area to watch a high-stakes “Counter Strike” contest in October. Staring at a screen might sound like something you can just do at home, but experts say it is genuinely thrilling to cheer, gasp, and groan alongside other spectators while watching contestants fight to the digital death.

“It’s a shared experience with a massive amount of other human beings,” said Andy Nealen, a professor of game engineering and computer graphics at New York University. “Just to straight up be in the atmosphere, it’s pretty exhilarati­ng.”

 

Cybersecurity sleuths learn to think like hackers

CNET


from May 15, 2016

… “Everyone should know cybersecurity,” says William Yang, an 18-year-old from the Arkansas School for Mathematics, Sciences and the Arts in Hot Springs, Arkansas, who tells me security needs to be baked into more software. In the meantime, he’s having fun just figuring out the solution to the cyberpuzzle.

Yang is here at New York University’s Brooklyn campus for Cyber Security Awareness Week. The largest event of its kind, CSAW comprises six competitions for high-school and university students and a career fair. At stake is more than $450,000 in college scholarships for the high schoolers and more than $11,000 in cash prizes for university winners.

 

Imker to Lead Illinois Efforts in Multi-Institution Data Curation Network Funded by Sloan Foundation

University of Illinois Library


from May 12, 2016

Heidi Imker, Director of the Research Data Service and Associate Professor at the University of Illinois Library, will lead local participation in a new Data Curation Network funded by the Sloan Foundation. The grant will enable six institutions, including Illinois, to pilot a “network of expertise” model for data curation services.

 

[1605.04462] Natural Language Processing for Mental Health: Large Scale Discourse Analysis of Counseling Conversations

arXiv, Computer Science > Computation and Language; Tim Althoff, Kevin Clark, Jure Leskovec


from May 14, 2016

Mental illness is one of the most pressing public health issues of our time. While counseling and psychotherapy can be effective treatments, our knowledge about how to conduct successful counseling conversations has been limited due to lack of large-scale data with labeled outcomes of the conversations. In this paper, we present a large-scale, quantitative study on the discourse of text-message-based counseling conversations.

 

Deep learning accelerator and neural network software framework introduced by Movidius

Vision Systems Design


from May 16, 2016

Movidius has announced the release of the Fathom Neural Compute Stick—a deep learning acceleration module—as well as Fathom deep learning software framework, both of which will enable neural networks to be moved out of the cloud and deployed natively in end-user devices.

The Fathom Neural Compute Stick, a USB stick embedded neural network accelerator, features the Myriad 2 vision processor and can run fully-trained neural networks at under 1 Watt of power. Targeted at deep learning product developers and researchers, the Fathom Neural Compute Stick accepts networks defined by Caffe or TensorFlow and their accompanying dataset. It then uses the Movidius Fathom Tool to prepare and execute the convolutional neural network on Myriad 2.

When connected to a PC, the Fathom Neural Compute Stick behaves as a neural network profiling and evaluation tool, meaning companies will be able to prototype faster and more efficiently, reducing time to market for products requiring artificial intelligence.

 

Bossy girls, Parser McParseface, and why deep learning is not just another fad

Pete Warden's blog


from May 15, 2016

Deep learning is different, and I believe this fervently because I’ve seen the approach deliver record-beating results in practical applications across an amazing variety of different problems. That’s why TensorFlow is so important to me personally. …
It’s also why I was over the moon to see another Google research team release Parsey McParseface!

 
Events



Program – NLP+CSS: Workshops on Natural Language Processing and Computational Social Science



Language is perhaps the most salient outcome of complex social processes. We do not expect teenagers to speak like senior citizens, and we recognize the mutual dependency between language and social factors. Although this interdependence is at the core of models in both natural language processing (NLP) and (computational) social sciences (CSS), these two fields still exist largely in parallel, holding back research insights and potential applications.

This workshops aims to advance the joint computational analysis of social sciences and language by explicitly connecting social scientists, network scientists, NLP researchers, and industry partners.

Hannover, Germany May 22, 2016, Hannover, Germany. This is one of two companion workshops on NLP+CSS in 2016.

 

Information+ Conference Tickets



The inaugural Information+ conference will bring together researchers and practitioners in information design and information visualization to discuss common questions and challenges in these rapidly changing fields.

Vancouver, British Columbia Thursday, June 16, at Emily Carr University of Art + Design.

 
Deadlines



OPODIS 2016 – The 20th International Conference on Principles of Distributed Systems

deadline: subsection?

OPODIS is an open forum for the exchange of state-of-the-art knowledge on distributed computing and distributed computer systems. All aspects of distributed systems are within the scope of OPODIS.

Madrid, Spain Tuesday-Friday, December 13-16.

Deadline for submissions is Monday, August 22.

 
Tools & Resources



Troubleshooting Neural Networks: What is Wrong When My Error Increases?

KDnuggets, Sebastian Raschka


from May 15, 2016

There are many possible reasons that could explain this problem. There could be a technical explanation — we implemented backpropagation incorrectly — or, we chose a learning rate that was too high, which in turn let to the problem that we were overshooting the local minima of the cost function.

 

Deep Learning Software

NVIDIA Developer


from May 01, 2016

Deep learning algorithms use large amounts of data and the computational power of the GPU to learn information directly from data such as images, signals, and text. NVIDIA DIGITS offers an interactive workflow-based solution for image classification. Deep learning frameworks offer more flexibility with designing and training custom deep neural networks and provide interfaces to common programming language. The NVIDIA Deep Learning SDK offers powerful tools and libraries for the development of deep learning frameworks such as Caffe, CNTK, TensorFlow, Theano, and Torch.

 

Marco Bonzanini – Building Data Pipelines in Python

YouTube, PyData


from May 11, 2016

I presented “Building Data Pipelines in Python”, with a focus on the need to bring R&D and Engineering together, and how basic engineering principles can be beneficial even if your job is not all about writing code. After presenting a very similar talk at PyCon Italy, I found the audience in London to be a bit more on the academic side than I initially thought, which was perfect for my engineering rants. After the usual first few minutes of feeling awkward when speaking publicly, I started my discussion on unit testing and asked how many in the audience write unit tests regularly. Random guy from the audience: “What’s a unit test?”. Thank you kind stranger, you lifted my spirit and the rest of the talk was a breeze

 
Careers



Postdoctoral Scholar – Research Associate
 

University of Southern California, Information Sciences Institute
 

Leave a Comment

Your email address will not be published.