Tag Data Science

Instacart Data Science with Jeremy Stanley

http://traffic.libsyn.com/sedaily/InstacartDataScience.mp3Podcast: Play in new window | Download Instacart is a grocery delivery service. Customers log onto the website or mobile app and pick their groceries. Shoppers at the store get those groceries off the shelves. Drivers pick up the groceries and drive them to the customer. This is an infinitely complex set of logistics problems, paired with a rich data set given by the popularity of Instacart. Jeremy Stanley is

Continue reading…

Data Skepticism with Kyle Polich

http://traffic.libsyn.com/sedaily/dataskeptic_edited.mp3Podcast: Play in new window | Download With a fast-growing field like data science, it is important to keep some amount of skepticism. Tools can be overhyped, buzzwords can be overemphasized, and people can forget the fundamentals. If you have bad data, you will get bad results in your experimentation. If you don’t know what statistical approach you want to take to your data, it doesn’t matter how well you

Continue reading…

Artificial Intelligence Implications with Rumman Chowdhury

http://traffic.libsyn.com/sedaily/aiwithrumman_edited_1.mp3Podcast: Play in new window | Download Machine learning has improved both in tools and accessibility. Frameworks like TensorFlow create the right abstractions for developers to work efficiently. Educational programs like Metis and Insight Data Science provide a place for developers to learn these tools. As a result, artificial intelligence is becoming easier to develop and more widespread. Rumman Chowdhury works on artificial intelligence at Accenture. Before her current role,

Continue reading…

Data Applications With Dave King

http://traffic.libsyn.com/sedaily/DataApps.mp3Podcast: Play in new window | Download Data scientists need flexible interfaces for displaying and manipulating data sets. Data engineers need to be able to visualize how their data pipelines wire together databases and data processing frameworks. DevOps engineers need dashboards to understand their monitoring data at a high level. All of these programmers are building data applications. Data applications let us visualize and manipulate data sets effectively. In today’s

Continue reading…

Go Data Science with Daniel Whitenack

http://traffic.libsyn.com/sedaily/Go_Data_Science.mp3Podcast: Play in new window | Download Data science is typically done by engineers writing code in Python, R, or another scripting language. Lots of engineers know these languages, and their ecosystems have great library support. But these languages have some issues around deployment, reproducibility, and other areas. The programming language Golang presents an appealing alternative for data scientists. Daniel Whitenack transitioned from doing most of his data science work

Continue reading…

Data Engineering with Pete Soderling

http://traffic.libsyn.com/sedaily/hakkalabs_edited.mp3Podcast: Play in new window | Download In the last five years, companies started hiring data engineers. A data engineer creates the systems that manage and access the huge volumes of data that are accumulating on cheap cloud servers. As the saying goes, “it’s more expensive to throw out the data than to store it.” Pete Soderling joins the show to discuss the rise of the data engineer, and how

Continue reading…

Simpsons Data Science with Todd Schneider

http://traffic.libsyn.com/sedaily/simpsons_data_science_edited.mp3Podcast: Play in new window | Download The Simpsons is a classic, beloved television show. The scripts of The Simpsons have been made publicly available, and include dialogue, location, and character information. Todd Schneider used these scripts and other information sources as a corpus to analyze The Simpsons and find interesting statistics–such as who the most important supporting characters were, and how the ratings of the show have trended over

Continue reading…

PANCAKE STACK Data Engineering with Chris Fregly

http://traffic.libsyn.com/sedaily/pancakestack_edited_fixed.mp3Podcast: Play in new window | Download Data engineering is the software engineering that enables data scientists to work effectively. In today’s episode, we explore the different sides of data engineering–the data science algorithms that need to be processed and the implementation of software architectures that enable those algorithms to run smoothly. The PANCAKE STACK is a 12-letter acronym that Chris Fregly gave to a collection of data engineering technologies

Continue reading…

Data Validation with Dan Morris

http://traffic.libsyn.com/sedaily/datavalidation_edited_2.mp3Podcast: Play in new window | Download Data Validation is the process of ensuring that data is accurate. In many software domains, an application is pulling in large quantities of data from external sources. That data will eventually be exposed to users, and it needs to be correct. Radius Intelligence is a company that aggregates data on small businesses. In order to ensure that business addresses and phone numbers are

Continue reading…

Using Software to Discover Rare Diseases with Matt Might

http://traffic.libsyn.com/sedaily/Might_Edited_2.mp3Podcast: Play in new window | Download “In many ways, nature is still the fastest computer we have when it comes to studying disease.” Software engineering is a deterministic field. We write lines of code, and feed data into that code, expecting to get a certain answer. Computing is deterministic because humans developed it–we understand computers from top to bottom. The same cannot be said about biology. Matt Might is

Continue reading…

Data Visualization and Mapping with Aurelia Moser

http://traffic.libsyn.com/sedaily/Mapping_Edited.mp3Podcast: Play in new window | Download “I’m always worried that if you teach too much magic, people don’t learn the basics – they don’t know why something is working, they just know the documentation said it should work that way.” On Software Engineering Daily, we often discuss big data in terms of data engineering and data science. Data engineering is the infrastructure and pipelines that handle massive amounts of

Continue reading…

Machine Learning in Healthcare with David Kale

http://traffic.libsyn.com/sedaily/healthcareML_Edited.mp3Podcast: Play in new window | Download “Building a model to predict disease and deploying that in the wild – the bar for success is much higher there than, say, deciding what ad to show you.” Diagnosing illness today requires the trained eye of a doctor. With machine learning, we might someday be able to diagnose illness using only a data set. Today on Software Engineering Daily, we are joined

Continue reading…

Data Science at Monsanto with Tim Williamson

http://traffic.libsyn.com/sedaily/Monsanto_Edited_FInal.mp3Podcast: Play in new window | Download “Nothing’s cool unless you call it ‘as a service.’ ” Monsanto is a company that is known for its chemical and biological engineering. It is less well known for its data science and software engineering teams. Tim Williamson is a data scientist at Monsanto, and on today’s show he talked about how he and a small group of engineers at Monsanto dramatically shifted

Continue reading…

Matplotlib with Ben Root

“My eyes just roll whenever I see a table just full of numbers – they don’t mean anything to me, I don’t immediately grok it. But if I see a line plot, I get it, right away.”

Continue reading…

Deep Learning and Keras with François Chollet

“I definitely think we can try to abstract away the first principles of intelligence and then try to go from these principles to an intelligent machine that might look nothing like the brain.”

Continue reading…

Mesosphere and Tech Journalism with Derrick Harris

“The business of technology and the technology of technology are kind of converging if you ask me. And there is definitely a space for some publications that don’t have decades of technical debt in the software space.”

Continue reading…

Spark in Practice with Holden Karau

“I found Spark and I was really excited because I’m a functional programming nerd, and it was written in Scala.”

Continue reading…

Machine Learning for Businesses with Joshua Bloom

“You’ve got software engineers who are interested in machine learning, and think what they need to do is just bring in another module and then that will solve their problem. It’s particularly important for those people to understand that this is a different type of beast.”

Continue reading…

Data Science with Srini Kadamati

“I really think that data science is like design in the sense that it’s a way of thinking.”

Continue reading…

Hiring Engineers with Ammon Bartram

http://traffic.libsyn.com/sedaily/Triplebyte_Edited.mp3Podcast: Play in new window | Download “Humans are the most complicated thing out there – judging human skill is extremely hard, there’s all kinds of ways that people can be good.” Triplebyte is a technical hiring platform that vets engineers using a comprehensive evaluation platform and connects them to companies that are interesting in hiring them. Triplebyte was part of the Y Combinator summer class of 2015. Ammon Bartram

Continue reading…

  • 1 2