Online Learning • Conferences • Radar |
|
|
|
|
1. ML requires a fundamentally different deployment approach As organizations embrace machine learning, the need for new deployment tools and strategies grows. Mike Loukides explains. |
|
2. Bloomberg offers clients historic data for ML purposes Bloomberg will migrate its historical market data to the cloud so that the company’s clients can use the data to test algorithms and train machine learning models. Bloomberg’s chief information officer, Tony McManus, said the company had a growing number of hedge fund clients requesting access to its databases for data analytics purposes. + Check out Findata Day at Strata in NY, Sept 24. |
|
3. Clojure conundrums This intro (with examples) to Kafka Streams with Clojure covers the main Kafka Streams AP, the Willa library for writing idiomatic Clojure, and transducers in Clojure. + Kafka cocreator Neha Narkhede outlines her top resources to help you understand Kafka in this O'Reilly expert playlist: Understanding Streaming Data with Apache Kafka. |
|
4. Evading ML malware classifiers At this year’s DEFCON hacking conference, an MLSEC competition challenged hackers to work out how to smuggle 50 distinct malicious executables past machine learning classifiers. Here, William Fleshman, the winner of the competition, explains how he won. |
|
5. Dropbox’s journey to type checking 4 million lines of Python “At our scale—millions of lines of Python—the dynamic typing in Python made code needlessly hard to understand and started to seriously impact productivity. To mitigate this, we have been gradually migrating our code to static type checking using mypy, likely the most popular standalone type checker for Python.” Here’s Dropbox’s story. |
|
|
IN COLLABORATION WITH Google Cloud
Get early access to Google Cloud Dataproc on Kubernetes
Containerize, isolate, and run Apache Spark jobs on Kubernetes with Dataproc.
|
|
|
6. Scraping a public website doesn’t violate the CFAA, Ninth Circuit (mostly) holds In a major ruling handed down by the US Ninth Circuit Court of Appeals, you can now legally scrape data that’s publicly available—for example, LinkedIn profiles. Orin Kerr, professor of law at UC Berkeley, explains the case and what it means (and what it doesn’t). |
|
7. Obsessed with numbers? Us too. But this Harvard Business Review article argues that tying performance metrics to strategy has a downside. |
|
8. 7 Apache Cassandra mistakes Micha? Mat?oka lists seven common mistakes people make when using Apache Cassandra. + Berglund and McCullough on Mastering Cassandra for Architects (video training course on O’Reilly learning) |
|
|
IN COLLABORATION WITH Neo4j
Free ebook: Graph Algorithms—Practical Examples in Apache Spark and Neo4j
Graph algorithms can be incredibly useful, from finding vulnerabilities and bottlenecks to detecting communities and improving machine learning predictions. This free ebook, courtesy of Neo4j, gives you hands-on examples of how to use graph algorithms in Apache Spark and Neo4j along with sample code and tips for over 20 practical graph algorithms that cover optimal pathfinding, importance through centrality, and community detection using methods like clustering and partitioning.
|
|
|
9. What statistics can and can’t tell us about ourselves “Harold Eddleston, a seventy-seven-year-old from Greater Manchester, was still reeling from a cancer diagnosis he had been given that week when, on a Saturday morning in February 1998, he received the worst possible news. He would have to face the future alone: his beloved wife had died unexpectedly, from a heart attack. Eddleston’s daughter, concerned for his health, called their family doctor, a well-respected local man named Harold Shipman. He came to the house, sat with her father, held his hand, and spoke to him tenderly. Pushed for a prognosis as he left, Shipman replied portentously, ‘I wouldn’t buy him any Easter eggs.’ By Wednesday, Eddleston was dead; Dr Shipman had murdered him.” Shipman was one of the world’s most prolific serial killers. But stats wouldn’t have told you that. |
|
10. Imposter syndrome in data visualization What do you do when the boss says, “Make it look like this”? Phil Hawkins gets it. |
|
What will $245 buy you at Strata in New York?
If you’ve heard about the O’Reilly Strata Data Conference in NY and you’re curious but not quite ready to take the plunge, check out the Expo Plus pass. You’ll be able to test-drive new tools, compare products, attend all the sponsored sessions (and two technical sessions), meet with speakers and authors, get access to 90 days of O’Reilly online learning (a $117 value), and more. It’s a great opportunity to see what’s new in data tools and technologies, network, and get a taste of Strata.
|
|
+ Want to speak at Strata in London? The Call for Speakers is now open. Check out our tips for submitting a great proposal and send us your best ideas by the Oct 1 deadline. |
|
|
|
|
Want your own copy of this newsletter? Sign up here. |
|
Get more data insight and analysis at oreilly.com |
|
|
|
|
|
|
|
This message was sent to @. You are receiving this because you’re a customer of O’Reilly Media, or you’ve signed up to receive email from us. We hope you found this message to be useful. However, if you’d rather not receive future emails of this type from O’Reilly, please manage your preferences or unsubscribe here. Read our Privacy Policy. O’Reilly Media, Inc. 1005 Gravenstein Highway North, Sebastopol, CA 95472 (707) 827-7000 |
|
|