Tagged | data-analytics
-
Recognize Class Imbalance with Baselines and Better Metrics
(engineering.indeedblog.com) -
Monitoring to prevent game cheating
(engineering.linecorp.com)#software-engineering #software-design #monitoring #data-analytics
-
Detecting Bias with SHAP
(databricks.com) -
ROC Curves and the Efficient Frontier
(towardsdatascience.com) -
How GPU Computing literally saved me at work?
(medium.com) -
Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask
(eng.uber.com) -
Analyzing Spatial Patterns in Life Expectancy with Python
(www.azavea.com) -
Understanding Dynamic Time Warping
(databricks.com) -
Presentation: Winning Ways for Your Visualization Plays
(www.infoq.com) -
Introducing PySurvival
(medium.com) -
How We Detect Anomalies in Our Product Recommendations Metrics
(tech.wayfair.com)#data-science #software-architecture #machine-learning #data-analytics
-
Using Reinforcement Learning to Design a Better Rocket Engine
(blog.insightdatascience.com) -
A Visual Exploration of Gaussian Processes
(distill.pub)#data-science #data-analytics #visualisation #math #statistics
-
Design Principles for Mathematical Engineering in Experimentation Platform at Netflix
(medium.com) -
Modeling Censored Time-to-Event Data Using Pyro, an Open Source Probabilistic Programming Language
(eng.uber.com)#data-analytics #Probabilistic-programming #time-series #statistics
-
Presentation: Data-driven Decision Making
(www.infoq.com) -
Anomaly Detection with Isolation Forest & Visualization
(medium.com) -
Why Financial Planning is Exciting… At Least for a Data Scientist
(eng.uber.com) -
Understanding Customer Churning with Big Data Analytics
(towardsdatascience.com) -
Analyzing Twitch chat during a Pokémon Marathon
(blog.twitch.tv) -
Exploratory Design in Data Visualization
(towardsdatascience.com) -
Generating Twitter Ego-Networks & Detecting Ego-Communities
(towardsdatascience.com)#data-analytics #big-data #graph-processing #visualisation #social-networks
-
Monte Carlo Power Analysis
(deliveroo.engineering) -
Predicting real-time availability of 200 million grocery items in US/Canada stores
(tech.instacart.com) -
Presentation: The State of AI Marketing
(www.infoq.com) -
Exploring & Machine Learning for Airbnb Listings in Toronto
(towardsdatascience.com) -
Linear Regression in Real Life
(www.dataquest.io) -
Pylift: A Fast Python Package for Uplift Modeling
(tech.wayfair.com) -
Deduplicating files in Public Git Archive
(blog.sourced.tech) -
Risk Detection Infrastructure @ Postmates
(blog.postmates.com) -
Towards Natural Language Semantic Code Search
(githubengineering.com) -
Introducing Oak: an Open Source Scalable Key-Value Map for Big Data Analytics
(yahooeng.tumblr.com) -
Data Warehousing and ETLs
(medium.com) -
Real-time Streaming Pattern: Analyzing Trends
(blog.wallaroolabs.com) -
What to Consider When Choosing Colors for Data Visualization
(www.dataquest.io) -
Optimizing TV Advertising Toward Return on Investment
(tech.wayfair.com) -
Leveraging Elastic Demand for Forecasting
(tech.instacart.com) -
Learning Market Dynamics for Optimal Pricing
(medium.com) -
Machine Learning in Google BigQuery
(ai.googleblog.com) -
Presentation: R for AI developers
(www.infoq.com) -
Visualize your real-time data
(itnext.io) -
From shallow to deep learning in fraud
(eng.lyft.com) -
Time-Series Analysis Using Recurrent Neural Networks in Tensorflow
(towardsdatascience.com) -
The Design of Statistical Graphics
(towardsdatascience.com) -
How are Logistic Regression & Ordinary Least Squares Regression Related?
(towardsdatascience.com) -
Overscripted! Digging into JavaScript execution at scale
(hacks.mozilla.org) -
From Beautiful Maps to Actionable Insights: Introducing kepler.gl, Uber’s Open Source Geospatial Toolbox
(eng.uber.com) -
A modified Artificial Bee Colony algorithm to solve Clustering problems
(towardsdatascience.com) -
The Duplicate Review Tool: Incorporating Visual Search into Merchandising Operations
(tech.wayfair.com) -
Keeping 2 billion Android devices safe with machine learning
(android-developers.googleblog.com) -
Categorizing Listing Photos at Airbnb
(medium.com) -
Clustering Cryptocurrencies with Affinity Propagation and the RAD 30 Crypto Composite
(hackernoon.com) -
Two things about power
(multithreaded.stitchfix.com) -
Predicting Ethereum prices with Long Short Term Memory (LSTM)
(towardsdatascience.com) -
How to build analytic products in an age when data privacy has become critical
(www.oreilly.com) -
Uplift Modeling in Display Remarketing
(tech.wayfair.com) -
Gimel: PayPal’s Analytics Data Processing Platform
(www.paypal-engineering.com) -
Quantifying Effort through Heart Rate Data
(medium.com) -
Simon Moss on using artificial intelligence to fight financial crimes
(www.oreilly.com) -
MobileNetV2: Inverted Residuals and Linear Bottlenecks
(towardsdatascience.com) -
Give Meaning to 100 billion Events a Day - The Analytics Pipeline at Teads
(highscalability.com) -
Lumpers and Splitters: Tensions in Taxonomies
(multithreaded.stitchfix.com) -
Information Theory of Neural Networks
(hackernoon.com) -
Extracting Signals From the News
(eng.datafox.com) -
Renko brick size optimization
(towardsdatascience.com) -
A Look Behind the AI that Powers LinkedIn’s Feed: Sifting through Billions of Conversations to Create Personalized News Feeds for Hundreds of Millions of Members
(engineering.linkedin.com) -
Analysing 1.4 billion rows with python
(hackernoon.com) -
[Podcast] The Rising Threat of Content Abuse
(blog.siftscience.com) -
Finding Desirable Items in eBay Search by a Deep Dive into Skipped Items
(www.ebayinc.com) -
Real-Time Hotspot Detection in Amazon Kinesis Analytics
(aws.amazon.com) -
Common Patterns for Analyzing Data
(towardsdatascience.com) -
Listing Embeddings for Similar Listing Recommendations and Real-time Personalization in Search
(medium.com) -
Data Pre-Processing in Python: How I learned to love parallelized applies with Dask and Numba
(towardsdatascience.com) -
Data Analysis with Spark
(jobs.zalando.com) -
How to use deep-learning to quantify pollinator behavior I
(towardsdatascience.com) -
Intro to Descriptive Statistics
(towardsdatascience.com) -
Using Synthetic Data Modeling to Enhance Machine Learning
(engineering.salesforce.com) -
Time Series Forecasting with Splunk. Part I. Intro & Kalman Filter.
(towardsdatascience.com) -
Omphalos, Uber’s Parallel and Language-Extensible Time Series Backtesting Tool
(eng.uber.com) -
Improving the Random Forest in Python Part 1
(towardsdatascience.com) -
Understanding Feature Engineering (Part 2) — Categorical Data
(towardsdatascience.com) -
Big Data: Information visualization techniques
(towardsdatascience.com) -
Congressional Partisanship: A Visualization
(towardsdatascience.com) -
Out of Core Genomics
(towardsdatascience.com) -
Introducing pydqc
(towardsdatascience.com) -
Let’s talk about Advanced Analytics: A brief look at Artificial Intelligence
(becominghuman.ai) -
Numerai walkthrough: Quantitative Analysis & Machine learning for fun and profit.
(hackernoon.com) -
Large-Scale Health Data Analytics with OHDSI
(blog.cloudera.com) -
Uncovering hidden patterns through machine learning
(www.oreilly.com) -
The Statistical Modeling System Powering LinkedIn Salary
(engineering.linkedin.com) -
Stopping fraudsters by changing products
(eng.lyft.com) -
Exploratory Data Analysis of orders on Grofers
(lambda.grofers.com) -
What the SATs Taught Us about Finding the Perfect Fit
(multithreaded.stitchfix.com) -
Using Excel with pandas
(www.dataquest.io) -
DDoS Attack Detection with Wallaroo: A Real-time Time Series Analysis Example
(blog.wallaroolabs.com) -
The Incredible Convergence Of Deep Learning And Genomics
(hackernoon.com) -
Matching Albums through Cover Art Fingerprinting
(deezer.io)#machine-learning #image-processing #data-analytics #classifier
-
Box Graph: how we built a spontaneous social network
(blog.box.com)#machine-learning #data-analytics #graph-processing #social-networks
-
A sub-optimal approach for predicting real estate prices by Zillow
(becominghuman.ai) -
Our Discovery of Cramming
(blog.twitter.com) -
So You Have Some Clusters, Now What?
(medium.com) -
A Neural Network Primer
(technology.condenast.com) -
Machine Learning Algorithms: Which One to Choose for Your Problem
(blog.statsbot.co) -
Kaggle Fundamentals: The Titanic Competition
(www.dataquest.io) -
StreamING Machine Learning Models: How ING Adds Fraud Detection Models at Runtime with Apache Flink®
(data-artisans.com) -
Why Violin Plots are Awesome for Feature Engineering: An Example Using NLP to Identify Similar Products
(engineering.wayfair.com) -
Zalando's Smart Product Platform
(jobs.zalando.com) -
Brain MRI image segmentation using Stacked Denoising Autoencoders
(blog.insightdatascience.com) -
Building Brundage Bot
(hackernoon.com) -
A Brief Tour of Grouping and Aggregating in Pandas
(intoli.com) -
Bayesian Nonparametrics
(blog.statsbot.co) -
Getting Started Analyzing Twitter Data in Apache Kafka through KSQL
(www.confluent.io) -
SoundCloud's Data Science Process
(developers.soundcloud.com) -
enry: detecting languages
(blog.sourced.tech) -
The Product Possibilities of Interpretability
(blog.fastforwardlabs.com) -
Explore Happiness Data Using Python Pivot Tables
(www.dataquest.io) -
Using social media data to help measure smoke exposure
(research.fb.com) -
Fantasy Football for Hackers II — An Interactive Visualization of Average Draft Position vs Season Projections
(intoli.com) -
Zalando Fulfillment Solutions and our FAST Replenishment Algorithm
(jobs.zalando.com) -
Taste Graph part 1: Assigning interests to Pins
(medium.com) -
Serving Top Comments in Professional Social Networks
(engineering.linkedin.com) -
Analyzing One Million robots.txt Files
(intoli.com) -
Pre-Processing GeoTIFF files and training DeepMask/SharpMask model
(software.intel.com) -
Druid and Spark Together – Mixing Analytics Workflows
(metamarkets.com) -
Interpretability in conversation with Patrick Hall and Sameer Singh
(blog.fastforwardlabs.com) -
Dynamic Information Retrieval Modeling
(becominghuman.ai) -
Boosting Product Categorization with Machine Learning
(techblog.commercetools.com) -
Product Matching in eCommerce
(medium.com) -
Time Dependent Classification
(multithreaded.stitchfix.com) -
Engineering Uncertainty Estimation in Neural Networks for Time Series Prediction at Uber
(eng.uber.com) -
Meet Michelangelo: Uber’s Machine Learning Platform
(eng.uber.com) -
Why your relationship is likely to last (or not): using Local Interpretable Model-Agnostic Explanations (LIME)
(blog.fastforwardlabs.com) -
Leveraging Machine Learning to Rank Buying Intent
(code.hootsuite.com) -
Tessa: 1,000,000,000 Strava Activities, 1 Spatiotemporal Dataset
(medium.com) -
Exploring and Visualizing an Open Global Dataset
(research.googleblog.com) -
Strategic Pricing in Retail with Machine Learning
(blog.statsbot.co) -
Implementing Temporal Graphs with Apache TinkerPop and HGraphDB
(blog.cloudera.com) -
TimescaleDB vs. Postgres for time-series
(blog.timescale.com) -
Periscope Data | Geographic Analysis in SQL: Measuring Polygon Area from Latitude and Longitude
(webflow-blog.periscopedata.com) -
Cube Planner – Build an Apache Kylin OLAP Cube Efficiently and Intelligently
(www.ebaytechblog.com) -
Breaking the “curse of dimensionality” in Genomics using “wide” Random Forests
(databricks.com) -
Closing the Data-Quality Loop
(jobs.zalando.com) -
Using Machine Learning to Predict Value of Homes On Airbnb
(medium.com) -
Engineering Data Analytics with Presto and Parquet at Uber
(eng.uber.com) -
Text Mining of Stack Overflow Questions
(stackoverflow.blog) -
Setting up Spark Streaming - Part II
(tech.showmax.com) -
Democratizing Kaplan-Meier
(engineering.harrys.com) -
We Analyzed 100 Million Headlines. Here’s What We Learned (New Research)
(buzzsumo.com)