May 25, 2017
Using Spline Interpolation in SQL to Analyze Sparse DataData scientists use spline interpolation to produce smooth graphs and estimate missing values by “filling in” the space between discrete points of data. We can use SQL to better approximate our real world data.
Read MoreMay 23, 2017
How ZeroCater Made Our Lunch Data-Driven: Survey Analysis and the IoTHere at Periscope we absolutely love data, so I couldn’t help but notice when our catering company, ZeroCater, brought in an amazing device to collect survey data about our lunches in real time. ZeroCater has always delivered us a great meal, but for a data driven analyst like myself this this was too interesting to ignore...
Read MoreMay 16, 2017
Feature Release: Query ManagementOver the past few months we’ve been working with customers to empower a new level of visibility and control over their databases. With great joy we’re now introducing the incredible power of Query Management.
Read MoreMay 12, 2017
Precision Recall and ROC Curves for Pregnancy TestsAs a data nerd who hates taking chances, I thought I’d perform an example analysis on pregnancy tests to figure out whether or not the current standard of measuring the hormone hCG (human chorionic gonadotropin) is the most optimal method. After all, no matter if you are a woman who wants to be pregnant or a woman who doesn’t, having an incorrect prediction is not something you want to risk. With Periscope Data, use SQL to map out Precision-Recall and Receiver Operating Characteristic (ROC) curves.
Read MoreMay 10, 2017
What You Need To Know About SQL's GROUP BYThe SQL 'group by' is one of the most common keywords. Here is how it works, and a few special cases you want to be aware of
Read MoreApril 13, 2017
The War SQL Won: Your Database Isn't Really RelationalRather than a true relational database, SQL is a compromise between convenience and the rigour of relational algebra. Here's how it's different.
Read MoreMarch 16, 2017
Introducing Periscope Data for SlackOver the past year our customers have asked us to improve our link sharing functionality in a place where they are increasingly working and communicating more: Slack.
Read MoreMarch 2, 2017
SQL Text Analysis with Donald Trump’s TweetsThere were many unique aspects of the 2016 U.S. presidential election, with one of the most interesting being Donald Trump’s use of Twitter. As luck would have it, tweets happen to be very easy to extract and put into relational database format.
Read MoreFebruary 28, 2017
Net Promoter Score and Survey Analysis in SQLIntroduced by Fred Reichheld in 2003, Net Promoter Score (NPS) is a simple method for measuring the likelihood your customers will recommend your product or service. It requires that you survey respondents and ask one simple question: “How likely are you to recommend [Company/Product/Service] to a friend or colleague?”
Read MoreFebruary 1, 2017
How Joins WorkThe SQL join operation is one of the most powerful and commonly used SQL operations, but little attention is paid to how the internal SQL engine breaks down the tasks of join operations.
Read MoreMay 25, 2017
Using Spline Interpolation in SQL to Analyze Sparse DataData scientists use spline interpolation to produce smooth graphs and estimate missing values by “filling in” the space between discrete points of data. We can use SQL to better approximate our real world data.
Read MoreMay 23, 2017
How ZeroCater Made Our Lunch Data-Driven: Survey Analysis and the IoTHere at Periscope we absolutely love data, so I couldn’t help but notice when our catering company, ZeroCater, brought in an amazing device to collect survey data about our lunches in real time. ZeroCater has always delivered us a great meal, but for a data driven analyst like myself this this was too interesting to ignore...
Read MoreMay 12, 2017
Precision Recall and ROC Curves for Pregnancy TestsAs a data nerd who hates taking chances, I thought I’d perform an example analysis on pregnancy tests to figure out whether or not the current standard of measuring the hormone hCG (human chorionic gonadotropin) is the most optimal method. After all, no matter if you are a woman who wants to be pregnant or a woman who doesn’t, having an incorrect prediction is not something you want to risk. With Periscope Data, use SQL to map out Precision-Recall and Receiver Operating Characteristic (ROC) curves.
Read MoreMay 10, 2017
What You Need To Know About SQL's GROUP BYThe SQL 'group by' is one of the most common keywords. Here is how it works, and a few special cases you want to be aware of
Read MoreApril 13, 2017
The War SQL Won: Your Database Isn't Really RelationalRather than a true relational database, SQL is a compromise between convenience and the rigour of relational algebra. Here's how it's different.
Read MoreMarch 2, 2017
SQL Text Analysis with Donald Trump’s TweetsThere were many unique aspects of the 2016 U.S. presidential election, with one of the most interesting being Donald Trump’s use of Twitter. As luck would have it, tweets happen to be very easy to extract and put into relational database format.
Read MoreFebruary 28, 2017
Net Promoter Score and Survey Analysis in SQLIntroduced by Fred Reichheld in 2003, Net Promoter Score (NPS) is a simple method for measuring the likelihood your customers will recommend your product or service. It requires that you survey respondents and ask one simple question: “How likely are you to recommend [Company/Product/Service] to a friend or colleague?”
Read MoreFebruary 1, 2017
How Joins WorkThe SQL join operation is one of the most powerful and commonly used SQL operations, but little attention is paid to how the internal SQL engine breaks down the tasks of join operations.
Read MoreOctober 21, 2016
Data Warehouse Management: Redshift, Bigquery, and SnowflakeIn our post comparing Redshift, BigQuery, and Snowflake on query performance and cost for interactive analytics, we looked at the trade-offs across different data warehouses from a performance perspective.
Read MoreOctober 20, 2016
Interactive Analytics: Redshift vs Snowflake vs BigQueryEvery day customers ask us what the best data warehouse technology is for handling the workload of their growing analytics department: they want to run queries at interactive, real-time speeds at a price-point that fits in their budget.
Read MoreMay 16, 2017
Feature Release: Query ManagementOver the past few months we’ve been working with customers to empower a new level of visibility and control over their databases. With great joy we’re now introducing the incredible power of Query Management.
Read MoreMarch 16, 2017
Introducing Periscope Data for SlackOver the past year our customers have asked us to improve our link sharing functionality in a place where they are increasingly working and communicating more: Slack.
Read MoreNovember 7, 2016
Periscope Data’s $25M Series BWe’re excited to announce that we’ve raised $25M in Series B Funding!
Read MoreOctober 21, 2015
Announcing Periscope Data’s $9.5M Series AWe could not be more excited to announce that we have raised $9.5M in Series A funding from Josh Stein at DFJ!
Read MoreArchives
What you Need to Know about SQLs Group By
SQL Text Analysis with Donald Trump’s Tweets
Net Promoter Score and Survey Analysis in SQL
Data Warehouse Management: Redshift, Bigquery, and Snowflake
Interactive Analytics: Redshift vs Snowflake vs BigQuery
Tracking KPIs using SQL for Effective Revenue Growth Analysis
The Best Redshift Optimization Blog Posts of 2016
Calculating Proportional Values in SQL
Valuing User Subscriptions Over Time with SQL
Safety in San Francisco: A Numbers Game
Creating Ranked Position Time Series with SQL
Building a Better Pokédex with SQL
IMDb vs RottenTomatoes Ratings with SQL Trendlines
Cohorted Engagement: 7-day and 30-day Metrics
Feature Analysis with Mean-Time-Between-Usage Calculation
Cohorted Engagement with Comparable Time Windows in SQL
Common Salesforce Reports In SQL
Calculating Distance Between Data Centers On A Globe
Converting MySQL Syntax and Functions into Redshift
Understanding Outliers with Skew and Kurtosis in SQL
Predicting Churn Using Hypothesis Testing
Calculating Money Growth using PL/pgSQL
Coalesce Down Using Correlated Subqueries
Getting Started With Regular Expressions
Analyzing Disparate Data With Segment Sources
Simple Lead Scoring With Enrichment
Impact or No? -- Simple Event Study in SQL Part I
Rolling Classifications by X Previous Aggregations
Managing A Six-Figure Ad Budget With SQL
Portfolio Risk Analysis In SQL
Bigram Frequencies In Pure SQL
Calculating Expected Value vs. Actual Results for Super Bowl Contenders
Binomial Options Pricing In Postgres
Finding Nearest Neighbors In SQL
Using CTEs and Unions to Compute Running Totals
Funnel Metrics - Step-over-Step Drop Rates
Multi-dimensional Clustering Using K-Means in Postgres SQL
One Dimensional Clustering In Postgres
Creating a Histogram in Redshift
Adding Summary Statistics to your SQL Table
Concatenating Rows in Redshift, Postgres, & MySQL
Calculating Year-over-Year Metrics
Building The Periscope Cache With Amazon Redshift
Analyzing Your Olark History with SQL
Calculating Significance of A/B Tests in Redshift
Understanding Distributed Analytics Databases, Part 2 — Data Distribution
Understanding Distributed Analytics Databases, Part 1: Query Strategies
Analyzing Your Salesforce Data With SQL
Solving the Traveling Salesman Problem with Postgres Recursive CTEs
Connecting to a Redshift Cluster with a Private IP
Setting up Your Redshift Cluster
Game Revenue Metrics — What Matters, and When
International Analytics In SQL
How to ETL Data into and out of Amazon Redshift
Building Your Own User Analytics System In SQL
Thinking Critically About Mobile Game Retention
Explain And Other Tools For Query Optimization
The Top SQL Analysis Errors We See
10 Rules for a Better SQL Schema
Optimizing Query Management on Redshift
Getting the First Row per Group 5X Faster
Extrapolating Data with Day-of-Week Effects
Daily, Weekly and Monthly Charts On The Same Graph
Helpful Redshift Admin Queries
Analyzing Our Long Term Blog Metrics With SQL
Single Event Tables and Common Analysis Queries
Splitting Comma-Separated Values In MySQL
Redshift User Defined Functions in Python
SQL for Marketers — Your Most Profitable Channels
The Lazy Analyst's Guide To Postgres JSON
Exact Row Counts for All Tables in MySQL and Postgres
$3000 Data Warehouse — Redshift vs. Postgres
Importing Data into Redshift from MySQL and Postgres
SQL for Marketers — 3 Ways to Slice Your Acquisition Data
Counting Comma-Delimited Values in Postgres, MySQL, Amazon Redshift and MS SQL Server.
Reuse Calculations in the Same Query with Lateral Joins
Demystifying Redshift — What's Up with My Disks?
SQL for Marketers — Making Where More Powerful
Aggregating into Strings without String_agg or Array_agg in Amazon Redshift
Changing Dist and Sort Keys on Giant Tables in Redshift
Rolling Averages in MySQL and SQL Server
Extrapolating to the End of the Month in SQL
SQL for Marketers Part 4 — Common Analysis Queries
SQL for Marketers Part 3 — Our Most Valuable Traffic Source
Day-Over-Day Changes Without Window Functions
SQL for Marketers Part 2 — Users over the Last 30 Days
Comparing Hourly Revenue In SQL
Using Row Numbering And Full Joins To Transpose Wide Tables
Estimating Conversions With Poisson Distributions
Beyond Random() — Normal Distributions in SQL
Selecting Only One Row Per Group
Use Subqueries and Window Functions to Compute Running Averages
Computing Day-Over-Day Changes With Window Functions
Computing Day-Over-Day Changes With Window Functions
Using Self Joins To Calculate Your Retention, Churn, And Reactivation Metrics
How To Optimize Lifetime Distinct Counts Using Window Functions
Predicting Exponential Growth With SQL
How To Calculate Cohort Retention In SQL
Hashing Tables to Ensure Consistency in Postgres, Redshift and MySQL
How to Format Numbers as Currency in Postgres, MySQL and Redshift
Generate Series in Redshift and MySQL
Faster Redshift Queries with Materialized Views — Lifetime Daily ARPU
How To Sample Rows in SQL 273X Faster
How To Calculate Confidence Intervals In SQL
Use window functions for time-series percentage
4 Ways to Join Only The First Row in SQL
Count Distinct Compared on Top 4 SQL Databases
2X Your Redshift Speed With Sortkeys and Distkeys
Use generate_series to get continuous results
How To Speed Up Where Clauses By 3,000X