Using Spline Interpolation in SQL to Analyze Sparse Data

Data scientists use spline interpolation to produce smooth graphs and estimate missing values by “filling in” the space between discrete points of data. We can use SQL to better approximate our real world data.

Read More
How ZeroCater Made Our Lunch Data-Driven: Survey Analysis and the IoT

Here at Periscope we absolutely love data, so I couldn’t help but notice when our catering company, ZeroCater, brought in an amazing device to collect survey data about our lunches in real time. ZeroCater has always delivered us a great meal, but for a data driven analyst like myself this this was too interesting to ignore...

Read More
Feature Release: Query Management

Over the past few months we’ve been working with customers to empower a new level of visibility and control over their databases. With great joy we’re now introducing the incredible power of Query Management.

Read More
Precision Recall and ROC Curves for Pregnancy Tests

As a data nerd who hates taking chances, I thought I’d perform an example analysis on pregnancy tests to figure out whether or not the current standard of measuring the hormone hCG (human chorionic gonadotropin) is the most optimal method. After all, no matter if you are a woman who wants to be pregnant or a woman who doesn’t, having an incorrect prediction is not something you want to risk. With Periscope Data, use SQL to map out Precision-Recall and Receiver Operating Characteristic (ROC) curves.

Read More
What You Need To Know About SQL's GROUP BY

The SQL 'group by' is one of the most common keywords. Here is how it works, and a few special cases you want to be aware of

Read More
The War SQL Won: Your Database Isn't Really Relational

Rather than a true relational database, SQL is a compromise between convenience and the rigour of relational algebra. Here's how it's different.

Read More
Introducing Periscope Data for Slack

Over the past year our customers have asked us to improve our link sharing functionality in a place where they are increasingly working and communicating more: Slack.

Read More
SQL Text Analysis with Donald Trump’s Tweets

There were many unique aspects of the 2016 U.S. presidential election, with one of the most interesting being Donald Trump’s use of Twitter. As luck would have it, tweets happen to be very easy to extract and put into relational database format.

Read More
Net Promoter Score and Survey Analysis in SQL

Introduced by Fred Reichheld in 2003, Net Promoter Score (NPS) is a simple method for measuring the likelihood your customers will recommend your product or service. It requires that you survey respondents and ask one simple question: “How likely are you to recommend [Company/Product/Service] to a friend or colleague?”

Read More
How Joins Work

The SQL join operation is one of the most powerful and commonly used SQL operations, but little attention is paid to how the internal SQL engine breaks down the tasks of join operations.

Read More
Using Spline Interpolation in SQL to Analyze Sparse Data

Data scientists use spline interpolation to produce smooth graphs and estimate missing values by “filling in” the space between discrete points of data. We can use SQL to better approximate our real world data.

Read More
How ZeroCater Made Our Lunch Data-Driven: Survey Analysis and the IoT

Here at Periscope we absolutely love data, so I couldn’t help but notice when our catering company, ZeroCater, brought in an amazing device to collect survey data about our lunches in real time. ZeroCater has always delivered us a great meal, but for a data driven analyst like myself this this was too interesting to ignore...

Read More
Precision Recall and ROC Curves for Pregnancy Tests

As a data nerd who hates taking chances, I thought I’d perform an example analysis on pregnancy tests to figure out whether or not the current standard of measuring the hormone hCG (human chorionic gonadotropin) is the most optimal method. After all, no matter if you are a woman who wants to be pregnant or a woman who doesn’t, having an incorrect prediction is not something you want to risk. With Periscope Data, use SQL to map out Precision-Recall and Receiver Operating Characteristic (ROC) curves.

Read More
What You Need To Know About SQL's GROUP BY

The SQL 'group by' is one of the most common keywords. Here is how it works, and a few special cases you want to be aware of

Read More
The War SQL Won: Your Database Isn't Really Relational

Rather than a true relational database, SQL is a compromise between convenience and the rigour of relational algebra. Here's how it's different.

Read More
SQL Text Analysis with Donald Trump’s Tweets

There were many unique aspects of the 2016 U.S. presidential election, with one of the most interesting being Donald Trump’s use of Twitter. As luck would have it, tweets happen to be very easy to extract and put into relational database format.

Read More
Net Promoter Score and Survey Analysis in SQL

Introduced by Fred Reichheld in 2003, Net Promoter Score (NPS) is a simple method for measuring the likelihood your customers will recommend your product or service. It requires that you survey respondents and ask one simple question: “How likely are you to recommend [Company/Product/Service] to a friend or colleague?”

Read More
How Joins Work

The SQL join operation is one of the most powerful and commonly used SQL operations, but little attention is paid to how the internal SQL engine breaks down the tasks of join operations.

Read More
Data Warehouse Management: Redshift, Bigquery, and Snowflake

In our post comparing Redshift, BigQuery, and Snowflake on query performance and cost for interactive analytics, we looked at the trade-offs across different data warehouses from a performance perspective.

Read More
Interactive Analytics: Redshift vs Snowflake vs BigQuery

Every day customers ask us what the best data warehouse technology is for handling the workload of their growing analytics department: they want to run queries at interactive, real-time speeds at a price-point that fits in their budget.

Read More
Feature Release: Query Management

Over the past few months we’ve been working with customers to empower a new level of visibility and control over their databases. With great joy we’re now introducing the incredible power of Query Management.

Read More
Introducing Periscope Data for Slack

Over the past year our customers have asked us to improve our link sharing functionality in a place where they are increasingly working and communicating more: Slack.

Read More
No items found.
Periscope Data’s $25M Series B

We’re excited to announce that we’ve raised $25M in Series B Funding!

Read More
Announcing Periscope Data’s $9.5M Series A

We could not be more excited to announce that we have raised $9.5M in Series A funding from Josh Stein at DFJ!

Read More
No items found.

Archives

What you Need to Know about SQLs Group By

SQL Text Analysis with Donald Trump’s Tweets

Net Promoter Score and Survey Analysis in SQL

How Joins Work

Data Warehouse Management: Redshift, Bigquery, and Snowflake

Interactive Analytics: Redshift vs Snowflake vs BigQuery

Tracking KPIs using SQL for Effective Revenue Growth Analysis

SQL Symbol Cheat Sheet

The Best Redshift Optimization Blog Posts of 2016

Calculating Proportional Values in SQL

Valuing User Subscriptions Over Time with SQL

Safety in San Francisco: A Numbers Game

Creating Ranked Position Time Series with SQL

Building a Better Pokédex with SQL

IMDb vs RottenTomatoes Ratings with SQL Trendlines

Cohorted Engagement: 7-day and 30-day Metrics

Feature Analysis with Mean-Time-Between-Usage Calculation

Cohorted Engagement with Comparable Time Windows in SQL

Common Salesforce Reports In SQL

Calculating Distance Between Data Centers On A Globe

Converting MySQL Syntax and Functions into Redshift

Understanding Outliers with Skew and Kurtosis in SQL

Predicting Churn Using Hypothesis Testing

Calculating Money Growth using PL/pgSQL

Coalesce Down Using Correlated Subqueries

Getting Started With Regular Expressions

Analyzing Disparate Data With Segment Sources

Simple Lead Scoring With Enrichment

Impact or No? -- Simple Event Study in SQL Part I

Analyzing Your Email With SQL

Rolling Classifications by X Previous Aggregations

Managing A Six-Figure Ad Budget With SQL

Portfolio Risk Analysis In SQL

Bigram Frequencies In Pure SQL

String Parsing In SQL

Calculating Expected Value vs. Actual Results for Super Bowl Contenders

Binomial Options Pricing In Postgres

Finding Nearest Neighbors In SQL

Outlier Detection In SQL

Using CTEs and Unions to Compute Running Totals

How to Really Use SQL Views

Funnel Metrics - Step-over-Step Drop Rates

Multi-dimensional Clustering Using K-Means in Postgres SQL

One Dimensional Clustering In Postgres

Creating a Histogram in Redshift

Adding Summary Statistics to your SQL Table

SQL Joins For Data Analysis

Concatenating Rows in Redshift, Postgres, & MySQL

Marketing Attribution In SQL

Calculating Year-over-Year Metrics

Building The Periscope Cache With Amazon Redshift

Analyzing Your Olark History with SQL

Calculating Significance of A/B Tests in Redshift

Understanding Distributed Analytics Databases, Part 2 — Data Distribution

Understanding Distributed Analytics Databases, Part 1: Query Strategies

Cash Flow Modeling In SQL

Analyzing Your Salesforce Data With SQL

Solving the Traveling Salesman Problem with Postgres Recursive CTEs

Connecting to a Redshift Cluster with a Private IP

Setting up Your Redshift Cluster

Game Revenue Metrics — What Matters, and When

International Analytics In SQL

How to ETL Data into and out of Amazon Redshift

Building Your Own User Analytics System In SQL

Thinking Critically About Mobile Game Retention

Explain And Other Tools For Query Optimization

Understanding Website Response Time with SQL

Comparing IP Addresses In SQL

4 Reasons Not To Use MySQL For Analysis

The Top SQL Analysis Errors We See

10 Rules for a Better SQL Schema

Optimizing Query Management on Redshift

Getting the First Row per Group 5X Faster

Extrapolating Data with Day-of-Week Effects

Daily, Weekly and Monthly Charts On The Same Graph

Helpful Redshift Admin Queries

Analyzing Our Long Term Blog Metrics With SQL

Single Event Tables and Common Analysis Queries

Splitting Comma-Separated Values In MySQL

Redshift User Defined Functions in Python

SQL for Marketers — Your Most Profitable Channels

Understanding SQL's Null

The Lazy Analyst's Guide To Postgres JSON

Exact Row Counts for All Tables in MySQL and Postgres

$3000 Data Warehouse — Redshift vs. Postgres

Importing Data into Redshift from MySQL and Postgres

SQL for Marketers — 3 Ways to Slice Your Acquisition Data

Counting Comma-Delimited Values in Postgres, MySQL, Amazon Redshift and MS SQL Server.

Reuse Calculations in the Same Query with Lateral Joins

Demystifying Redshift — What's Up with My Disks?

Redshift Maintenance 101

SQL for Marketers — Making Where More Powerful

Aggregating into Strings without String_agg or Array_agg in Amazon Redshift

Changing Dist and Sort Keys on Giant Tables in Redshift

Rolling Averages in MySQL and SQL Server

Extrapolating to the End of the Month in SQL

SQL for Marketers Part 4 — Common Analysis Queries

Medians In SQL

Cross-Database Joins

SQL for Marketers Part 3 — Our Most Valuable Traffic Source

Day-Over-Day Changes Without Window Functions

SQL for Marketers Part 2 — Users over the Last 30 Days

Optimizing Your PSQL

Comparing Hourly Revenue In SQL

SQL For Marketers

Using Row Numbering And Full Joins To Transpose Wide Tables

Fun with Window Functions

Estimating Conversions With Poisson Distributions

Beyond Random() — Normal Distributions in SQL

Selecting Only One Row Per Group

Use Subqueries and Window Functions to Compute Running Averages

Computing Day-Over-Day Changes With Window Functions

Date Parts In SQL Server

Computing Day-Over-Day Changes With Window Functions

Using Self Joins To Calculate Your Retention, Churn, And Reactivation Metrics

How To Optimize Lifetime Distinct Counts Using Window Functions

Predicting Exponential Growth With SQL

How To Calculate Cohort Retention In SQL

Hashing Tables to Ensure Consistency in Postgres, Redshift and MySQL

Hyperloglog In Pure SQL

How to Format Numbers as Currency in Postgres, MySQL and Redshift

Generate Series in Redshift and MySQL

Faster Redshift Queries with Materialized Views — Lifetime Daily ARPU

How To Sample Rows in SQL 273X Faster

Counting Conditionally In SQL

How To Calculate Confidence Intervals In SQL

Use window functions for time-series percentage

4 Ways to Join Only The First Row in SQL

Count Distinct Compared on Top 4 SQL Databases

2X Your Redshift Speed With Sortkeys and Distkeys

Use generate_series to get continuous results

How To Speed Up Where Clauses By 3,000X

Use Subqueries To Count Distinct 50X Faster

Range joins give you accurate histories