layout | title | nav |
---|---|---|
default |
Query Language |
queries |
The query language is based on Socrata's SoQL language. If you're already comfortable with the basics you can learn how to construct more advanced queries by familiarizing yourself with all the HMDA fields on the field reference page, or by using the metadata endpoints on the interactive console. Otherwise, we have organized this page into two sections to familiarize yourself:
Querying data - An overview of the clauses you can use to specify what you want and do not want returned in a slice. If our API was the barista at your favorite coffee shop, it could handle even the most persnickety of drink orders.
Summarizing or aggregating data - Once you have the data you need to work with, our API can crunch all the numbers for you. This section explains the ins and outs of how to gain more insight into large and overwhelming data.
Use these clauses to filter through millions of records to get only the data you want in a matter of seconds. See the API in action on the explore the data section of our HMDA page.
Clause | What it does |
---|---|
$select |
Specifies which fields to return, separated by commas. If not specified, all columns will be returned. Use this to aggregate or summarize data. A list of all fields available is here or at the concept endpoint. |
$where |
Filter the results using SQL WHERE query syntax. If not specified, the results will not be filtered. Learn more. |
$orderBy |
Fields to order by, separated by commas, or ASC and DESC . If not specified, the order will be consistent, but unspecified. Learn more. |
$group |
Fields to group by, separated by commas. Learn more. |
$limit |
Maximum number of results to return. If not specified, defaults to 100 . Enter 0 for no limit. |
$offset |
Number of records to skip. If not specified, defaults to 0 . |
$callback |
The name of the JavaScript callback to invoke in a query. Only used with JSONP. |
This clause supports a mini-language for writing queries. It's a subset of SQL WHERE
clauses, with the addition of function
support. A $where
clause is made up of one or more comparisons, joined by boolean operators.
Operator | What it means | Example |
---|---|---|
= |
equality | name = "Phillip" |
!= |
inequality | state != "Alaska" |
> |
greater than | age > 18 |
>= |
greater than or equal | square_miles >= 1000 |
< |
less than | age < 65 |
<= |
less than or equal | square_miles <= 1000 |
LIKE |
matches strings | name LIKE "Pete%" would match "Pete, "Peter," or anything that starts with "Pete" |
ILIKE |
matches case-insensitive strings | name ILIKE "%rick" would match "Rick" as well as "Yorick," "Harrick," or anything else with "rick" in it |
IS NULL |
existence of a value | city is NULL |
IS NOT NULL |
non-existence of a value | city IS NOT NULL |
Operator | What it means | Example |
---|---|---|
AND |
logical AND of two comparisons | state = "Alaska" AND age > 18 |
OR |
logical OR of two comparisons | state = "Alaksa" OR state = "Hawaii" |
NOT |
negation of a comparison | NOT (state = "Alaska" OR state = "Hawaii" |
() |
grouping or order of operations | (state = "Alaska" OR state = "Hawaii") AND age > 18 |
In addition to powerful data filtering, you can also aggregate data, as seen in the summary tables section of our HMDA page. The three clauses used in these queries are:
This clause takes a list of the fields you want returned, separated by commas. Unlike the SQL version of SELECT
, it does not allow for AS
aliasing, so:
Right: state_abbr
, county
Wrong: state_abbr AS state
, county
This clause determines the order of the results returned. It takes a list of columns, separated by commas, with an optional suffix of ASC
or DESC
to indicate that you want the data in ascending or descending order. For example, if you wanted to see loan applications ordered by gender, and age in descending order, you would use:
$orderBy = gender, age DESC
This clause lists the fields you want to group results by. For example, if you wanted to see loan applications organized by denial reason, you would use:
$group = denial_reason
$group
usually requires a $select
clause, where you need to specify the fields you are grouping on, as well as any aggregations you want.
Aggregations are functions run on fields of grouped data in order to reduce them to an associated value. All functions are called with the name of the field to aggregate in parentheses. For example, if you wanted to know the total number of owner-occupied properties per metro area in HMDA, you would use:
$select: msamd, SUM(numer_of_owner_occupied_units)
$group: msamd
Function | What it does |
---|---|
SUM |
Totals the values |
MIN |
Returns the minimum value in the results |
MAX |
Returns the maximum value in the results |
COUNT |
Returns the number of rows in the set, i.e., the number of results. You can use COUNT for any field |
When summarizing and aggregating data, take care to know the order in which the clauses and names of the aggregated fields are applied.
$where
is applied first in order to reduce the amount of data being aggregated- The data is then aggregated using
$group
and$select
clauses $orderBy
$limit
,$offset
Then, when you want to use aggregated fields for grouping, name them according to the function you used following this format:
function_field_name
and
sum_number_of_owner_occupied_units
You're ready to start building your own queries, and bring HMDA data into your own apps and tools!
Get familiar with the information contained in HMDA LAR data on the field reference page, and head on over to API calls to test out your queries in the console.