Healthcare provider data and medical marketing lists

The CoLoCode (Co-Location Code)

The CoLoCode is a CarePrecise proprietary identifier of each location where medical services are performed, making it possible to link providers by their place of practice, despite variations in their self-reported address data. That makes a lot of things possible, including the operations described below.

CoLoCode creation is not based on geospacial data, i.e., latitude and longitide (although though CarePrecise offers geocoded addresses for all U.S. provier practice addresses). Nor is it based on the Placekey POI (Point of Interest) code, which is included with all CarePrecise products. Rather, the CoLoCode is calculated from the practice address listed on every healthcare provider record in the CarePrecise Complete (all HIPAA-covered U.S. healthcare providers), and the extended provider dataset, CarePrecise Advanced.

The CoLoCode is constructed using a proprietary algorithm that hyper-conforms practice location data for the 7.4 million US healthcare providers present in the federal NPI Registry. Going well beyond the US Postal Service's Pub 28 standard, the CoLoCode accounts for addresses that may have been entered very differently. For example, "2525 North 33rd Street, Suite 205" is understood to be the same location as "2525 N 33 #205," as well as many other variations.

The CoLoCode is a great asset when merging lists of individuals with lists of their companies, or de-duplicating databases, and is a key element of CarePrecise's Qo-Relate record-linkage processing.

 

 

People Working at the Same Locations: The CoLoCode Method

Here's how a query is structured that links all providers having the same CoLoCode, in a manner like this:

Click to view graphic full size...

Find practitioners working at the same location.


Think of the table on the far left as the "input" table, and the table on the far right as the table(s) that you're gathering the "output" from. The "Names tables" are shown in the example, but you can also include address, phone/fax, and other tables. If you're using CP ListMaker, you might use the tSelect_Matches table on the left, and perhaps the tNames and tAddresses tables on the right. But in every case, the two iterations of tCoLoCode are what makes the query work.

The "CoLoCode table" is used twice in the query; the first one is the CoLoCode for the provider you want to find co-workers for, and the second one is going to pull all of the NPI numbers for people (and companies) listing that location for their place of practice. A count of the workers having an organization's CoLoCode indicates the size of the organization.

This method has a wide range of applications. For example, it can be used to infer a healthcare worker's coworker network for a range of purposes. Companies who utilize CarePrecise data and tools for marketing can identify prospective customers in the cohort of existing customers' coworkers. The method has been used in contact tracing efforts to identify potentially infected workers within healthcare service populations.

 

Worker-to-Workplace Linkage

The same configuration works to locate the workplace (organization) name for a given worker. On the left side we specify Type 1 records (individuals), and on the right side we specify Type 2 records (organizations), to arrive at the "worker-to-workplace" connection, and a row that contains both the individual's record and the organization's record. Once again, the graphic shows only Names tables, but other tables would be included; such as address, phone/fax, and taxonomy/specialty/license.

Click to view graphic full size...

Worket to Workplace linkage diagram.

 

A Way to De-duplicate Healthcare Provider Records

There are three common issues in healthcare provider data that cause de-duplication of records to be challenging.

  1. In most cases involving federal healthcare provider data, the providers themselves supply the information. As with all data entered by people who are not primarily focused on data entry tasks, the result will be inconsistent information. For example, a practice address entered as "2345 West 39th" may appear on another record as 2345 W. 39 St," or a number of other common representations of the address, resisting deduplication efforts.
  2. In some provider datasets, an individual provider (Type 1 providers in the National Plan and Provider Enumeration System, known colloquially as the "NPI Registry") may also have a record for their personal business entity (a Type 2 record in the NPPES). For example, a physician will have his one and only one personal Type 1 record as required by law, and also a Type 2 record tied to their corporate or single-partner LLC business entity. And this is all within the same dataset.
  3. Also within the same dataset, a physician practice organization may have many Type 2 records, all belonging to the same business entity. In fact, a business entity is entitled to have as many Type 2 NPI registry records as they deem desirable for their operations.

The CarePrecise CoLoCode system hyper-conforms location data, going well beyond USPS Pub 28 standardization, to provide the key missing element in de-duplicating similar records.

 

Using the CoLoCode in Excel

If you have an Excel file containing a CoLoCode column, simply sort the file on the CoLoCode column. Where you see a cluster of the same CoLoCodes, these are records with the same practice location. If your Excel file also contains the EntityType field, then the record(s) of Type 2 identify the organization (workplace), and the records of Type 1 identify the individuals (workers). Like this:

Click to view graphic full size...

ColoCode to place practitioners at their organizations.

 

 

Considerations

The example above illustrates a characteristic of the CoLoCode. Notice that both a physician group and a hospital have been captured as a single workplace. This can happen when the postal addresses are too similar, and do not have standard secondary identifiers. To determine one from the other, use the hospital Taxonomy Codes (in the Taxo table) to identify the hospitals. The same logic applies if you're looking for clinical pharmacy or lab workers.

There's an upside here; the fact that so many labs, groups and pharmacies are co-located with hospitals makes it possible to roll up some of these allied organizations to their hospital connection, and by extension, the health systems they're part of (using the hospital-to-health system connections in the Authoritative Hospital Database).

 

Creating an Accurate NPI Record Rollup for a particular Type of Organization

The CoLoCode can be used as shown above to locate all NPI records for every physical location. As explained, more than one business entity may occupy a given postal address. For example, a clinical lab or pharmacy may be co-located within a hospital, but may be externally owned.

To create the most accurate results in a rollup of hospitals' NPI records ("NPI Rollup"), we want to limit the Type 2/Organization side of the query by including only the records of hospitals, and not the records of any other kinds of co-located business. This can done by using our facility type codes in the tTaxo table (providers' reported taxonomy codes) to include only NPI records that report one or more of the hospital facility types. The same can be done for pharmacies, laboratories, primary care clinics, physical therapy locations, and all across the healthcare spectrum. This is the method used to produce the Hospital Rollup packaged with the CarePrecise Authoritative Hospital Database.

In another scenario, we may want to identify records belonging to a physician practice which may be co-located within a hospital. While the CoLoCode will usually, by itself, contain adequate suite-level data to disambiguate this circumstance, in some cases it will not. By removing Type 2 records that are associated with hospital taxonomy codes, and using data in the tTaxoGroup table, we are able to pinpoint the records associated with the practice group. A number of CarePrecise clients use these tables, along with the CoLoCode table, in just this way.

 

CoLoCoding Beyond Healthcare Provider Data

The powerful CoLoCode feature is available only from CarePrecise. It is present in CarePrecise datasets wherever location data is available, (with the exception of the OtherPracticeLocations table at this time). But it can also be used on any dataset that contains street address field(s) (line 1 only, or lines 1 and 2), Zip Code® (5-digit or Zip+4) fields, and a unique ID field (such as the NPI). CarePrecise makes its proprietary CoLoCoding system available for a wide range of applications, such as developing business intelligence on health plans, and populating web applications with location-related services.