Skip to main content

Getting Started

Welcome! This guide will walk you through subscribing to CodeRx and accessing your pharmaceutical data.

Subscription

CodeRx is available as an annual subscription starting at $5,500/year. This gives you access to weekly-updated drug data marts hosted on AWS S3.

Payment

To subscribe, complete payment through our secure Stripe checkout:

Subscribe Now

Ready to get started? Complete your subscription through our secure Stripe checkout to access CodeRx's comprehensive drug database with weekly updates, complete RxNorm mappings, and rich drug knowledge graphs.

Subscribe Now →

Once your payment is confirmed, you'll receive:

  • AWS S3 Access Credentials - Access key ID and secret access key
  • S3 Bucket Information - Bucket name and region details
  • Connection Instructions - Step-by-step setup guide
  • Welcome Email - Additional resources and documentation links

Note: Access credentials are typically provided within 24 hours of payment confirmation.

Accessing Your Data

After receiving your credentials, you can access your data from AWS S3 using Python:

S3 Bucket Details

  • Bucket Name: coderx
  • Region: us-east-1
  • Format: CSV and Parquet files organized by data mart folder

Authentication

You'll receive:

  • Access Key ID: Your AWS access key
  • Secret Access Key: Your AWS secret key

Keep these credentials secure and never commit them to version control.

Querying Data with Python

Here's how to access and query CodeRx data using Python with s3fs:

Note: You'll need to install s3fs, pandas, and pyarrow to work with parquet files. Install them with:

pip install s3fs pandas pyarrow
import s3fs
import pandas as pd

# Create filesystem interface
fs = s3fs.S3FileSystem(
key='YOUR_ACCESS_KEY_ID',
secret='YOUR_SECRET_ACCESS_KEY'
)

# Read parquet file directly
df = pd.read_parquet(
's3://coderx/drugs/drugs.parquet',
filesystem=fs
)

# Filter and analyze
print(df.head())
print(f"Total drugs: {len(df)}")

Data Mart Structure

Your S3 bucket contains the following data marts, each organized in its own folder:

  • drugs/ - Drug products with names, RXCUIs, dose forms
  • packages/ - NDC packages with pricing and pack sizes
  • ingredients/ - Active and inactive ingredients
  • classes/ - Drug classification systems
  • excipients/ - Inactive ingredients with safety data
  • synonyms/ - Drug name synonyms and aliases

Each data mart folder contains:

  • Latest snapshot: {data_mart}/{data_mart}.csv or {data_mart}/{data_mart}.parquet (e.g., drugs/drugs.parquet)
  • Dated snapshots: {data_mart}/{data_mart}_YYYY-MM-DD.csv or {data_mart}/{data_mart}_YYYY-MM-DD.parquet (e.g., drugs/drugs_2026-01-16.parquet)

Files are updated weekly, with new dated snapshots added while the latest file is always updated to point to the most recent data.

Next Steps

Support

Need help? Contact us at support@coderx.io or visit our Slack community.