Getting Started

Welcome! This guide will walk you through subscribing to CodeRx and accessing your pharmaceutical data.

Subscription

CodeRx is available as an annual subscription starting at $5,500/year. This gives you access to weekly-updated drug data marts hosted on AWS S3.

Payment

To subscribe, complete payment through our secure Stripe checkout:

Subscribe Now

Ready to get started? Complete your subscription through our secure Stripe checkout to access CodeRx's comprehensive drug database with weekly updates, complete RxNorm mappings, and rich drug knowledge graphs.

Subscribe Now →

Once your payment is confirmed, you'll receive:

AWS S3 Access Credentials - Access key ID and secret access key
S3 Bucket Information - Bucket name and region details
Connection Instructions - Step-by-step setup guide
Welcome Email - Additional resources and documentation links

Note: Access credentials are typically provided within 24 hours of payment confirmation.

Accessing Your Data

After receiving your credentials, you can access your data from AWS S3 using Python:

S3 Bucket Details

Bucket Name: coderx
Region: us-east-1
Format: CSV and Parquet files organized by data mart folder

Authentication

You'll receive:

Access Key ID: Your AWS access key
Secret Access Key: Your AWS secret key

Keep these credentials secure and never commit them to version control.

Querying Data with Python

Here's how to access and query CodeRx data using Python with s3fs:

Note: You'll need to install s3fs, pandas, and pyarrow to work with parquet files. Install them with:
pip install s3fs pandas pyarrow

import s3fs
import pandas as pd

# Create filesystem interface
fs = s3fs.S3FileSystem(
    key='YOUR_ACCESS_KEY_ID',
    secret='YOUR_SECRET_ACCESS_KEY'
)

# Read parquet file directly
df = pd.read_parquet(
    's3://coderx/drugs/drugs.parquet',
    filesystem=fs
)

# Filter and analyze
print(df.head())
print(f"Total drugs: {len(df)}")

Data Mart Structure

Your S3 bucket contains the following data marts, each organized in its own folder:

drugs/ - Drug products with names, RXCUIs, dose forms
packages/ - NDC packages with pricing and pack sizes
ingredients/ - Active and inactive ingredients
classes/ - Drug classification systems
excipients/ - Inactive ingredients with safety data
synonyms/ - Drug name synonyms and aliases

Each data mart folder contains:

Latest snapshot: {data_mart}/{data_mart}.csv or {data_mart}/{data_mart}.parquet (e.g., drugs/drugs.parquet)
Dated snapshots: {data_mart}/{data_mart}_YYYY-MM-DD.csv or {data_mart}/{data_mart}_YYYY-MM-DD.parquet (e.g., drugs/drugs_2026-01-16.parquet)

Files are updated weekly, with new dated snapshots added while the latest file is always updated to point to the most recent data.

Next Steps

Concepts - Learn about the data models
Tutorials - Step-by-step guides
Data Schema - Explore the database schema

Support

Need help? Contact us at support@coderx.io or visit our Slack community.

Subscription​

Payment​

Accessing Your Data​

S3 Bucket Details​

Authentication​

Querying Data with Python​

Data Mart Structure​

Next Steps​

Support​