Getting Started
Welcome! This guide will walk you through subscribing to CodeRx and accessing your pharmaceutical data.
Subscription
CodeRx is available as an annual subscription starting at $5,500/year. This gives you access to weekly-updated drug data marts hosted on AWS S3.
Payment
To subscribe, complete payment through our secure Stripe checkout:
Ready to get started? Complete your subscription through our secure Stripe checkout to access CodeRx's comprehensive drug database with weekly updates, complete RxNorm mappings, and rich drug knowledge graphs.
Once your payment is confirmed, you'll receive:
- AWS S3 Access Credentials - Access key ID and secret access key
- S3 Bucket Information - Bucket name and region details
- Connection Instructions - Step-by-step setup guide
- Welcome Email - Additional resources and documentation links
Note: Access credentials are typically provided within 24 hours of payment confirmation.
Accessing Your Data
After receiving your credentials, you can access your data from AWS S3 using Python:
S3 Bucket Details
- Bucket Name:
coderx - Region:
us-east-1 - Format: CSV and Parquet files organized by data mart folder
Authentication
You'll receive:
- Access Key ID: Your AWS access key
- Secret Access Key: Your AWS secret key
Keep these credentials secure and never commit them to version control.
Querying Data with Python
Here's how to access and query CodeRx data using Python with s3fs:
Note: You'll need to install
s3fs,pandas, andpyarrowto work with parquet files. Install them with:pip install s3fs pandas pyarrow
import s3fs
import pandas as pd
# Create filesystem interface
fs = s3fs.S3FileSystem(
key='YOUR_ACCESS_KEY_ID',
secret='YOUR_SECRET_ACCESS_KEY'
)
# Read parquet file directly
df = pd.read_parquet(
's3://coderx/drugs/drugs.parquet',
filesystem=fs
)
# Filter and analyze
print(df.head())
print(f"Total drugs: {len(df)}")
Data Mart Structure
Your S3 bucket contains the following data marts, each organized in its own folder:
- drugs/ - Drug products with names, RXCUIs, dose forms
- packages/ - NDC packages with pricing and pack sizes
- ingredients/ - Active and inactive ingredients
- classes/ - Drug classification systems
- excipients/ - Inactive ingredients with safety data
- synonyms/ - Drug name synonyms and aliases
Each data mart folder contains:
- Latest snapshot:
{data_mart}/{data_mart}.csvor{data_mart}/{data_mart}.parquet(e.g.,drugs/drugs.parquet) - Dated snapshots:
{data_mart}/{data_mart}_YYYY-MM-DD.csvor{data_mart}/{data_mart}_YYYY-MM-DD.parquet(e.g.,drugs/drugs_2026-01-16.parquet)
Files are updated weekly, with new dated snapshots added while the latest file is always updated to point to the most recent data.
Next Steps
- Concepts - Learn about the data models
- Tutorials - Step-by-step guides
- Data Schema - Explore the database schema
Support
Need help? Contact us at support@coderx.io or visit our Slack community.