Documentation Index
Fetch the complete documentation index at: https://docs.synthesize.bio/llms.txt
Use this file to discover all available pages before exploring further.
pysynthbio is a Python package for the Synthesize Bio API. It lets researchers easily access AI-generated transcriptomic data across modalities including bulk and single-cell RNA-seq.
To generate datasets without code, use the web platform.
Authentication
Get your API key
Visit the API keys page to generate a key. Click + Create API Key, then Create Key, and copy your key. There are several ways to makepysynthbio aware of your token.
- Interactive
- Environment variable
- Non-interactive
- System keyring
use_keyring=True, the token persists across sessions; with use_keyring=False, it’s only set for the current session. Keyring support is included by default in pysynthbio 2.2.1 and later.Available model types
Synthesize Bio provides several model types for different use cases.Baseline models
Generate synthetic gene expression data from metadata alone. Describe the biological conditions and the model generates realistic expression profiles.gem-1-bulk: Bulk RNA-seq baseline modelgem-1-sc: Single-cell RNA-seq baseline model
Reference conditioning models
Generate expression data conditioned on a real reference sample. This lets you anchor to an existing expression profile while applying perturbations or modifications.gem-1-bulk_reference-conditioning: Bulk RNA-seq reference conditioning modelgem-1-sc_reference-conditioning: Single-cell RNA-seq reference conditioning model
Metadata prediction models
Infer metadata from observed expression data. Given a gene expression profile, predict likely biological characteristics such as cell type, tissue, or disease state.gem-1-bulk_predict-metadata: Bulk RNA-seq metadata prediction modelgem-1-sc_predict-metadata: Single-cell RNA-seq metadata prediction model
Only baseline models are available to all users. Check programmatically with
list_models(). Contact support@synthesize.bio if you have questions.Listing available models
Exploring available metadata
Each model accepts a specific set of metadata fields with defined vocabularies (valid ontology IDs, cell lines, tissues, etc.). Browse and download these vocabularies at app.synthesize.bio/docs/vocab. See Available metadata for more details.Quick start
A minimal example using a baseline model:Security notes
- The API token provides full access to your Synthesize Bio account.
- When using
use_keyring=True, your token is stored securely in your system’s credential manager. - For production environments, prefer environment variables or a secrets management tool.
Cleanup
When you’re done using the API, you can clear the token from your environment:Rate limits
Free usage of Synthesize Bio is limited. If you exceed the limit, the API returns an error explaining it. For higher limits, contact support@synthesize.bio.Troubleshooting
Keychain access on Mac
If you get this error on macOS when usinguse_keyring=True: