Submit New Annotation

Bystro annotates sequencing data rapidly using multiple databases and provides powerful filtering and natural-language search capabilities. Upload your VCF or SNP files to get comprehensive genomic annotations.

Getting Started

Bystro's submission process supports genomic annotation, data imputation, multi-omics integration, and merging multiple datasets. Follow these steps to submit your data:

Step 1: Start Your Submission

Click Start and choose your reference genome from the available options.

Step 2: Set Up Notifications (Optional)

Enter an email address to receive annotation progress updates and completion notifications. This helps you track long-running annotations.

Step 3: Upload Your Data

Upload genomic files (VCF/SNP format) and optionally add proteomics data (TMT or SomaScan). Bystro supports compressed files and can handle multiple files in a single submission.

Step 4: Configure Advanced Options

Optionally enable missing data imputation for SNP array data, add covariate information to link samples to phenotypes, or merge multiple chromosome files from the same subjects into a complete annotation.

Upload Methods

🖥️ Desktop Upload

Upload files directly from your computer. Best for smaller files and simple workflows. Features include drag-and-drop interface, progress tracking, and support for multiple files.

☁️ Amazon S3 Upload

Connect directly to your S3 buckets. Ideal for large files and cloud-based workflows. No file size limits, faster transfers for large datasets, and direct bucket browsing.

Advanced Features

🧬 Multi-Omics Integration

Upload both genomic and proteomics data in a single submission to create integrated analyses.

  • Proteomics formats: TMT (Tandem Mass Tags) and SomaScan data
  • Automatic linking: Bystro attempts to match samples between genomic and proteomic datasets
  • Covariate integration: Shared covariates are automatically identified when possible

🔍 Missing Data Imputation

Enhance SNP array datasets by imputing missing genotype data.

This feature fills in gaps in your genetic data using statistical methods, improving the completeness and analytical power of your dataset for downstream analyses.

🔗 Chromosome Merging

Combine multiple chromosome files from the same subjects into one complete annotation.

⚠️ Important: After selecting your chromosome files with checkboxes, you must click the "Merge" button. Simply checking the boxes does not initiate the merge!

Video Tutorials

Desktop Upload Walkthrough

Learn how to upload files directly from your computer and navigate the submission interface.

Amazon S3 Integration

Set up your AWS credentials and upload files directly from your S3 storage.

S3 Setup Requirements

  • AWS Access Key ID
  • AWS Secret Access Key
  • Appropriate bucket permissions for file access

Supported File Formats

Genomic Data

VCF files (.vcf, .vcf.gz), SNP format files, SNP array data, and compressed formats. Multiple files per submission supported.

Proteomics Data

TMT (Tandem Mass Tags) and SomaScan proteomics datasets. Can be linked to genomic data during submission for integrated analysis.

💡 Pro Tips

  • Large files: Use S3 upload for files over 1GB for better reliability
  • Multi-omics: Upload proteomics and genomics data together for integrated analysis
  • Email notifications: Highly recommended for large datasets that take time to process
  • Chromosome merging: Don't forget to click "Merge" after selecting your files!
  • Sample information: Add covariates during submission or afterward using the sample info feature

What Happens Next?

After submitting your files, Bystro will:

  • 1.Validate your files and check formatting
  • 2.Process advanced features like imputation, merging, or multi-omics integration if selected
  • 3.Queue your annotation for processing
  • 4.Annotate variants using comprehensive databases
  • 5.Generate integrated results available for download and analysis