Submit New Annotation
Bystro annotates sequencing data rapidly using multiple databases and provides powerful filtering and natural-language search capabilities. Upload your VCF or SNP files to get comprehensive genomic annotations.
Getting Started
Bystro's submission process supports genomic annotation, data imputation, multi-omics integration, and merging multiple datasets. Follow these steps to submit your data:
Step 1: Start Your Submission
Click Start and choose your reference genome from the available options.
Step 2: Set Up Notifications (Optional)
Enter an email address to receive annotation progress updates and completion notifications. This helps you track long-running annotations.
Step 3: Upload Your Data
Upload genomic files (VCF/SNP format) and optionally add proteomics data (TMT or SomaScan). Bystro supports compressed files and can handle multiple files in a single submission.
Step 4: Configure Advanced Options
Optionally enable missing data imputation for SNP array data, add covariate information to link samples to phenotypes, or merge multiple chromosome files from the same subjects into a complete annotation.
Upload Methods
🖥️ Desktop Upload
Upload files directly from your computer. Best for smaller files and simple workflows. Features include drag-and-drop interface, progress tracking, and support for multiple files.
☁️ Amazon S3 Upload
Connect directly to your S3 buckets. Ideal for large files and cloud-based workflows. No file size limits, faster transfers for large datasets, and direct bucket browsing.
Advanced Features
🧬 Multi-Omics Integration
Upload both genomic and proteomics data in a single submission to create integrated analyses.
- ▶Proteomics formats: TMT (Tandem Mass Tags) and SomaScan data
- ▶Automatic linking: Bystro attempts to match samples between genomic and proteomic datasets
- ▶Covariate integration: Shared covariates are automatically identified when possible
🔍 Missing Data Imputation
Enhance SNP array datasets by imputing missing genotype data.
This feature fills in gaps in your genetic data using statistical methods, improving the completeness and analytical power of your dataset for downstream analyses.
🔗 Chromosome Merging
Combine multiple chromosome files from the same subjects into one complete annotation.
⚠️ Important: After selecting your chromosome files with checkboxes, you must click the "Merge" button. Simply checking the boxes does not initiate the merge!
Video Tutorials
Desktop Upload Walkthrough
Learn how to upload files directly from your computer and navigate the submission interface.
Amazon S3 Integration
Set up your AWS credentials and upload files directly from your S3 storage.
S3 Setup Requirements
- ▶AWS Access Key ID
- ▶AWS Secret Access Key
- ▶Appropriate bucket permissions for file access
Supported File Formats
Genomic Data
VCF files (.vcf, .vcf.gz), SNP format files, SNP array data, and compressed formats. Multiple files per submission supported.
Proteomics Data
TMT (Tandem Mass Tags) and SomaScan proteomics datasets. Can be linked to genomic data during submission for integrated analysis.
💡 Pro Tips
- ▶Large files: Use S3 upload for files over 1GB for better reliability
- ▶Multi-omics: Upload proteomics and genomics data together for integrated analysis
- ▶Email notifications: Highly recommended for large datasets that take time to process
- ▶Chromosome merging: Don't forget to click "Merge" after selecting your files!
- ▶Sample information: Add covariates during submission or afterward using the sample info feature
What Happens Next?
After submitting your files, Bystro will:
- 1.Validate your files and check formatting
- 2.Process advanced features like imputation, merging, or multi-omics integration if selected
- 3.Queue your annotation for processing
- 4.Annotate variants using comprehensive databases
- 5.Generate integrated results available for download and analysis