Examples¶
This section provides practical examples of using AugChem for molecular data augmentation. Choose the appropriate section based on your data type and augmentation needs.
๐งช Quick Overview¶
AugChem supports two main types of molecular data augmentation:
- ๐ค SMILES Augmentation: String-based molecular representation augmentation
- ๐ Graph Augmentation: Graph neural network-ready molecular graph augmentation
๐ Available Example Collections¶
SMILES Examples¶
Comprehensive examples for SMILES-based molecular augmentation including:
- Basic SMILES manipulation techniques
- Dataset-level augmentation strategies
- Quality control and validation
- Real-world drug discovery applications
- Integration with cheminformatics workflows
Graph Examples¶
Detailed examples for graph-based molecular augmentation including: - PyTorch Geometric integration - Individual augmentation techniques - Machine learning pipeline integration - Comparative analysis and visualization - Advanced pharmaceutical applications
๐ Getting Started¶
If you're new to AugChem, we recommend:
- Start with Prerequisites: Install required packages
- Choose Your Data Type: SMILES strings or molecular graphs
- Follow Relevant Examples: Pick examples that match your use case
- Experiment: Modify parameters to suit your specific needs
Prerequisites¶
pip install augchem torch torch-geometric rdkit pandas matplotlib
Basic Usage Pattern¶
from augchem import Augmentator
# Initialize with reproducible seed
augmentator = Augmentator(seed=42)
# For SMILES data
smiles_result = augmentator.SMILES.augment_data(
dataset="your_data.csv",
augmentation_methods=["fusion", "enumeration"],
augment_percentage=0.5
)
# For Graph data (when available)
# graph_result = augmentator.Graph.augment_dataset(...)
๐ฏ Example Categories¶
Beginner Examples¶
- Basic augmentation setup
- Single molecule processing
- Simple dataset expansion
Intermediate Examples¶
- Parameter optimization
- Quality control implementation
- Integration with ML pipelines
Advanced Examples¶
- Custom augmentation strategies
- Large-scale processing
- Research-grade applications
๐ก Tips for Using Examples¶
- Modify Parameters: Adjust augmentation rates based on your data
- Validate Results: Always check output quality
- Set Seeds: Use random seeds for reproducible experiments
- Start Small: Test with small datasets first
- Monitor Performance: Track augmentation impact on model performance
๐ฌ Real-World Applications¶
Our examples cover scenarios from: - Academic Research: Dataset expansion for publications - Drug Discovery: Virtual compound generation - Chemical Informatics: Property prediction enhancement - Materials Science: Novel structure exploration
๐ Additional Resources¶
- SMILES Tutorial - Step-by-step learning guide
- Graph Tutorial - Comprehensive graph augmentation guide
- API Reference - Complete function documentation
Ready to augment your molecular data? Choose your examples and start exploring! ๐งฌโจ