Furkejuvvon:
Bibliográfalaš dieđut
Váldodahkkit: Guo, Fang, Huang
Materiálatiipa: Recurso digital
Giella:
Almmustuhtton: Zenodo 2025
Liŋkkat:https://doi.org/10.5281/zenodo.16949673
Fáddágilkorat: Lasit fáddágilkoriid
Eai fáddágilkorat, Lasit vuosttaš fáddágilkora!
Sisdoallologahallan:
  • <p># scGEN: Single-Cell Gene-Aware Embedded Network</p> <p>## Introduction</p> <p>Recent advancements in single-cell RNA sequencing have greatly enhanced our ability to dissect cellular heterogeneity. However, unsupervised clustering often struggles to identify transitional or developmental boundary cells, as existing methods rely on highly variable genes without considering expression levels, thereby overlooking subtle but crucial signals.</p> <p>To address this challenge, we developed **scGEN** (single-cell Gene-aware Embedded Network), which captures complex cellular relationships among cells. scGEN employs adaptive feature weighting and iterative fine-tuning to prioritize ambiguous or transitional cells with overlapping transcriptional profiles. </p> <p>### Key Features<br>- Adaptive feature weighting for better cell type identification<br>- Iterative fine-tuning to capture transitional cell states<br>- Superior performance on ambiguous cell classification<br>- Enhanced detection of subtle biological differences</p> <p>### Performance<br>Evaluation across eight distinct scRNA-seq datasets demonstrated that scGEN consistently outperformed nine leading clustering approaches. Additionally, scGEN refined the classification of ~10% ambiguous cells and uncovered biologically significant differences, providing a more comprehensive view of cellular heterogeneity in the human fetal pituitary than existing methods.</p> <p>## Installation</p> <p>```bash<br>git clone https://github.com/hurlab/scGEN.git<br>cd scGEN<br>```</p> <p>## Data Preparation</p> <p>scGEN accepts input data in `.mat` (MATLAB) format. You can convert your data to the required format using the provided `sv2mat.m` script in MATLAB.</p> <p>## Usage</p> <p>### Step-by-step workflow:</p> <p>1. **Select HVGs**: Use the `hvgs2csv.py` file in the scGEN directory to filter the normalized data with top 2000 highly variable genes.</p> <p>2. **Create .mat file**: Use the `csv2mat.m` file in the scGEN directory to create a `.mat` file in MATLAB.</p> <p>3. **Place your data**: Put your `.mat` file in the `dataset` folder under the scGEN directory.</p> <p>4. **Run scGEN**: Execute the main training script:<br>   ```bash<br>   python3 train.py<br>   ```</p> <p>### Data Download<br>You can download example datasets from: <a href="https://zenodo.org/records/16949673">https://zenodo.org/uploads/16949673</a></p> <p>## Hyperparameter Configuration</p> <p>scGEN utilizes two key hyperparameters:<br>- **α**: Balances the contributions of the Regularized ZINB loss and the structure-guided hard-sample contrastive loss functions<br>- **γ**: Adjusts the attention weight assigned to hard samples in the learning process</p> <p>### Best-performing Parameters by Dataset</p> <p>Based on extensive parameter sensitivity analyses (α: 0.01-100, γ: 1-5), the optimal parameters for benchmark datasets are:</p> <p>| Dataset      | γ (gamma) | α (alpha) |<br>|--------------|-----------|-----------|<br>| Bell         | 1         | 1         |<br>| hrvatin_B1   | 1         | 1         |<br>| hrvatin_B2   | 1         | 1         |<br>| pbmc3k       | 4         | 0.1       |<br>| Savas        | 4         | 0.1       |<br>| Scala        | 2         | 1         |<br>| Schwalbe     | 4         | 100       |<br>| zhang        | 4         | 10        |</p> <p>### Parameter Tuning Guidelines</p> <p>1. **Start with default parameters**: α=1, γ=1<br>2. **If results are unsatisfactory**:<br>   - Adjust γ for better hard-sample mining<br>   - Modify α based on dataset complexity</p> <p>## Output and Results</p> <p>The output file `result.csv` contains performance metrics (ACC, NMI, ARI, and F1 values) for each dataset across 20 runs, including the top two best-performing seeds with their average and standard deviation values.</p> <p>## Contact</p> <p>For questions or issues, please contact guokai8@gmail.com or open an issue on GitHub.</p>