• About
  • Documentation

  • More Universes
  • Recent Updates
  • Leader board

  • All repositories
  • All packages
  • All articles
  • All datasets
  • All system Libraries
selcukorkmaz
  • Builds
  • Packages
  • Articles
  • Datasets
  • Contribution
  • Badges
  • API
  • Feed

Links toselcukorkmaz

MVN - Multivariate Normality Tests

A comprehensive suite for assessing multivariate normality using six statistical tests (Mardia, Henze–Zirkler, Henze–Wagner, Royston, Doornik–Hansen, Energy). Also includes univariate diagnostics, bivariate density visualization, robust outlier detection, power transformations (e.g., Box–Cox, Yeo–Johnson), and imputation strategies ("mean", "median", "mice") for handling missing data. Bootstrap resampling is supported for selected tests to improve p-value accuracy in small samples. Diagnostic plots are available via both 'ggplot2' and interactive 'plotly' visualizations. See Korkmaz et al. (2014) <https://journal.r-project.org/articles/RJ-2014-031/RJ-2014-031.pdf>.

Last updated

9.59 score 4 stars 5 dependents 1.3k scripts 12k downloads

fastml - Guarded Resampling Workflows for Safe and Automated Machine Learning in R

Provides a guarded resampling workflow for training and evaluating machine-learning models. When the guarded resampling path is used, preprocessing and model fitting are re-estimated within each resampling split to reduce leakage risk. Supports multiple resampling schemes, integrates with established engines in the 'tidymodels' ecosystem, and aims to improve evaluation reliability by coordinating preprocessing, fitting, and evaluation within supported workflows. Offers a lightweight AutoML-style workflow by automating model training, resampling, and tuning across multiple algorithms, while keeping evaluation design explicit and user-controlled.

Last updated

6.27 score 110 stars 13 scripts 547 downloads

PubChemR - Interface to the 'PubChem' Database for Chemical Data Retrieval

Provides an interface to the 'PubChem' database via the PUG REST <https://pubchem.ncbi.nlm.nih.gov/docs/pug-rest> and PUG View <https://pubchem.ncbi.nlm.nih.gov/docs/pug-view> services. This package allows users to automatically access chemical and biological data from 'PubChem', including compounds, substances, assays, and various other data types. Functions are available to retrieve data in different formats, perform searches, and access detailed annotations.

Last updated

5.90 score 4 stars 40 scripts 304 downloads

bioLeak - Leakage-Safe Modeling and Auditing for Genomic and Clinical Data

Prevents and detects information leakage in biomedical machine learning. Provides leakage-resistant split policies (subject-grouped, batch-blocked, study leave-out, time-ordered), guarded preprocessing (train-only imputation, normalization, filtering, feature selection), cross-validated fitting with common learners, permutation-gap auditing, batch and fold association tests, and duplicate detection.

Last updated

5.88 score 5 stars 16 scripts 589 downloads

splitGraph - Dataset Dependency Graphs for Leakage-Aware Evaluation

Represent biomedical dataset structure as typed dependency graphs so that sample provenance, repeated-measure structure, study design, batch effects, and temporal relationships are explicit and inspectable. Validates dataset structure, detects sample-level overlap, derives deterministic split constraints, and produces a tool-agnostic split specification for leakage-aware evaluation workflows.

Last updated

4.48 score 1 stars 5 scripts 456 downloads

rPDBapi - A Comprehensive Interface for Accessing the Protein Data Bank

Provides an R interface to the 'RCSB' Protein Data Bank ('PDB') Search and Data APIs (<https://www.rcsb.org/>). Supports full-text, attribute, sequence, motif, structure, and chemical searches; retrieval of entry-, assembly-, polymer-entity-, and chemical-component-level metadata; and conversion of API responses into analysis-ready tables and typed R objects for reproducible structural bioinformatics workflows.

Last updated

3.11 score 13 scripts 548 downloads

genSEIR - Predict Epidemic Curves with Generalized SEIR Modeling

Performs generalized Susceptible-Exposed-Infected-Recovered (SEIR) modeling to predict epidemic curves. The method is described in Peng et al. (2020) <doi:10.1101/2020.02.16.20023465>.

Last updated

1.70 score 239 downloads