Agent Skills
Data-analysisStatistical

Outlier Detection & Handling

AIPOCH-AI

Scan the dataset, identify outliers based on statistical rules (e.g., 3-Sigma, IQR), and provide processing suggestions (reasons for removal or retention)

65
1
FILES
outlier-detection-handler/
skill.md
scripts
main.py
test_data.txt

SKILL.md

Outlier Detection & Handling

Identify and manage statistical outliers.

Use Cases

  • Data quality control
  • Pre-analysis screening
  • Regulatory compliance (FDA data integrity)

Parameters

ParameterTypeRequiredDefaultDescription
datastrYes-Path to dataset file (CSV/Excel)
methodstrNo"3-sigma"Detection method: "3-sigma", "IQR", or "Grubbs"
actionstrNo"flag"Handling action: "flag", "remove", or "winsorize"

Returns

  • Outlier flagging with method details
  • Handling recommendations
  • Documentation for regulatory submission

Example

Input: Biomarker measurements from 200 patients Output: 5 outliers identified (2.5%), recommended action: investigate then winsorize

Risk Assessment

Risk IndicatorAssessmentLevel
Code ExecutionPython/R scripts executed locallyMedium
Network AccessNo external API callsLow
File System AccessRead input files, write output filesMedium
Instruction TamperingStandard prompt guidelinesLow
Data ExposureOutput files saved to workspaceLow

Security Checklist

  • No hardcoded credentials or API keys
  • No unauthorized file system access (../)
  • Output does not expose sensitive information
  • Prompt injection protections in place
  • Input file paths validated (no ../ traversal)
  • Output directory restricted to workspace
  • Script execution in sandboxed environment
  • Error messages sanitized (no stack traces exposed)
  • Dependencies audited

Prerequisites

# Python dependencies
pip install -r requirements.txt

Evaluation Criteria

Success Metrics

  • Successfully executes main functionality
  • Output meets quality standards
  • Handles edge cases gracefully
  • Performance is acceptable

Test Cases

  1. Basic Functionality: Standard input → Expected output
  2. Edge Case: Invalid input → Graceful error handling
  3. Performance: Large dataset → Acceptable processing time

Lifecycle Status

  • Current Stage: Draft
  • Next Review Date: 2026-03-06
  • Known Issues: None
  • Planned Improvements:
    • Performance optimization
    • Additional feature support