// TOPIC

Data Scientist

Skills designed for data scientists to accelerate data analysis and model development workflows

Common Pain Points

Heavy workload in data cleaning and preprocessing

Feature engineering requires repeated experimentation and validation

Time-consuming model selection and hyperparameter tuning

Repetitive visualization chart creation

Difficult to manage and compare experiment results

Complex model interpretability analysis

Recommended Skills

Cocoindex

Comprehensive toolkit for developing with the CocoIndex library. Use when users need to create data transformation pipelines (flows), write custom functions, or operate flows via CLI or API. Covers building ETL workflows for AI data processing, including embedding documents into vector databases, building knowledge graphs, creating search indexes, or processing data streams with incremental updates.

16036
etl data-transformation vector-database embeddings knowledge-graph ai-pipeline incremental-processing real-time

Author: davila7

Dnanexus Integration

DNAnexus cloud genomics platform. Build apps/applets, manage data (upload/download), dxpy Python SDK, run workflows, FASTQ/BAM/VCF, for genomics pipeline development and execution.

16036
genomics bioinformatics cloud-computing dnanexus pipeline fastq bam vcf

Author: davila7

Excel Analysis

Analyze Excel spreadsheets, create pivot tables, generate charts, and perform data analysis. Use when analyzing Excel files, spreadsheets, tabular data, or .xlsx files.

16036
excel data-analysis spreadsheets pivot-tables charts pandas xlsx data-visualization

Author: davila7

Latchbio Integration

Latch platform for bioinformatics workflows. Build pipelines with Latch SDK, @workflow/@task decorators, deploy serverless workflows, LatchFile/LatchDir, Nextflow/Snakemake integration.

16036
bioinformatics workflow serverless python cloud-computing genomics data-pipeline docker

Author: davila7

Modal

Run Python code in the cloud with serverless containers, GPUs, and autoscaling. Use when deploying ML models, running batch processing jobs, scheduling compute-intensive tasks, or serving APIs that require GPU acceleration or dynamic scaling.

16036
serverless gpu-computing machine-learning batch-processing auto-scaling cloud-deployment python ml-inference

Author: davila7

Phoenix Observability

Open-source AI observability platform for LLM tracing, evaluation, and monitoring. Use when debugging LLM applications with detailed traces, running evaluations on datasets, or monitoring production AI systems with real-time insights.

16036
observability tracing llm-monitoring evaluation debugging opentelemetry phoenix ai-ops

Author: davila7

Tensorboard

Visualize training metrics, debug models with histograms, compare experiments, visualize model graphs, and profile performance with TensorBoard - Google's ML visualization toolkit

16036
tensorboard ml-visualization training-metrics model-debugging pytorch tensorflow experiment-tracking performance-profiling

Author: davila7

Uv Package Manager

Master the uv package manager for fast Python dependency management, virtual environments, and modern Python project workflows. Use when setting up Python projects, managing dependencies, or optimizing Python development workflows with uv.

25450
python package-manager dependencies virtual-environment pip-alternative rust performance development

Author: wshobson

ML Model Debugger

4.6

Quickly diagnose machine learning model issues with detailed debugging info and fix suggestions

3800
280
Machine Learning Debugging Model Optimization TensorFlow PyTorch

Author: AI Tools

Feedback

Found an issue or have a better suggestion?

Submit Feedback