help@rskworld.in +91 93305 39277
RSK World
  • Home
  • Development
    • Web Development
    • Mobile Apps
    • Software
    • Games
    • Project
  • Technologies
    • Data Science
    • AI Development
    • Cloud Development
    • Blockchain
    • Cyber Security
    • Dev Tools
    • Testing Tools
  • About
  • Contact

Theme Settings

Color Scheme
Display Options
Font Size
100%

Healthcare Patients Dataset

Comprehensive Healthcare Patients dataset with medical records, diagnostic information, treatment outcomes, and 26 features for building healthcare analytics models. Includes patient demographics, medical history, diagnostic tests, treatment plans, Python scripts for analysis, machine learning models, and visualization tools. Perfect for healthcare analytics, disease prediction, medical research, and predictive modeling.

Medical Analytics 26 Features Treatment Outcomes Download Machine Learning Python Scripts Diagnostic Tests HIPAA Compliant
Download Free Source Code Live Demo RSK View Files
Healthcare Patients Dataset - RSK World
Healthcare Patients Dataset - RSK World
Medical Analytics 26 Features Treatment Outcomes Machine Learning Python Healthcare Analytics

This project features a comprehensive Healthcare Patients dataset designed for professional medical analytics, disease prediction, and healthcare research. The dataset includes 30 patient records with 26 features including patient demographics, medical history, diagnostic tests, treatment plans, treatment outcomes, length of stay, readmission status, and medical charges. Includes powerful Python scripts: analyze_patients.py for data analysis, advanced_analysis.py for machine learning models, export_to_excel.py for Excel export, export_to_json.py for JSON export, and validate_data.py for data validation. Also includes multiple visualization scripts and interactive demo website. The package includes interactive demo website, comprehensive README.md, and MIT License. Perfect for data scientists, researchers, students, and developers working on healthcare analytics, disease prediction, medical research, and predictive modeling projects.

If you find this Healthcare Patients Dataset useful, you can support with a small contribution.

Secure Fast Trusted
Pay via UPI QR
Scan or tap an amount to auto-generate
UPI QR
₹
Open UPI app
GPay PhonePe Paytm
Download Free Source Code

Dataset Overview

Complete healthcare patients dataset with medical records, diagnostic information, treatment outcomes, and 26 features for healthcare analytics.

  • Patient records with comprehensive medical features
  • Medical history (diagnosis, comorbidities, medications)
  • Diagnostic tests (lab tests with results)
  • 30 patient records with 26 features
  • Treatment outcomes (Recovered/Improved/Stable)
  • Length of stay and readmission status
  • Medical charges and insurance information
  • Ready for machine learning and predictive modeling
  • HIPAA compliant format
  • Multiple analysis scripts included
  • Perfect for healthcare analytics & medical research

Dataset Structure & Files

Well-organized project structure with patient data, Python scripts for analysis and machine learning, visualization tools, and interactive demo.

  • healthcare_patients.csv - Main dataset file (30 patients, 26 features)
  • healthcare_patients.xlsx - Excel format with 6 sheets (Data, Statistics, Diagnosis, Treatment, Demographics, Outcomes)
  • healthcare_patients.json - JSON export with metadata
  • analyze_patients.py - Comprehensive data analysis script with statistics and visualizations
  • advanced_analysis.py - Machine learning analysis with Random Forest models and statistical tests
  • export_to_excel.py - Excel export utility with multiple sheets
  • export_to_json.py - JSON export utility with structured data
  • validate_data.py - Data validation and quality checks script
  • index.html - Interactive demo website with Chart.js visualizations
  • README.md - Comprehensive project documentation
  • QUICKSTART.md - Quick start guide for beginners
  • PROJECT_SUMMARY.md - Complete project overview and status
  • ADVANCED_FEATURES.md - Advanced features documentation
  • FILES_SUMMARY.md - Detailed file descriptions
  • PROJECT_INFO.md - Project metadata and information
  • RELEASE_NOTES.md - Version history and updates
  • IMAGE_DESCRIPTION.md - Image requirements documentation
  • requirements.txt - Python dependencies (pandas, numpy, scikit-learn, matplotlib, seaborn, scipy)
  • LICENSE - MIT License file
  • .gitignore - Git ignore configuration
  • visualizations/ - Directory with 15+ charts (correlation matrix, feature importance, predictions, etc.)
  • advanced_analysis_report.txt - Generated ML analysis report
  • Consistent naming convention across all files
  • Easy to load with pandas (pd.read_csv)
  • Scikit-learn ready format for ML models
  • Multiple export formats (CSV, Excel, JSON)
  • HIPAA compliant data structure

Healthcare Analytics & Machine Learning

Complete analysis pipeline with support for healthcare analytics, machine learning models, and predictive modeling.

  • Random Forest Classifier - 30-day readmission prediction model
  • Random Forest Regressor - Medical charges prediction model
  • Model performance metrics - Accuracy, precision, recall, F1-score for classification
  • Regression metrics - R² score, RMSE (Root Mean Squared Error) for predictions
  • Feature importance analysis - Identify key factors affecting outcomes
  • Statistical analysis - T-tests for group comparisons (charges by gender, stay by outcome)
  • Chi-square tests - Categorical relationship analysis (outcome vs treatment type)
  • P-value calculations - Statistical significance testing
  • Correlation matrix analysis - Identify relationships between numerical features
  • Strong correlation detection - Features with |r| > 0.3 identified
  • Classification reports - Detailed performance breakdown by class
  • Confusion matrix visualization - Model performance visualization
  • Scikit-learn integration - Ready-to-use ML pipeline
  • Model training and validation - Train/test split methodology
  • Healthcare analytics utilities - Custom functions for medical data analysis
  • Diagnostic test analysis - Lab test result interpretation
  • Patient outcome patterns - Recovered/Improved/Stable analysis
  • Readmission risk factors - Identify high-risk patient characteristics
  • Cost prediction models - Predict medical charges accurately

Multiple File Formats

Dataset available in multiple formats for maximum compatibility with different data science tools and ML frameworks.

  • CSV format - healthcare_patients.csv (comma-separated values, UTF-8 encoded)
  • Excel format - healthcare_patients.xlsx with 6 organized sheets
  • Excel Sheets - Data, Statistics, Diagnosis Summary, Treatment Summary, Demographics, Outcome Analysis
  • JSON format - healthcare_patients.json with structured metadata
  • JSON Structure - Nested format with metadata, statistics, and patient records
  • Pandas DataFrame ready - Direct loading with pd.read_csv() or pd.read_excel()
  • NumPy array compatible - Easy conversion to numpy arrays for ML
  • Scikit-learn compatible - Ready for train_test_split, preprocessing, models
  • XGBoost/LightGBM ready - Compatible with gradient boosting frameworks
  • TensorFlow/PyTorch ready - Can be converted for deep learning models
  • Standard data science formats - Widely supported formats
  • Easy to import and process - One-line data loading
  • Compatible with all ML libraries - Universal format support
  • Jupyter Notebook ready - Perfect for interactive analysis
  • Python pandas ready - Native pandas support
  • R compatible - Can be imported into R for analysis
  • SQL import ready - Can be imported into databases
  • API integration ready - JSON format for web services
  • Excel formulas and charts - Use Excel for quick analysis
  • Data validation support - Easy to validate and clean data

Analysis & Visualization

Comprehensive analysis tools with visualization capabilities and interactive patient data explorer.

  • Interactive Patient Data Explorer - Chart.js powered dashboard
  • 15+ Visualization Charts - age_distribution, gender_distribution, top_diagnoses, length_of_stay
  • Charges distribution charts - Visualize medical cost patterns
  • Outcome distribution charts - Treatment outcome analysis (Recovered/Improved/Stable)
  • Correlation matrix heatmap - Feature relationship visualization
  • Feature importance chart - ML model feature rankings
  • Charges prediction scatter plot - Predicted vs actual charges comparison
  • Age vs Charges by Outcome - Multi-colored scatter visualization
  • Charges by Treatment Type - Box plots for cost analysis
  • Length of Stay by Outcome - Box plots for duration analysis
  • Charges heatmap - Diagnosis vs Treatment Type matrix
  • Monthly admissions time series - Trend analysis over time
  • Treatment effectiveness visualization - Outcome by treatment comparison
  • Dataset statistics - Comprehensive summary statistics
  • Interactive Chart.js charts - Doughnut, Bar, and Line charts
  • Real-time chart updates - Dynamic filtering and visualization
  • Diagnosis and treatment filtering - Filter data by medical conditions
  • Performance benchmarking - Model evaluation and comparison
  • Model evaluation metrics - Classification and regression metrics
  • Export visualization data - Download chart data in multiple formats
  • Responsive design - Works on desktop, tablet, and mobile devices

Compatible Frameworks

Works with all major data science and analytics frameworks out of the box.

  • Scikit-learn ML library - Classification, regression, clustering, preprocessing
  • Random Forest models - Ensemble learning for predictions
  • Gradient Boosting - XGBoost, LightGBM, CatBoost support
  • Deep Learning - TensorFlow, PyTorch, Keras compatibility
  • Healthcare predictive models - Disease prediction, readmission risk, cost forecasting
  • NumPy numerical computing - Array operations and mathematical functions
  • pandas data manipulation - DataFrames, grouping, filtering, merging
  • matplotlib visualization - Static charts and graphs
  • seaborn statistical visualization - Advanced statistical plots
  • Statistical analysis (scipy) - T-tests, chi-square, correlation, ANOVA
  • Plotly interactive charts - Dynamic and interactive visualizations
  • Jupyter Notebook support - Interactive data science environment
  • Google Colab ready - Works in cloud-based notebooks
  • VS Code integration - Python extension support
  • PyCharm compatible - Full IDE support
  • Healthcare analytics models - Custom ML models for medical data
  • Medical research tools - Statistical analysis for research papers
  • Tableau/Power BI ready - Can be imported for business intelligence
  • R/RStudio compatible - Cross-platform data science
  • SQL databases - PostgreSQL, MySQL, SQLite import support

What You Get

Complete package with all files needed for professional healthcare analytics and medical research projects.

  • 30 Patient records with comprehensive medical data (26 features per patient)
  • 5 Python analysis scripts - analyze_patients.py, advanced_analysis.py, export_to_excel.py, export_to_json.py, validate_data.py
  • analyze_patients.py - Comprehensive data analysis with statistics and 8+ visualizations
  • advanced_analysis.py - Machine learning models (Random Forest), statistical tests, correlation analysis
  • export_to_excel.py - Excel export with 6 organized sheets and formatting
  • export_to_json.py - JSON export with structured metadata and statistics
  • validate_data.py - Data quality checks, validation, and data integrity verification
  • healthcare_patients.csv - Main dataset file (30 patients, 26 features, UTF-8 encoded)
  • healthcare_patients.xlsx - Excel format with 6 sheets (Data, Statistics, Diagnosis, Treatment, Demographics, Outcomes)
  • healthcare_patients.json - JSON format with nested structure and metadata
  • index.html - Interactive demo website with Chart.js visualizations
  • 15+ Visualization charts - Correlation matrix, feature importance, predictions, distributions, heatmaps
  • Complete documentation - README.md, QUICKSTART.md, PROJECT_SUMMARY.md, ADVANCED_FEATURES.md
  • 8 Documentation files - Comprehensive guides and project information
  • requirements.txt - All Python dependencies listed and versioned
  • LICENSE - MIT License (free for commercial and non-commercial use)
  • advanced_analysis_report.txt - Generated ML analysis report
  • visualizations/ directory - All generated charts and graphs
  • HIPAA compliant data structure - De-identified patient information
  • Ready-to-use code examples - Copy and run scripts immediately

Interactive Demo Website

Beautiful demo website with patient data explorer, healthcare analytics dashboard, and comprehensive guide.

  • Modern animated design - Smooth transitions and visual effects
  • Interactive Patient Data Explorer - Filter and search patient records
  • Healthcare Analytics Dashboard - Real-time statistics and metrics
  • Chart.js Integration - Interactive charts (Doughnut, Bar, Line charts)
  • Treatment Outcome Metrics - Visual representation of recovery rates
  • Filter by diagnosis and treatment - Dynamic data filtering
  • Patient feature visualization - Age, gender, charges, outcomes visualization
  • Outcome distribution charts - Recovered/Improved/Stable breakdown
  • Dataset statistics display - Total patients, features, date ranges
  • Real-time chart updates - Charts update when filters change
  • Step-by-step usage guide - Comprehensive instructions
  • Dark theme with gradients - Modern, professional appearance
  • Fully responsive layout - Mobile, tablet, and desktop support
  • Data export options - Download CSV, Excel, JSON formats
  • Python scripts download - Access to all analysis scripts
  • Interactive filters - Filter by outcome, diagnosis, treatment type
  • Patient detail view - Individual patient record display
  • Statistics summary - Quick overview of dataset metrics
  • No backend required - Pure HTML, CSS, JavaScript
  • Cross-browser compatible - Works on Chrome, Firefox, Safari, Edge

Python Scripts Included

Professional Python scripts for healthcare analysis, machine learning, and data visualization.

  • analyze_patients.py - Comprehensive data analysis with statistics, visualizations, and reports
  • advanced_analysis.py - Machine learning models (Random Forest), statistical tests, correlation analysis
  • export_to_excel.py - Excel export utility with multiple sheets, formatting, and styling
  • export_to_json.py - JSON export utility with structured data and metadata
  • validate_data.py - Dataset validation, quality checks, and data integrity verification
  • Statistical analysis integration - T-tests, chi-square tests, correlation, p-values
  • Healthcare analytics utilities - Custom functions for medical data processing
  • Dataset verification - Data type checking, missing value detection, range validation
  • Batch processing support - Process multiple files or datasets efficiently
  • ML model utilities - Model training, evaluation, and prediction functions
  • Visualization generation - Automatic chart creation and saving
  • Report generation - Text-based analysis reports with insights
  • Error handling - Comprehensive error checking and informative messages
  • Logging support - Track script execution and debugging
  • Configurable parameters - Customize analysis settings easily
  • Medical research tools - Statistical methods for research papers
  • Code comments and documentation - Well-documented code for learning
  • Complete code examples - Ready-to-run scripts with examples
  • Modular design - Reusable functions and classes
  • Best practices - Follows Python coding standards (PEP 8)

Dataset Features

Comprehensive healthcare patients dataset with patient records and 26 features for analytics.

  • Patient Demographics - patient_id, first_name, last_name, age, gender, blood_type
  • Medical History - admission_date, discharge_date, diagnosis, diagnosis_code, primary_condition, comorbidities
  • Medications - medication_1, medication_2, medication_3 (prescribed medications)
  • Diagnostic Tests - lab_test_1, lab_test_1_result, lab_test_2, lab_test_2_result, lab_test_3, lab_test_3_result
  • Treatment Information - treatment_type, length_of_stay (days), readmission_30_days (Yes/No)
  • Outcome Labels - outcome (Recovered/Improved/Stable)
  • Financial Data - insurance_type, charges (total medical charges)
  • 30 patient records - Balanced dataset with diverse medical conditions
  • 26 features - Comprehensive medical information for analysis
  • Date information - Admission and discharge dates for trend analysis
  • ICD-10 codes - Standard diagnosis codes for medical classification
  • Comorbidity data - Multiple conditions per patient for complexity analysis
  • Lab test results - Quantitative and qualitative test outcomes
  • Treatment outcomes - Three-tier classification system
  • Length of stay - Continuous variable for regression analysis
  • Readmission flag - Binary classification target for prediction
  • Insurance types - Categorical variable for cost analysis
  • High-quality patient data - Clean, validated, and consistent
  • HIPAA compliant format - De-identified patient information
  • No missing values - Complete dataset ready for analysis
  • Ready for machine learning - Feature engineering and model training
  • Predictive modeling ready - Multiple target variables available
  • Healthcare analytics utilities - Pre-built analysis functions
  • Easy to extend dataset - Add more patients or features
  • Organized project structure - Clear file organization

Credits & Acknowledgments

This dataset is provided for educational and research purposes. Core technologies and libraries are credited below.

  • Python 3.8+ - Programming language (PSF License)
  • Scikit-learn - Machine learning library (BSD License)
  • XGBoost - Gradient boosting framework (Apache 2.0)
  • NumPy - Numerical computing (BSD License)
  • pandas - Data manipulation (BSD License)
  • matplotlib - Data Visualization (PSF License)
  • RSK World - Dataset creator and provider
  • GitHub Repository - Source code and releases
  • Author: Molla Samser | Designer: Rima Khatun
  • MIT License - Free for learning & research

Support & Contact

For commercial use, custom datasets, or integration help, please contact us.

  • Email: help@rskworld.in
  • Phone: +91 93305 39277
  • Website: RSKWORLD.in
  • Location: Nutanhat, Mongolkote, West Bengal, India
  • Author: Molla Samser
  • Designer & Tester: Rima Khatun
  • GitHub: Coming Soon
  • Healthcare Patients Dataset Documentation
  • Technical Support Available
  • Custom Dataset Requests Welcome
Featured Content
Additional Sponsored Content

Download Free Source Code

Get the complete dataset bundle. You can view the files or download the dataset directly.

Download Free Source Code

Quick Links

Live Demo - Try Healthcare Patients Dataset Click to explore
Download Free Source Code Click to explore
View Files (Browser) Click to explore
Explore All Dataset Projects by RSK World Click to explore
Explore All Data Science Projects by RSK World Click to explore

Categories

Medical Analytics 26 Features Treatment Outcomes Machine Learning Python Healthcare Analytics

Technologies

Medical Analytics
Healthcare Analytics
SQL
Python
Data Science

Explore More Datasets

Healthcare & Medical Analytics

Dataset Learning Dataset Computer Vision Python Image Classification
Voice Cloning Dataset - rskworld.in
Voice Cloning Dataset
Audio Data

Voice cloning dataset with speaker recordings and voice characteristics for text...

View Project
Sports Analysis Dataset - rskworld.in
Sports Analysis Dataset
Video Data

Sports video dataset with game footage, player tracking, and event annotations f...

View Project
Video Classification Dataset - rskworld.in
Video Classification Dataset
Video Data

Video classification dataset with labeled video clips across multiple categories...

View Project
Medical Imaging Dataset - rskworld.in
Medical Imaging Dataset
Image Data

Medical image dataset with X-rays, CT scans, and MRI images with diagnostic labe...

View Project
Environmental Sound Dataset - rskworld.in
Environmental Sound Dataset
Audio Data

Environmental sound classification dataset with audio samples of natural and urb...

View Project
View All Projects

About RSK World

Founded by Molla Samser, with Designer & Tester Rima Khatun, RSK World is your one-stop destination for free programming resources, source code, and development tools.

Founder: Molla Samser
Designer & Tester: Rima Khatun

Development

  • Game Development
  • Web Development
  • Mobile Development
  • AI Development
  • Development Tools

Legal

  • Terms & Conditions
  • Privacy Policy
  • Disclaimer

Contact Info

Nutanhat, Mongolkote
Purba Burdwan, West Bengal
India, 713147

+91 93305 39277

hello@rskworld.in
support@rskworld.in

© 2026 RSK World. All rights reserved.

Content used for educational purposes only. View Disclaimer

Support This Free Project

This project is completely free to download!

If you find it useful, consider supporting us with a small donation. Your support helps us create more free projects.

Pay via Razorpay

If you find this Healthcare Patients Dataset useful, you can support with a small contribution.

Secure Fast Trusted
Payment Successful! Your download will start automatically...
Pay via UPI QR
Scan or tap an amount to auto-generate
UPI QR
₹
Open UPI app
GPay PhonePe Paytm